Data Scientist Skill Set
af 喆 徐

1. Data Munging
1.1. Informatica
1.2. SSIS
1.3. Open Source ones
2. Big Data
2.1. Apache Hadoop
2.1.1. HDFS
2.1.2. MapReduce
2.1.3. Flume
2.1.4. Hive
2.1.5. Pig
2.1.6. Hbase
2.1.7. YARN
2.1.8. Oozie
2.2. NoSQL
2.2.1. MongoDB
2.2.2. Couchbase
2.2.3. Neo4J
2.2.4. Cassandra