Data Scientist Skill Set
da 喆 徐

1. Data Munging
1.1. Informatica
1.2. SSIS
1.3. Open Source ones
2. Big Data
2.1. Apache Hadoop
2.1.1. HDFS
2.1.2. MapReduce
2.1.3. Flume
2.1.4. Hive
2.1.5. Pig
2.1.6. Hbase
2.1.7. YARN
2.1.8. Oozie
2.2. NoSQL
2.2.1. MongoDB
2.2.2. Couchbase
2.2.3. Neo4J
2.2.4. Cassandra