Data Scientist Skill Set
Door 喆 徐

1. Data Munging
1.1. Informatica
1.2. SSIS
1.3. Open Source ones
2. Big Data
2.1. Apache Hadoop
2.1.1. HDFS
2.1.2. MapReduce
2.1.3. Flume
2.1.4. Hive
2.1.5. Pig
2.1.6. Hbase
2.1.7. YARN
2.1.8. Oozie
2.2. NoSQL
2.2.1. MongoDB
2.2.2. Couchbase
2.2.3. Neo4J
2.2.4. Cassandra