Data Scientist Skill Set
by 喆 徐
1. Data Munging
1.1. Informatica
1.2. SSIS
1.3. Open Source ones
2. Big Data
2.1. Apache Hadoop
2.1.1. HDFS
2.1.2. MapReduce
2.1.3. Flume
2.1.4. Hive
2.1.5. Pig
2.1.6. Hbase
2.1.7. YARN
2.1.8. Oozie
2.2. NoSQL
2.2.1. MongoDB
2.2.2. Couchbase
2.2.3. Neo4J
2.2.4. Cassandra