Big Data Architecture
von Thanachart Numnonda

1. Data Ingestion
1.1. Sqoop
1.2. Flume
1.3. KafKa
2. Hadoop Tools
2.1. Cluster Management
2.2. Search
3. Storage
3.1. Hadoop HDFS
3.2. NoSQL
3.2.1. HBase
3.3. RDBMS
3.4. Object Storage
4. Processing
4.1. Batch
4.1.1. MapReduce
4.1.2. Hive
4.1.3. Pig
4.2. SQL
4.3. RealTime
4.3.1. Impala
4.3.2. Spark
4.4. Machine Learning
4.4.1. Mahout
4.4.2. Spark