Big Data Architecture
Door Thanachart Numnonda

1. Data Ingestion
1.1. Sqoop
1.2. Flume
1.3. KafKa
2. Hadoop Tools
2.1. Cluster Management
2.2. Search
3. Storage
3.1. Hadoop HDFS
3.2. NoSQL
3.2.1. HBase
3.3. RDBMS
3.4. Object Storage
4. Processing
4.1. Batch
4.1.1. MapReduce
4.1.2. Hive
4.1.3. Pig
4.2. SQL
4.3. RealTime
4.3.1. Impala
4.3.2. Spark
4.4. Machine Learning
4.4.1. Mahout
4.4.2. Spark