登録は簡単!. 無料です
または 登録 あなたのEメールアドレスで登録
Hadoop World により Mind Map: Hadoop World

1. Lambda

1.1. Twitter Summingbird

1.2. Lambdoop

2. Architectures

3. Relational Databases

3.1. General

3.1.1. Cloudera Impala

3.2. Warehouse (OLAP)

3.2.1. Apache Hive

3.2.1.1. Apache Tajo

3.2.1.2. Spark SQL

3.3. Query Language

3.3.1. Hive-QL

3.3.2. SQL

3.4. Read-Only/Low Latency

3.4.1. SploutSQL

3.5. Transactions

3.5.1. Row-Based ACID

3.5.2. ACID

3.5.3. Eventually Consistent

3.5.4. Eventually Durable

3.6. Interfaces

3.6.1. Software

3.6.1.1. Apache Thrift

3.6.2. JDBC

3.6.3. ODBC

3.7. Transactional

3.7.1. Splice

3.7.2. Stinger.next/Apache Hive

4. Event Processing

4.1. Spark Streaming

5. Use Cases

5.1. Large-Scale Logging and Failure Analysis

5.1.1. Apache Chukwa

5.2. Predictive Maintenance

5.3. Personalized Advertisement

5.4. Master Data Management

5.5. Preference Learning

5.6. Gamification

5.7. Business Warehouse

6. NoSQL

6.1. Data Storage Type

6.1.1. Key/Value

6.1.1.1. Apache HBase

6.1.1.1.1. SQL

6.1.1.2. Apache Accumulo

6.1.2. Graph

6.1.2.1. Neo4J

6.1.2.2. Apache Giraph

6.1.2.2.1. Bagel

6.1.2.3. GraphX

6.1.3. Columnar

6.1.3.1. Parquet (Storageformat)

6.1.3.2. Apache Drill

6.1.3.3. Apache Cassandra

6.1.4. GIS

6.1.4.1. GIS Tools for Hadoop

6.1.4.2. Spatial Hadoop

6.2. Query Language

6.2.1. Apache Pig

7. Processing Paradigm

7.1. Batch-Processing

7.1.1. Map-Reduce

7.1.2. TEZ

7.1.3. Spark

7.2. Stream-Processing(Realtime)

7.2.1. Software

7.2.1.1. Apache Spark

7.2.1.2. Apache Flink

7.3. Integration of both

7.3.1. Software

7.3.1.1. Twitter Summingbird

7.4. in-memory

7.4.1. Apache Tez

7.4.2. Apache Tachyon

7.4.3. Apache Ignite

7.4.4. Apache Flink

7.5. Libraries

7.5.1. Apache Crunch

8. Statistical Analytics/Machine Learning

8.1. Software

8.1.1. RHadoop

8.1.2. RHipe

8.1.3. Apache Mahout

8.1.4. SparkR

8.1.5. Apache Hama

8.1.6. mllib

8.1.7. Weka (distributedWekaHadoop)

8.1.8. DDF.io - Distributed Data Frame

8.1.9. Kepler

8.2. Languages

8.2.1. R

8.2.2. Java

8.2.3. Python

8.3. GUI

8.3.1. Browser

8.3.1.1. RStudioWeb

8.3.1.2. Cloudera Hue

8.3.1.3. Apache Zeppelin

8.4. in-database analytics

8.4.1. hivemall

9. Alternatives

9.1. Event-Processing

9.1.1. Apache Storm

10. Data Import/Export

10.1. Software

10.1.1. Apache Flume

10.1.2. Apache Sqoop

11. Reporting

11.1. Software

11.1.1. R

11.1.1.1. MarkDown

11.1.1.2. Knit

12. Workflows

12.1. Software

12.1.1. Apache Oozie

12.1.1.1. Apache Falcon

12.1.2. Apache Flink (Stratosphere)

12.1.3. Spotify Luigi

12.2. Run-time / Query Optimization

12.3. Data transformation

13. Security

13.1. Cluster

13.1.1. Software

13.1.1.1. Apache Knox

13.2. Data

13.2.1. Authorization

13.2.1.1. Software

13.2.1.1.1. Apache Sentry

14. Legacy Software Integration

14.1. Apache Slider

15. Data Cleaning

15.1. Openrefine

15.2. Netflix Zeno

16. NewSQL

16.1. BayesDB

16.2. H-Store

17. MetaData

17.1. Kite

17.2. Hive MetaStore

18. OLAP

18.1. Kylin

19. Distributed Framework Manager

19.1. Large-Scale Datatransfer

19.1.1. DistCp

19.2. Scheduling Type

19.2.1. monolithic

19.2.2. two-level

19.2.3. shared state

19.3. Software

19.3.1. Apache Mesos

19.3.1.1. Scheduling

19.3.1.2. Monitoring

19.3.2. Apache Ambari

19.3.2.1. Monitoring

19.3.2.2. Manage Cluster

19.3.2.3. Automated Deployment

19.3.3. Ganglia

19.3.3.1. Monitoring

19.3.4. Ooyala Spark Job-Server

19.3.5. Google Kubernetes

20. Configuration Management

20.1. Software

20.1.1. Apache Zookeeper

21. Core

21.1. Distributed Filesystem

21.1.1. HDFS

21.2. Scheduling Big Data Jobs

21.2.1. Yarn

21.2.1.1. Map-Reduce

21.2.2. Job Schedule Manager

21.2.2.1. Apache Reef

22. Search

22.1. Solr

23. Managing Environments

23.1. Software

23.1.1. Puppet

23.1.2. Chef

23.1.3. Google Kubernetes

23.2. Software Container

23.2.1. Software

23.2.1.1. Docker

23.3. Deploy

23.3.1. Software

23.3.1.1. Apache Slider

24. Cloud Manager

24.1. Software

24.1.1. Apache Delta Cloud

24.1.2. Ubuntu Juju

24.1.3. Apache Whirr

24.1.4. Cloudera Cloud Manager

24.1.5. OpenStack

24.1.5.1. Apache Savanna

25. Packaging/Distribution

25.1. Cloud

25.1.1. Amazon Elastic MapReduce (EMR)

25.1.2. Microsoft Azure HDInsight

25.1.3. Google Compute Hadoop

25.1.4. Altiscale

25.2. On-Premise

25.2.1. MapR

25.2.2. Apache BigTop

25.2.3. HortonWorks

25.2.4. Microsoft HDInsight

25.2.5. Cloudera Enterprise

25.2.6. Buildoop

25.2.6.1. Lambda Architecture

26. Distributed File Systems

26.1. Windows Azure Blob Storage

26.2. CassandraFS

26.3. CephFS

26.4. CleverSafe Object Store

26.5. Google Cloud Storage Connector

26.6. ClusterFS

26.7. GridGrain

26.8. Lustre

26.9. MapR FileSystem

26.10. OrangeFS

26.11. Quantcast File System

26.12. Symantec Veritas Cluster File System

26.13. Amazon S3

27. Messaging

27.1. Software

27.1.1. Apache Kafka

27.1.1.1. Apache Samza

27.1.2. Akka

28. System Tools

28.1. JVM Garbage Collection

28.1.1. GCViewer

28.2. HDFS live Statistics

28.2.1. Twitter HDFS Du

28.3. Disk Image Analytics

28.3.1. HDFS FSImage

28.4. UserMonitor

28.4.1. LinkedIn White Elephant

28.5. MapReduce Monitor

28.5.1. Twitter Hraven