Unlock the full potential of your projects.
Try MeisterTask for free.
Vous n'avez pas de compte ?
Inscription gratuite
Parcourir
Cartes en vedette
Catégories
Gestion de projet
Objectifs d'affaires
Ressources humaines
Brainstorming et analyse
Marketing et contenu
Éducation et remarques
Loisirs
Vie courante
Technologie
Design
Résumés
Autre
Langues
English
Deutsch
Français
Español
Português
Nederlands
Dansk
Русский
日本語
Italiano
简体中文
한국어
Autre
Montrer carte totale
Copier éditer carte
Copier
Hadoop Ecosystem
Objectifs d'affaires
Ed Sarausad
Suivre
Lancez-Vous.
C'est gratuit
S'inscrire avec Google
ou
s'inscrire
avec votre adresse e-mail
Cartes mentales similaires
Plan de carte mentale
Hadoop Ecosystem
par
Ed Sarausad
1. Column store
2. Pig
2.1. Scripting for Hadoop
3. HBase
3.1. Transactional lookups
4. Flume
4.1. Log collector
4.2. Integrates into Hadoop
5. Avro
5.1. Data parsing
5.2. Binary data serialization
5.3. RPC
5.4. language-neutral
5.5. optional codegen
5.6. schema evolution
5.7. untagged data
5.8. dynamic typing
6. Mahout
6.1. Machine learning
6.2. Applied to MR
7. Ambari
7.1. Cluster deployment and admin
7.2. Driven by Hortonworks
8. ZooKeeper
8.1. Coordinator of shared state between apps
8.2. Naming, configuration, and synchronization services
9. YARN
9.1. cluster management
9.2. Hadoop 2
9.3. resource manager
9.4. job scheduler
10. BigTop
10.1. Package Hadoop ecosys
10.2. Test Hadoop ecosys package
11. Query data stored in HDFS and HBase
12. Non-relational
13. Maps query onto nodes
14. Coordinator jobs are recurrent Oozie Workflow jobs that are triggered by time and data availability.
15. Reduces aggregated results into answers
16. Links jobs
16.1. Workflow processing
17. Bundle provides a way to package multiple coordinator and workflow jobs and to manage the lifecycle of those jobs
17.1. Connects non-Hadoop stores (RDBMS)
17.2. Moves data to & from RDBMS to Hadoop
18. Workflow jobs are Directed Acyclical Graphs (DAGs), specifying a sequence of actions to execute. The Workflow job has to wait
19. Hive
19.1. SQL-like querying
19.2. Combiner can be used to optimize reducer performance
19.3. Structured data warehousing
19.4. Partition columns instead of indexes
20. Oozie
21. Sqoop
21.1. Autogens Java InputFormat code for data access
22. MapReduce
22.1. Distributed compute
23. Related Apache Ecosystems
24. HDFS
24.1. Distributed storage
25. Spark
26. Impala
26.1. SQL query egnine
26.2. Real time
27. Cascading
27.1. Higher abstraction from MR
27.2. Creates Flow that assembles Map/Reduce jobs
Lancez-vous. C'est gratuit!
Connectez-vous avec Google
ou
S'inscrire