In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-21 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)05/31 Report--
This article mainly explains "what are Ambari and ClouderaManager". Friends who are interested may wish to have a look. The method introduced in this paper is simple, fast and practical. Let's let the editor take you to learn what Ambari and ClouderaManager are.
1. What is CDH,Ambari?
Apache Ambari is a Web-based tool that supports provisioning, management, and monitoring of Apache Hadoop clusters. Ambari already supports most Hadoop components, including HDFS, MapReduce, Hive, Pig, Hbase, Zookeper, Sqoop, and Hcatalog.
Apache Ambari supports centralized management of HDFS, MapReduce, Hive, Pig, Hbase, Zookeper, Sqoop, and Hcatalog. It is also one of the top five hadoop management tools. Ambari can install secure (Kerberos-based) Hadoop clusters to support Hadoop security, provide role-based user authentication, authorization and audit functions, and integrate LDAP and Active Directory for user management.
Introduction to CDH
Cloudera's Distribution, including Apache Hadoop
Is one of the many branches of Hadoop, maintained by Cloudera and built on a stable version of Apache Hadoop
Provides the core of Hadoop
-Extensible Stora
-distributed computing
User interface based on Web
Advantages of CDH
The version is clearly divided.
The version is updated quickly.
Support for Kerberos security authentication
The document is clear
Support multiple installation methods (Cloudera Manager mode)
2. Why do you need them
For a cluster of 1000 servers, at least how long will it take to build a Hadoop cluster, including Hive, Hbase, Flume, Kafka, Spark, etc.
Just give you one day to finish the above work?
For the above clusters to upgrade the hadoop version, which upgrade plan will you choose and how long will it take at least?
The new version of Hadoop is compatible with Hive, Hbase, Flume, Kafka, Spark, etc.
Big data cluster management can be divided into manual mode (Apache hadoop) and tool mode (Ambari + hdp and Cloudera Manger + CDH).
Manual deployment, you need to configure too many parameters, however, to understand its principle, it is recommended to learn to do so, you can learn a lot. This method, ah, has to be carried out by the user, and there are too many details. When designing multiple components, users have to solve the problem of version compatibility between components themselves.
Tool deployment, such as Ambari or Cloudera Manger. (at present, the two most mainstream cluster management tools, the former is Hortonworks and the latter is Cloudera.) using the tool can be said to be an one-click operation, and the difficulty lies in the deployment of the tool Ambari or Cloudera Manger itself.
Comparison between manual and tool methods:
Comparison point
Manual method
Tool mode
Degree of difficulty
Difficult, almost impossible to succeed
Simple and easy to do
Compatibility
Solve component compatibility issues on your own
Automatically install compatible components
Number of components supported
Support for all components
Support common components
Advantages
Deep understanding of component and cluster management
Simple, easy and feasible
Shortcoming
Too complicated to succeed
Block too many details, hindering the understanding of the component
Tool comparison:
The detailed comparison is as follows:
Publisher:
Hortonworks developed big data analysis integration platform of Ambari and hdp
Cloudera developed the cloudera manger and cdh big data analysis integration platform.
Stability:
Cloudera is relatively stable.
Ambari is relatively unstable (the page opens slowly)
Resource consumption:
The server Xmx of cloudera manager is 2G, and the agent is 1G, but with host monitor and service monitor, the total is about 1G.
The server Xmx of ambari is 2G, the ams of ams and the env of hbase, which is about 2G.
Cluster restart:
Cloudera supports rolling restart (hdfs needs to be designed as ha before rolling restart)
Ambari supports rolling restart (hdfs needs to be designed as ha before rolling restart)
Cluster upgrade (generally speaking, do not upgrade the cluster easily):
Cloudera does not support rolling upgrade service
Ambari supports rolling upgrade service (this is the advantage of ambari, hdfs must be ha)
Secondary development:
Cloudera does not support
Ambari support
Service version:
Cloudera is older
Ambari is newer
Service integration:
Cloudera is weak
Ambari is strong, supporting es, redis, presto, kylin, etc.
Experience effect:
Hello, cloudera.
Ambari relative difference
Installation process:
Cloudera is complex.
Ambari is simple
Email alarm:
Cloudera support is not good.
Ambari support is good.
Install the package:
Cloudera is the parcel package
Ambari is the rpm package
At this point, I believe you have a deeper understanding of "what Ambari and ClouderaManager are". You might as well do it in practice. Here is the website, more related content can enter the relevant channels to inquire, follow us, continue to learn!
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.