Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

What are Ambari and ClouderaManager?

2025-01-21 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >

Share

Shulou(Shulou.com)05/31 Report--

This article mainly explains "what are Ambari and ClouderaManager". Friends who are interested may wish to have a look. The method introduced in this paper is simple, fast and practical. Let's let the editor take you to learn what Ambari and ClouderaManager are.

1. What is CDH,Ambari?

Apache Ambari is a Web-based tool that supports provisioning, management, and monitoring of Apache Hadoop clusters. Ambari already supports most Hadoop components, including HDFS, MapReduce, Hive, Pig, Hbase, Zookeper, Sqoop, and Hcatalog.

Apache Ambari supports centralized management of HDFS, MapReduce, Hive, Pig, Hbase, Zookeper, Sqoop, and Hcatalog. It is also one of the top five hadoop management tools. Ambari can install secure (Kerberos-based) Hadoop clusters to support Hadoop security, provide role-based user authentication, authorization and audit functions, and integrate LDAP and Active Directory for user management.

Introduction to CDH

Cloudera's Distribution, including Apache Hadoop

Is one of the many branches of Hadoop, maintained by Cloudera and built on a stable version of Apache Hadoop

Provides the core of Hadoop

-Extensible Stora

-distributed computing

User interface based on Web

Advantages of CDH

The version is clearly divided.

The version is updated quickly.

Support for Kerberos security authentication

The document is clear

Support multiple installation methods (Cloudera Manager mode)

2. Why do you need them

For a cluster of 1000 servers, at least how long will it take to build a Hadoop cluster, including Hive, Hbase, Flume, Kafka, Spark, etc.

Just give you one day to finish the above work?

For the above clusters to upgrade the hadoop version, which upgrade plan will you choose and how long will it take at least?

The new version of Hadoop is compatible with Hive, Hbase, Flume, Kafka, Spark, etc.

Big data cluster management can be divided into manual mode (Apache hadoop) and tool mode (Ambari + hdp and Cloudera Manger + CDH).

Manual deployment, you need to configure too many parameters, however, to understand its principle, it is recommended to learn to do so, you can learn a lot. This method, ah, has to be carried out by the user, and there are too many details. When designing multiple components, users have to solve the problem of version compatibility between components themselves.

Tool deployment, such as Ambari or Cloudera Manger. (at present, the two most mainstream cluster management tools, the former is Hortonworks and the latter is Cloudera.) using the tool can be said to be an one-click operation, and the difficulty lies in the deployment of the tool Ambari or Cloudera Manger itself.

Comparison between manual and tool methods:

Comparison point

Manual method

Tool mode

Degree of difficulty

Difficult, almost impossible to succeed

Simple and easy to do

Compatibility

Solve component compatibility issues on your own

Automatically install compatible components

Number of components supported

Support for all components

Support common components

Advantages

Deep understanding of component and cluster management

Simple, easy and feasible

Shortcoming

Too complicated to succeed

Block too many details, hindering the understanding of the component

Tool comparison:

The detailed comparison is as follows:

Publisher:

Hortonworks developed big data analysis integration platform of Ambari and hdp

Cloudera developed the cloudera manger and cdh big data analysis integration platform.

Stability:

Cloudera is relatively stable.

Ambari is relatively unstable (the page opens slowly)

Resource consumption:

The server Xmx of cloudera manager is 2G, and the agent is 1G, but with host monitor and service monitor, the total is about 1G.

The server Xmx of ambari is 2G, the ams of ams and the env of hbase, which is about 2G.

Cluster restart:

Cloudera supports rolling restart (hdfs needs to be designed as ha before rolling restart)

Ambari supports rolling restart (hdfs needs to be designed as ha before rolling restart)

Cluster upgrade (generally speaking, do not upgrade the cluster easily):

Cloudera does not support rolling upgrade service

Ambari supports rolling upgrade service (this is the advantage of ambari, hdfs must be ha)

Secondary development:

Cloudera does not support

Ambari support

Service version:

Cloudera is older

Ambari is newer

Service integration:

Cloudera is weak

Ambari is strong, supporting es, redis, presto, kylin, etc.

Experience effect:

Hello, cloudera.

Ambari relative difference

Installation process:

Cloudera is complex.

Ambari is simple

Email alarm:

Cloudera support is not good.

Ambari support is good.

Install the package:

Cloudera is the parcel package

Ambari is the rpm package

At this point, I believe you have a deeper understanding of "what Ambari and ClouderaManager are". You might as well do it in practice. Here is the website, more related content can enter the relevant channels to inquire, follow us, continue to learn!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Servers

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report