Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

I am studying Java, want to try big data development, how to plan to learn?

2025-01-14 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/03 Report--

I am studying Java, want to try big data development, how to plan to learn?

one。 Introduction to big data's work

two。 The skill requirements of big data engineer

three。 Big data's learning path

four。 Learning resource recommendations (books, blogs, websites)

one。 Introduction to big data's work

Big data's work is currently divided into three main directions:

01. Engineer big data

02. Data analyst

03. Big data scientist

04. Other (data mining is essentially machine learning, but related to data, it can also be understood as a direction of big data)

two。 The skill requirements of big data engineer

Attach two more authoritative big data engineer skill maps (picture invasion and deletion)

The summary is as follows:

There are 10 skills required:

01.Java Advanced (virtual machine, concurrent)

Basic operation of 02.Linux

03.Hadoop (here chivalrous concept refers only to HDFS+MapReduce+Yarn)

04.HBase (JavaAPI operation + Phoenix)

05.Hive (basic operation and principle understanding of Hql)

06.Kafka

07.Storm

08.Scala needs

09.Python

10.Spark (Core+sparksql+Spark streaming)

11. Some gadgets (Sqoop, etc.)

6 advanced skills:

11. Machine learning algorithm and mahout library plus MLlib

12.R language

13.Lambda architecture

14.Kappa architecture

15.Kylin

16.Aluxio

three。 Learning path

Because I am from Java development through about 3 months of self-study transferred to big data development. So I mainly share my learning drive.

The first stage:

01.Linux study (learn from Bird and ok)

02.Java Advanced Learning ("in-depth understanding of Java Virtual Machine", "Java High concurrency practice")

The second stage:

03.Hadoop (book by Dong Xicheng)

04.HBase ("HBase authoritative Guide")

05.Hive ("Hive Development Guide")

06.Scala ("learn Scala")

07.Spark ("Spark Fast big data Analysis")

08.Python (learn from Liao Xuefeng's blog and ok)

The third stage:

In response to the skill demand, I will ok by collecting more information on the Internet. I will tell you the most important thing (what to learn)

The rest is for you to collect the corresponding materials and learn ok.

Of course, if you think your reading efficiency is too slow, you can collect some courses online and follow the course OK. This is entirely based on your own situation. If you are not efficient in reading, you will have online lessons, on the contrary, you will read by yourself.

four。 Learning resource recommendations:

01.Apache official website

02.Stackoverflow

04.github

03.Cloudra official website

04.Databrick official website

05. Big data Learning 1 (Wechat official account)

06.CSDN,51CTO

07. There will be a lot of books in Dangdang, but in fact, the contents are all the same.

Last but not least: pay more attention to technological trends and keep learning.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report