In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-09-15 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)05/31 Report--
This article mainly explains "what is the method of mapreduce performance optimization". The content of the article is simple and clear, and it is easy to learn and understand. Please follow the editor's train of thought to study and learn "what is the method of mapreduce performance optimization".
The number of map task seriously affects the performance of mr. There are a lot of materials on the Internet, which explain the principles and methods, but I will not elaborate on them here. I have summed up two revision methods, which I hope will be helpful to you.
Method 1: modify the blocksize of hdfs, which is undesirable and requires a restart of the cluster. The key is that the previous data is still the same size as the previous blocksize, and the new data will be the size of the new blocksize. For example, it used to be 128m, which was modified to 512m. The previous data has not changed, and the blocksize is still 128m.
Method 2: recommend the method. Code settings, the most flexible, according to different requirements, set different values. Add FileInputFormat.setMinInputSplitSize to the code that sets the size of mapred.min.split.size in bytes, such as my code:
Job.setMapOutputValueClass (IntWritable.class)
Job.setNumReduceTasks (1)
/ / set the minimum shard to 512m
FileInputFormat.setMinInputSplitSize (job, 1024,1024,512)
FileInputFormat.addInputPath (job, new Path ("/ usr/keyword/input"))
Supplementary note:
TextInputFormat.setMinInputSplitSize (job,1024L); / / set the minimum shard size
TextInputFormat.setMaxInputSplitSize (job,1024 × 1024 × 10L); / / set the maximum shard size
Thank you for your reading, the above is the content of "what is the method of mapreduce performance optimization". After the study of this article, I believe you have a deeper understanding of what the method of mapreduce performance optimization is, and the specific use needs to be verified in practice. Here is, the editor will push for you more related knowledge points of the article, welcome to follow!
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about
The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r
A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
About us Contact us Product review car news thenatureplanet
More Form oMedia: AutoTimes. Bestcoffee. SL News. Jarebook. Coffee Hunters. Sundaily. Modezone. NNB. Coffee. Game News. FrontStreet. GGAMEN
© 2024 shulou.com SLNews company. All rights reserved.