In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >
Share
Shulou(Shulou.com)06/01 Report--
What this article shares with you is about how the Python crawler crawls the official account of Wechat to read and read. The editor thinks it is quite practical, so I share it with you to learn. I hope you can get something after reading this article. Without saying much, follow the editor to have a look.
Since 2013, self-media has begun to rise. Then in 2014, self-media began to make money, and since the media has gradually become a trend of this era.
With the continuous popularity of the official account platform, various self-media platforms have sprung up. Since the vigorous development of the media, to a large extent reshaped the pattern of information dissemination, but also gave birth to a huge market dividend, more and more people enter.
This is a good thing for economic development, but not for the vertical field of data collection.
Almost all the information on the self-media platform is based on APP as a carrier, and there are few web-side websites. And smart recommendations are becoming more and more popular, and all we see is the information that the platform pushes to us based on our browsing habits. Not all, which undoubtedly adds another obstacle to the collection.
If you want not to miss the information released by the media account, you can only monitor the media number. There are many platforms, difficult to collect, high cost and low efficiency, which has become the biggest disadvantage of data acquisition recently.
Since its launch in 2012, the Wechat official account platform has been upgraded to version 5.0 in August 2013. at the same time, after adjusting the account type (Subscription account and service account), its development has become better and better. the total number of official accounts has exceeded 30 million. How to monitor the posting, reading, reading and other information of these accounts?
Today, I will introduce to you four methods of collection.
The first one: use a third-party platform to obtain
The third-party platforms are mainly: new list, Qingbo, Tuotu data and other data service providers, in which the new list and thin platform are reading and watching, the update has a certain lag. The Tuotuo data will be all right at that time. I just tested it with my own official account, about a minute or two, and got all the historical information (my account posted less articles, only about 20 articles). Both the reading number and the reading number are correct.
Third-party platform, early did a lot of things, you want to use its services, of course, to buy members, recharge, and so on, it is impossible to make you free. If you check a small number of official accounts, and it is a year and a half of short-term monitoring, whether it is an individual or a company, a third-party platform is the most cost-effective. Because I want to build a set of collection, it is still a lot of difficulty, not a technical staff, is really not good.
The second method: use PC to simulate click
If you are an individual, do not want to spend money on an account, know some Python yourself, and need to get the number of readings and not much data, this is the most appropriate way. Because the main technical points involved in this method are pymouse, PyKeyboard, pyperclip and so on, but it should be noted that PyKeyboard is defective in inputting Chinese characters and needs to be converted. Please refer to my previous article.
Third: use third-party tools
Third-party tools, such as octopus, Jane number, etc., all have the function of official account, and you can download and use them on your own.
Fourth: the way based on the official account platform of Wechat
This approach is actually the most difficult, because there are a lot of things to analyze, involving the use of packet grabbing tools, data flow analysis, and so on. The main process is as follows:
1: log in to Wechat public account:. In the menu bar: material Management-> New material, the following page appears
Click the hyperlink to select another official account
Enter the official account, search, click the official account to get a list of official account articles.
Then click on the article, grab the package using tools such as fiddler, and so on.
Then use the code to simulate the request to get the number of readings and the number of people watching. In short, this way is the most difficult, if you are a technical bull, you can try.
Because the revision of the official account is more frequent, you may finish the analysis today, but you will not be able to use it tomorrow, so you have to analyze it again and again and start over and over again.
Whether it is the second way or the fourth way, if you want to get the number of readings and readings of the official account, you can only use the APP interface. Due to the stricter and stricter restrictions on the official account, each WeChat account or official account can only access 8000,000 articles per day. Therefore, if you want to collect in large quantities, you still need a large number of WeChat accounts.
The above is how the Python crawler crawled Wechat official account to read and read. The editor believes that there are some knowledge points that we may see or use in our daily work. I hope you can learn more from this article. For more details, please follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.