Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Video on how Python climbs watermelons

2025-04-05 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >

Share

Shulou(Shulou.com)06/02 Report--

Python how to climb watermelon video, in response to this question, this article introduces in detail the corresponding analysis and answers, hoping to help more partners who want to solve this problem to find a more simple and easy way.

First, write in front

Really, why do so many people watch the game? I posted it twice and it adds up to only a hundred.

Forget it, forget the game, you don't like it anyway.

Today, let's try to climb those girls who are hot on the headlines. I don't know if I can stand it.

2. Preparation work 1. Environment python 3.8pycharm 2021.2 Professional Edition 2. Third-party module seleniumrequestsparsel to be used. 3. General process.

Since you don't like me to talk, but the process, I still want to write it out for you, so I listed it separately.

1. Website analysis (clear requirements) find the link corresponding to embedUrl in the video page source code; find the video playback address in the link, in the element panel; find that the groupby_id on the regular embedUrl is actually the id on the current video link. When downloading a video, you only need an id to download the video.

2. Code implementation process Construction embedUrl uses selenium to access the link, extract video link address, splice video link address, use requests to send request and get video binary data to save video.

Fourth, code display analysis

First, import the module.

Import requestsfrom selenium import webdriver

Enter browser settings

Options = webdriver.ChromeOptions ()

1. Build embedUrl

Group_id = input ("Please enter the id:") url = 'https://www.ixigua.com/embed?group_id=' + group_id

Headless browser

Options.add_argument ("--headless")

Add a disguise.

Options.add_argument ('User-Agent= "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/96.0.4664.45 Safari/537.36"')

2. Use selenium to access the link driver: browser

Driver = webdriver.Chrome (executable_path= "chromedriver.exe", options=options)

Open a web-driven configuration: a middleman for code manipulation browsers

Driver.get (url)

Implicit wait: wait up to five seconds if you finish loading in one second and continue to execute

Driver.implicitly_wait (5)

3. Extract video link address

Info = driver.find_elements_by_xpath ('/ / * [@ id= "player_default"] / xg-controls/xg-definition/ul/li [1]') video_url = info [0] .get _ attribute ("url")

4. Stitching video link address

Video_url = 'http:' + video_url

5. Use requests to send requests and get video binary data

Video_data = requests.get (video_url). Contentwith open ('1.mp4mm, mode='wb') as f: f.write (video_data)

All codes

Import requestsfrom selenium import webdriver# goes to the browser and sets options = webdriver.ChromeOptions () # 1. Build embedUrl https://www.ixigua.com/embed?group_id=7029910152576926238group_id = input ("Please enter the id you want to download the video:") url = 'https://www.ixigua.com/embed?group_id=' + group_id# headless browser options.add_argument ("--headless") # plus a disguised options.add_argument (' User-Agent= "Mozilla/5.0 (Windows NT 10.0; Win64) X64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/96.0.4664.45 Safari/537.36 "') # 2. Use selenium to access the link # driver: browser driver = webdriver.Chrome (executable_path= "chromedriver.exe", options=options) # Open a web page # driver configuration: code to operate a middleman in the browser driver.get (url) # implicit wait: wait up to five seconds if you finish loading in one second and continue to execute driver.implicitly_wait (5) # 3. Extract the video link address info = driver.find_elements_by_xpath ('/ * [@ id= "player_default"] / xg-controls/xg-definition/ul/li [1]') video_url = info [0] .get _ attribute ("url") # 4. Splicing video link address video_url = 'http:' + video_url# 5. Use requests to send a request and get video binary data video_data = requests.get (video_url). Contentwith open ('1.mp4climb, mode='wb') as f: f.write (video_data) print ("crawl succeeded!!") # leave an error and see if everyone is smart enough to find out the answer to the video question about how Python crawls watermelons. I hope the above content can be helpful to you. If you still have a lot of questions to solve, you can follow the industry information channel to learn more about it.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Development

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report