In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-04-05 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >
Share
Shulou(Shulou.com)06/02 Report--
What is the role of pubmed in Python? aiming at this question, this article introduces the corresponding analysis and solution in detail, hoping to help more partners who want to solve this problem to find a more simple and feasible way.
Import installation package
The first step, of course, is to guide the package. Remember, the package partner needed for automation is selenium, which is as follows:
From selenium import webdriver from selenium.webdriver.support.ui import Selectfrom selenium.webdriver.common.action_chains import ActionChains
① code explanation
Code 1: call the webdriver module in the selenium package. I believe the partners have installed the webdriver software in the Google browser installation directory. This module will enable the webdriver software to automate the Google browser operation.
Code 2: call the Select module, which is called when the page needs to operate on the checkbox, check box, and drop-down box, because we need to select the CSV format in Format (see figure) in the next operation, so we can import this module in advance.
Code 3: ActionChains module is a real simulation of keyboard and mouse operations, such as mouse hover, double-click, right-click and other operations, can be said to be a universal weapon in selenium.
PS: if you need Python learning materials, you can add the group below to go to the free administrator to get it.
Set parameters
Entering the preparation stage, some parameters are set in advance to facilitate the formal operation of the third step.
Find = input ('Please enter what you want to find, enter to confirm:') url = 'https://pubmed.ncbi.nlm.nih.gov/?term='+find+'&size=200&page=51'driver = webdriver.Chrome () driver.get (url) driver.maximize_window () driver.implicitly_wait (10)
① code explanation
Code 1: is through the input function to achieve human-computer interaction, we need to find the content assigned to a variable called "find" (partners can customize variables, if modified in Code 2, then remember to change to a new variable name).
Run this code alone, python will appear the following interface: you can see that the content in input () will appear in the terminal, play a prompt role, partners can customize the content; enter the content in the green box, will be assigned to the "find" variable.
Here the editor takes "T2DM" as an example, enter "T2DM" enter, at this time "find" equals "T2DM", and then you can assign a value to the "url" variable through Code 2, which is the next step we want to automate the page's links.
Matters needing attention
If some partners feel that human-computer interaction is not necessary, they can also assign values directly, and still take "T2DM" as an example, as follows: the effect is the same.
Note: except for text input, all symbols in python are English symbols. The quotation marks here should be "" instead of "".
Find = "T2DM" url = 'https://pubmed.ncbi.nlm.nih.gov/?term='+find+'&size=200&page=51'
② code explanation
Code 1 Code 2: we can get the corresponding link address, and then we can access the page by calling the browser.
Code 3: call the Chrome browser through the webdriver module (I believe the partner has installed the webdriver software in the Google browser installation directory through the previous tutorial), and then let Google browser access the link address through code 4. When you run to code 4, you can see that python automatically opens the following interface:
③ code explanation
For the open page, we can further optimize the settings. With code 5, maximize the window, and then set the hidden wait to 10s. (partners who don't remember the difference between explicit waiting and implicit waiting can refer to the previous python tutorial.)
Automated process
Before you begin, you need to know one concept:
① python automates operations based on the logic we set up, so instead of saying "go download these documents" directly to python, which is not the logic that python can understand, but to python, "you click here first, then click here, select this, then click here, wait a minute, and then click here", that is, if we want to automate python, we have to split what we have to do to python.
So, first manually manipulate → to record each operation → into code, and then modify the python language. At this time, when you open the pubmed interface just now, there are many kinds of operations that can be downloaded in batches. The editor is just giving an example, but in fact it is the same. Friends can draw examples from others.
② manual operation. Click Save →, click Format →, select CSV → Creat file, and you can save the whole page at this time.
③ is converted to code. Through the 3.2steps, we can say to python, "order this first, then this and then this", but how to convert it into code, friends, remember a sentence of code.
ActionChains (driver). Move_to_element (driver.find_element_by_xpath ('xpath_content')). Click (). Perform ()
Keep in mind that this code can almost walk sideways in subsequent automation, this code is a bit long, split it:
ActionChains (driver) move_to_element (A) driver.find_element_by_xpath ('B') click () perform ()
Code explanation:
Code 1: call ActionChains to operate driver, how to do it? Look at Code 2.
Code 2: move the mouse to element A, what is element A? Look at code 3.
Code 3: find a "B" xpath through driver, and the result is A.
Code 4: mouse click
Code 5: do the above in turn.
Another way to interpret it is to find a xpath called "B", then move the mouse over the xpath and click, which simulates the real mouse movement and click.
Paste the code written by the small.
For i in range (51 Select 56): driver.implicitly_wait (10) ActionChains (driver). Move_to_element (driver.find_element_by_xpath ('/ * [@ id= "save-results-panel-trigger"]')) .click () .perform () Select (driver.find_element_by_xpath ('/ * [@ id= "save-action-selection"]')) .select_by_visible_text ("All results on this page") ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ * [@ id= "save-action-format"]')) .click () .perform () Select (driver.find_element_by_xpath ('/ html/body/main/div [1] / div/form/div [2] / select')) .select_by_visible_text ("CSV") ActionChains (driver). Move_to_element (driver.find_element_by) _ xpath ('/ / * [@ id= "save-action-panel-form"] / div [3] / button [1]') .click () .perform () target = driver.find_element_by_xpath ('/ / * [@ id= "search-results"] / section/div [3] / a Uniba span') driver.execute_script ("arguments [0]. () " Target) ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ / * [@ id= "search-results"] / section/div [3] / a _ Acer span') .click () .perform () ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ * [@ id= "search-page"] / div [12] / div/form/button')) .click (). Perform () Print ('page' + str (I) + 'page downloaded successfully') print ('jump first' + str (item1) + 'page') driver.quit () print ('download completed Exit automatically.')
As you can see, this code appears the most times, and each time it only modifies the content of xpath, that is to say, the key content of this key code is how to find the xpath, and the rest is just copy.
Find = input ('Please enter what you are looking for Enter to confirm:') url = 'https://pubmed.ncbi.nlm.nih.gov/?term='+find+'&size=200&page=51'from selenium import webdriverfrom selenium.webdriver.support.ui import Selectfrom selenium.webdriver.common.action_chains import ActionChainsdriver = webdriver.Chrome () driver.get (url) driver.maximize_window () driver.implicitly_wait (10) for i in range (51 driver 56): driver.implicitly_wait (10) ActionChains (driver). Move_to_element (driver.find_) Element_by_xpath ('/ * [@ id= "save-results-panel-trigger"]') .click () .perform () Select (driver.find_element_by_xpath ('/ * [@ id= "save-action-selection"]')) .select_by_visible_text ("All results on this page") ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ / * [@ id= "save-action-format") ]) .click () .perform () Select (driver.find_element_by_xpath ('/ html/body/main/div [1] / div/form/div [2] / select')) .select_by_visible_text ("CSV") ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ * [@ id= "save-action-panel-form"] / div [3] / button [1]')) .click (). Perform () target = driver.find_element_by_xpath ('/ / * [@ id= "search-results"] / section/div [3] / an Accord span') driver.execute_script ("arguments [0]. () " Target) ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ / * [@ id= "search-results"] / section/div [3] / a _ Acer span') .click () .perform () ActionChains (driver) .move_to_element (driver.find_element_by_xpath ('/ * [@ id= "search-page"] / div [12] / div/form/button')) .click (). Perform () Print ('page' + str (I) + 'page downloaded successfully') print ('jump first' + str (item1) + 'page') driver.quit () print ('download completed Exit automatically.') The answer to the question about the role of pubmed in Python is shared here. I hope the above content can be of some help to you. If you still have a lot of doubts to be solved, you can follow the industry information channel to learn more about it.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.