In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-03-01 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >
Share
Shulou(Shulou.com)06/01 Report--
In this issue, the editor will bring you about how to carry out PyQuery theoretical analysis. The article is rich in content and analyzes and narrates it from a professional point of view. I hope you can get something after reading this article.
Hello, everyone, Hello, last time we learned the basic knowledge of html, then some people had doubts. I looked up the information on html, and I wanted to get it, but how can I get the information I want from it? Do you have any tools? Well, it shows that this student is still very thoughtful. At present, there are many related tools that can be used to obtain html in the market, and most of them are used in industry: BeautifulSoup, xpath, pyquery. What we introduce today is pyquery, which is also the most frequently used tool in my work, which can be said to be very handy. OK, next let's take a look at this tool.
Introduction
The pyquery library is the Python implementation of jQuery and can be used to parse the content of HTML web pages. The official document address is http://packages.python.org/pyquery/.
Pyquery allows you to use the syntax of jQuery to operate on xml. This I is very similar to jQuery. If you use lxml,pyquery to process xml and html, it will be faster.
This library is not (at least not yet) a code base that can interact with JavaScript, it's just very much like jQuery API.
Installation
Pip install pyquery
Or download and install: https://pypi.python.org/pypi/pyquery/#downloads
Initialization
Import library: from pyquery import PyQuery as pq
1. Direct string
The doc=pq ("") pq parameter can be passed directly into the HTML code, and doc is now equivalent to the $symbol in jQuery.
2 、 lxml.etree
Doc=pq (etree.fromstring ("))
You can first use lxml's etree to deal with the code, so that if your HTML code is incomplete or omitted, it will be automatically converted into complete and clearly structured HTML code.
3. Send URL directly
Doc=pq ('http://www.baidu.com')
It's like requesting a web page directly, similar to requesting the link directly with urllib2 to get the HTML code.
4. Transfer documents
Doc=pq (filename='hello.html')
You can directly pass the file name of a path.
The above is the editor for you to share how to carry out the theoretical analysis of PyQuery, if there happen to be similar doubts, you might as well refer to the above analysis to understand. If you want to know more about it, you are welcome to follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.