Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Python crawl preparation for an understanding of HTML

2025-02-24 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >

Share

Shulou(Shulou.com)06/02 Report--

Web page source code

Open the web page and press the shortcut key [Ctrl+U] to open the source page

HTML

HTML is the structure of the whole web page, which is equivalent to the frame of the whole website. Tags with "<" and ">" symbols belong to HTML, and tags appear in pairs.

Common labels are as follows:

.. Indicates that the element in the middle of the tag is a web page.. Represents content that is visible to the user.. Presentation framework

..

To represent a paragraph.. Presentation list

..

Represents a picture.. Indicates the title.. Represents a hyperlink

HTML

Html example

Local hyperlinks can be either relative or absolute.

The address of the picture can be a relative path or an absolute path.

This is the theme of the HTML test page. This is the title.

This is the text.

This is a list, this is a web hyperlink, this is a local hyperlink, and here's a picture.

如果图像无法显示,将显示这个

After entering the code, save notepad, and then change the file name and suffix to "HTML.html". The effect is as follows:

The legitimacy of reptiles

Every website has a document called robots.txt, and of course there are some sites that do not have a robots.txt. Sites without robots.txt can get encrypted data without password through web crawlers, that is, all page data of the site can be crawled. If the site has robots.txt documents, it is important to determine whether there is data that is forbidden to visitors.

Part of the crawler is allowed to access part of its path, while for those that are not allowed, crawling is prohibited at all.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Servers

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report