Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to use regular expressions to extract Special Information in python

2025-04-06 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/02 Report--

How do I use regular expressions to extract special information in python? I believe that many inexperienced people are at a loss about this, so this article summarizes the causes and solutions of the problem. Through this article, I hope you can solve this problem.

1. Delete the Python comment in the string

Case study:

Import re time = "2020-01-01 # this is a date" num = re.sub (ringing dates, ", time) # separated by #, removing the following information print (" this time is: ", num)

Results:

This time is: 2020-01-01

2. Intercept the content before and after a symbol

Case 1:

Txt ='my phone number is: 131My 246-XXX19'a = txt.split (':') [0] # 0 indicates that the result before the symbol b = txt.split (':') [1] # [- 1] is the same as the result of [1], indicating the print after the symbol ("a") print ("b")

Results:

The result of an is: my phone is b, and the result is: 131b-246-XXX19

Case 2:

Txt = "I love python. I love python" text = re.sub (r'. * $', ", txt) # to. For separation,\ is the escape character, with the following. Distinguish print ("this intercept result is:", text)

Results:

The result of this interception is: I love python

3. Delete non-numeric strings

Case study:

Import re time = "2020-01-01 # this is a date" num = re.sub (r'\ Downs, ", time) print (" this time is: ", num)

Results:

The time is: 20200101

4. Keep only Chinese

Case study:

Reg = "[^\ u4e00 -\ u9fa5]" text = "all right! e We aa $$to start 34 learning-python!" Print (re.sub (reg,', text))

Results:

All right, let's start learning.

5. Keep only Chinese, uppercase and lowercase letters and Arabic numerals

Case study:

Reg = "[^ 0-9A-Za-z\ u4e00 -\ u9fa5]" text = "all right! we aa $$to start 34 learning-python!" Print (re.sub (reg,', text))

Results:

All right, E, aa, let's start 34 to learn python.

6. Remove uppercase and lowercase letters and numbers

Case study:

Import retxt= "Ayouleyang A ~ Youle-Yang" text = re.sub ('[a-zA-Z0-9]',', txt) print (text)

Results:

A ~ Youle-Yang 1

7. Remove a special character

Case list:

Import re txt ='A * a # excellent $Le. : Yang;: youle+'text = re.sub ('[,;; youle::.;, $* #] +', ", txt) # [] will be replaced, similar to replace () print (text)

Results:

An a you Leyang + 1

8. Keep English, numbers and special symbols

Case study:

Reg = "[^ 0-9A-Za-z\ u4e00 -]" # keep only English, numbers and -, u4e00 "-" is the symbol that needs to be retained txt = "all right! my number is 131e00!:" text = re.sub (reg,', txt) print (text)

Results:

131C-246-XXX19

After reading the above, have you mastered how to use regular expressions to extract special information in python? If you want to learn more skills or want to know more about it, you are welcome to follow the industry information channel, thank you for reading!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report