Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

How to install and use pycorrector

2025-02-23 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >

Share

Shulou(Shulou.com)06/01 Report--

In this article, the editor introduces in detail "how to install and use pycorrector". The content is detailed, the steps are clear, and the details are handled properly. I hope this article "how to install and use pycorrector" can help you solve your doubts.

Pycorrector mainly solves the following Chinese input scenarios.

Homophonic words, such as a pair of eyes-with glasses.

Confuse phonetic words, such as wandering Weaver Girl-Cowherd and Weaver Girl.

The order of words is reversed, such as Woody Allen Allen Woody.

Word completion, such as love has providence-if love has providence.

The shape is incorrect, such as sorghum-sorghum.

Chinese pinyin spelling, such as xingfu- Happiness.

Chinese pinyin abbreviations, such as sz- Shenzhen.

Grammatical errors, such as unimaginable-unimaginable.

Installation

You can install using pip.

Pip install pycorrector

You can also download the source code and install it manually.

Git clone https://github.com/shibing624/pycorrector.gitcd pycorrectorpython setup.py install

Dependent library kenlm installation.

Pip install https://github.com/kpu/kenlm/archive/master.zip

Other dependent library packages are installed.

Pip install-r requirements.txt

Use examples

1. Text error correction

Import pycorrectorcorrected_sent, detail = pycorrector.correct ('Young Pioneers should make room for the elderly') print (corrected_sent, detail) output: young pioneers should give up their seats for the elderly [('due', 'should', 4, 6)], [('sit', 'seat', 10, 11)]

The rule method loads the kenlm language model file from the path ~ /. Pycorrector/datasets/zh_giga.no_cna_cmn.prune01244.klm by default, and if the file is not detected, the program will automatically download it online. Of course, you can also download the model file (2.8g) manually and place it in that location.

two。 Error detection

Import pycorrectoridx_errors = pycorrector.detect ('Young Pioneers should make way for the elderly') print (idx_errors) output: ['cause', 4, 6, 'word'], [' sit', 10, 11, 'char']]

The return type is list, [error_word, begin_pos, end_pos, error_type], and the pos index position starts with 0.

After reading this, the article "how to install and use pycorrector" has been introduced. If you want to master the knowledge points of this article, you still need to practice and use it yourself. If you want to know more about related articles, welcome to follow the industry information channel.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Internet Technology

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report