In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-04-12 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >
Share
Shulou(Shulou.com)06/02 Report--
This article focuses on "what are the characteristics of IK Analyzer 2012". Interested friends may wish to have a look at it. The method introduced in this paper is simple, fast and practical. Let's let the editor take you to learn "what are the features of IK Analyzer 2012"?
IK Analyzer is an open source, java-based lightweight Chinese word segmentation toolkit. Since the release of version 1.0 in December 2006, IKAnalyzer has launched four major versions. At first, it is a Chinese word segmentation component based on the open source project Luence, which combines dictionary word segmentation and grammar analysis algorithm. Since version 3. 0, IK has evolved into a common word segmentation component for Java, independent of the Lucene project, and provides a default optimization implementation for Lucene. In version 2012, IK implements a simple ambiguity elimination algorithm for word segmentation, which marks the evolution of IK word segmentation from simple dictionary segmentation to simulated semantic word segmentation.
IK Analyzer 2012 features:
A unique "forward iterative finest granularity segmentation algorithm" is adopted, which supports fine-grained segmentation and intelligent word segmentation.
In the system environment: Core2 i7 3.4G dual core, 4G memory, window 7 64-bit, Sun JDK 1.6-29 64-bit ordinary pc environment test, IK2012 has 1.6 million words per second (3000KB/S) high-speed processing capacity.
The 2012 version of intelligent word segmentation mode supports simple word segmentation and ambiguity processing and quantifier merge output.
The use of multi-processor analysis mode, support: English letters, numbers, Chinese vocabulary and other word segmentation processing, compatible with Korean and Japanese characters.
Optimized dictionary storage, smaller memory footprint. Support for user dictionary extension definitions. In particular, in version 2012, the dictionary supports mixed words in Chinese, English and numbers.
At this point, I believe you have a deeper understanding of "what are the characteristics of IK Analyzer 2012". You might as well do it in practice. Here is the website, more related content can enter the relevant channels to inquire, follow us, continue to learn!
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.