In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)05/31 Report--
This article mainly explains "what is solr". The content of the explanation is simple and clear, and it is easy to learn and understand. Please follow the editor's train of thought to study and learn "what is solr".
# # Solr Features## Solr is an independent enterprise search server that provides API similar to REST. You can store documents (or indexes) through the binary of XML, JSON, CVS, or HTTP. You can query and accept XML, JSON, CVS or binary data through the HTTP GET method.
Advanced full-text retrieval function
Optimize for network traffic with high throughput
Based on open interface standards-XML, JSON and HTTP
Comprehensive HTML management interface
Server statistics are exposed, which can be monitored through JMX
Linear scalability, automatic index replication, automatic failover and recovery
Near real-time index
Flexibility and adaptability based on XML configuration
Extensible plug-in architecture
# Solr uses Lucene to retrieve the library and extend it #
A real data pattern, including numeric types, dynamic fields, unique keys
Powerful extension to Lucene query language
Aspect retrieval and filtering
Support geospatial retrieval of multipoints and geo-polygons in each document
Advanced, configurable text analyzer
Highly configurable and user scalable cach
Performance optimization
Scalable configuration based on XML
Management Interface based on AJAX
Logs that can be monitored
Fast and near-real-time incremental index and index replication
Highly scalable distributed retrieval based on index fragments across multiple hosts
JSON, XML, CVS/delimited-text, and binary update formats
A simple way to pull data from a database, a XML file on a local disk, or a HTTP resource
Use Apache Tika to parse and index rich text (PDF, Word, HTML, etc.)
Metadata extraction based on Apache UIMA Integrated configuration
Multiple retrieval index
# Schema####
Define document fields and specify field types
Can drive smarter processing
Declarative lucene word splitter
Dynamic fields are new fields that can be added immediately.
The field replication function allows multiple ways to index a single field, or to merge multiple fields into a single search field.
Explicit type that sorts the necessary guess field types
Configuration stop word list, synonym list, protection word list based on extended (external file) files
Many additional text analysis components, including word segmentation, regular expressions, and sounds-like filters
Each field has a pluggable similar model
# Query####
The HTTP interface provides configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, CSV, binary)
Can be sorted by any number of fields or by complex functions for any number of fields
The advanced DisMax query analyzer obtains highly relevant results from queries entered by the user
Highlight context clip
Retrieval based on unique field values, explicit queries, date ranges, numeric ranges, or point aspects
Multifaceted retrieval through tags or optional execution filtering
Make spelling suggestions for user queries
Similarity suggestions for a given document
Function query-affects the score of a complex numeric field or query correlation score function specified by the user
Filter the range of results by function query
Range filter over Function Query results
Date Math-specify dates relative to "NOW" in queries and updates
Clustering dynamic search results by carrot2
Numeric field statistics, such as minimum, maximum, average, standard deviation
Combine queries derived from different syntaxes
Automatic suggestion function to complete user query
Allows you to configure the highest priority results of a query, which can override normal scores and sorting
Simply connect two document types
Performance optimization
# Core####
Dynamically create or delete a collection of documents without restarting the service
Pluggable query processor and extensible XML data format
Pluggable user function for function query
Customizable build-based request processor and support for distributed retrieval
Document uniqueness Enhancement based on unique key Field
Duplicate file detection, including fuzzy near duplicates
Customizable index processing chain that allows document processing before indexing
User configurable commands triggered on index changes
Ability to control where docs with the sort field missing will be placed
"Luke" request handler for corpus information
# SolrCloud####
Centralized configuration based on Apache ZooKeeper
Automatically distributed indexing / sharding, sending the document to any node and the document will be forwarded to the correct shard
Near real-time index based on immediate push model (which also supports slow pull model replication)
The transaction log ensures that even if the updated document is not indexed to disk, it will not be lost
Automatic index failover, index leader election, and failure recovery if failed
No single point of failure
# Admin Interface####
Comprehensive statistics on cache utilization, update, and query
Interactive mode browsing including index statistics
Replication monitoring
SolrCloud dashboard for graphical status of cluster nodes
Full log control
Text analysis and debugging, showing the analysis results of each stage of an analyzer
Web query interface, w / mode output
Parsed query output
Detailed Design of document score based on Lecene explain ()
Explain score for documents outside of the requested range to debug why a given document wasn't ranked highe
Thank you for your reading, the above is the content of "what is solr", after the study of this article, I believe you have a deeper understanding of what solr is, and the specific use needs to be verified in practice. Here is, the editor will push for you more related knowledge points of the article, welcome to follow!
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.