In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >
Share
Shulou(Shulou.com)05/31 Report--
This article mainly explains "Storm how to write a crawler", the content of the article is simple and clear, easy to learn and understand, the following please follow the editor's ideas slowly in depth, together to study and learn "Storm how to write a crawler" bar!
Package com.digitalpebble.storm.crawler.bolt.indexing;import java.util.Map;import org.slf4j.LoggerFactory;import backtype.storm.task.OutputCollector;import backtype.storm.task.TopologyContext;import backtype.storm.topology.OutputFieldsDeclarer;import backtype.storm.topology.base.BaseRichBolt;import backtype.storm.tuple.Tuple;import com.digitalpebble.storm.crawler.StormConfiguration;import com.digitalpebble.storm.crawler.util.Configuration;/** * A generic bolt for indexing documents which determines which endpoint to use * based on the configuration and delegates the indexing to it. * * / @ SuppressWarnings ("serial") public class IndexerBolt extends BaseRichBolt {private Configuration config; private BaseRichBolt endpoint; private static final org.slf4j.Logger LOG = LoggerFactory .getLogger (IndexerBolt.class); public void prepare (Map conf, TopologyContext context, OutputCollector collector) {config = StormConfiguration.create (); / / get the implementation to use / / and instanciate it String className = config.get ("stormcrawler.indexer.class") If (className = = null) {throw new RuntimeException ("No configuration found for indexing");} try {final Class implClass = (Class) Class .forName (className); endpoint = implClass.newInstance ();} catch (final Exception e) {throw new RuntimeException ("Couldn't create" + className, e) } if (endpoint! = null) endpoint.prepare (conf, context, collector);} public void execute (Tuple tuple) {if (endpoint! = null) endpoint.execute (tuple);} public void declareOutputFields (OutputFieldsDeclarer declarer) {if (endpoint! = null) endpoint.declareOutputFields (declarer) }} Thank you for your reading, the above is the content of "how to write a crawler in Storm". After the study of this article, I believe you have a deeper understanding of how to write a crawler in Storm, and the specific use needs to be verified in practice. Here is, the editor will push for you more related knowledge points of the article, welcome to follow!
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.