In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-01-15 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >
Share
Shulou(Shulou.com)11/24 Report--
CTOnews.com June 26, the Massachusetts Institute of Technology (MIT) research team recently published a paper pointing out that the existing third-party Twitter (Twitter) robot account automatic detection tool is not accurate because its data set is too simple and lack of generality.
Earlier, it was reported that too many robot accounts were one of the reasons to prevent Musk from buying Twitter. Twitter claimed at the time that 5% of its daily active users were robot accounts, but Musk said that number was much higher than 5%.
Twitter has its own robot account identification system, but it has not been made public. Therefore, for the general public, the third-party tool is a more feasible detection method. These third-party tools use data sets and machine learning models collected from Twitter to detect suspicious signs of robots. Many tools and models have been used to study robot activities on social media, and there have been thousands of related papers.
▲ 's public benchmark data set for Twitter robot detection most of the benchmark data sets in these papers are data sets collected in different tweets, many of which are collected in specific tweets (such as tweets with specific topic tags), each of which is manually marked as a robot or human. However, this specially trained robot detection model does well in this professional field, does not cover all areas, and relies heavily on specific data, rather than the fundamental differences between robots and humans.
When these models are tested on data sets in other areas, their accuracy is very poor, almost equal to the level of random prediction. At the same time, in many data sets, even the relatively simple model is as accurate as the most advanced machine learning model (SOTA).
Comparison of the performance of ▲ simple model and SOTA model on basic data sets in other words, the model trained on one data set can not be extended to other data sets, and the existing robot detection data sets are less universal because of simple data collection.
Finally, the researchers warn that when using existing robots to detect data sets, users should carefully consider what types of deviations may exist. The researchers believe that a fundamental solution is that social media such as Twitter itself should provide researchers with rich and reliable data and high-quality real tags.
CTOnews.com enclose the address of the paper: click here to
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.