Bot Datasets on Twitter: Analysis and Challenges

Samper-Escalante, Luis Daniel; Loyola-González, Octavio; Monroy, Raúl; Medina-Pérez, Miguel Angel

Published in

MDPI, Applied Sciences, 9(11), p. 4105, 2021

DOI: 10.3390/app11094105

Tools

Export citation

Search in Google Scholar

Bot Datasets on Twitter: Analysis and Challenges

Journal article published in 2021 by Luis Daniel Samper-Escalante

, Octavio Loyola-González

, Raúl Monroy

, Miguel Angel Medina-Pérez

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

The reach and influence of social networks over modern society and its functioning have created new challenges and opportunities to prevent the misuse or tampering of such powerful tools of social interaction. Twitter, a social networking service that specializes in online news and information exchange involving billions of users world-wide, has been infested by bots for several years. In this paper, we analyze both public and private databases from the literature of bot detection on Twitter. We summarize their advantages, disadvantages, and differences, recommending which is more suitable to work with depending on the necessities of the researcher. From this analysis, we present five distinct behaviors in automated accounts exhibited across all the bot datasets analyzed from these databases. We measure their level of presence in each dataset using a radar chart for visual comparison. Finally, we identify four challenges that researchers of bot detection on Twitter have to face when using these databases from the literature.

Published in

Links

Tools

Bot Datasets on Twitter: Analysis and Challenges

Abstract