Abstract
In 2022, the European Union introduced the Digital Services Act (DSA), a new legislation to report and moderate harmful content from online social networks. Trusted flaggers are mandated to identify harmful content, which platforms must remove within a set delay (currently 24 h). Here, we analyze the likely effectiveness of EU-mandated mechanisms for regulating highly viral online content with short half-lives. We deploy self-exciting point processes to determine the relationship between the regulated moderation delay and the likely harm reduction achieved. We find that harm reduction is achievable for the most harmful content, even for fast-paced platforms such as Twitter. Our method estimates moderation effectiveness for a given platform and provides a rule of thumb for selecting content for investigation and flagging, managing flaggers’ workload.
Author supplied keywords
Cite
CITATION STYLE
Schneider, P. J., & Rizoiu, M. A. (2023). The effectiveness of moderating harmful online content. Proceedings of the National Academy of Sciences of the United States of America, 120(34). https://doi.org/10.1073/pnas.2307360120
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.