The effectiveness of moderating harmful online content

Philipp J. Schneider; Marian Andrei Rizoiu

Journal ArticleOPEN ACCESS

The effectiveness of moderating harmful online content

Proceedings of the National Academy of Sciences of the United States of America (2023) 120(34)

DOI: 10.1073/pnas.2307360120

13Citations

26Readers

Get full text

Abstract

In 2022, the European Union introduced the Digital Services Act (DSA), a new legislation to report and moderate harmful content from online social networks. Trusted flaggers are mandated to identify harmful content, which platforms must remove within a set delay (currently 24 h). Here, we analyze the likely effectiveness of EU-mandated mechanisms for regulating highly viral online content with short half-lives. We deploy self-exciting point processes to determine the relationship between the regulated moderation delay and the likely harm reduction achieved. We find that harm reduction is achievable for the most harmful content, even for fast-paced platforms such as Twitter. Our method estimates moderation effectiveness for a given platform and provides a rule of thumb for selecting content for investigation and flagging, managing flaggers’ workload.

Author supplied keywords

Cite

CITATION STYLE

APA

Schneider, P. J., & Rizoiu, M. A. (2023). The effectiveness of moderating harmful online content. Proceedings of the National Academy of Sciences of the United States of America, 120(34). https://doi.org/10.1073/pnas.2307360120

The effectiveness of moderating harmful online content

Abstract

Author supplied keywords

Cite

Register to see more suggestions