Tootfinder

@arXiv_csCY_bot@mastoxiv.page
2025-06-05 07:16:58

Improving Regulatory Oversight in Online Content Moderation
Benedetta Tessa, Denise Amram, Anna Monreale, Stefano Cresci
https://arxiv.org/abs/2506.04145 h…

Improving Regulatory Oversight in Online Content Moderation
The European Union introduced the Digital Services Act (DSA) to address the risks associated with digital platforms and promote a safer online environment. However, despite the potential of components such as the Transparency Database, Transparency Reports, and Article 40 of the DSA to improve platform transparency, significant challenges remain. These include data inconsistencies and a lack of detailed information, which hinder transparency in content moderation practices. Additionally, the ab…

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 16:04:44

This https://arxiv.org/abs/2409.03219 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCY_…

Content Moderation by LLM: From Accuracy to Legitimacy
One trending application of LLM (large language model) is to use it for content moderation in online platforms. Most current studies on this application have focused on the metric of accuracy -- the extent to which LLMs make correct decisions about content. This article argues that accuracy is insufficient and misleading because it fails to grasp the distinction between easy cases and hard cases, as well as the inevitable trade-offs in achieving higher accuracy. Closer examination reveals that …

@tinoeberl@mastodon.online
2025-06-10 10:12:03

Mitbekommen? Neben X und #Meta auch #Youtube:
YouTube passt seine #Moderation an und lässt künftig mehr Videos online, auch wenn sie teils gegen Richtlinien verstoßen.
Inhalte mit möglichem …

YouTube has loosened its content moderation policies
YouTube has loosened its content moderation policies, and has told reviewers not to remove potentially policy-violating videos if they’re in the public interest, according to a report from The New York Times.

@wfryer@mastodon.cloud
2025-07-25 15:12:43

#JournalArticle: The End of Trust & Safety?: Examining the Future of Content Moderation and Upheavals in Professional Online Safety Effort
https://dl.acm.org/doi/10.1145/3706598.3713662

Four-panel comic:
1. Trust & Safety Teams – Woman with headset monitors harmful content online.
2. Layoffs 2021–2023 – Man receives layoff notice, symbolizing industry cuts.
3. Partisan Pressure – Hand points at content moderation icons amid COVID and election misinformation.
4. Future of T&S? – Researcher thinks, surrounded by lightbulbs and words: Design, Policy, Research.

@arXiv_csHC_bot@mastoxiv.page
2025-06-18 08:21:17

"I Cannot Write This Because It Violates Our Content Policy": Understanding Content Moderation Policies and User Experiences in Generative AI Products
Lan Gao, Oscar Chen, Rachel Lee, Nick Feamster, Chenhao Tan, Marshini Chetty
https://arxiv.org/abs/2506.14018

"I Cannot Write This Because It Violates Our Content Policy": Understanding Content Moderation Policies and User Experiences in Generative AI Products
While recent research has focused on developing safeguards for generative AI (GAI) model-level content safety, little is known about how content moderation to prevent malicious content performs for end-users in real-world GAI products. To bridge this gap, we investigated content moderation policies and their enforcement in GAI online tools -- consumer-facing web-based GAI applications. We first analyzed content moderation policies of 14 GAI online tools. While these policies are comprehensive i…

@arXiv_csCY_bot@mastoxiv.page
2025-07-03 09:24:00

From Reports to Reality: Testing Consistency in Instagram's Digital Services Act Compliance Data
Marie-Therese Sekwenz, Ben Wagner, Hans De Bruijn
https://arxiv.org/abs/2507.01787

From Reports to Reality: Testing Consistency in Instagram's Digital Services Act Compliance Data
The Digital Services Act (DSA) introduces harmonized rules for content moderation and platform governance in the European Union, mandating robust compliance mechanisms, particularly for very large online platforms and search engines. This study examined compliance with DSA requirements, focusing on Instagram as a case study. We develop and apply a multi-level consistency framework to evaluate DSA compliance. Our findings contribute to the broader discussion on empirically-based regulation, prov…

@arXiv_csCL_bot@mastoxiv.page
2025-06-12 09:20:52

Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages
Amel Muminovic, Amela Kadric Muminovic
https://arxiv.org/abs/2506.09992

Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages
Online toxic language causes real harm, especially in regions with limited moderation tools. In this study, we evaluate how large language models handle toxic comments in Serbian, Croatian, and Bosnian, languages with limited labeled data. We built and manually labeled a dataset of 4,500 YouTube and TikTok comments drawn from videos across diverse categories, including music, politics, sports, modeling, influencer content, discussions of sexism, and general topics. Four models (GPT-3.5 Turbo, G…

@arXiv_csHC_bot@mastoxiv.page
2025-07-10 09:13:31

Civil Society in the Loop: Feedback-Driven Adaptation of (L)LM-Assisted Classification in an Open-Source Telegram Monitoring Tool
Milena Pustet, Elisabeth Steffen, Helena Mihaljevi\'c, Grischa Stanjek, Yannis Illies
https://arxiv.org/abs/2507.06734

Civil Society in the Loop: Feedback-Driven Adaptation of (L)LM-Assisted Classification in an Open-Source Telegram Monitoring Tool
The role of civil society organizations (CSOs) in monitoring harmful online content is increasingly crucial, especially as platform providers reduce their investment in content moderation. AI tools can assist in detecting and monitoring harmful content at scale. However, few open-source tools offer seamless integration of AI models and social media monitoring infrastructures. Given their thematic expertise and contextual understanding of harmful content, CSOs should be active partners in co-dev…

Tootfinder

Opt-in global Mastodon full text search. Join the index!