HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
Accepted at ACL (Findings) 2025
Naquee Rizwan, Seid Muhie Yimam, Daryna Dementieva, Florian Skupin, Tim Fischer, Daniil Moskovskiy, Aarushi Ajay Borkar, Robert Geislinger, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, and Animesh Mukherjee: [Paper] [Arxiv] [Slides]
Despite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies.
Survey.xlsx file contains the detailed list of questionnaire spanning country regulations, platform policies and research datasets curated with the collaborative effort of NLP researchers and legal expert.
@inproceedings{rizwan-etal-2025-hateprism,
title = "{H}ate{PRISM}: Policies, Platforms, and Research Integration. Advancing {NLP} for Hate Speech Proactive Mitigation",
author = "Rizwan, Naquee and
Yimam, Seid Muhie and
Dementieva, Daryna and
Skupin, Dr. Florian and
Fischer, Tim and
Moskovskiy, Daniil and
Borkar, Aarushi Ajay and
Geislinger, Robert and
Saha, Punyajoy and
Roy, Sarthak and
Semmann, Martin and
Panchenko, Alexander and
Biemann, Chris and
Mukherjee, Animesh",
editor = "Che, Wanxiang and
Nabende, Joyce and
Shutova, Ekaterina and
Pilehvar, Mohammad Taher",
booktitle = "Findings of the Association for Computational Linguistics: ACL 2025",
month = jul,
year = "2025",
address = "Vienna, Austria",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.findings-acl.824/",
pages = "16008--16022",
ISBN = "979-8-89176-256-5",
abstract = "Despite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies."
}@misc{rizwan2025hateprismpoliciesplatformsresearch,
title={HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation},
author={Naquee Rizwan and Seid Muhie Yimam and Daryna Dementieva and Florian Skupin and Tim Fischer and Daniil Moskovskiy and Aarushi Ajay Borkar and Robert Geislinger and Punyajoy Saha and Sarthak Roy and Martin Semmann and Alexander Panchenko and Chris Biemann and Animesh Mukherjee},
year={2025},
eprint={2507.04350},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2507.04350},
}For any questions or issues, please contact: [email protected], [email protected], [email protected]
