HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Accepted at ACL (Findings) 2025

Naquee Rizwan, Seid Muhie Yimam, Daryna Dementieva, Florian Skupin, Tim Fischer, Daniil Moskovskiy, Aarushi Ajay Borkar, Robert Geislinger, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, and Animesh Mukherjee: [Paper] [Arxiv] [Slides]

Abstract

Despite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies.

File/Folder Description 📂

Survey.xlsx file contains the detailed list of questionnaire spanning country regulations, platform policies and research datasets curated with the collaborative effort of NLP researchers and legal expert.

Please cite our paper

@inproceedings{rizwan-etal-2025-hateprism,
    title = "{H}ate{PRISM}: Policies, Platforms, and Research Integration. Advancing {NLP} for Hate Speech Proactive Mitigation",
    author = "Rizwan, Naquee  and
      Yimam, Seid Muhie  and
      Dementieva, Daryna  and
      Skupin, Dr. Florian  and
      Fischer, Tim  and
      Moskovskiy, Daniil  and
      Borkar, Aarushi Ajay  and
      Geislinger, Robert  and
      Saha, Punyajoy  and
      Roy, Sarthak  and
      Semmann, Martin  and
      Panchenko, Alexander  and
      Biemann, Chris  and
      Mukherjee, Animesh",
    editor = "Che, Wanxiang  and
      Nabende, Joyce  and
      Shutova, Ekaterina  and
      Pilehvar, Mohammad Taher",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2025",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.findings-acl.824/",
    pages = "16008--16022",
    ISBN = "979-8-89176-256-5",
    abstract = "Despite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies."
}

@misc{rizwan2025hateprismpoliciesplatformsresearch,
      title={HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation}, 
      author={Naquee Rizwan and Seid Muhie Yimam and Daryna Dementieva and Florian Skupin and Tim Fischer and Daniil Moskovskiy and Aarushi Ajay Borkar and Robert Geislinger and Punyajoy Saha and Sarthak Roy and Martin Semmann and Alexander Panchenko and Chris Biemann and Animesh Mukherjee},
      year={2025},
      eprint={2507.04350},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2507.04350}, 
}

Correspondance

For any questions or issues, please contact: [email protected], [email protected], [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
HatePRISM_Poster.jpg		HatePRISM_Poster.jpg
HatePRISM_Poster.pdf		HatePRISM_Poster.pdf
LICENSE		LICENSE
README.md		README.md
Survey.xlsx		Survey.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Abstract

File/Folder Description 📂

Please cite our paper

Correspondance

About

Uh oh!

Releases

Packages

License

hate-alert/platforms_policies_research

Folders and files

Latest commit

History

Repository files navigation

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Abstract

File/Folder Description 📂

Please cite our paper

Correspondance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages