ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark
-
Updated
Sep 2, 2025 - Python
ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark
Accepted at Transaction of Machine Learning Research (TMLR)
Intercept AI - Empowering safe and secure online experiences through cutting-edge content filtering technology.
Agent Shield is an advanced AI agent designed to safeguard digital environments by detecting and categorizing harmful content, including hate speech and personal information.
Open Bengali (Bangla) slang & harmful-language dataset for moderation and safety research
Add a description, image, and links to the harmful-content-detection topic page so that developers can more easily learn about it.
To associate your repository with the harmful-content-detection topic, visit your repo's landing page and select "manage topics."