A sentiment resource for under-represented Eastern Indo-Aryan languages (Bhojpuri, Maithili), built via a disagreement-guided annotation pipeline: a committee of models labels scraped text, high-disagreement items are routed to a native-speaker expert.
Early development. Pilot stage — validating the pipeline on a small sample before scaling.
MIT