File size: 4,649 Bytes

340cedb

---

language:
- id
- ace
- ban
- bjn
- bug
- jav
- mad
- min
- sun
- bbc
- eng
library_name: transformers
pipeline_tag: text-classification
tags:
- text-classification
- hate-speech-detection
- abusive-language-detection
- multilabel-classification
- indonesian
- multilingual
- social-media
- natural-language-processing
- xlm-roberta
license: apache-2.0
metrics:
- accuracy
- f1
base_model:
- FacebookAI/xlm-roberta-base
---


# Hate Speech & Abusive Language Detection (Multilabel)  
**Multilingual Indonesian & English — XLM-RoBERTa**

This repository provides a fine-tuned **XLM-RoBERTa** model for **MULTILABEL HATE CONTENT DETECTION** in social media text.  
The model is designed to identify **Hate Speech** and **Abusive Language** simultaneously across **Indonesian**, **regional Indonesian languages**, and **English**, particularly in noisy and informal online conversations.

---

## 🚀 Highlights

- Multilabel classification: **Hate Speech** & **Abusive Language**
- Supports overlapping labels in a single text
- Multilingual (Indonesia + English)
- Robust on informal and user-generated content
- Ready-to-use with Hugging Face `pipeline`
- Suitable for content moderation and safety systems

---

## 🌍 Supported Languages

- 🇮🇩 Bahasa Indonesia  
- Bahasa Melayu  
- Indonesian regional languages (Aceh, Banjar, Bugis, Jawa, Madura, Minang, Sunda, dll.)  
- 🇬🇧 English  

---

## 📊 Model Performance

> Performance metrics are reported on a held-out validation set.

| Metric          | Score  |
|-----------------|--------|
| Precision       | 0.9249 |
| Recal           | 0.9300 |
| F1 (Macro)      | 0.9274 |
| F1 (Weighted)   | 0.9269 |
| Training Loss   | 0.1181 |
| Validation Loss | 0.2070 |

*(Exact scores may vary depending on evaluation split and threshold.)*

---

## ⚙️ Usage

### Installation
```bash

pip install transformers torch

````

### Single Prediction

```python

from transformers import pipeline



classifier = pipeline(

    task="text-classification",

    model="nahiar/hatespeech-abusive-xlm-roberta-v1",

    return_all_scores=True

)



result = classifier("Dasar bodoh, otak udang!")

print(result)

```

**Output**

```text

[

  {'label': 'HATESPEECH', 'score': 0.9123},

  {'label': 'ABUSIVE', 'score': 0.9841}

]

```

> Because this is a **multilabel model**, more than one label can be active for a single input.

---

## 🏷️ Label Definitions

```text

HATESPEECH → Content that attacks or demeans a group based on identity

ABUSIVE    → Insulting, offensive, or aggressive language without protected targets

```

---

## 📦 Batch Inference

```python

texts = [

    "Dasar kaum ini selalu bikin rusuh",

    "Kamu memang bodoh dan tidak berguna",

    "Saya tidak setuju dengan pendapat kamu"

]



results = classifier(texts)



for text, preds in zip(texts, results):

    labels = [(p["label"], round(p["score"], 4)) for p in preds]

    print(text, "→", labels)

```

---

## 🏗️ Training Configuration

| Parameter         | Value                     |
| ----------------- | ------------------------- |
| Base Model        | xlm-roberta-base          |
| Task Type         | Multilabel Classification |
| Training Strategy | Fine-tuning               |
| Epochs            | Multiple                  |
| Learning Rate     | 2e-5                      |
| Batch Size        | 16                        |
| Training Date     | 2025-12-18                |

---

## 🎯 Intended Use

* Hate speech & abusive language moderation
* Content safety and compliance systems
* Social media monitoring dashboards
* Pre-filtering before sentiment or topic analysis

---

## ⚠️ Limitations

* Limited to **Hate Speech** and **Abusive Language** labels
* Does not identify specific hate targets or protected attributes
* Context-dependent sarcasm may be misclassified
* Not suitable for legal or policy enforcement without human review

---

## 📜 License

This model is released under the **Apache License 2.0**
Free for research and commercial use.

---

## 📚 Citation

```bibtex

@misc{djunaedi2025hatespeech_multilabel,

  author    = {Raihan Hidayatulloh Djunaedi},

  title     = {Multilabel Hate Speech and Abusive Language Detection for Social Media Text},

  year      = {2025},

  publisher = {Hugging Face},

  url       = {https://huggingface.co/nahiar/hatespeech-xlmr-v4}

}

```

---

## 🙌 Acknowledgements

* Hugging Face Transformers
* Facebook AI Research — XLM-RoBERTa