FilBench: Can LLMs Understand and Generate Filipino?
Paper • 2508.03523 • Published • 1
My collection of works and collabs related to Filipino NLP. For more info, please visit: https://ljvmiranda921.github.io/filipino-nlp/
Note We created a benchmark, FilBench, for evaluating language models on several Filipino-centric tasks (EMNLP 2025).
An Open LLM Leaderboard for Filipino
Note Here, we introduce TLUnified-NER, a gold-standard dataset for Tagalog NER (SEALP 2023).
Note We created an open-source tool for Tagalog NLP based on spaCy (NLP-OSS 2023).
Note We introduce the largest Tagalog treebank to date, 100x larger than previous treebanks (ACL 2025).