iceman2434
/

xlm-roberta-base-fake-news-detection-tl

Text Classification

fake-news-detection

Model card Files Files and versions Community

iceman2434 commited on Oct 21, 2024

Commit

ad9e209

·

verified ·

1 Parent(s): 0f29c74

Create README.md

Files changed (1) hide show

README.md +30 -0

README.md ADDED Viewed

	@@ -0,0 +1,30 @@

+# Tagalog Fake News Detection Model
+## Overview
+This project implements a fake news detection model for Tagalog/Filipino using the XLM-RoBERTa base model with an accuracy of 95.46%.
+### Dataset
+- Total Size: 18,522 samples
+- Composition: 50/50 split of real and fake news
+- Languages: Filipino and English
+-
+#### Dataset Split
+- Train Set: ~12,968 samples
+- Validation Set: ~2,784 samples
+- Test Set: ~2,770 samples
+### Performance Metrics (on Evaluation Set)
+- Accuracy: 95.46%
+- F1 Score: 95.40%
+- Precision: 95.40%
+- Recall: 95.40%
+## Data Sources
+The model was trained on a combined dataset from two primary sources:
+1. [Fake News Filipino Dataset](https://huggingface.co/datasets/jcblaise/fake_news_filipino)
+   - 3,206 rows used
+2. [Philippine Fake News Corpus](https://github.com/aaroncarlfernandez/Philippine-Fake-News-Corpus)
+   - 15,312 rows used out of 22,458 available