๐ฎ๐ฉ FinBERT Indonesia โ Sentiment Classification for Financial News in Bahasa Indonesia
This model is a fine-tuned version of ProsusAI/finbert on a custom dataset of ~500 financial news headlines written in Bahasa Indonesia. The task is 3-class sentiment classification: positive, neutral, and negative.
๐๏ธ Model Architecture
The base model is FinBERT, which itself is a BERT model pre-trained on financial texts. It has been fine-tuned using the Hugging Face transformers library with the following modifications:
- Multilingual financial context adaptation via custom labeled data in Bahasa Indonesia
- Classification head for 3 sentiment labels
๐งพ Dataset
The training dataset consists of 500 manually labeled financial news titles from Indonesian sources. Each entry is categorized as:
positiveโ bullish or growth-related headlinesneutralโ factual or event-based reportingnegativeโ bearish or risk-indicative headlines
Example:
| Title | Label |
|---|---|
| IHSG diperkirakan rebound minggu ini | positive |
| BI umumkan suku bunga tetap | neutral |
| Rupiah melemah terhadap dolar AS | negative |
๐งช Evaluation
Evaluation is based on accuracy using a stratified train/test split.
| Metric | Score |
|---|---|
| Accuracy | TBD |
To reproduce the benchmark or compare other models, see the sample inference code below.
๐งช Usage
from transformers import pipeline
classifier = pipeline(
"text-classification",
model="michaelmanurung/finbert-indonesia",
tokenizer="michaelmanurung/finbert-indonesia"
)
result = classifier("IHSG turun tipis karena aksi ambil untung investor.")
print(result)
# Output: [{'label': 'LABEL_2', 'score': 0.89}] -> e.g. 'positive'
- Downloads last month
- 40
Model tree for michaelmanurung/finbert-indonesia
Base model
ProsusAI/finbert