grammatical_mt5_model
This model is a fine-tuned version of google/mt5-small on an unknown dataset. It achieves the following results on the evaluation set:
- Epoch: 0
- Train Loss: 49.2615
- Train Accuracy: 0.0051
- Validation Loss: 46.5507
- Validation Accuracy: 0.0313
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 3e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.99, 'epsilon': 0.1, 'amsgrad': False, 'weight_decay_rate': 0.01}
- training_precision: float32
Training results
Epoch | Train Loss | Train Accuracy | Validation Loss | Validation Accuracy |
---|---|---|---|---|
0 | 49.2615 | 0.0051 | 46.5507 | 0.0313 |
Framework versions
- Transformers 4.47.0
- TensorFlow 2.17.1
- Datasets 3.2.0
- Tokenizers 0.21.0
- Downloads last month
- 11
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for rishikeshgautam/grammatical_mt5_model
Base model
google/mt5-small