File size: 1,683 Bytes
cdfa862 02274fc cdfa862 9b460db cdfa862 6b874d7 9b460db 6b874d7 cdfa862 6b874d7 cdfa862 6b874d7 cdfa862 6b874d7 cdfa862 ee3197b cdfa862 de7662a cdfa862 de7662a cdfa862 de7662a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
---
tags:
- fastai
- text-generation
language: ml
widget:
- text: "ഓഹരി വിപണി തകരുമ്പോള് നിക്ഷേപം എങ്ങനെ സുരക്ഷിതമാക്കാം"
example_title: "Malayalam Casual Language Model"
datasets:
- rajeshradhakrishnan/malayalam_wiki
---
# Blurr x Casual Machine Learning Model trained on Malayalam (മലയാളം) text. (Working in Progress)
[![മലയാളം: notebook](https://img.shields.io/badge/മലയാളം%20-notebook-green.svg)](https://nbviewer.org/github/rajeshradhakrishnanmvk/kitchen2.0/blob/main/ml/malayalam_blurr_xlm_roberta_base.ipynb)
---
# malayalam-blurr-xlm-roberta-base (base-sized model)
malayalam-blurr-xlm-roberta-base model is pre-trained on [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) using the library [blurr](https://ohmeow.github.io/blurr/) Language Model using fastai x huggingface frameworks.
Ref: [Causal Language Modeling](https://ohmeow.github.io/blurr/text-modeling-language-modeling.html#Causal-language-modeling).
## Usage
```
!pip install -Uqq huggingface_hub["fastai"] ohmeow-blurr
from huggingface_hub import from_pretrained_fastai
learner = from_pretrained_fastai(repo_id)
learner.blurr_generate("ബ്ളൂർ പഠിക്കാൻ വളെരെ എളുപ്പമാണ് എന്തുകൊണ്ട് എന്നാൽ", max_length=50, do_sample=True, top_k=25)
```
## Intended uses & limitations
It's not fine tuned to the state of the art accuracy
## Training and evaluation data
[Wiki 2020 Malayalam Dataset ](https://huggingface.co/datasets/rajeshradhakrishnan/malayalam_wiki)
|