This model was created by ilnikolaev

Trained from scratch using Tensorflow Keras

200mb Russian Comments from 2ch dataset used

  • Type: decoder-only
  • Tokenizer: BPE
  • Vocabulary size: 32000
  • Max sequence length: 120
  • Hidden size: 768
  • FFN size: 3072
  • Attention heads: 24
  • Decoder layers: 4
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.