Interview request: genAI evaluation & documentation
#61 opened 5 months ago
by
meggymuggy
language dependency
#60 opened 7 months ago
by
Jay369
[AUTOMATED] Model Memory Requirements
#59 opened 9 months ago
by
model-sizer-bot
Deployments to Azure and Inference Endpoints
#55 opened 10 months ago
by
mo2024
Very sensitve to any repetition penalty!
#52 opened 10 months ago
by
jukofyork
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65995c45539c808e84c38bf1/k0y3ULloWQEMvosQwHgrE.png)
Text2SQL2Output
#51 opened 10 months ago
by
Sudipta179002
The generated response cannot stop.
1
#50 opened 10 months ago
by
shaohuay
Saving dbrx model and tokenizer in dbfs
5
#49 opened 10 months ago
by
pro-shep
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/lD2D0Chg3jkJ1n42CJbWD.png)
OSError: Unable to load vocabulary from file
7
#47 opened 10 months ago
by
khurramnaseem
TypeError: __init__() got an unexpected keyword argument 'bias'
2
#46 opened 10 months ago
by
dainesn1
[DO NOT REVIEW] Mixtral like config
#45 opened 10 months ago
by
Pernekhan
Why clamp qkv_states, is it common?
#44 opened 10 months ago
by
jay68
Chat template
9
#43 opened 10 months ago
by
ehartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63111b2d88942700629f5771/u2a9y-yx6TG0N31OhMSHI.png)
GGUF quants?
1
#41 opened 10 months ago
by
Iommed
Does the tokenizer of this model have a network to load successfully?
3
#40 opened 10 months ago
by
Rnake
VRAM Requirements?
8
#39 opened 10 months ago
by
dounykim
How to get hands on experience as a newbie
1
#38 opened 10 months ago
by
kimsia
Text2sql template and examples
3
#34 opened 10 months ago
by
daxiongshu
Continuation of the Discussion: More than 10 minutes the status is in Setting `pad_token_id` to `eos_token_id`:100257 for open-end generation. #28
7
#31 opened 10 months ago
by
Madhugraj
Errors During Training for the Original Implementation and the Fixes for the Errors
2
#24 opened 10 months ago
by
v2ray
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/fTCV7VLY0eK4OXbwgIT2n.png)
Instruct dataset
#23 opened 10 months ago
by
Andriy
How to Fine Tune DBRX-Instruct?
7
#18 opened 10 months ago
by
elysiia
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/BnvZ-iL8S6QDi_lkejspP.jpeg)
Bug on AMD MI 250 with flash-attention
3
#13 opened 11 months ago
by
PierreColombo
The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA
31
#10 opened 11 months ago
by
tdrussell