为什么计算softmax之前要将logits转为float?
#10 opened 8 months ago
by
yuanshuai
how did you guys pretrain the tokenizer using tiktoken ?
#9 opened 8 months ago
by
StephennFernandes
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1614429358033-noauth.jpeg)
是否可以运行在两张不同型号的GPU上
#8 opened 11 months ago
by
XCZDH
Adding Evaluation Results
#7 opened 11 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
On how much English token was the model trained onn
3
#5 opened about 1 year ago
by
aslawliet
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6551e98b7490049d62631325/7dCCm4Im1A8y1NrQ2M7PD.jpeg)
_set_gradient_checkpointing() got an unexpected keyword argument 'enable'
2
#3 opened about 1 year ago
by
ehartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63111b2d88942700629f5771/u2a9y-yx6TG0N31OhMSHI.png)