Shane Tian
ShaneTian
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-V3-Base
new activity
about 2 months ago
OpenCoder-LLM/opc-annealing-corpus:Question About the Completeness of the Released Dataset
new activity
about 2 months ago
OpenCoder-LLM/opc-annealing-corpus:Question About the Completeness of the Released Dataset
Organizations
None yet
ShaneTian's activity
Question About the Completeness of the Released Dataset
#5 opened about 2 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
What is the FIM template for the base model?
2
#4 opened 3 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
Optimization details
2
#16 opened 11 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
About the compressed file size < 10MB
2
#7 opened 11 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
`CpmTokenizer` is different from the original CPM-1 tokenizer in GitHub
3
#1 opened over 2 years ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
Not found `the-stack-v2-train-extras`
2
#5 opened 11 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
Training loss or logs?
#15 opened 11 months ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
ctx window & languages?
4
#1 opened about 1 year ago
by
JosephusCheung
Why does Code-Llama-34B not support infilling mode, i.e. FIM
1
#18 opened over 1 year ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
Are there plans to include some models that use OctoPack to fine-tune, like OctoCoder, etc
2
#7 opened over 1 year ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)
`CpmTokenizer` is different from the original CPM-1 tokenizer in GitHub
3
#1 opened over 2 years ago
by
ShaneTian
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660138927470-62e37ac7a0be7413eb879b0a.jpeg)