arxiv:2501.00958
Yongliang Shen
tricktreat
AI & ML interests
None yet
Recent Activity
liked
a model
29 minutes ago
unsloth/DeepSeek-V3-GGUF
liked
a model
about 3 hours ago
simplescaling/s1-32B
liked
a model
1 day ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Organizations
models
25
tricktreat/iChainGPT-Instruct
Updated
tricktreat/ichaingpt
Updated
•
7
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-lora612-hhrlhf
Text Generation
•
Updated
•
10
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo
Text Generation
•
Updated
•
13
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo
Text Generation
•
Updated
•
11
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-lora612-hhrlhf
Text Generation
•
Updated
•
8
tricktreat/llama-2-7b-chat-12layers-T6-25000steps-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo-2
Text Generation
•
Updated
•
131
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-12layers-T6-25000steps-peft-lora-orpo-2
Text Generation
•
Updated
•
8
tricktreat/llama-2-7b-chat-12layers-T6-merged-with-llama-2-7b-chat-peft-lora-orpo
Text Generation
•
Updated
•
8
tricktreat/llama-2-7b-chat-merged-with-llama-2-7b-chat-peft-lora-orpo
Text Generation
•
Updated
•
6
datasets
None public yet