17 29 59

PenutChen

penut85420

penut85420

AI & ML interests

LLM, Quantization

Recent Activity

updated a Space about 12 hours ago

DaOppaiLoli/JpVocab

liked a Space 1 day ago

DaOppaiLoli/JpVocab

published a Space 2 days ago

DaOppaiLoli/JpVocab

View all activity

Organizations

penut85420's activity

updated a Space about 12 hours ago

JpVocab

✏

Take a Japanese vocabulary quiz

liked a Space 1 day ago

JpVocab

✏

Take a Japanese vocabulary quiz

published a Space 2 days ago

JpVocab

✏

Take a Japanese vocabulary quiz

liked 2 models 28 days ago

jinaai/ReaderLM-v2

Text Generation • Updated 8 days ago • 32.7k • • 500

sentence-transformers/static-similarity-mrl-multilingual-v1

commented a paper about 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 12 •

upvoted a paper about 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 12

liked a model about 2 months ago

IamCreateAI/Ruyi-Mini-7B

Image-to-Video • Updated Dec 25, 2024 • 3.47k • 589

upvoted a collection about 2 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 134

updated a Space 2 months ago

KanaQuiz

📝

Take a kana quiz and learn romaji

upvoted a paper 3 months ago

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

Paper • 2410.10739 • Published Oct 14, 2024 • 2

commented a paper 3 months ago

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

Paper • 2410.10739 • Published Oct 14, 2024 • 2 •

New activity in yentinglin/Llama-3-Taiwan-8B-Instruct 4 months ago

請問是有重新訓練過tokenizer嗎?

#9 opened 7 months ago by

tedslin

commented a paper 5 months ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29 •

upvoted a paper 5 months ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29

liked a model 5 months ago

MediaTek-Research/Breeze-7B-FC-v1_0

Updated about 1 month ago • 324 • 20

commented a paper 5 months ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17 •

liked 2 models 5 months ago

jinaai/reader-lm-1.5b

Text Generation • Updated 28 days ago • 1.77k • 586

jinaai/jina-embeddings-v3

Feature Extraction • Updated Jan 6 • 1.86M • 737

commented a paper 6 months ago

Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems

Paper • 2404.01616 • Published Apr 2, 2024 •