Onnx format?
#49 opened 9 days ago
by
pylotlight
请问task的编写有什么原则吗
1
#45 opened 29 days ago
by
zhusl-cpu
Inquiry about Future Plans for GTE-Qwen Models Based on Qwen2.5
#43 opened about 1 month ago
by
goldhorn1975
欢迎大家使用我们的开源代码来进一步微调gte模型
#42 opened about 1 month ago
by
jcli0606
![](https://cdn-avatars.huggingface.co/v1/production/uploads/66928a2ce32997bdf7113e43/QyntftxFO2Z-9zh5nzwEy.jpeg)
Fine-tuning Alibaba-NLP/gte-Qwen2-7B-instruct for Domain-Specific Retrieval with Query, Positive, and Hard Negatives
3
#41 opened 2 months ago
by
wilfoderek
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bfa4237f75bb4d92557db9/8Vu3xJkqI59GrtoFrZbwj.jpeg)
测试效果bad case
1
#40 opened 2 months ago
by
jwww123
微调后验证和训练阶段验证不一致
#37 opened 3 months ago
by
penghui1
使用scripts/eval_mteb.py 无法复现效果
2
#35 opened 4 months ago
by
whucai
bfloat16 vs. float32
#34 opened 5 months ago
by
tanliboy
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6448b3266ffed6ece10335ba/HLC0SfOHjssWXB99eyxt8.png)
Fix eval_mteb.py of undefined variables
#33 opened 5 months ago
by
tongyx361
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/eG4R9-3hgrimttP7ep3dN.jpeg)
Add base_model metadata
#32 opened 5 months ago
by
davanstrien
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1627505688463-60107b385ac3e86b3ea4fc34.jpeg)
请问如何对这个模型进行微调?
1
#31 opened 6 months ago
by
Doublebear
关于bidirectional attention的问题
1
#30 opened 6 months ago
by
JoeBlack18
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/RKU7UnWg4narRtljEzGb5.png)
会考虑发布到ollama上吗?
1
#29 opened 6 months ago
by
huyueeer
请问flash-attn可以关闭吗?是否可以直接使用transformers库里提供的qwen2模型加载?
1
#28 opened 6 months ago
by
shizue
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1637396421928-6198af3a04b5da0c05211dd8.jpeg)
Recommanded hyperparameters?
1
#27 opened 6 months ago
by
zhilinw6
Retrieval 效果一般,仅和bm25持平
8
#26 opened 6 months ago
by
Stefan8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/X0jRpQ3wEhLD6pfKlXIQe.jpeg)
Pooling method: mean vs last?
2
#25 opened 6 months ago
by
alexzhou689
Padding token for batched embedding in Transformers?
1
#24 opened 6 months ago
by
ChrisCrass
请问支持fp16推理吗?
2
#23 opened 6 months ago
by
DJCan
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62f0babaef9cc6810cec02ff/WnLWjyIchnfNm6ytqw-bT.jpeg)
测试效果一般
3
#22 opened 6 months ago
by
dhajshdkajs
Parameters for peak performance
1
#21 opened 7 months ago
by
cvdbdo
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64634fbeefb4e8550484ca67/gOnIj1A3aqmCQfLuQoFuX.png)
What languages are supported?
1
#20 opened 7 months ago
by
jasonrayles
config_sentence_transformers.json does not include the prompt to embeddings mappings
1
#19 opened 7 months ago
by
Tonic
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3bb1cd0d8c2c2169f0b88/eT2TS0IlQbZtz-F_zHLz9.jpeg)
[AUTOMATED] Model Memory Requirements
#18 opened 7 months ago
by
model-sizer-bot
Running this model without flash-attn
2
#17 opened 7 months ago
by
lisa-tse
输出的向量维度可以压缩吗?
1
#16 opened 7 months ago
by
sen63
AWS Sagemaker Deployment error
2
#14 opened 7 months ago
by
gauravsirola
The performance of full-parameter finetuning
1
#13 opened 7 months ago
by
stephenshuang
question about quants
3
#12 opened 7 months ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
Model architecture should be Qwen2Model instead of Qwen2ForCausalLM?
#11 opened 8 months ago
by
kavin1337
need gguf q4km
#10 opened 8 months ago
by
windkkk
Training embadding Issues.
2
#8 opened 8 months ago
by
Imran1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62846faa99bff5076f0a93b4/QO7sgRWOXS6nlQ-GcEg94.jpeg)
Hi, can you tell me how to train?
3
#7 opened 8 months ago
by
EEEmpty
输出的embedding size是多少
3
#6 opened 8 months ago
by
seleven11
模型太耗内存了,有量化版本吗?flashatt是不是可以关闭,对显卡限制太多
8
#3 opened 8 months ago
by
fukai
Is it consistent with the multi-language support of qwen2, or only Chinese and English?
1
#2 opened 8 months ago
by
fukai