Cannot load model post agreement to new terms and using access token
8
#104 opened 10 months ago
by
CTJP
not working
11
#103 opened 10 months ago
by
snieunny
mistrall down
3
#102 opened 10 months ago
by
giodeleo
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65d60d9f43700aceb25d2115/B2bQsRsQQNpESpyrFjJ8R.png)
Service unavailable
#101 opened 10 months ago
by
fyp-llm
Is it down?
6
#99 opened 10 months ago
by
hprakashproj
there is an error!!
35
#98 opened 10 months ago
by
Issafre
Update README.md
1
#96 opened 10 months ago
by
XIX181
Is the model down?
2
#95 opened 10 months ago
by
hvkkvh
How do I successfully merge adater weights to this base model correctly? And then siccessfulyl convert to GGUF
#94 opened 10 months ago
by
uyiosa
Cannot access gated repo You must be authenticated to access it.
44
#93 opened 10 months ago
by
liketheflower
deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.
6
#92 opened 10 months ago
by
jiangtaozh
why put MistralRotaryEmbedding in each attention layer instead of putting only once before the first attention layer?
#91 opened 10 months ago
by
liougehooa
How to use this model in next js?
2
#90 opened 10 months ago
by
shreyassihasane
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65eac172cc962f82a4fe26fb/g-1m-MkxfIbsrGFutNaBs.png)
Model doesn't stop generation after answering the user question.
2
#88 opened 10 months ago
by
jerinjude
How does v0.2 manages to support 32k token context without Sliding Window Attention?
4
#85 opened 10 months ago
by
Andriy
will Mistral-7B-Instruct-v0.2 let me generate a response of around 8k tokens in one go?
#84 opened 10 months ago
by
akshat1311
How to prune layers in AutoModelForCausalModel
5
#83 opened 10 months ago
by
badri369
[AUTOMATED] Model Memory Requirements
#82 opened 10 months ago
by
model-sizer-bot
Update README.md
#81 opened 11 months ago
by
Austinc2003
Quantized version taking too long with CPU's
#80 opened 11 months ago
by
SukanyaM
Model inconsistency Issue
#79 opened 11 months ago
by
adityar23
LangChain Agent with Mistral-7B-Instruct-v0.2
12
#78 opened 11 months ago
by
deeplearner123
Training Data difference from v0.1
#77 opened 11 months ago
by
tsavage68
Update README.md
#76 opened 11 months ago
by
mixxz
Why was Sliding-Window Attention deprecated?
#75 opened 11 months ago
by
matrixssy
Update config.json to accurately reflect the 32k context window.
4
#73 opened 11 months ago
by
Kearm
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655dc641accde1bbc8b41aec/9sR2Mm7mMsyh_SpSH7ilq.jpeg)
Was this model based of Mistral-7B-v0.2 from the start?
4
#72 opened 11 months ago
by
stduhpf
Can someone from Mistral comment on what the knowledge cutoff is?
1
#69 opened 11 months ago
by
MarginallyEffective
Mistral-7B-Instruct-v0.2 loopy text generation with custom chat template
4
#68 opened 11 months ago
by
ercanucan
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651ab2d76a6b822b88dd5b9b/m39KXMqY4J-cdXyE00Dit.jpeg)
User input repetition after finetuning
1
#67 opened 11 months ago
by
nuratamton
What is the max context length of this model?
1
#66 opened 11 months ago
by
flexwang
![](https://cdn-avatars.huggingface.co/v1/production/uploads/646e62cd15ac6f207df96edb/LtfvCFSt9thXHdHmwXaQ2.png)
Inference API
1
#65 opened 11 months ago
by
Shivkumar27
cm_test
#64 opened 11 months ago
by
chenmin2001
FIne tuned model generating both user and assistant dialogues during inference
1
#63 opened 11 months ago
by
sabber
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653193760191-noauth.jpeg)
Has anybody gotten this example to work for converting string data into valid JSON?
2
#62 opened 11 months ago
by
capnchat
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64a8bf955fc663fee2203646/D-G6lxn1T7uIlzXAeamhH.jpeg)
Is mistral7b instruct v0.2 down for everybody?
2
#61 opened 11 months ago
by
SzymonSt2808
Friendly Reminder
#60 opened 11 months ago
by
AnzaniAI
Is it possible to see embeddinges once you have fine tuned it ??
#59 opened 12 months ago
by
RikoteMaster
ValueError: Bfloat16 is only supported on GPUs with compute capability of at least 8.0
2
#58 opened 12 months ago
by
itod
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6432f3a82bfb2b0ec75901d8/q2Sg1L_OC5KLNbquIYhw_.jpeg)
instruction fine tuning template
2
#57 opened 12 months ago
by
Iamexperimenting
sliding_window appears to be None. TypeError: bad operand type for unary -: 'NoneType'
4
#56 opened 12 months ago
by
narai
value for sliding_window in config.json updated
1
#55 opened 12 months ago
by
manaschauhan
Fix the command format of "Installing transformers from source"
#53 opened 12 months ago
by
musfiqdehan
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1652197428055-625a6e0c535747b1a15be2de.jpeg)
System prompt
4
#52 opened 12 months ago
by
VladimirNGIT
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/8zD2js-CKkxIdEkSO43fL.png)
Process finished with exit code -1073741819 (0xC0000005)
1
#51 opened 12 months ago
by
aminev
Is there any vllm support for this version?
9
#49 opened 12 months ago
by
Aloukik21
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63dd6dd65ea8577c8d5a41dc/rOKZ9XOlJVYcbHFPQG5eZ.jpeg)
Mistral does not finish the answers
9
#48 opened about 1 year ago
by
expiderman
Special token( </s>) not generating in the model.generate() method
7
#47 opened about 1 year ago
by
Pradeep1995
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1599822346546-noauth.jpeg)
Can we save the finetuned Mistral model by exporting to TorchScript
1
#46 opened about 1 year ago
by
Pradeep1995
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1599822346546-noauth.jpeg)
deploying on aws sagemaker.
3
#45 opened about 1 year ago
by
adhiltortil