Phil's picture

Phil

phil111

·

AI & ML interests

None yet

Recent Activity

new activity 12 days ago

mistralai/Mistral-Small-24B-Instruct-2501:This Mistral Small has FAR less knowledge than the last.

liked a model 25 days ago

deepseek-ai/DeepSeek-R1

new activity 28 days ago

internlm/internlm3-8b-instruct:English tests and tasks are absurdly overfit.

View all activity

Organizations

None yet

phil111's activity

New activity in mistralai/Mistral-Small-24B-Instruct-2501 12 days ago

This Mistral Small has FAR less knowledge than the last.

#5 opened 15 days ago by

New activity in internlm/internlm3-8b-instruct 28 days ago

English tests and tasks are absurdly overfit.

#8 opened 30 days ago by

New activity in microsoft/phi-4 about 1 month ago

A heavily filtered corpus simply doesn't work.

#19 opened about 1 month ago by

I Don't Understand This Model

#9 opened about 1 month ago by

New activity in matteogeniaccio/phi-4 about 2 months ago

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-V3-Base about 2 months ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened about 2 months ago by

New activity in NyxKrage/Microsoft_Phi-4 about 2 months ago

SimpleQA score

#1 opened 2 months ago by

New activity in ibm-granite/granite-3.1-8b-instruct about 2 months ago

Exceptional creative writer

#1 opened about 2 months ago by

New activity in tiiuae/Falcon3-7B-Instruct about 2 months ago

Very High English MMLU scores, Yet Extremely Low Broad English Knowledge

#8 opened about 2 months ago by

New activity in CohereForAI/c4ai-command-r7b-12-2024 about 2 months ago

How was r7b?

#3 opened 2 months ago by

Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks

#1 opened 2 months ago by

New activity in meta-llama/Llama-3.3-70B-Instruct 2 months ago

local Llama + GPU(cuda)

#34 opened 2 months ago by

Base Model?

#32 opened 2 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 2 months ago

Add Hymba-1.5B to the leaderboard

#1030 opened 2 months ago by

New activity in mistralai/Ministral-8B-Instruct-2410 3 months ago

Hallucinates more than Mistral 7b

#13 opened 3 months ago by

New activity in mistralai/Ministral-8B-Instruct-2410 4 months ago

Looks like not as good as Qwen2.5 7B

#5 opened 4 months ago by

MonolithFoundation

This LLM is hallucinating like crazy. Can someone verify these prompts?

#3 opened 4 months ago by

Looks like not as good as Qwen2.5 7B

#5 opened 4 months ago by

MonolithFoundation

This LLM is hallucinating like crazy. Can someone verify these prompts?

#3 opened 4 months ago by

This LLM is hallucinating like crazy. Can someone verify these prompts?

#3 opened 4 months ago by