Kalle Hilsenbek

Bachstelze

AI & ML interests

Combining BERT with instructions for explainable AI: gitlab.com/Bachstelze/instructionbert

Recent Activity

Organizations

None yet

Bachstelze's activity

commented on Announcing AI Energy Score Ratings 3 days ago
view reply

Thanks for your effort in energy efficiency. You worked up my curiosity!
Why do smolLM-135m and smolLm-1.7B nearly have the same score besides a 10 times model size difference? Does the identical context size mostly cause it?
Could you please enable encoder-decoder models? They should be in theory more efficient because the input has to be encoded only once and can be reused in every decoding step.

upvoted an article 16 days ago
view article
Article

Is Attention Interpretable in Transformer-Based Large Language Models? Let’s Unpack the Hype

4
New activity in answerdotai/ModernBERT-base 27 days ago

ModernBART wen?

6
#38 opened about 1 month ago by
Fizzarolli
New activity in Nart/monolingual_ab about 2 months ago

Goldfish model

#5 opened about 2 months ago by
Bachstelze
New activity in HuggingFaceTB/SmolLM2-360M-Instruct 3 months ago
New activity in HuggingFaceTB/SmolLM-135M 4 months ago

Benchmark results

#17 opened 4 months ago by
Bachstelze
New activity in Slim205/mmlu_ift 5 months ago

Readme

#1 opened 5 months ago by
Bachstelze