Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
ronnie robinson
atorsvn
Follow
Frostsongr's profile picture
1 follower
·
1 following
AI & ML interests
AI for education
Recent Activity
replied
to
hexgrad
's
post
4 days ago
Technical question: Is Abliteration still an effective method for uncensoring LLMs? Generally, what are the most effective methods to uncensor LLMs? An effective uncensoring method would ideally be low-cost, data-efficient, and above all, successfully uncensor an LLM with minimal benchmark regressions. "Tiananmen Square", "Winnie-the-Pooh", etc and more broadly "China influence/censorship" are some common criticisms leveled at DeepSeek. I am vaguely aware of "Abliteration", a technique coined by @failspy (apologies if that attribution is incorrect) and originally described in a mid-2024 paper titled "Refusal in Language Models Is Mediated by a Single Direction" https://arxiv.org/abs/2406.11717 Abliteration is proposed as a relatively cheap and effective way to bypass censorship in models. However, it is not without criticism: https://www.reddit.com/r/LocalLLaMA/comments/1f07b4b/abliteration_fails_to_uncensor_models_while_it/ Curious to hear people's takes on Abliteration or other uncensoring methods, especially as it relates to DeepSeek.
new
activity
9 months ago
Crataco/distilgpt2-82M-GGUF:
How did you manage this conversion?
updated
a model
about 1 year ago
atorsvn/TinyLlama-1.1B-Chat-v0.6-gptq-4bit
View all activity
Organizations
None yet
models
9
Sort:Â Recently updated
atorsvn/TinyLlama-1.1B-Chat-v0.6-gptq-4bit
Text Generation
•
Updated
Nov 24, 2023
•
6
atorsvn/TinyLlama-1.1B-Chat-v0.4-gptq-4bit
Text Generation
•
Updated
Nov 18, 2023
•
8
atorsvn/TinyLlama-1.1B-Chat-v0.3-gptq-4bit
Text Generation
•
Updated
Oct 3, 2023
•
3.6k
•
1
atorsvn/TinyLlama-1.1B-Chat-v0.1-gptq-4bit
Text Generation
•
Updated
Sep 27, 2023
•
75
atorsvn/TinyLlama-1.1B-step-50K-105b-gptq-4bit
Text Generation
•
Updated
Sep 6, 2023
•
77
•
1
atorsvn/LaMini-GPT-124M-gptq-4bit
Text Generation
•
Updated
Sep 5, 2023
•
75
atorsvn/RedPajama-INCITE-Chat-3B-v1-gptq-4bit
Text Generation
•
Updated
Sep 4, 2023
•
75
atorsvn/LaMini-GPT-774M-gptq-4bit
Text Generation
•
Updated
Sep 4, 2023
•
76
atorsvn/distilgpt2-gptq-4bit
Text Generation
•
Updated
Sep 4, 2023
•
74
datasets
None public yet