Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Akbir Khan
akbir
Follow
https://akbir.dev
akbirkhan
akbir
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Alignment faking in large language models
authored
a paper
2 months ago
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
authored
a paper
5 months ago
Language Models Learn to Mislead Humans via RLHF
View all activity
Organizations
None yet
Papers
4
arxiv:
2412.14093
arxiv:
2411.13543
arxiv:
2409.12822
arxiv:
2311.10090
models
None public yet
datasets
None public yet