Olivia S's picture

29 32

Olivia S

taygetea

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

upvoted a paper 3 days ago

Almost Surely Safe Alignment of Large Language Models at Inference-Time

upvoted a paper 3 days ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

View all activity

Organizations

None yet

taygetea's activity

upvoted 20 papers 3 days ago

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training

Paper • 2501.18965 • Published 15 days ago • 6

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Paper • 2502.01208 • Published 12 days ago • 11

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published 11 days ago • 4

Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations

Paper • 2501.19066 • Published 15 days ago • 10

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 11 days ago • 11

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published 10 days ago • 19

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published 11 days ago • 31

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published 8 days ago • 10

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published 8 days ago • 10

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Paper • 2502.04306 • Published 8 days ago • 17

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published 9 days ago • 21

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published 8 days ago • 25

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 8 days ago • 31

SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs

Paper • 2502.02909 • Published 10 days ago • 2

Value-Based Deep RL Scales Predictably

Paper • 2502.04327 • Published 8 days ago • 5

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Paper • 2502.04350 • Published 10 days ago • 10

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Paper • 2502.05178 • Published 7 days ago • 10

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Paper • 2502.03738 • Published 9 days ago • 9

CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference

Paper • 2502.04416 • Published 8 days ago • 10

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18