Publications
2025 ICML
Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time
Proceedings of the 41st International Conference on Machine Learning (ICML), 2025
An inference-time alignment method that treats acceptable behavior through satisficing rather than single-objective maximization.
Contribution
What this paper adds
- Frames LLM alignment through bounded rationality and satisficing.
- Targets inference-time control without retraining the full model.
- Studies trade-offs among helpfulness, harmlessness, and other response objectives.