CROP: Conservative Reward for Model-based Offline Policy Optimization

Measuring and reducing surgical staff stress in a realistic operating room setting using EDA monitoring and smart hearing protection

BackgroundStress is a critical factor in the operating room (OR) and affects both the performance and well-being of surgical staff. Measuring and mitigating this stress

Bioethical considerations in deploying mobile mental health apps in LMIC settings: insights from the MITHRA pilot study in rural India

IntroductionIn India, untreated depression among women contributes significantly to morbidity and mortality, underscoring an urgent need for accessible and ethically grounded mental health interventions. Mobile

ARGOS: Who, Where, and When in Agentic Multi-Camera Person Search

arXiv:2604.12762v1 Announce Type: cross Abstract: We introduce ARGOS, the first benchmark and framework that reformulates multi-camera person search as an interactive reasoning problem requiring an

BayMOTH: Bayesian optiMizatiOn with meTa-lookahead — a simple approacH

arXiv:2604.12005v1 Announce Type: cross Abstract: Bayesian optimization (BO) has for sequential optimization of expensive black-box functions demonstrated practicality and effectiveness in many real-world settings. Meta-Bayesian

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following

arXiv:2510.14420v4 Announce Type: replace-cross Abstract: Language models often struggle to follow multi-constraint instructions that are crucial for real-world applications. Existing reinforcement learning (RL) approaches suffer