CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling

Depression subtype classification from social media posts: few-shot prompting vs. fine-tuning of large language models

BackgroundSocial media provides timely proxy signals of mental health, but reliable tweet-level classification of depression subtypes remains challenging due to short, noisy text, overlapping symptomatology,

Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

arXiv:2603.21522v1 Announce Type: cross Abstract: Large Language Models (LLM)-based Multi-Agent Systems (MASs) have emerged as a new paradigm in software system design, increasingly demonstrating strong

Efficient Zero-Shot AI-Generated Image Detection

arXiv:2603.21619v1 Announce Type: cross Abstract: The rapid progress of text-to-image models has made AI-generated images increasingly realistic, posing significant challenges for accurate detection of generated

ProMAS: Proactive Error Forecasting for Multi-Agent Systems Using Markov Transition Dynamics

arXiv:2603.20260v1 Announce Type: new Abstract: The integration of Large Language Models into Multi-Agent Systems (MAS) has enabled the so-lution of complex, long-horizon tasks through collaborative

Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms

arXiv:2603.20333v1 Announce Type: cross Abstract: Modern autonomous multi-agent systems combine heterogeneous learning mechanisms operating at different timescales. An open question remains: can one formally guarantee