Post Content
Performance of federated versus centralized learning for mammography classification across film–digital domain shift
IntroductionLarge, diverse datasets are essential for reliable deep learning in mammography, yet clinical data remain siloed due to privacy and governance constraints. Federated learning enables collaborative training without sharing raw data, but its robustness under strong imaging-domain heterogeneity, such as film–digital shifts, remains uncertain.MethodsWe conducted a comparative evaluation of centralized learning and cross-silo federated learning […]
Explainable and reproducible AI: culturally responsive AI for health equity in minoritized groups
Artificial intelligence (AI) is transforming healthcare by enabling advanced diagnostics, personalized treatments, and improved operational efficiencies. By identifying complex data patterns and correlations, AI could supplement clinical decision-making, enabling more rapid diagnoses and treatment decisions tailored to meet the unique needs of diverse communities. However, realizing these benefits requires that clinical AI models be consistent, […]
Robust Multi-Agent Reinforcement Learning for Small UAS Separation Assurance under GPS Degradation and Spoofing
arXiv:2603.28900v1 Announce Type: cross Abstract: We address robust separation assurance for small Unmanned Aircraft Systems (sUAS) under GPS degradation and spoofing via Multi-Agent Reinforcement Learning (MARL). In cooperative surveillance, each aircraft (or agent) broadcasts its GPS-derived position; when such position broadcasts are corrupted, the entire observed air traffic state becomes unreliable. We cast this state […]
UniRank: End-to-End Domain-Specific Reranking of Hybrid Text-Image Candidates
arXiv:2603.29897v1 Announce Type: cross Abstract: Reranking is a critical component in many information retrieval pipelines. Despite remarkable progress in text-only settings, multimodal reranking remains challenging, particularly when the candidate set contains hybrid text and image items. A key difficulty is the modality gap: a text reranker is intrinsically closer to text candidates than to image […]
Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
arXiv:2510.16187v2 Announce Type: replace-cross Abstract: Real-world multi-agent systems may require ad hoc teaming, where an agent must coordinate with other previously unseen teammates to solve a task in a zero-shot manner. Prior work often either selects a pretrained policy based on an inferred model of the new teammates or pretrains a single policy that is […]
When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On
arXiv:2603.05659v2 Announce Type: replace-cross Abstract: Reinforcement learning with verifiable rewards (RLVR) and Rubrics as Rewards (RaR) have driven strong gains in domains with clear correctness signals and even in subjective domains by synthesizing evaluation criteria from ideal reference answers. But many real-world tasks admit multiple valid outputs and lack the single ideal answer that rubric […]
A Multi-Modal Dataset for Ground Reaction Force Estimation Using Consumer Wearable Sensors
arXiv:2603.28784v1 Announce Type: cross Abstract: This Data Descriptor presents a fully open, multi-modal dataset for estimating vertical ground reaction force (vGRF) from consumer-grade Apple Watch sensors with laboratory force plate ground truth. Ten healthy adults aged 26–41 years performed five activities: walking, jogging, running, heel drops, and step drops, while wearing two Apple Watches positioned […]
Wildfire Suppression: Complexity, Models, and Instances
arXiv:2603.29865v1 Announce Type: cross Abstract: Wildfires cause major losses worldwide, and the frequency of fire-weather conditions is likely to increase in many regions. We study the allocation of suppression resources over time on a graph-based representation of a landscape to slow down fire propagation. Our contributions are theoretical and methodological. First, we prove that this […]
Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics
arXiv:2603.29518v1 Announce Type: cross Abstract: Conversational systems should generate diverse language forms to interact fluently and accurately with users. In this context, Natural Language Generation (NLG) engines convert Meaning Representations (MRs) into sentences, directly influencing user perception. These MRs usually encode the communicative function (e.g., inform, request, confirm) via DAs and enumerate the semantic content […]
Enhancing Structural Mapping with LLM-derived Abstractions for Analogical Reasoning in Narratives
arXiv:2603.29997v1 Announce Type: cross Abstract: Analogical reasoning is a key driver of human generalization in problem-solving and argumentation. Yet, analogies between narrative structures remain challenging for machines. Cognitive engines for structural mapping are not directly applicable, as they assume pre-extracted entities, whereas LLMs’ performance is sensitive to prompt format and the degree of surface similarity […]
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents
arXiv:2603.12564v5 Announce Type: replace-cross Abstract: Tool-augmented LLM agents increasingly operate as multi-turn advisors in high-stakes domains, yet their evaluation relies on ranking metrics that measure what is recommended but not whether it is safe for the user. We present a paired-trajectory protocol that replays real financial dialogues under clean and contaminated tool-output conditions across eight […]