Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control

From Causal Discovery to Dynamic Causal Inference in Neural Time Series

arXiv:2603.20980v1 Announce Type: cross Abstract: Time-varying causal models provide a powerful framework for studying dynamic scientific systems, yet most existing approaches assume that the underlying

Decoupling Numerical and Structural Parameters: An Empirical Study on Adaptive Genetic Algorithms via Deep Reinforcement Learning for the Large-Scale TSP

arXiv:2603.20702v1 Announce Type: cross Abstract: Proper parameter configuration is a prerequisite for the success of Evolutionary Algorithms (EAs). While various adaptive strategies have been proposed,

RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation

arXiv:2603.20882v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly evaluated and sometimes trained using automated graders such as LLM-as-judges that output scalar scores

Detecting Neurovascular Instability from Multimodal Physiological Signals Using Wearable-Compatible Edge AI: A Responsible Computational Framework

arXiv:2603.20442v1 Announce Type: cross Abstract: We propose Melaguard, a multimodal ML framework (Transformer-lite, 1.2M parameters, 4-head self-attention) for detecting neurovascular instability (NVI) from wearable-compatible physiological

MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv:2603.20586v1 Announce Type: cross Abstract: As long-context language modeling becomes increasingly important, the cost of maintaining and attending to large Key/Value (KV) caches grows rapidly,