arXiv:2604.24428v1 Announce Type: cross Abstract: Electroencephalography (EEG) is highly susceptible to artifact contamination, such as electrooculographic (EOG) and electromyographic (EMG) interference, which severely degrades signal quality and hinders reliable interpretation in applications including neurological diagnosis, brain-computer interfaces (BCIs), etc. Effective EEG denoising remains challenging because different artifact sources exhibit diverse and temporally varying distributions, together […]
Task-guided Spatiotemporal Network with Diffusion Augmentation for EEG-based Dementia Diagnosis and MMSE Prediction
arXiv:2604.23964v1 Announce Type: cross Abstract: Patients with dementia typically exhibit cognitive impairment, which is routinely assessed using the Mini-Mental State Examination (MMSE). Concurrently, their underlying neurophysiological abnormalities are reflected in Electroencephalography (EEG), providing a basis for joint modeling. However, traditional multi-task approaches suffer from feature entanglement, which leads to inter-task interference when handling heterogeneous objectives.To […]
Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models
arXiv:2604.24608v1 Announce Type: cross Abstract: Large Language Models (LLMs) have recently been explored as fine-grained zero-shot re-rankers by leveraging attention signals to estimate document relevance. However, existing methods either aggregate attention signals across all heads or rely on a statically selected subset identified by heuristic rules. This solution can be suboptimal because the informative heads […]
EVE: A Domain-Specific LLM Framework for Earth Intelligence
arXiv:2604.13071v2 Announce Type: replace-cross Abstract: We introduce Earth Virtual Expert (EVE), the first open-source, end-to-end initiative for developing and deploying domain-specialized LLMs for Earth Intelligence. At its core is EVE-Instruct, a domain-adapted 24B model built on Mistral Small 3.2 and optimized for reasoning and question answering. On newly constructed Earth Observation and Earth Sciences benchmarks, […]
Personalized Worked Example Generation from Student Code Submissions using Pattern-based Knowledge Components
arXiv:2604.24758v1 Announce Type: cross Abstract: Adaptive programming practice often relies on fixed libraries of worked examples and practice problems, which require substantial authoring effort and may not correspond well to the logical errors and partial solutions students produce while writing code. As a result, students may receive learning content that does not directly address the […]
Viewport-Unaware Blind Omnidirectional Image Quality Assessment: A Unified and Generalized Approach
arXiv:2604.23953v1 Announce Type: cross Abstract: Blind omnidirectional image quality assessment (BOIQA) presents a great challenge to the visual quality assessment community, due to different storage formats and diverse user viewing behaviors. The main paradigm of BOIQA models includes two steps, ie, viewport generation, and quality prediction, which brings an extra computational burden and is hard […]
MERIT: Modular Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning
arXiv:2510.17590v2 Announce Type: replace Abstract: We present MERIT, an inference-time modular framework for multimodal misinformation detection that decomposes verification into four specialized modules: visual forensics, cross-modal alignment, retrieval-augmented claim verification, and calibrated judgment. On MMFakeBench, MERIT with GPT-4o-mini achieves 81.65% F1, outperforming all reported zero-shot baselines including GPT-4V with MMD-Agent (74.0% F1). A controlled same-model […]
Towards Long-horizon Agentic Multimodal Search
arXiv:2604.12890v2 Announce Type: replace-cross Abstract: Multimodal deep search agents have shown great potential in solving complex tasks by iteratively collecting textual and visual evidence. However, managing the heterogeneous information and high token costs associated with multimodal inputs over long horizons remains a critical challenge, as existing methods often suffer from context explosion or the loss […]
A dialog between cell adhesion and topology at the core of morphogenesis
arXiv:2602.09867v3 Announce Type: replace Abstract: During the development of an organism, cells must coordinate and organize to generate the correct shape, structure, and spatial patterns of tissues and organs, a process known as morphogenesis. The morphogenesis of embryonic tissues is supported by multiple processes that induce the precise physical deformations required for tissues to ultimately […]
MegaScale-Data: Scaling Dataloader for Multisource Large Foundation Model Training
arXiv:2504.09844v4 Announce Type: replace-cross Abstract: Modern frameworks for training large foundation models (LFMs) employ dataloaders in a data-parallel manner, with each loader processing a disjoint subset of training data. When preparing data for LFM training that originates from multiple, distinct sources, two fundamental challenges arise. First, due to the quadratic computational complexity of the attention […]
How Much Heavy Lifting Can an Agent Harness Do?: Measuring the LLM’s Residual Role in a Planning Agent
arXiv:2604.07236v3 Announce Type: replace Abstract: Agent harnesses — the stateful programs that wrap a language model and decide what it sees at each step — are now known to change end-to-end performance on a fixed model by as much as six times. That observation raises a question asked less often than it should be: once […]
Scalable Agentic Reasoning for Designing Biologics Targeting Intrinsically Disordered Proteins
arXiv:2512.15930v2 Announce Type: replace Abstract: Intrinsically disordered proteins (IDPs) represent crucial therapeutic targets due to their significant role in disease — approximately 80% of cancer-related proteins contain long disordered regions — but their lack of stable secondary/tertiary structures makes them “undruggable”. While recent computational advances, such as diffusion models, can design high-affinity IDP binders, translating […]