arXiv:2604.26302v1 Announce Type: cross Abstract: Experiments show that isochoric (constant-volume) conditions enhance supercooling stability relative to isobaric (constant-pressure) conditions. Here, combining Helmholtz equilibrium thermodynamics with a first-order perturbation methodology, we derive an inequality governing nucleation stability under volumetric constraint. The derivation provides a general thermodynamic proof that for any substance undergoing phase transformation in which […]
A modelling perspective on mosquito infectiousness: time-varying transmission competence in arbovirus vector
arXiv:2604.25714v2 Announce Type: replace Abstract: Mosquito vector competence is usually represented as a process in which once virus is detected in saliva, mosquitoes are assumed to remain infectious for life, implying an irreversible transition to the transmitting state. However, some experiments report declines in the proportion of transmitting mosquitoes at late times post-exposure, suggesting transmission […]
ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance
arXiv:2604.26348v1 Announce Type: cross Abstract: Diffusion models have achieved remarkable success in image generation, yet their training is predominantly driven by full-reference objectives that enforce pixel-wise similarity to ground-truth images.Such supervision, while effective for fidelity, may insufficient in terms of subjective visual perception quality and text-image semantic consistency. In this work, we investigate the problem […]
DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent
arXiv:2604.26311v1 Announce Type: new Abstract: We introduce DreamProver, an agentic framework that leverages a “wake-sleep” program induction paradigm to discover reusable lemmas for formal theorem proving. Existing approaches either rely on fixed lemma libraries, which limit adaptability, or synthesize highly specific intermediate lemmas tailored to individual theorems, thereby lacking generality. DreamProver addresses this gap through […]
Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning
arXiv:2509.21983v2 Announce Type: replace-cross Abstract: Constructing robots to accomplish long-horizon tasks is a long-standing challenge within artificial intelligence. Approaches using generative methods, particularly Diffusion Models, have gained attention due to their ability to model continuous robotic trajectories for planning and control. However, we show that these models struggle with long-horizon tasks that involve complex decision-making […]
The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion
arXiv:2508.16131v2 Announce Type: replace-cross Abstract: Code completion entails the task of providing missing tokens given a surrounding context. It can boost developer productivity while providing a powerful code discovery tool. Following the Large Language Model (LLM) wave, code completion has been approached with diverse LLMs fine-tuned on code (code LLMs). The performance of code LLMs […]
Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-Codex
arXiv:2604.14858v2 Announce Type: replace Abstract: As agent systems move into increasingly diverse execution settings, trajectory-level safety evaluation and diagnosis require benchmarks that evolve with them. ATBench is a diverse and realistic agent trajectory benchmark for safety evaluation and diagnosis. This report presents ATBench-Claw and ATBench-Codex, two domain-customized extensions that carry ATBench into the OpenClaw and […]
Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents
arXiv:2505.02077v2 Announce Type: replace-cross Abstract: AI agents are beginning to interact with each other directly and across internet platforms and physical environments, creating security challenges beyond traditional cybersecurity and AI safety frameworks. Free-form protocols are essential for AI’s task generalization but enable new threats like secret collusion and coordinated swarm attacks. Network effects can rapidly […]
IDOBE: Infectious Disease Outbreak forecasting Benchmark Ecosystem
arXiv:2604.18521v2 Announce Type: replace-cross Abstract: Epidemic forecasting has become an integral part of real-time infectious disease outbreak response. While collaborative ensembles composed of statistical and machine learning models have become the norm for real-time forecasting, standardized benchmark datasets for evaluating such methods are lacking. Further, there is limited understanding on performance of these methods for […]
Stochastic dynamics at the back of a gene drive propagation wave
arXiv:2502.21268v2 Announce Type: replace Abstract: Gene drive alleles bias their own inheritance to offspring. They can fix in a wild-type population in spite of a fitness cost, and even lead to the eradication of the target population if the fitness cost is high. However, this outcome may be prevented or delayed if areas previously cleared […]
AdaFRUGAL: Adaptive Memory-Efficient Training with Dynamic Control
arXiv:2601.11568v2 Announce Type: replace-cross Abstract: Training Large Language Models (LLMs) is highly memory-intensive due to optimizer state overhead. The FRUGAL framework mitigates this with gradient splitting, but its static hyperparameters — the subspace ratio ($rho$) and update frequency ($T$) — require costly manual tuning, limiting adaptability. We present AdaFRUGAL, which automates this process by introducing […]
Unilateral Relationship Revision Power in Human-AI Companion Interaction
arXiv:2603.23315v5 Announce Type: replace-cross Abstract: When providers update AI companions, users report grief, betrayal, and loss. A growing literature asks whether the norms governing personal relationships extend to these interactions. So what, if anything, is morally significant about them? I argue that this debate has missed a prior structural question: who controls the relationship, and […]