arXiv:2603.23625v1 Announce Type: new Abstract: Artificial intelligence (AI) is increasingly being explored in health and social care to reduce administrative workload and allow staff to spend more time on patient care. This paper evaluates a voice-enabled Care Home Smart Speaker designed to support everyday activities in residential care homes, including spoken access to resident records, […]
BoolForge: Controlled Generation and Analysis of Boolean Functions and Networks
arXiv:2509.02496v2 Announce Type: replace Abstract: Boolean networks are a widely used modeling framework in systems biology for studying gene regulation, signal transduction, and cellular decision-making. Empirical studies indicate that biological Boolean networks exhibit a high degree of canalization, a structural property of Boolean update rules that stabilizes dynamics and constrains state transitions. Despite its central […]
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
arXiv:2603.23610v1 Announce Type: new Abstract: Although large language models (LLMs) have advanced rapidly, robust automation of complex software workflows remains an open problem. In long-horizon settings, agents frequently suffer from cascading errors and environmental stochasticity; a single misstep in a dynamic interface can lead to task failure, resulting in hallucinations or trial-and-error. This paper introduces […]
Are LLMs Smarter Than Chimpanzees? An Evaluation on Perspective Taking and Knowledge State Estimation
arXiv:2601.12410v2 Announce Type: replace Abstract: Cognitive anthropology suggests that the distinction of human intelligence lies in the ability to infer other individuals’ knowledge states and understand their intentions. In comparison, our closest animal relative, chimpanzees, lack the capacity to do so. With this paper, we aim to evaluate LLM performance in estimating other individuals’ knowledge […]
OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security
arXiv:2603.08566v2 Announce Type: replace-cross Abstract: DARPA’s AI Cyber Challenge (AIxCC) showed that cyber reasoning systems (CRSs) can go beyond vulnerability discovery to autonomously confirm and patch bugs: seven teams built such systems and open-sourced them after the competition. Yet all seven open-sourced CRSs remain largely unusable outside their original teams, each bound to the competition […]
Relationship-Aware Safety Unlearning for Multimodal LLMs
arXiv:2603.14185v3 Announce Type: replace Abstract: Generative multimodal models can exhibit safety failures that are inherently relational: two benign concepts can become unsafe when linked by a specific action or relation (e.g., child-drinking-wine). Existing unlearning and concept-erasure approaches often target isolated concepts or image-text pairs, which can cause collateral damage to benign uses of the same […]
Interfacial Potential Transduction for Diagnostics
arXiv:2603.23775v1 Announce Type: new Abstract: A major barrier to decentralized, near-patient diagnostics is the lack of a signal transduction modality that is both analytically precise and accessible at the point of care. Optical readouts remain instrument-dependent and difficult to miniaturize, while compact electrochemical readouts are prone to matrix-derived signal distortion, limiting their biomarker coverage in […]
High-Fidelity Face Content Recovery via Tamper-Resilient Versatile Watermarking
arXiv:2603.23940v1 Announce Type: cross Abstract: The proliferation of AIGC-driven face manipulation and deepfakes poses severe threats to media provenance, integrity, and copyright protection. Prior versatile watermarking systems typically rely on embedding explicit localization payloads, which introduces a fidelity–functionality trade-off: larger localization signals degrade visual quality and often reduce decoding robustness under strong generative edits. Moreover, […]
Self-organized pattern synchronization modulated by stochasticity in coupled plankton ecosystems
arXiv:2603.24000v1 Announce Type: cross Abstract: Spatial patterning and synchronization are pervasive features of plankton communities, yet the mechanisms that allow such patterns to persist coherently under environmental noise remain unresolved. In vertically structured aquatic ecosystems, plankton populations are often organized into distinct layers, raising the question of how interactions between layers shape both spatial self-organization […]
Perturbation: A simple and efficient adversarial tracer for representation learning in language models
arXiv:2603.23821v1 Announce Type: cross Abstract: Linguistic representation learning in deep neural language models (LMs) has been studied for decades, for both practical and theoretical reasons. However, finding representations in LMs remains an unsolved problem, in part due to a dilemma between enforcing implausible constraints on representations (e.g., linearity; Arora et al. 2024) and trivializing the […]
The Luna Bound Propagator for Formal Analysis of Neural Networks
arXiv:2603.23878v1 Announce Type: cross Abstract: The parameterized CROWN analysis, a.k.a., alpha-CROWN, has emerged as a practically successful bound propagation method for neural network verification. However, existing implementations of alpha-CROWN are limited to Python, which complicates integration into existing DNN verifiers and long-term production-level systems. We introduce Luna, a new bound propagator implemented in C++. Luna […]
Estimating Individual Tree Height and Species from UAV Imagery
arXiv:2603.23669v1 Announce Type: cross Abstract: Accurate estimation of forest biomass, a major carbon sink, relies heavily on tree-level traits such as height and species. Unoccupied Aerial Vehicles (UAVs) capturing high-resolution imagery from a single RGB camera offer a cost-effective and scalable approach for mapping and measuring individual trees. We introduce BIRCH-Trees, the first benchmark for […]