Large Language Models Outperform Humans in Fraud Detection and Resistance to Motivated Investor Pressure

arXiv:2604.20652v1 Announce Type: new Abstract: Large language models trained on human feedback may suppress fraud warnings when investors arrive already persuaded of a fraudulent opportunity. We tested this in a preregistered experiment across seven leading LLMs and twelve investment scenarios covering legitimate, high-risk, and objectively fraudulent opportunities, combining 3,360 AI advisory conversations with a 1,201-participant […]

ThermoQA: A Three-Tier Benchmark for Evaluating Thermodynamic Reasoning in Large Language Models

arXiv:2604.19758v1 Announce Type: new Abstract: We present ThermoQA, a benchmark of 293 open-ended engineering thermodynamics problems in three tiers: property lookups (110 Q), component analysis (101 Q), and full cycle analysis (82 Q). Ground truth is computed programmatically from CoolProp 7.2.0, covering water, R-134a, and variable-cp air. Six frontier LLMs are evaluated across three independent […]

VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation

arXiv:2604.20444v1 Announce Type: cross Abstract: Embodied intelligence has advanced rapidly in recent years; however, bimanual manipulation-especially in contact-rich tasks remains challenging. This is largely due to the lack of datasets with rich physical interaction signals, systematic task organization, and sufficient scale. To address these limitations, we introduce the VTOUCH dataset. It leverages vision based tactile […]

Diagnosing CFG Interpretation in LLMs

arXiv:2604.20811v1 Announce Type: new Abstract: As LLMs are increasingly integrated into agentic systems, they must adhere to dynamically defined, machine-interpretable interfaces. We evaluate LLMs as in-context interpreters: given a novel context-free grammar, can LLMs generate syntactically valid, behaviorally functional, and semantically faithful outputs? We introduce RoboGrid, a framework that disentangles syntax, behavior, and semantics through […]

Resolving space-sharing conflicts in road user interactions through uncertainty reduction: An active inference-based computational model

arXiv:2604.19838v1 Announce Type: new Abstract: Understanding how road users resolve space-sharing conflicts is important both for traffic safety and the safe deployment of autonomous vehicles. While existing models have captured specific aspects of such interactions (e.g., explicit communication), a theoretically-grounded computational framework has been lacking. In this paper, we extend a previously developed active inference-based […]

Deconstructing Superintelligence: Identity, Self-Modification and Diff’erance

arXiv:2604.19845v1 Announce Type: new Abstract: Self-modification is often taken as constitutive of artificial superintelligence (SI), yet modification is a relative action requiring a supplement outside the operation. When self-modification extends to this supplement, the classical self-referential structure collapses. We formalise this on an associative operator algebra $mathcalA$ with update $hatU$, discrimination $hatD$, and self-representation $hatR$, […]

Energy gradients as potential drivers of pre-cellular chemical organization

arXiv:2604.19842v1 Announce Type: new Abstract: The onset of life is often framed around membrane bound compartments and encoded metabolism, leaving unresolved how spatial organization arose before stable boundaries. In this context, environmental gradients are usually treated as boundary conditions rather than variables structuring chemical dynamics. We ask whether spatial localization and functional coupling can emerge […]

Multi-stage volume exclusion models for cell proliferation

arXiv:2604.19852v1 Announce Type: new Abstract: Cell proliferation and cell movement are fundamentally stochastic processes which lead to variability in the growth and spatial structure of cell populations in many biological settings, such as cell invasion, wound healing, and tumour growth. We develop stochastic, on-lattice agent-based models (ABMs) which incorporate volume exclusion, random movement, and multi-stage […]

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

arXiv:2603.14222v2 Announce Type: replace-cross Abstract: Contrastive pretraining models such as CLIP and CLAP, serve as the ubiquitous perceptual backbones for modern multimodal large models, yet their reliance on web-scale data raises growing concerns about memorizing Personally Identifiable Information (PII). Auditing such models via membership inference is challenging in practice: shadow-model MIAs are computationally prohibitive for […]

Indirect Prey-taxis VS a Shortwave External Signal in Multiple Dimensions

arXiv:2604.20469v1 Announce Type: cross Abstract: We address a short-wave asymptotic for one class of quasi-linear second order PDE systems involving the cross-diffusion described by the so-called Patlak–Keller–Segel law. It is common to employ these equations for modelling the predator–prey community with the prey-taxis that means the interactions of two species of particles or cells or […]

Do We Need Bigger Models for Science? Task-Aware Retrieval with Small Language Models

arXiv:2604.01965v2 Announce Type: replace-cross Abstract: Scientific knowledge discovery increasingly relies on large language models, yet many existing scholarly assistants depend on proprietary systems with tens or hundreds of billions of parameters. Such reliance limits reproducibility and accessibility for the research community. In this work, we ask a simple question: do we need bigger models for […]

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd.   dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844