Do Language Models Need Sleep? Offline Recurrence for Improved Online Inference

arXiv:2605.26099v2 Announce Type: replace-cross Abstract: Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consolidation mechanism in which a model periodically converts recent context into persistent fast weights before clearing its key-value cache. During sleep, the model performs […]

Anomaly as Non-Conformity via Training-Free Graph Laplacian Energy Minimization

arXiv:2605.28428v1 Announce Type: cross Abstract: Detecting subtle visual anomalies in images remains challenging, particularly when only normal samples are available a priori. Such unsupervised anomaly detection is typically solved by measuring feature similarity of a query patch to a memory of normal patches. However, similarity alone does not reveal how strongly a query patch violates […]

Periodic RoPE for Infinite Context LLMs

arXiv:2605.27980v1 Announce Type: cross Abstract: The ability to process ultra-long contexts is crucial for large language models (LLMs) to perform long-horizon tasks. While recent efforts have extended context windows to 1M and beyond, model performance degrades when sequence length exceeds the pre-trained range of positional encodings (e.g., RoPE), i.e., position exhaustion. This fundamental limitation must […]

Towards Reliable Multilingual LLMs-as-a-Judge: An Empirical Study

arXiv:2605.28710v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used for the automatic evaluation of generated text, yet most prior work focuses on English. Despite the growing demand for multilingual evaluation, extending LLM-based evaluators to multilingual settings remains challenging, particularly for low-resource languages and scenarios where in-domain data is scarce. This work explores […]

CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation

arXiv:2605.25378v2 Announce Type: replace-cross Abstract: Customized image editing aims to equip pre-trained diffusion models with specific visual effects using limited paired data, typically via Low-Rank Adaptation (LoRA). As the number of desired effects grows, storing and dynamically loading numerous these effect LoRAs significantly increases deployment overhead. Furthermore, current pipelines typically cascade these effect LoRAs with […]

How Much Can a Few Engine Moves Help? Quantifying Limited Cheating in Chess

arXiv:2601.05386v2 Announce Type: replace Abstract: Cheating in chess, by using advice from powerful software, has become a major problem, reaching the highest levels. As opposed to the large majority of previous work, which concerned em detection of cheating, here we try to evaluate the possible gain in performance, obtained by cheating a limited number of […]

Semantic Flow Regularization: Teaching LLMs to Generate Diverse Yet Coherent Responses

arXiv:2605.27971v1 Announce Type: cross Abstract: When large language models are fine-tuned to generate persona- or tone-conditioned responses, their output diversity is severely limited–a failure we term Cross-Style Collapse. We trace this collapse to the cross-entropy objective, which under shared representations tends to suppress diverse continuations. We propose Semantic Flow Regularization (SFR), a lightweight auxiliary objective […]

From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems

arXiv:2605.23955v2 Announce Type: replace Abstract: Deploying machine learning in regulated financial environments — credit risk, fraud detection, and anti-money laundering — exposes critical vulnerabilities in algorithmic reproducibility. While early financial ML addressed statistical challenges such as backtest overfitting, deep neural networks and Generative AI have introduced mechanical nondeterminism rooted in hardware and architecture. This survey […]

Grimlock: Guarding High-Agency Systems with eBPF and Attested Channels

arXiv:2605.27488v1 Announce Type: cross Abstract: Agentic systems increasingly run user-authored orchestration code that invokes tools, spawns subtasks, and delegates work across machines and clouds. Although this high agency is productive, it creates a security problem: identity, authorization, provenance, and delegation are often pushed into application code, where they become difficult to enforce consistently and difficult […]

Knowledge Graph-Driven Expert-Level Reasoning for Neuroscience

arXiv:2605.25183v2 Announce Type: replace-cross Abstract: Knowledge graph (KG) is an abstraction that can be extracted from text corpora and used for in-depth reasoning. Prior work has leveraged KGs to fine-tune language models (LMs), enabling domain-specific superintelligence. In this work, we explore whether KG-driven in-depth reasoning capabilities can emerge in neuroscience using only information contained within […]

Developing an Intelligent Job Recommendation System Using Semantic Retrieval and Explainable AI Techniques

arXiv:2605.27656v1 Announce Type: cross Abstract: Online recruitment platforms require recommendation methods capable of retrieving relevant job opportunities from large and heterogeneous collections of job postings. Keyword-based search is efficient and interpretable, but it may fail to retrieve relevant postings when equivalent roles are expressed using different terminology. This study presents a metadata-driven job recommendation system […]

PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization

arXiv:2507.16679v4 Announce Type: replace-cross Abstract: In-Context Learning has shown great potential for aligning Large Language Models (LLMs) with human values, helping reduce harmful outputs and accommodate diverse preferences without costly post-training, known as In-Context Alignment (ICA). However, LLMs’ comprehension of input prompts remains agnostic, limiting ICA’s ability to address value tensions–human values are inherently pluralistic, […]

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd.   dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844