arXiv:2603.23069v1 Announce Type: cross Abstract: The task of authorship style transfer involves rewriting text in the style of a target author while preserving the meaning of the original text. Existing style transfer methods train a single model on large corpora to model all target styles at once: this high-cost approach offers limited flexibility for target-specific […]
Mi:dm K 2.5 Pro
arXiv:2603.18788v2 Announce Type: replace-cross Abstract: The evolving LLM landscape requires capabilities beyond simple text generation, prioritizing multi-step reasoning, long-context understanding, and agentic workflows. This shift challenges existing models in enterprise environments, especially in Korean-language and domain-specific scenarios where scaling is insufficient. We introduce Mi:dm K 2.5 Pro, a 32B parameter flagship LLM designed to address […]
A Sobering Look at Tabular Data Generation via Probabilistic Circuits
arXiv:2603.23016v1 Announce Type: cross Abstract: Tabular data is more challenging to generate than text and images, due to its heterogeneous features and much lower sample sizes. On this task, diffusion-based models are the current state-of-the-art (SotA) model class, achieving almost perfect performance on commonly used benchmarks. In this paper, we question the perception of progress […]
Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems
arXiv:2503.04945v3 Announce Type: replace-cross Abstract: The proliferation of generative models has presented significant challenges in distinguishing authentic human-authored content from deepfake content. Collaborative human efforts, augmented by AI tools, present a promising solution. In this study, we explore the potential of DeepFakeDeLiBot, a deliberation-enhancing chatbot, to support groups in detecting deepfake text. Our findings reveal […]
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
arXiv:2510.26865v2 Announce Type: replace-cross Abstract: Reading measurement instruments is effortless for humans and requires relatively little domain expertise, yet it remains surprisingly challenging for current vision-language models (VLMs) as we find in preliminary evaluation. In this work, we introduce MeasureBench, a benchmark on visual measurement reading covering both real-world and synthesized images of various types […]
Streaming Attention Approximation via Discrepancy Theory
arXiv:2502.07861v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have achieved impressive success, but their high memory requirements present challenges for long-context token generation. In this paper we study the streaming complexity of attention approximation, a key computational primitive underlying token generation. Our main contribution is BalanceKV, a streaming algorithm for $epsilon$-approximating attention computations based […]
An Accurate and Interpretable Framework for Trustworthy Process Monitoring
arXiv:2302.10426v3 Announce Type: replace Abstract: Trustworthy process monitoring seeks to build an accurate and interpretable monitoring framework, which is critical for ensuring the safety of energy conversion plant (ECP) that operates under extreme working conditions such as high pressure and temperature. Contemporary self-attentive models, however, fall short in this domain for two main reasons. First, […]
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
arXiv:2603.23495v1 Announce Type: cross Abstract: Existing approaches for improving the efficiency of Large Vision-Language Models (LVLMs) are largely based on the concept of visual token reduction. This approach, however, creates an information bottleneck that impairs performance, especially on challenging tasks that require fine-grained understanding and reasoning. In this work, we challenge this paradigm by introducing […]
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
arXiv:2603.22918v1 Announce Type: cross Abstract: Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token sequences of videos, which contain extensive temporal dependencies and redundant frames. Existing approaches typically treat MLLMs as passive recognizers, processing entire videos or uniformly sampled frames without adaptive reasoning. Recent agent-based methods introduce external tools, […]
The EU AI Act and the Rights-based Approach to Technological Governance
arXiv:2603.22920v1 Announce Type: cross Abstract: The EU AI Act constitutes an important development in shaping the Union’s digital regulatory architecture. The Act places fundamental rights at the heart of a risk-based governance framework. The article examines how the AI Act institutionalises a human-centric approach to AI and how the AI Act’s provisions explicitly and implicitly […]
Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation
arXiv:2603.22153v2 Announce Type: replace-cross Abstract: Recent advances in cross-view geo-localization (CVGL) methods have shown strong potential for supporting unmanned aerial vehicle (UAV) navigation in GNSS-denied environments. However, existing work predominantly focuses on matching UAV views to onboard map tiles, which introduces an inherent trade-off between accuracy and storage overhead, and overlooks the importance of the […]
Sketching a Space of Brain States
arXiv:2603.22296v1 Announce Type: new Abstract: Brain functional connectivity alterations, that is, pathological changes in the signal exchange between areas of the brain, occur in several neurological diseases, including neurodegenerative and neuropsychiatric ones. They consist in changes in how brain functional networks operate. By conceptualising a brain space as a space whose points are connectome configurations […]