Fourier Compressor: Frequency-Domain Visual Token Compression for Vision-Language Models

arXiv:2508.06038v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) incur substantial computational overhead and inference latency due to the large number of vision tokens introduced by high-resolution image and video inputs. Existing parameter-free token compression methods typically rely on token selection or merging, yet they risk discarding substantial visual information or distorting the original representation distribution, […]

Toward Robust Multilingual Adaptation of LLMs for Low-Resource Languages

arXiv:2510.14466v3 Announce Type: replace-cross Abstract: Large language models (LLMs) continue to struggle with low-resource languages, primarily due to limited training data, translation noise, and unstable cross-lingual alignment. To address these challenges, we propose LiRA (Linguistic Robust Anchoring for LLMs)-a plug-and-play framework that requires only lightweight fine-tuning on top of existing pretrained backbones. LiRA jointly optimizes […]

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

arXiv:2510.18941v2 Announce Type: replace-cross Abstract: Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-answering. However, many real-world applications require evaluating LLMs in processing professional documents, synthesizing information, and generating comprehensive reports in response to user queries. We introduce […]

Supervised sparse auto-encoders for interpretable and compositional representations

arXiv:2602.00924v3 Announce Type: replace Abstract: Sparse auto-encoders (SAEs) have re-emerged as a prominent method for mechanistic interpretability, yet they face two significant challenges: the non-smoothness of the $L_1$ penalty, which hinders reconstruction and scalability, and a lack of alignment between learned features and human semantics. In this paper, we address these limitations by adapting unconstrained […]

Flowette: Flow Matching with Graphette Priors for Graph Generation

arXiv:2602.23566v3 Announce Type: replace-cross Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework that employs a graph neural network-based transformer to learn a velocity field over graph representations with node and edge attributes. Our model promotes topology-aware alignment through optimal transport-based coupling and encourages global […]

MoleCode unlocks structural intelligence in large language models

arXiv:2605.16480v1 Announce Type: new Abstract: Molecules are graphs, but large language models~(LLMs) are usually asked to reason about them through linear strings. The most popular molecular representation, SMILES, compresses atoms, bonds, branches and rings into a compact sequence in which topology is implicit, forcing LLMs to reconstruct molecular structure before performing the requested chemical operation. […]

CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

arXiv:2603.21071v2 Announce Type: replace-cross Abstract: As one of the most important underwater sensing technologies, forward-looking sonar exhibits unique imaging characteristics. Sonar images are often affected by severe speckle noise, low texture contrast, acoustic shadows, and geometric distortions. These factors make it difficult for traditional teacher-student frameworks to achieve satisfactory performance in sonar semantic segmentation tasks […]

From Prompts to Protocols: An AI Agent for Laboratory Automation

arXiv:2605.16552v1 Announce Type: new Abstract: Automating science laboratories enables faster, safer, more accurate, and more reproducible execution of protocols, accelerating the discovery and testing of new materials, drugs, and more. However, setting up and running autonomous labs requires coordinating numerous instruments and robots, forcing scientists to write code, manage configuration files, and navigate complex software […]

Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining

arXiv:2605.16392v1 Announce Type: new Abstract: Multiple instance learning (MIL) is the dominant framework for whole-slide image analysis in computational pathology, typically combining a frozen patch encoder, a projection layer, and a slide-level aggregator. While encoders and aggregators have been extensively studied, the projection layer remains a largely morphology-only bottleneck. This limits endpoints such as biomarker […]

An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization

arXiv:2605.14624v2 Announce Type: replace-cross Abstract: A common critique of neural combinatorial-optimization solvers is that they are less energy-efficient than CPU metaheuristics, given the operational energy cost of training them on GPUs. This paper examines the inferential step from “training is expensive” to “neural solvers are net-inefficient”, which is where the critique actually goes wrong. Training […]

An exponential logarithmic measure of drug receptor binding and saturation

arXiv:2605.16466v1 Announce Type: new Abstract: Ligand receptor interactions are commonly assessed through equilibrium occupancy and pharmacodynamic measures that describe binding and saturation by means of bounded response curves. Thermodynamic approaches relate binding affinity to logarithmic concentration scaling, while probabilistic descriptions of occupancy arise from exponential relations. We introduce an exponential logarithmic descriptor (ELD) that integrates […]

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd.   dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844