arXiv:2508.06038v3 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs) incur substantial computational overhead and inference latency due to the large number of vision tokens introduced by high-resolution image and video inputs. Existing parameter-free token compression methods typically rely on token selection or merging, yet they risk discarding substantial visual information or distorting the original representation distribution, […]
Toward Robust Multilingual Adaptation of LLMs for Low-Resource Languages
arXiv:2510.14466v3 Announce Type: replace-cross Abstract: Large language models (LLMs) continue to struggle with low-resource languages, primarily due to limited training data, translation noise, and unstable cross-lingual alignment. To address these challenges, we propose LiRA (Linguistic Robust Anchoring for LLMs)-a plug-and-play framework that requires only lightweight fine-tuning on top of existing pretrained backbones. LiRA jointly optimizes […]
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
arXiv:2510.18941v2 Announce Type: replace-cross Abstract: Evaluating progress in large language models (LLMs) is often constrained by the challenge of verifying responses, limiting assessments to tasks like mathematics, programming, and short-form question-answering. However, many real-world applications require evaluating LLMs in processing professional documents, synthesizing information, and generating comprehensive reports in response to user queries. We introduce […]
ADMEDTAGGER: an annotation framework for distillation of expert knowledge for the Polish medical language
arXiv:2601.09722v2 Announce Type: replace-cross Abstract: In this work, we present an annotation framework that demonstrates how a multilingual LLM pretrained on a large corpus can be used as a teacher model to distill the expert knowledge needed for tagging medical texts in Polish. This work is part of a larger project called ADMEDVOICE, within which […]
Supervised sparse auto-encoders for interpretable and compositional representations
arXiv:2602.00924v3 Announce Type: replace Abstract: Sparse auto-encoders (SAEs) have re-emerged as a prominent method for mechanistic interpretability, yet they face two significant challenges: the non-smoothness of the $L_1$ penalty, which hinders reconstruction and scalability, and a lack of alignment between learned features and human semantics. In this paper, we address these limitations by adapting unconstrained […]
Flowette: Flow Matching with Graphette Priors for Graph Generation
arXiv:2602.23566v3 Announce Type: replace-cross Abstract: We study generative modeling of graphs with recurring subgraph motifs. We propose Flowette, a continuous flow matching framework that employs a graph neural network-based transformer to learn a velocity field over graph representations with node and edge attributes. Our model promotes topology-aware alignment through optimal transport-based coupling and encourages global […]
MoleCode unlocks structural intelligence in large language models
arXiv:2605.16480v1 Announce Type: new Abstract: Molecules are graphs, but large language models~(LLMs) are usually asked to reason about them through linear strings. The most popular molecular representation, SMILES, compresses atoms, bonds, branches and rings into a compact sequence in which topology is implicit, forcing LLMs to reconstruct molecular structure before performing the requested chemical operation. […]
CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels
arXiv:2603.21071v2 Announce Type: replace-cross Abstract: As one of the most important underwater sensing technologies, forward-looking sonar exhibits unique imaging characteristics. Sonar images are often affected by severe speckle noise, low texture contrast, acoustic shadows, and geometric distortions. These factors make it difficult for traditional teacher-student frameworks to achieve satisfactory performance in sonar semantic segmentation tasks […]
From Prompts to Protocols: An AI Agent for Laboratory Automation
arXiv:2605.16552v1 Announce Type: new Abstract: Automating science laboratories enables faster, safer, more accurate, and more reproducible execution of protocols, accelerating the discovery and testing of new materials, drugs, and more. However, setting up and running autonomous labs requires coordinating numerous instruments and robots, forcing scientists to write code, manage configuration files, and navigate complex software […]
Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining
arXiv:2605.16392v1 Announce Type: new Abstract: Multiple instance learning (MIL) is the dominant framework for whole-slide image analysis in computational pathology, typically combining a frozen patch encoder, a projection layer, and a slide-level aggregator. While encoders and aggregators have been extensively studied, the projection layer remains a largely morphology-only bottleneck. This limits endpoints such as biomarker […]
An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization
arXiv:2605.14624v2 Announce Type: replace-cross Abstract: A common critique of neural combinatorial-optimization solvers is that they are less energy-efficient than CPU metaheuristics, given the operational energy cost of training them on GPUs. This paper examines the inferential step from “training is expensive” to “neural solvers are net-inefficient”, which is where the critique actually goes wrong. Training […]
An exponential logarithmic measure of drug receptor binding and saturation
arXiv:2605.16466v1 Announce Type: new Abstract: Ligand receptor interactions are commonly assessed through equilibrium occupancy and pharmacodynamic measures that describe binding and saturation by means of bounded response curves. Thermodynamic approaches relate binding affinity to logarithmic concentration scaling, while probabilistic descriptions of occupancy arise from exponential relations. We introduce an exponential logarithmic descriptor (ELD) that integrates […]