Three immunoregulatory signatures define non-productive HIV infection in CD4+ T memory stem cells

The persistent HIV reservoir constitutes the main obstacle to curing HIV/AIDS disease. Our understanding of how non-productive HIV infections are established in primary human CD4+

Dispersal, adaptation and persistence of H5N1 in the sub-Antarctic and Antarctica

High pathogenicity avian influenza virus (HPAIV) H5N1 reached the sub-Antarctic and Antarctica in 2023, subsequently spreading to remote locations within this region where it had

ApeA cleaves genomic RNA to defend against RNA phage infection

To protect themselves against bacteriophage infection, bacteria encode a vast diversity of antiphage defense systems. However, the mechanisms of action of most of these systems

FASTERCC: Accelerating Flux Consistency Testing and Context-Specific Reconstruction for Large-Scale Metabolic Network Models

The increase in size of metabolic network models especially with the advent of single-cell data calls for scalable reconstruction and analysis tools. Such models, often

Acceleration and Velocity Dissociate Temporal Phases of Postural Control in Rhesus Macaques

Maintaining balance requires the nervous system to transform sensory signals about unexpected postural perturbations into precisely timed motor commands. Although human studies have established that

Are a Thousand Words Better Than a Single Picture? Beyond Images — A Framework for Multi-Modal Knowledge Graph Dataset Enrichment

March 19, 2026

arXiv:2603.16974v1 Announce Type: cross
Abstract: Multi-Modal Knowledge Graphs (MMKGs) benefit from visual information, yet large-scale image collection is hard to curate and often excludes ambiguous but relevant visuals (e.g., logos, symbols, abstract scenes). We present Beyond Images, an automatic data-centric enrichment pipeline with optional human auditing. This pipeline operates in three stages: (1) large-scale retrieval of additional entity-related images, (2) conversion of all visual inputs into textual descriptions to ensure that ambiguous images contribute usable semantics rather than noise, and (3) fusion of multi-source descriptions using a large language model (LLM) to generate concise, entity-aligned summaries. These summaries replace or augment the text modality in standard MMKG models without changing their architectures or loss functions. Across three public MMKG datasets and multiple baseline models, we observe consistent gains (up to 7% Hits@1 overall). Furthermore, on a challenging subset of entities with visually ambiguous logos and symbols, converting images into text yields large improvements (201.35% MRR and 333.33% Hits@1). Additionally, we release a lightweight Text-Image Consistency Check Interface for optional targeted audits, improving description quality and dataset reliability. Our results show that scaling image coverage and converting ambiguous visuals into text is a practical path to stronger MMKG completion. Code, datasets, and supplementary materials are available at https://github.com/pengyu-zhang/Beyond-Images.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844