Three immunoregulatory signatures define non-productive HIV infection in CD4+ T memory stem cells

The persistent HIV reservoir constitutes the main obstacle to curing HIV/AIDS disease. Our understanding of how non-productive HIV infections are established in primary human CD4+

Dispersal, adaptation and persistence of H5N1 in the sub-Antarctic and Antarctica

High pathogenicity avian influenza virus (HPAIV) H5N1 reached the sub-Antarctic and Antarctica in 2023, subsequently spreading to remote locations within this region where it had

ApeA cleaves genomic RNA to defend against RNA phage infection

To protect themselves against bacteriophage infection, bacteria encode a vast diversity of antiphage defense systems. However, the mechanisms of action of most of these systems

FASTERCC: Accelerating Flux Consistency Testing and Context-Specific Reconstruction for Large-Scale Metabolic Network Models

The increase in size of metabolic network models especially with the advent of single-cell data calls for scalable reconstruction and analysis tools. Such models, often

Acceleration and Velocity Dissociate Temporal Phases of Postural Control in Rhesus Macaques

Maintaining balance requires the nervous system to transform sensory signals about unexpected postural perturbations into precisely timed motor commands. Although human studies have established that

Parameterizing Dataset Distillation via Gaussian Splatting

March 19, 2026

arXiv:2509.26219v3 Announce Type: replace-cross
Abstract: Dataset distillation aims to compress training data while preserving training-aware knowledge, alleviating the reliance on large-scale datasets in modern model training. Dataset parameterization provides a more efficient storage structure for dataset distillation, reducing redundancy and accommodating richer information. However, existing methods either rely on complex auxiliary modules or fail to balance representational capacity and efficiency. In this paper, we propose GSDD, a simple, novel, and effective dataset parameterization technique for Dataset Distillation based on Gaussian Splatting. We adapt CUDA-based splatting operators for parallel training in batch, enabling high-quality rendering with minimal computational and memory overhead. Gaussian primitives can effectively capture meaningful training features, allowing a sparse yet expressive representation of individual images. Leveraging both high representational capacity and efficiency, GSDD substantially increases the diversity of distilled datasets under a given storage budget, thereby improving distillation performance. Beyond achieving competitive results on multiple standard benchmarks, GSDD also delivers significant performance gains on large-scale datasets such as ImageNet-1K and on video distillation tasks. In addition, we conduct comprehensive benchmarks to evaluate the computational efficiency, memory footprint, and cross-GPU architectural stability of GSDD. Code is available on https://github.com/j-cyoung/GSDatasetDistillation

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844