DGAT1-dependent lipid droplet synthesis in microglia attenuates neuroinflammatory responses to lipopolysaccharides.

Lipid droplets (LD) are dynamic storage organelles for triglycerides (TG). LD act as a hub that modulates the availability of fatty acids to sustain metabolic

Helix: a structure-aware deep learning model for accurate prediction of A-to-I RNA editing by endogenous ADARs

Adenosine deaminase acting on RNA (ADAR) converts adenosine to inosine within double-stranded RNA (dsRNA) and can be co-opted for therapeutic RNA editing by introducing dsRNA

CAD-C reveals centromere pairing and near-perfect alignment of sister chromatids

Three-dimensional (3D) genome organization plays a central role in gene regulation, chromatin folding, and genome stability. Although chromosome-conformation capture (3C)-derived methods have revolutionized our understanding

5′ RNA Aminoacylation via Interstrand Acyl-Transfer

The catalytic transformations driving coded protein synthesis revolve around linkage of amino acids to molecules of RNA as 2’/3′-aminoacyl esters. This defining molecular species of

Quinazolinone and Phthalazinone Inhibitors of the HDAC6/Ubiquitin Protein-Protein Interaction

Histone deacetylase 6 (HDAC6) is a class IIb histone deacetylase that regulates diverse cytosolic acetylation through its two catalytic deacetylase domains and a C-terminal zinc

Memo2496: Expert-Annotated Dataset and Dual-View Adaptive Framework for Music Emotion Recognition

December 18, 2025

arXiv:2512.13998v2 Announce Type: replace-cross
Abstract: Music Emotion Recogniser (MER) research faces challenges due to limited high-quality annotated datasets and difficulties in addressing cross-track feature drift. This work presents two primary contributions to address these issues. Memo2496, a large-scale dataset, offers 2496 instrumental music tracks with continuous valence arousal labels, annotated by 30 certified music specialists. Annotation quality is ensured through calibration with extreme emotion exemplars and a consistency threshold of 0.25, measured by Euclidean distance in the valence arousal space. Furthermore, the Dual-view Adaptive Music Emotion Recogniser (DAMER) is introduced. DAMER integrates three synergistic modules: Dual Stream Attention Fusion (DSAF) facilitates token-level bidirectional interaction between Mel spectrograms and cochleagrams via cross attention mechanisms; Progressive Confidence Labelling (PCL) generates reliable pseudo labels employing curriculum-based temperature scheduling and consistency quantification using Jensen Shannon divergence; and Style Anchored Memory Learning (SAML) maintains a contrastive memory queue to mitigate cross-track feature drift. Extensive experiments on the Memo2496, 1000songs, and PMEmo datasets demonstrate DAMER’s state-of-the-art performance, improving arousal dimension accuracy by 3.43%, 2.25%, and 0.17%, respectively. Ablation studies and visualisation analyses validate each module’s contribution. Both the dataset and source code are publicly available.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registeration number 16808844