Mixture of Attention Schemes (MoAS): Learning to Route Between MHA, GQA, and MQA

AI Wrapped: The 14 AI terms you couldn’t avoid in 2025

If the past 12 months have taught us anything, it’s that the AI hype train is showing no signs of slowing. It’s hard to believe

FEM-Bench: A Structured Scientific Reasoning Benchmark for Evaluating Code-Generating LLMs

arXiv:2512.20732v1 Announce Type: cross Abstract: As LLMs advance their reasoning capabilities about the physical world, the absence of rigorous benchmarks for evaluating their ability to

Forward Only Learning for Orthogonal Neural Networks of any Depth

arXiv:2512.20668v1 Announce Type: cross Abstract: Backpropagation is still the de facto algorithm used today to train neural networks. With the exponential growth of recent architectures,

Revisiting the Learning Objectives of Vision-Language Reward Models

arXiv:2512.20675v1 Announce Type: cross Abstract: Learning generalizable reward functions is a core challenge in embodied intelligence. Recent work leverages contrastive vision language models (VLMs) to

Forecasting N-Body Dynamics: A Comparative Study of Neural Ordinary Differential Equations and Universal Differential Equations

arXiv:2512.20643v1 Announce Type: cross Abstract: The n body problem, fundamental to astrophysics, simulates the motion of n bodies acting under the effect of their own