Revisiting the Learning Objectives of Vision-Language Reward Models

FEM-Bench: A Structured Scientific Reasoning Benchmark for Evaluating Code-Generating LLMs

arXiv:2512.20732v1 Announce Type: cross Abstract: As LLMs advance their reasoning capabilities about the physical world, the absence of rigorous benchmarks for evaluating their ability to

Forward Only Learning for Orthogonal Neural Networks of any Depth

arXiv:2512.20668v1 Announce Type: cross Abstract: Backpropagation is still the de facto algorithm used today to train neural networks. With the exponential growth of recent architectures,

Forecasting N-Body Dynamics: A Comparative Study of Neural Ordinary Differential Equations and Universal Differential Equations

arXiv:2512.20643v1 Announce Type: cross Abstract: The n body problem, fundamental to astrophysics, simulates the motion of n bodies acting under the effect of their own

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

arXiv:2512.20848v1 Announce Type: cross Abstract: We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid Mamba-Transformer language model. Nemotron 3 Nano was pretrained on 25 trillion

Bridging Efficiency and Safety: Formal Verification of Neural Networks with Early Exits

arXiv:2512.20755v1 Announce Type: cross Abstract: Ensuring the safety and efficiency of AI systems is a central goal of modern research. Formal verification provides guarantees of