Predictive and Prescriptive AI toward Optimizing Wildfire Suppression

Towards Robust LLM Post-Training: Automatic Failure Management for Reinforcement Fine-Tuning

arXiv:2605.04431v1 Announce Type: cross Abstract: Reinforcement fine-tuning (RFT) has become a core paradigm for post-training large language models, yet its training process remains highly fragile.

CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

arXiv:2605.04495v1 Announce Type: cross Abstract: Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document

Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping

arXiv:2605.04308v1 Announce Type: cross Abstract: Continual incorporation of new knowledge is essential for the long-term evolution of large language models (LLMs). Existing approaches typically rely

A Zero-Inflated Beta Mixture Model for Marginal Mediation Analysis with Compositional Microbiome Mediators

arXiv:2605.04372v1 Announce Type: cross Abstract: The role of the microbiome in disease pathogenesis is an emerging field with strong evidence suggesting that dysbiosis is associated

From Beats to Breaches:How Offensive AI Infers Sensitive User Information from Playlists

arXiv:2605.04724v1 Announce Type: cross Abstract: The pervasive integration of AI has enabled Offensive AI: the exploitation of AI for malicious ends across the cyber-kill chain.