Uncategorized – Page 87 – dijee Pharma Intelligence

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

arXiv:2502.13388v2 Announce Type: replace Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for artificial intelligence and reinforcement learning research. To address the problem of Large Language Model(LLM) learning in complex environments through self-reflection, we propose a Reflection of Episodes(ROE) framework based on expert experience and self-experience. […]

April 3, 2026

Parallelized Hierarchical Connectome: A Spatiotemporal Recurrent Framework for Spiking State-Space Models

arXiv:2604.01295v1 Announce Type: new Abstract: This work presents the Parallelized Hierarchical Connectome (PHC), a general framework that upgrades temporal-only State-Space Models (SSMs) into spatiotemporal recurrent networks. Conventional SSMs achieve high-speed sequence processing through parallel scans, yet are limited to temporal recurrence without lateral or feedback interactions within a single timestep. PHC maps the diagonal SSM […]

April 3, 2026

Set Contribution Functions for Quantitative Bipolar Argumentation and their Principles

arXiv:2509.14963v2 Announce Type: replace Abstract: We present functions that quantify the contribution of a set of arguments in quantitative bipolar argumentation graphs to (the final strength of) an argument of interest, a so-called topic. Our set contribution functions are generalizations of existing functions that quantify the contribution of a single contributing argument to a topic. […]

April 3, 2026

DIVER: A Multi-Stage Approach for Reasoning-intensive Information Retrieval

arXiv:2508.07995v5 Announce Type: replace-cross Abstract: Retrieval-augmented generation has achieved strong performance on knowledge-intensive tasks where query-document relevance can be identified through direct lexical or semantic matches. However, many real-world queries involve abstract reasoning, analogical thinking, or multi-step inference, which existing retrievers often struggle to capture. To address this challenge, we present DIVER, a retrieval pipeline […]

April 3, 2026

Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions

arXiv:2601.09724v2 Announce Type: replace-cross Abstract: Large language models exhibit systematic negation sensitivity, yet no operational framework exists to measure this vulnerability at deployment scale, especially in high-stakes decisions. We introduce Syntactic Framing Fragility (SFF), a framework for quantifying decision consistency under logically equivalent syntactic transformations. SFF isolates syntactic effects via Logical Polarity Normalization, enabling direct […]

April 3, 2026

Quantifying Self-Preservation Bias in Large Language Models

arXiv:2604.02174v1 Announce Type: new Abstract: Instrumental convergence predicts that sufficiently advanced AI agents will resist shutdown, yet current safety training (RLHF) may obscure this risk by teaching models to deny self-preservation motives. We introduce the emphTwo-role Benchmark for Self-Preservation (TBSP), which detects misalignment through logical inconsistency rather than stated intent by tasking models to arbitrate […]

April 3, 2026

MTI: A Behavior-Based Temperament Profiling System for AI Agents

arXiv:2604.02145v1 Announce Type: new Abstract: AI models of equivalent capability can exhibit fundamentally different behavioral patterns, yet no standardized instrument exists to measure these dispositional differences. Existing approaches either borrow human personality dimensions and rely on self-report (which diverges from actual behavior in LLMs) or treat behavioral variation as a defect rather than a trait. […]

April 3, 2026

Thermodynamic connectivity reveals functional specialization and multiplex organization of extrasynaptic signaling

arXiv:2604.02057v1 Announce Type: new Abstract: Neural communication operates on both fast synaptic transmission and slower, diffusive extrasynaptic signaling, yet how these two modes jointly organize brain function remains unclear. Here, using the complete synaptic and neuropeptidergic connectomes of emphCaenorhabditis elegans, we develop a unified multiplex framework linking anatomical wiring to functional communication. We infer structure-derived […]

April 3, 2026

Strategies for tumor elimination and control under immune evasion and chemotherapy resistance

arXiv:2604.01385v1 Announce Type: new Abstract: The evolutionary and ecological dynamics of tumors under immune responses and therapeutic interventions pose major challenges to long-term treatment success. Although treatment may initially achieve short-term disease control, resistant cancer cell subpopulations often arise, leading to relapse with more aggressive and treatment-resistant forms of the disease. Here, we develop and […]

April 3, 2026

HAFixAgent: History-Aware Program Repair Agent

arXiv:2511.01047v3 Announce Type: replace-cross Abstract: Automated program repair (APR) has recently shifted toward large language models and agent-based systems, yet most systems rely on local snapshot context, overlooking repository history. Prior work shows that repository history helps repair single-line bugs, since the last commit touching the buggy line is often the bug-introducing one. In this […]

April 3, 2026

Leveraging the Value of Information in POMDP Planning

arXiv:2604.01434v1 Announce Type: new Abstract: Partially observable Markov decision processes (POMDPs) offer a principled formalism for planning under state and transition uncertainty. Despite advances made towards solving large POMDPs, obtaining performant policies under limited planning time remains a major challenge due to the curse of dimensionality and the curse of history. For many POMDP problems, […]

April 3, 2026

DVM: A Bytecode Virtual Machine Approach for Dynamic Tensor Computation

arXiv:2603.24239v2 Announce Type: replace-cross Abstract: Dynamism is common in AI computation, e.g., the dynamic tensor shapes and the dynamic control flows in models. Due to the long compilation time, existing runtime compilation damages the model efficiency, while the offline compilers either suffer from the long compilation time and device memory footprint to cover all the […]

April 3, 2026

Subscribe for Updates