May 7, 2026 – Page 26 – dijee Pharma Intelligence

MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning

arXiv:2605.04058v1 Announce Type: cross Abstract: Parameter-efficient transfer learning (PETL) has emerged as a pivotal paradigm for adapting pre-trained foundation models to downstream tasks, significantly reducing trainable parameters yet suffering from substantial memory overhead caused by gradient backpropagation during fine-tuning. While memory-efficient transfer learning (METL) circumvents this challenge by bypassing backbone gradient computation via lightweight small […]

May 7, 2026

Designing a double deep reinforcement learning selection tool for resilient demand prediction

arXiv:2605.04068v1 Announce Type: cross Abstract: The use of artificial intelligence in supply chain forecasting has attracted many scientific studies for several decades. However, the process of selecting an appropriate forecasting solution becomes a daunting task. This complexity arises due to the distinct features inherent to each dataset. Research to tackle this issue has been performed […]

May 7, 2026

Sparse Autoencoder Decomposition of Clinical Sequence Model Representations: Feature Complexity, Task Specialisation, and Mortality Prediction

arXiv:2605.04072v1 Announce Type: cross Abstract: Sparse autoencoders (SAEs) have been applied to large language models and protein language models, but not systematically to electronic health record (EHR) foundation models. We train TopK SAEs on FlatASCEND, a 14.5-million-parameter autoregressive clinical sequence model, at all 10 residual stream extraction points on INSPECT (outpatient) and MIMIC-IV (ICU). SAE […]

May 7, 2026

A Regulatory Governance Framework for AI-Driven Financial Fraud Detection in U.S. Banking: Integrating OCC, SR 11-7, CFPB, and FinCEN Compliance Requirements for Model Development, Validation, and Monitoring Lifecycles

arXiv:2605.04076v1 Announce Type: cross Abstract: U.S. financial institutions deploying AI-based fraud detection face a fragmented compliance landscape spanning four regulatory frameworks — OCC Bulletin 2011-12, SR 11-7, the CFPB AI circular, and FinCEN BSA/SAR requirements — with no integrated governance life cycle connecting these requirements to model development, validation, and monitoring practice. This paper presents […]

May 7, 2026

Connecting online criminal behavior with machine learning: Using authorship attribution to analyze and link potential online traffickers

arXiv:2605.04080v1 Announce Type: cross Abstract: This research investigated how online criminal activities can be better understood and connected using data-driven machine learning methods. Many illegal activities, such as human trafficking and illicit trade, have moved to online platforms where offenders hide behind anonymous accounts and frequently change identities. This makes it difficult for authorities to […]

May 7, 2026

Evaluating Patient Safety Risks in Generative AI: Development and Validation of a FMECA Framework for Generated Clinical Content

arXiv:2605.04085v1 Announce Type: cross Abstract: Objectives: Large language models (LLMs) are increasingly used for clinical text summarization, yet structured methods to assess associated patient safety risks remain limited. Failure Mode, Effects, and Criticality Analysis (FMECA) provides a proactive framework for systematic risk identification but has not been adapted to LLM-generated clinical content. This study aimed […]

May 7, 2026

Resource Utilization of Differentiable Logic Gate Networks Deployed on FPGAs

arXiv:2605.04109v1 Announce Type: cross Abstract: On-edge machine learning (ML) often strives to maximize the intelligence of small models while miniaturizing the circuit size and power needed to perform inference. Meeting these needs, differentiable Logic Gate Networks (LGN) have demonstrated nanosecond-scale prediction speeds while reducing the required resources as compares to traditional binary neural networks. Despite […]

May 7, 2026

Think-Aloud Reshapes Automated Cognitive Model Discovery Beyond Behavior

arXiv:2605.05091v1 Announce Type: new Abstract: Computational cognitive models discovered using large language models have so far relied solely on behavioral data. However, it is well-known that models produced from the behavioral trajectory alone are typically under-determined. In this work, we explore the use of Think Aloud traces as an additional form of data constraint during […]

May 7, 2026

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

arXiv:2605.05191v1 Announce Type: new Abstract: Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermediate content can overwhelm the agent, increasing costs and the risk of errors. We propose that effective context management should be adaptive: parts of the agent’s trajectory are maintained […]

May 7, 2026

A large language model-type architecture for high-dimensional molecular potential energy surfaces

arXiv:2412.03831v2 Announce Type: cross Abstract: Computing high-dimensional potential energy surfaces for molecular systems and materials is considered to be a great challenge in computational chemistry with potential impact in a range of areas including the fundamental prediction of reaction rates. In this paper, we design and discuss an algorithm that has similarities to large language […]

May 7, 2026

Deep Wave Network for Modeling Multi-Scale Physical Dynamics

arXiv:2605.04198v1 Announce Type: cross Abstract: Performance of deep learning models is strongly governed by architectural capacity, with width and depth as primary controls. However, in physical-science applications, models are often compared at a single fixed size or by separating accuracy and computational cost, which can be misleading since architectures exhibit different accuracy-cost scaling as width […]

May 7, 2026

A Dialogue-Based Framework for Correcting Multimodal Errors in AI-Assisted STEM Education

arXiv:2605.04131v1 Announce Type: cross Abstract: Large Language Models (LLMs) are democratizing access to personalized tutoring; however, their effectiveness is hindered by challenges in processing multimodal content, which limits AI’s potential to provide equitable, high-quality STEM support. This study evaluates LLM performance on multimodal physics problems, identifies specific failure modes through an empirical error taxonomy, and […]

May 7, 2026

Subscribe for Updates