TTP: Test-Time Padding for Adversarial Detection and Robust Adaptation on Vision-Language Models

arXiv:2512.16523v2 Announce Type: replace-cross Abstract: Vision-Language Models (VLMs), such as CLIP, have achieved impressive zero-shot recognition performance but remain highly susceptible to adversarial perturbations, posing significant risks in safety-critical scenarios. Previous training-time defenses rely on adversarial fine-tuning, which requires labeled data and costly retraining, while existing test-time strategies fail to reliably distinguish between clean and […]

Long-Term Outlier Prediction Through Outlier Score Modeling

arXiv:2603.20993v1 Announce Type: cross Abstract: This study addresses an important gap in time series outlier detection by proposing a novel problem setting: long-term outlier prediction. Conventional methods primarily focus on immediate detection by identifying deviations from normal patterns. As a result, their applicability is limited when forecasting outlier events far into the future. To overcome […]

DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation

arXiv:2603.20470v1 Announce Type: new Abstract: The rapid growth of the text-to-image (T2I) community has fostered a thriving online ecosystem of expert models, which are variants of pretrained diffusion models specialized for diverse generative abilities. Yet, existing model merging methods remain limited in fully leveraging abundant online expert resources and still struggle to meet diverse in-the-wild […]

A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors

arXiv:2603.21048v1 Announce Type: cross Abstract: The identification of hazardous driving behaviors from in-cabin video streams is essential for enhancing road safety and supporting the detection of traffic violations and unsafe driver actions. However, current temporal action localization techniques often struggle to balance accuracy with computational efficiency. In this work, we develop and evaluate a temporal […]

PhysMem: Self-Evolving Physical Memory for Robot Manipulation

arXiv:2602.20323v3 Announce Type: replace-cross Abstract: Reliable object manipulation requires understanding physical properties that vary across objects and environments. Vision-language model (VLM) planners can reason about friction and stability in general terms; however, they often cannot predict how a specific ball will roll on a particular surface or which stone will provide a stable foundation without […]

DMMRL: Disentangled Multi-Modal Representation Learning via Variational Autoencoders for Molecular Property Prediction

arXiv:2603.21108v1 Announce Type: cross Abstract: Molecular property prediction constitutes a cornerstone of drug discovery and materials science, necessitating models capable of disentangling complex structure-property relationships across diverse molecular modalities. Existing approaches frequently exhibit entangled representations–conflating structural, chemical, and functional factors–thereby limiting interpretability and transferability. Furthermore, conventional methods inadequately exploit complementary information from graphs, sequences, and […]

Transcranial Alternating Current Stimulation (tACS) for patients with Post-Stroke Anomia: Preliminary Data on Picture Naming Performance

arXiv:2603.20476v1 Announce Type: new Abstract: The present study evaluated the effectiveness of transcranial alternating current stimulation (tACS) treating patients with post-stroke anomia using a picture-naming task and a Single-Case Experimental Design (SCED). A right-handed 38-year-old woman with a left-hemisphere stroke and a left-handed 54-year-old man with a right-hemisphere stroke underwent an eight-week treatment program. Specifically, […]

LLM-based Automated Architecture View Generation: Where Are We Now?

arXiv:2603.21178v1 Announce Type: cross Abstract: Architecture views are essential for software architecture documentation, yet their manual creation is labor intensive and often leads to outdated artifacts. As systems grow in complexity, the automated generation of views from source code becomes increasingly valuable. Goal: We empirically evaluate the ability of LLMs and agentic approaches to generate […]

FinTradeBench: A Financial Reasoning Benchmark for LLMs

arXiv:2603.19225v2 Announce Type: replace-cross Abstract: Real-world financial decision-making is a challenging problem that requires reasoning over heterogeneous signals, including company fundamentals derived from regulatory filings and trading signals computed from price dynamics. Recently, with the advancement of Large Language Models (LLMs), financial analysts have begun to use them for financial decision-making tasks. However, existing financial […]

Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

arXiv:2603.21278v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed for extended, multi-topic conversations, yet the flat, append-only structure of current conversation interfaces introduces a fundamental limitation: all context accumulates in a single unbounded window, causing topically distinct threads to bleed into one another and progressively degrade response quality. We term this failure […]

Efficient Counterfactual Reasoning in ProbLog via Single World Intervention Programs

arXiv:2603.20505v1 Announce Type: new Abstract: Probabilistic Logic Programming (PLP) languages, like ProbLog, naturally support reasoning under uncertainty, while maintaining a declarative and interpretable framework. Meanwhile, counterfactual reasoning (i.e., answering “what if” questions) is critical for ensuring AI systems are robust and trustworthy; however, integrating this capability into PLP can be computationally prohibitive and unstable in […]

B-jet Tagging Using a Hybrid Edge Convolution and Transformer Architecture

arXiv:2603.21326v1 Announce Type: cross Abstract: Jet flavor tagging plays an important role in precise Standard Model measurement enabling the extraction of mass dependence in jet-quark interaction and quark-gluon plasma (QGP) interactions. They also enable inferring the nature of particles produced in high-energy particle collisions that contain heavy quarks. The classification of bottom jets is vital […]

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd.   dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844