AIR-ML
Home
Research
News
Team
Project
Publication
Position
Contact
Article
GREAT: Generalizable Backdoor Attacks in RLHF via Emotion-Aware Trigger Synthesis
we develop a novel framework for crafting generalizable backdoors in RLHF through emotion-aware trigger synthesis
Subrat Kishore Dutta
,
Yuelin Xu
,
Piyush Pant
,
Xiao Zhang
PDF
Cite
ArXiv
ContextFlow: Context-Aware Flow Matching For Trajectory Inference From Spatial Omics Data
We propose ContextFlow, a novel context-aware flow matching framework that incorporates prior knowledge to guide the inference of structural tissue dynamics from spatially resolved omics data.
Santanu Rathod
,
Francesco Ceccarelli
,
Sean B. Holden
,
Pietro Liò
,
Xiao Zhang
,
Jovan Tanevski
PDF
Cite
Code
ArXiv
Controllable Adversarial Makeup for Privacy via Text-Guided Diffusion
We introduce MASQUE, a diffusion-based framework that generates localized adversarial makeup guided by user-defined text prompts.
Youngjin Kwon
,
Xiao Zhang
PDF
Cite
Code
ArXiv
Generalizable Targeted Data Poisoning against Varying Physical Objects
We take the first step toward understanding the real-world threats of TDP by studying its generalizability across varying physical conditions.
Zhizhen Chen
,
Zhengyu Zhao
,
Subrat Kishore Dutta
,
Chenhao Lin
,
Chao Shen
,
Xiao Zhang
PDF
Cite
Code
ArXiv
Safe in Isolation, Dangerous Together: Agent-Driven Multi-Turn Decomposition Jailbreaks on LLMs
We propose a multi-agent, multi-turn jailbreak strategy that systematically bypasses LLM safety mechanisms by decomposing harmful queries into seemingly benign sub-tasks.
Devansh Srivastav
,
Xiao Zhang
PDF
Cite
Source Document
DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models
This paper introduces DiffCAP, a novel diffusion-based purification strategy that can effectively neutralize adversarial corruptions in VLMs.
Jia Fu
,
Yongtao Wu
,
Yihang Chen
,
Kunyu Peng
,
Xiao Zhang
,
Volkan Cevher
,
Sepideh Pashami
,
Anders Holst
PDF
Cite
ArXiv
Predicting Time-varying Flux and Balance in Metabolic Systems using Structured Neural ODE Processes
We propose a structured neural ODE process model to estimate flux and balance samples using gene-expression time-series data
Santanu Rathod
,
Pietro Liò
,
Xiao Zhang
PDF
Cite
ArXiv
OpenReview
Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks
We propose a general framework to craft invisibility cloak for human pose estimation models
Minxing Zhang
,
Michael Backes
,
Xiao Zhang
PDF
Cite
ArXiv
Improving the Efficiency of Self-Supervised Adversarial Training through Latent Clustering-based Selection
We introduce a Latent Clustering-based Selection method to choose a core subset from the entire unlabeled dataset, aiming to improve the efficiency of self-supervised adversarial training while preserving robustness.
Somrita Ghosh
,
Yuelin Xu
,
Xiao Zhang
PDF
Cite
OpenReview
Understanding Adversarially Robust Generalization via Weight-Curvature Index
We introduce the Weight-Curvature Index (WCI), a novel metric that captures the interplay between model parameters and loss landscape curvature to better understand and improve adversarially robust generalization in deep learning.
Yuelin Xu
,
Xiao Zhang
PDF
Cite
ArXiv
OpenReview
»
Cite
×