PureGen

PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics

PureGen purifies poisoned training data using iterative Langevin dynamics of Energy-Based Models and Denoising Diffusion Probabilistic Models, achieving state-of-the-art poison defense with minimal impact on classifier generalization.

June 2024 · Sunay Bhat, Jeffrey Jiang, Omead Pooladzandi, Alexander Branch, Gregory Pottie
Causal Structural Hypothesis Testing

Causal Structural Hypothesis Testing and Data Generation Models

CSHTEST and CSVHTEST use non-parametric structural causal knowledge and deep neural networks to perform hypothesis testing on causal models, validated on simulated DAGs, a synthetic pendulum dataset, and real-world medical trauma data.

December 2022 · Sunay Bhat, Omead Pooladzandi, Jeffrey Jiang, Gregory Pottie