publications | Elliot Creager

2025

Preprint
Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework

Cléa Chataigner, Rebecca Ma, Prakhar Ganesh, Afaf Taïk, Elliot Creager, and Golnoosh Farnadi

2025

Abs arXiv Bib PDF

Large language models (LLMs) are sensitive to subtle changes in prompt phrasing, complicating efforts to audit them reliably. Prior approaches often rely on arbitrary or ungrounded prompt variations, which may miss key linguistic and demographic factors in real-world usage. We introduce AUGMENT (Automated User-Grounded Modeling and Evaluation of Natural Language Transformations), a framework for systematically generating and evaluating controlled, realistic prompt paraphrases based on linguistic structure and user demographics. AUGMENT ensures paraphrase quality through a combination of semantic, stylistic, and instruction-following criteria. In a case study on the BBQ dataset, we show that user-grounded paraphrasing leads to significant shifts in LLM performance and bias metrics across nine models. Our findings highlight the need for more representative and structured approaches to prompt variation in LLM auditing.
@unpublished{Chataigner2025say, author = {Chataigner, Cléa and Ma, Rebecca and Ganesh, Prakhar and Taïk, Afaf and Creager, Elliot and Farnadi, Golnoosh}, title = {Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework}, journal = {arXiv preprint}, year = {2025}, eprint = {2505.03563}, archiveprefix = {arXiv}, primaryclass = {cs.CL}, url = {https://arxiv.org/abs/2505.03563}, }
Preprint
Crowding Out The Noise: Algorithmic Collective Action Under Differential Privacy

Rushabh Solanki, Meghana Bhange, Ulrich Aïvodji, and Elliot Creager

2025

Abs arXiv Bib PDF

The integration of AI into daily life has generated considerable attention and excitement, while also raising concerns about automating algorithmic harms and re-entrenching existing social inequities. While the responsible deployment of trustworthy AI systems is a worthy goal, there are many possible ways to realize it, from policy and regulation to improved algorithm design and evaluation. In fact, since AI trains on social data, there is even a possibility for everyday users, citizens, or workers to directly steer its behavior through Algorithmic Collective Action, by deliberately modifying the data they share with a platform to drive its learning process in their favor. This paper considers how these grassroots efforts to influence AI interact with methods already used by AI firms and governments to improve model trustworthiness. In particular, we focus on the setting where the AI firm deploys a differentially private model, motivated by the growing regulatory focus on privacy and data protection. We investigate how the use of Differentially Private Stochastic Gradient Descent (DPSGD) affects the collective’s ability to influence the learning process. Our findings show that while differential privacy contributes to the protection of individual data, it introduces challenges for effective algorithmic collective action. We characterize lower bounds on the success of algorithmic collective action under differential privacy as a function of the collective’s size and the firm’s privacy parameters, and verify these trends experimentally by simulating collective action during the training of deep neural network classifiers across several datasets.
@unpublished{Solanki2025crowding, author = {Solanki, Rushabh and Bhange, Meghana and Aïvodji, Ulrich and Creager, Elliot}, title = {Crowding Out The Noise: Algorithmic Collective Action Under Differential Privacy}, journal = {arXiv preprint}, year = {2025}, eprint = {2505.05707}, archiveprefix = {arXiv}, primaryclass = {cs.LG}, url = {https://arxiv.org/abs/2505.05707}, }
ICML Workshops
Transformers Don’t In-Context Learn Least Squares Regression

Joshua Hill, Ben Eyre, and Elliot Creager

In ICML 2025 Workshop on Reliable and Responsible Foundation Models 2025

Abs Bib

In-context learning (ICL) has emerged as a powerful capability of large pretrained transformers, enabling them to solve new tasks implicit in example input–output pairs without any gradient updates. Despite its practical success, the mechanisms underlying ICL remain largely mysterious. In this work we study synthetic linear regression to probe how transformers implement learning at inference time. Previous works have demonstrated that transformers match the performance of learning rules such as Ordinary Least Squares (OLS) regression or gradient descent and have suggested ICL is facilitated in transformers through the learned implementation of one of these techniques. In this work, we demonstrate through a suite of out‑of‑distribution generalization experiments that transformers trained for ICL fail to generalize after shifts in the prompt distribution, a behaviour that is inconsistent with the notion of transformers implementing algorithms such as OLS. Finally, we highlight the role of the pretraining corpus in shaping ICL behaviour through a spectral analysis of the learned representations in the residual stream. Inputs from the same distribution as the training data produce representations with a unique \textitspectral signature: inputs from this distribution tend to have the same top two singular vectors. This spectral signature is not shared by out-of-distribution inputs, and a metric characterizing the presence of this signature is highly correlated with low loss.
@inproceedings{Hill2025transformers, author = {Hill, Joshua and Eyre, Ben and Creager, Elliot}, title = {Transformers Don't In-Context Learn Least Squares Regression}, year = {2025}, booktitle = {ICML 2025 Workshop on Reliable and Responsible Foundation Models}, eprint = {forthcoming}, archiveprefix = {arXiv}, journal = {arXiv preprint}, primaryclass = {cs.LG}, url = {}, }
IEEE ETHICS (Student Oral)
Out of Sight, Out of Mind: Problems Perpetuated by Training on Facial Images Collected by De Facto Consent

Chloe Nguyen, Jin Sol Kim, Lai-Tze Fan, and Elliot Creager

In IEEE ETHICS 2025 Undergraduate Student Track 2025

Abs Bib

Images of human faces are being shared online at an unprecedented scale. These images—rich in biometric data—feed the datasets that power facial recognition technologies (FRT), such as airport security and facial identification on smartphones. But how this data is collected is increasingly opaque and invasive. Over the past year, major social media platforms, such as X and LinkedIn, have adopted de facto consent policies—that is to say, the automated permissions of user consent—allowing their machine learning models to train on user data without meaningful or even conscious user input. Public concern tends to emerge only after harm becomes visible, by which point the user’s biometric information has already shaped the system’s behavior.
@inproceedings{nguyen2025out, author = {Nguyen, Chloe and Kim, Jin Sol and Fan, Lai-Tze and Creager, Elliot}, title = {Out of Sight, Out of Mind: Problems Perpetuated by Training on Facial Images Collected by De Facto Consent}, booktitle = {IEEE ETHICS 2025 Undergraduate Student Track}, year = {2025}, note = {\textbf{Oral}}, url = {}, }

CPI GranConf (Spotlight)

Algorithmic Collective Action Directed at Fair and Private Learning Algorithms

Rushabh Solanki, Meghana Bhange, Ulrich Aïvodji, and Elliot Creager

In University of Waterloo Cybersecurity and Privacy Institute Graduate Student Conference 2025

@inproceedings{solanki2025algorithmic,
  author = {Solanki, Rushabh and Bhange, Meghana and Aïvodji, Ulrich and Creager, Elliot},
  title = {Algorithmic Collective Action Directed at Fair and Private Learning Algorithms},
  booktitle = {University of Waterloo Cybersecurity and Privacy Institute Graduate Student Conference},
  year = {2025},
  note = {\textbf{Spotlight}},
}

2024

NeurIPS Workshops
Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm

Rushabh Solanki, and Elliot Creager

In NeurIPS 2024 Workshop on Regulatable ML 2024

Abs arXiv Bib PDF

The deployment of AI in consumer products is currently focused on the use of so-called foundation models, large neural networks pre-trained on massive corpora of digital records. This emphasis on scaling up datasets and pre-training computation raises the risk of further consolidating the industry, and enabling monopolistic (or oligopolistic) behavior. Judges and regulators seeking to improve market competition may employ various remedies. This paper explores dissolution – the breaking up of a monopolistic entity into smaller firms – as one such remedy, focusing in particular on the technical challenges and opportunities involved in the breaking up of large models and datasets. We show how the framework of Conscious Data Contribution can enable user autonomy during under dissolution. Through a simulation study, we explore how fine-tuning and the phenomenon of "catastrophic forgetting" could actually prove beneficial as a type of machine unlearning that allows users to specify which data they want used for what purposes.
@inproceedings{solanki2024promoting, title = {Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm}, author = {Solanki, Rushabh and Creager, Elliot}, journal = {arXiv preprint arXiv:2411.13546}, year = {2024}, booktitle = {NeurIPS 2024 Workshop on Regulatable ML}, }
NeurIPS Workshops
Show, Don’t Tell: Uncovering Implicit Character Portrayal using LLMs

Brandon Jaipersaud, Zining Zhu, Frank Rudzicz, and Elliot Creager

In NeurIPS 2024 Workshop on Creativity and Generative AI 2024

Abs arXiv Bib PDF

Tools for analyzing character portrayal in fiction are valuable for writers and literary scholars in developing and interpreting compelling stories. Existing tools, such as visualization tools for analyzing fictional characters, primarily rely on explicit textual indicators of character attributes. However, portrayal is often implicit, revealed through actions and behaviors rather than explicit statements. We address this gap by leveraging large language models (LLMs) to uncover implicit character portrayals. We start by generating a dataset for this task with greater cross-topic similarity, lexical diversity, and narrative lengths than existing narrative text corpora such as TinyStories and WritingPrompts. We then introduce LIIPA (LLMs for Inferring Implicit Portrayal for Character Analysis), a framework for prompting LLMs to uncover character portrayals. LIIPA can be configured to use various types of intermediate computation (character attribute word lists, chain-of-thought) to infer how fictional characters are portrayed in the source text. We find that LIIPA outperforms existing approaches, and is more robust to increasing character counts (number of unique persons depicted) due to its ability to utilize full narrative context. Lastly, we investigate the sensitivity of portrayal estimates to character demographics, identifying a fairness-accuracy tradeoff among methods in our LIIPA framework – a phenomenon familiar within the algorithmic fairness literature. Despite this tradeoff, all LIIPA variants consistently outperform non-LLM baselines in both fairness and accuracy. Our work demonstrates the potential benefits of using LLMs to analyze complex characters and to better understand how implicit portrayal biases may manifest in narrative texts.
@inproceedings{jaipersaud2024show, title = {Show, Don't Tell: Uncovering Implicit Character Portrayal using LLMs}, author = {Jaipersaud, Brandon and Zhu, Zining and Rudzicz, Frank and Creager, Elliot}, journal = {arXiv preprint arXiv:2412.04576}, year = {2024}, booktitle = {NeurIPS 2024 Workshop on Creativity and Generative AI}, }
ICML
Out of the Ordinary: Robust Regression by Spectral Adaptation

Benjamin Eyre, Elliot Creager, David Madras, Vardan Papyan, and Richard Zemel

In Proceedings of the 41st International Conference on Machine Learning 2024

Abs arXiv Bib PDF

Designing deep neural network classifiers that perform robustly on distributions differing from the available training data is an active area of machine learning research. However, out-of-distribution generalization for regression-the analogous problem for modeling continuous targets-remains relatively unexplored. To tackle this problem, we return to first principles and analyze how the closed-form solution for Ordinary Least Squares (OLS) regression is sensitive to covariate shift. We characterize the out-of-distribution risk of the OLS model in terms of the eigenspectrum decomposition of the source and target data. We then use this insight to propose a method for adapting the weights of the last layer of a pre-trained neural regression model to perform better on input data originating from a different distribution. We demonstrate how this lightweight spectral adaptation procedure can improve out-of-distribution performance for synthetic and real-world datasets.
@inproceedings{eyre_out_2024, title = {Out of the {Ordinary}: {Robust} {Regression} by {Spectral} {Adaptation}}, url = {https://proceedings.mlr.press/v235/eyre24a.html}, language = {en}, booktitle = {Proceedings of the 41st {International} {Conference} on {Machine} {Learning}}, author = {Eyre, Benjamin and Creager, Elliot and Madras, David and Papyan, Vardan and Zemel, Richard}, year = {2024}, }
ICML
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making

Parand A. Alamdari, Toryn Q. Klassen, Elliot Creager, and Sheila A. Mcilraith

In Proceedings of the 41st International Conference on Machine Learning Jul 2024

Abs arXiv Bib PDF

Fair decision making has largely been studied with respect to a single decision. Here we investigate the notion of fairness in the context of sequential decision making where multiple stakeholders can be affected by the outcomes of decisions. We observe that fairness often depends on the history of the sequential decision-making process, and in this sense that it is inherently non-Markovian. We further observe that fairness often needs to be assessed at time points \textlessem\textgreaterwithin\textless/em\textgreater the process, not just at the end of the process. To advance our understanding of this class of fairness problems, we explore the notion of non-Markovian fairness in the context of sequential decision making. We identify properties of non-Markovian fairness, including notions of long-term, anytime, periodic, and bounded fairness. We explore the interplay between non-Markovian fairness and memory and how memory can support construction of fair policies. Finally, we introduce the FairQCM algorithm, which can automatically augment its training data to improve sample efficiency in the synthesis of fair policies via reinforcement learning.
@inproceedings{alamdari_remembering_2024, series = {Proceedings of {Machine} {Learning} {Research}}, title = {Remembering to {Be} {Fair}: {Non}-{Markovian} {Fairness} in {Sequential} {Decision} {Making}}, volume = {235}, url = {https://proceedings.mlr.press/v235/alamdari24a.html}, booktitle = {Proceedings of the 41st {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Alamdari, Parand A. and Klassen, Toryn Q. and Creager, Elliot and Mcilraith, Sheila A.}, month = jul, year = {2024}, pages = {906--920}, }

2023

Thesis
Robust Machine Learning by Transforming and Augmenting Imperfect Training Data

Elliot Creager

Jul 2023

Abs arXiv Bib PDF

Machine Learning (ML) is an expressive framework for turning data into computer programs. Across many problem domains – both in industry and policy settings – the types of computer programs needed for accurate prediction or optimal control are difficult to write by hand. On the other hand, collecting instances of desired system behavior may be relatively more feasible. This makes ML broadly appealing, but also induces data sensitivities that often manifest as unexpected failure modes during deployment. In this sense, the training data available tend to be imperfect for the task at hand. This thesis explores several data sensitivities of modern machine learning and how to address them. We begin by discussing how to prevent ML from codifying prior human discrimination measured in the training data, where we take a fair representation learning approach. We then discuss the problem of learning from data containing spurious features, which provide predictive fidelity during training but are unreliable upon deployment. Here we observe that insofar as standard training methods tend to learn such features, this propensity can be leveraged to search for partitions of training data that expose this inconsistency, ultimately promoting learning algorithms invariant to spurious features. Finally, we turn our attention to reinforcement learning from data with insufficient coverage over all possible states and actions. To address the coverage issue, we discuss how causal priors can be used to model the single-step dynamics of the setting where data are collected. This enables a new type of data augmentation where observed trajectories are stitched together to produce new but plausible counterfactual trajectories.
@phdthesis{creager_robust_2023, type = {{PhD} {Thesis}}, title = {Robust {Machine} {Learning} by {Transforming} and {Augmenting} {Imperfect} {Training} {Data}}, school = {University of Toronto (Canada)}, author = {Creager, Elliot}, year = {2023}, }
ICCV
SURFSUP: Learning Fluid Simulation for Novel Surfaces

Arjun Mani, Ishaan Preetam Chandratreya, Elliot Creager, Carl Vondrick, and Richard Zemel

In International Conference on Computer Vision Oct 2023

Abs arXiv Bib PDF Code Website

Modeling the mechanics of fluid in complex scenes is vital to applications in design, graphics, and robotics. Learning-based methods provide fast and differentiable fluid simulators, however most prior work is unable to accurately model how fluids interact with genuinely novel surfaces not seen during training. We introduce SURFSUP, a framework that represents objects implicitly using signed distance functions (SDFs), rather than an explicit representation of meshes or particles. This continuous representation of geometry enables more accurate simulation of fluid-object interactions over long time periods while simultaneously making computation more efficient. Moreover, SURFSUP trained on simple shape primitives generalizes considerably out-of-distribution, even to complex real-world scenes and objects. Finally, we show we can invert our model to design simple objects to manipulate fluid flow.
@inproceedings{mani_surfsup_2023, title = {{SURFSUP}: {Learning} {Fluid} {Simulation} for {Novel} {Surfaces}}, shorttitle = {{SURFSUP}}, booktitle = {International Conference on Computer Vision}, url = {http://arxiv.org/abs/2304.06197}, urldate = {2023-04-14}, publisher = {arXiv}, author = {Mani, Arjun and Chandratreya, Ishaan Preetam and Creager, Elliot and Vondrick, Carl and Zemel, Richard}, month = oct, year = {2023}, note = {arXiv:2304.06197 [physics]}, keywords = {Computer Science - Machine Learning, Physics - Fluid Dynamics}, }

2022

NeurIPS
MoCoDA: Model-based Counterfactual Data Augmentation

Silviu Pitis, Elliot Creager, Ajay Mandlekar, and Animesh Garg

In Advances in Neural Information Processing Systems Nov 2022

Abs arXiv Bib PDF Code

The number of states in a dynamic process is exponential in the number of objects, making reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to the real world, they will need to react to and reason about unseen combinations of objects. We argue that the ability to recognize and use local factorization in transition dynamics is a key element in unlocking the power of multi-object reasoning. To this end, we show that (1) known local structure in the environment transitions is sufficient for an exponential reduction in the sample complexity of training a dynamics model, and (2) a locally factored dynamics model provably generalizes out-of-distribution to unseen states and actions. Knowing the local structure also allows us to predict which unseen states and actions this dynamics model will generalize to. We propose to leverage these observations in a novel Model-based Counterfactual Data Augmentation (MoCoDA) framework. MoCoDA applies a learned locally factored dynamics model to an augmented distribution of states and actions to generate counterfactual transitions for RL. MoCoDA works with a broader set of local structures than prior work and allows for direct control over the augmented training distribution. We show that MoCoDA enables RL agents to learn policies that generalize to unseen states and actions. We use MoCoDA to train an offline RL agent to solve an out-of-distribution robotics manipulation task on which standard offline RL algorithms fail.
@inproceedings{pitis_mocoda_2022, title = {{MoCoDA}: {Model}-based {Counterfactual} {Data} {Augmentation}}, shorttitle = {{MoCoDA}}, url = {http://arxiv.org/abs/2210.11287}, urldate = {2022-10-26}, publisher = {arXiv}, author = {Pitis, Silviu and Creager, Elliot and Mandlekar, Ajay and Garg, Animesh}, month = nov, year = {2022}, keywords = {Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Robotics}, booktitle = {Advances in Neural Information Processing Systems} }
ICML Workshops
Towards Environment-Invariant Representation Learning for Robust Task Transfer

Benjamin Eyre, Richard Zemel, and Elliot Creager

In ICML Workshop on Spurious Correlation, Invariance, and Stability Jul 2022

Abs Bib PDF

To train a classification model that is robust to distribution shifts upon deployment, auxiliary labels indicating the various “environments” of data collection can be leveraged to mitigate reliance on environment-specific features. In this paper we attempt to determine where in the network the environment invariance property can be located for such a model, with the hopes of adapting a single pre-trained invariant model for use in multiple tasks. We discuss how to evaluate whether a model has formed an environment-invariant internal representation—as opposed to an invariant final classifier function—and propose an objective that encourages learning such a representation. We also extend color-biased digit recognition to a transfer setting where the target task requires an invariant model, but lacks the environment labels needed to train an invariant model from scratch, thus motivating the transfer of an invariant representation trained on a source task with environment labels.
@inproceedings{eyre_towards_2022, year = {2022}, month = jul, title = {Towards {Environment}-{Invariant} {Representation} {Learning} for {Robust} {Task} {Transfer}}, language = {en}, author = {Eyre, Benjamin and Zemel, Richard and Creager, Elliot}, pages = {8}, booktitle = {ICML Workshop on Spurious Correlation, Invariance, and Stability}, }

2021

ICML
Environment Inference for Invariant Learning

Elliot Creager, Joern-Henrik Jacobsen, and Richard Zemel

In Proceedings of the 38th International Conference on Machine Learning Jul 2021

Abs arXiv Bib PDF Code

Learning models that gracefully handle distribution shifts is central to research on domain generalization, robust optimization, and fairness. A promising formulation is domain-invariant learning, which identifies the key issue of learning which features are domain-specific versus domain-invariant. An important assumption in this area is that the training examples are partitioned into “domains” or “environments”. Our focus is on the more common setting where such partitions are not provided. We propose EIIL, a general framework for domain-invariant learning that incorporates Environment Inference to directly infer partitions that are maximally informative for downstream Invariant Learning. We show that EIIL outperforms invariant learning methods on the CMNIST benchmark without using environment labels, and significantly outperforms ERM on worst-group performance in the Waterbirds dataset. Finally, we establish connections between EIIL and algorithmic fairness, which enables EIIL to improve accuracy and calibration in a fair prediction problem.
@inproceedings{creager_environment_2021, title = {Environment {Inference} for {Invariant} {Learning}}, url = {https://proceedings.mlr.press/v139/creager21a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 38th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Creager, Elliot and Jacobsen, Joern-Henrik and Zemel, Richard}, month = jul, year = {2021}, note = {ISSN: 2640-3498}, pages = {2189--2200}, }
ICML (Oral)
On Disentangled Representations Learned from Correlated Data

Frederik Träuble, Elliot Creager, Niki Kilbertus, Francesco Locatello, Andrea Dittadi, Anirudh Goyal, Bernhard Schölkopf, and Stefan Bauer

In Proceedings of the 38th International Conference on Machine Learning Jul 2021

Abs arXiv Bib PDF Code

The focus of disentanglement approaches has been on identifying independent factors of variation in data. However, the causal variables underlying real-world observations are often not statistically independent. In this work, we bridge the gap to real-world scenarios by analyzing the behavior of the most prominent disentanglement approaches on correlated data in a large-scale empirical study (including 4260 models). We show and quantify that systematically induced correlations in the dataset are being learned and reflected in the latent representations, which has implications for downstream applications of disentanglement such as fairness. We also demonstrate how to resolve these latent correlations, either using weak supervision during training or by post-hoc correcting a pre-trained model with a small number of labels.
@inproceedings{trauble_disentangled_2021, title = {On {Disentangled} {Representations} {Learned} from {Correlated} {Data}}, url = {https://proceedings.mlr.press/v139/trauble21a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 38th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Träuble, Frederik and Creager, Elliot and Kilbertus, Niki and Locatello, Francesco and Dittadi, Andrea and Goyal, Anirudh and Schölkopf, Bernhard and Bauer, Stefan}, month = jul, year = {2021}, note = {ISSN: 2640-3498}, pages = {10401--10412}, }
ICML Workshops

Measuring User Recourse in a Dynamic Recommender System

Dilys Dickson, and Elliot Creager

In ICML Workshop on Algorithmic Recourse Jul 2021

Abs PDF

From online searches to suggested videos in social media, recommendation systems are heavily relied upon to mediate access to digital information. Concerns have been raised about these systems over their potential for feedback loops that can create unintended consequences such as echochambers, ﬁlter bubbles and polarization in the digital space. In this paper, we measure the effect of prolonged exposure to recommendation on availability of diverse suggested content to the user. We use the deﬁnition of reachability (or user recourse) of Dean et al. (2020b), as the proportion of unseen items that could be recommended to the user in the future, which can be approximated using knowledge of the embedding space geometry for linear recommenders. Whereas previous work assumed a static recommender, we study the case where the recommender can change over time, either by training for longer given a ﬁxed dataset, or dynamically updating its training online through interactions with users. We ﬁnd that dynamic changes to the recommender system do indeed affect the recourse available to users.
ICML Workshops

Online Algorithmic Recourse by Collective Action

Elliot Creager, and Richard Zemel

In ICML Workshop on Algorithmic Recourse Jul 2021

Abs arXiv PDF

Research on algorithmic recourse typically considers how an individual can reasonably change an unfavorable automated decision when interacting with a ﬁxed decision-making system. This paper focuses instead on the online setting, where system parameters are updated dynamically according to interactions with data subjects. Beyond the typical individual-level recourse, the online setting opens up new ways for groups to shape system decisions by leveraging the parameter update rule. We show empirically that recourse can be improved when users coordinate by jointly computing their feature perturbations, underscoring the importance of collective action in mitigating adverse automated decisions.

2020

NeurIPS
Counterfactual Data Augmentation using Locally Factored Dynamics

Silviu Pitis, Elliot Creager, and Animesh Garg

In Advances in Neural Information Processing Systems Dec 2020

Abs arXiv Bib PDF Code

Many dynamic processes, including common scenarios in robotic control and reinforcement learning (RL), involve a set of interacting subprocesses. Though the subprocesses are not independent, their interactions are often sparse, and the dynamics at any given time step can often be decomposed into locally independent\vphantom{} causal mechanisms. Such local causal structures can be leveraged to improve the sample efficiency of sequence prediction and off-policy reinforcement learning. We formalize this by introducing local causal models (LCMs), which are induced from a global causal model by conditioning on a subset of the state space. We propose an approach to inferring these structures given an object-oriented state representation, as well as a novel algorithm for Counterfactual Data Augmentation (CoDA). CoDA uses local structures and an experience replay to generate counterfactual experiences that are causally valid in the global model. We find that CoDA significantly improves the performance of RL agents in locally factored tasks, including the batch-constrained and goal-conditioned settings. Code available at https://github.com/spitis/mrl.
@inproceedings{pitis_counterfactual_2020, title = {Counterfactual {Data} {Augmentation} using {Locally} {Factored} {Dynamics}}, volume = {33}, url = {https://proceedings.neurips.cc/paper/2020/hash/294e09f267683c7ddc6cc5134a7e68a8-Abstract.html}, urldate = {2022-09-07}, booktitle = {Advances in {Neural} {Information} {Processing} {Systems}}, publisher = {Curran Associates, Inc.}, author = {Pitis, Silviu and Creager, Elliot and Garg, Animesh}, year = {2020}, month = dec, pages = {3976--3990}, }
NeurIPS Workshops
Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification

Robert Adragna, Elliot Creager, David Madras, and Richard Zemel

In NeurIPS 2020 Workshop on Fairness Through the Lens of Causality Dec 2020

Abs arXiv Bib PDF Code

Robustness is of central importance in machine learning and has given rise to the fields of domain generalization and invariant learning, which are concerned with improving performance on a test distribution distinct from but related to the training distribution. In light of recent work suggesting an intimate connection between fairness and robustness, we investigate whether algorithms from robust ML can be used to improve the fairness of classifiers that are trained on biased data and tested on unbiased data. We apply Invariant Risk Minimization (IRM), a domain generalization algorithm that employs a causal discovery inspired method to find robust predictors, to the task of fairly predicting the toxicity of internet comments. We show that IRM achieves better out-of-distribution accuracy and fairness than Empirical Risk Minimization (ERM) methods, and analyze both the difficulties that arise when applying IRM in practice and the conditions under which IRM will likely be effective in this scenario. We hope that this work will inspire further studies of how robust machine learning methods relate to algorithmic fairness.
@inproceedings{adragna_fairness_2020, title = {Fairness and {Robustness} in {Invariant} {Learning}: {A} {Case} {Study} in {Toxicity} {Classification}}, shorttitle = {Fairness and {Robustness} in {Invariant} {Learning}}, booktitle = {NeurIPS 2020 Workshop on Fairness Through the Lens of Causality}, url = {http://arxiv.org/abs/2011.06485}, urldate = {2022-09-07}, publisher = {arXiv}, author = {Adragna, Robert and Creager, Elliot and Madras, David and Zemel, Richard}, month = dec, year = {2020}, note = {arXiv:2011.06485 [cs]}, keywords = {Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language}, }
ICML
Causal Modeling for Fairness In Dynamical Systems

Elliot Creager, David Madras, Toniann Pitassi, and Richard Zemel

In Proceedings of the 37th International Conference on Machine Learning Jul 2020

Abs arXiv Bib PDF Code

In many applications areas—lending, education, and online recommenders, for example—fairness and equity concerns emerge when a machine learning system interacts with a dynamically changing environment to produce both immediate and long-term effects for individuals and demographic groups. We discuss causal directed acyclic graphs (DAGs) as a unifying framework for the recent literature on fairness in such dynamical systems. We show that this formulation affords several new directions of inquiry to the modeler, where sound causal assumptions can be expressed and manipulated. We emphasize the importance of computing interventional quantities in the dynamical fairness setting, and show how causal assumptions enable simulation (when environment dynamics are known) and estimation by adjustment (when dynamics are unknown) of intervention on short- and long-term outcomes, at both the group and individual levels.
@inproceedings{creager_causal_2020, title = {Causal {Modeling} for {Fairness} {In} {Dynamical} {Systems}}, url = {https://proceedings.mlr.press/v119/creager20a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 37th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Creager, Elliot and Madras, David and Pitassi, Toniann and Zemel, Richard}, month = jul, year = {2020}, note = {ISSN: 2640-3498}, pages = {2185--2195}, }
ICML
Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach

Martin Mladenov, Elliot Creager, Omer Ben-Porat, Kevin Swersky, Richard Zemel, and Craig Boutilier

In Proceedings of the 37th International Conference on Machine Learning Jul 2020

Abs arXiv Bib PDF

Most recommender systems (RS) research assumes that a user’s utility can be maximized independently of the utility of the other agents (e.g., other users, content providers). In realistic settings, this is often not true – the dynamics of an RS ecosystem couple the long-term utility of all agents. In this work, we explore settings in which content providers cannot remain viable unless they receive a certain level of user engagement. We formulate this problem as one of equilibrium selection in the induced dynamical system, and show that it can be solved as an optimal constrained matching problem. Our model ensures the system reaches an equilibrium with maximal social welfare supported by a sufficiently diverse set of viable providers. We demonstrate that even in a simple, stylized dynamical RS model, the standard myopic approach to recommendation - always matching a user to the best provider - performs poorly. We develop several scalable techniques to solve the matching problem, and also draw connections to various notions of user regret and fairness, arguing that these outcomes are fairer in a utilitarian sense.
@inproceedings{mladenov_optimizing_2020, title = {Optimizing {Long}-term {Social} {Welfare} in {Recommender} {Systems}: {A} {Constrained} {Matching} {Approach}}, shorttitle = {Optimizing {Long}-term {Social} {Welfare} in {Recommender} {Systems}}, url = {https://proceedings.mlr.press/v119/mladenov20a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 37th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Mladenov, Martin and Creager, Elliot and Ben-Porat, Omer and Swersky, Kevin and Zemel, Richard and Boutilier, Craig}, month = jul, year = {2020}, note = {ISSN: 2640-3498}, pages = {6987--6998}, }

2019

ICML
Flexibly Fair Representation Learning by Disentanglement

Elliot Creager, David Madras, Joern-Henrik Jacobsen, Marissa Weis, Kevin Swersky, Toniann Pitassi, and Richard Zemel

In Proceedings of the 36th International Conference on Machine Learning Jun 2019

Abs arXiv Bib PDF

We consider the problem of learning representations that achieve group and subgroup fairness with respect to multiple sensitive attributes. Taking inspiration from the disentangled representation learning literature, we propose an algorithm for learning compact representations of datasets that are useful for reconstruction and prediction, but are also flexibly fair, meaning they can be easily modified at test time to achieve subgroup demographic parity with respect to multiple sensitive attributes and their conjunctions. We show empirically that the resulting encoder—which does not require the sensitive attributes for inference—allows for the adaptation of a single representation to a variety of fair classification tasks with new target labels and subgroup definitions.
@inproceedings{creager_flexibly_2019, title = {Flexibly {Fair} {Representation} {Learning} by {Disentanglement}}, url = {https://proceedings.mlr.press/v97/creager19a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 36th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Creager, Elliot and Madras, David and Jacobsen, Joern-Henrik and Weis, Marissa and Swersky, Kevin and Pitassi, Toniann and Zemel, Richard}, month = jun, year = {2019}, note = {ISSN: 2640-3498}, pages = {1436--1445}, }
ICLR
Explaining Image Classifiers by Counterfactual Generation

Chun-Hao Chang, Elliot Creager, Anna Goldenberg, and David Duvenaud

In International Conference on Learning Representations May 2019

Abs arXiv Bib PDF Code

When an image classifier makes a prediction, which parts of the image are relevant and why? We can rephrase this question to ask: which parts of the image, if they were not seen by the classifier, would most change its decision? Producing an answer requires marginalizing over images that could have been seen but weren’t. We can sample plausible image in-fills by conditioning a generative model on the rest of the image. We then optimize to find the image regions that most change the classifier’s decision after in-fill. Our approach contrasts with ad-hoc in-filling approaches, such as blurring or injecting noise, which generate inputs far from the data distribution, and ignore informative relationships between different parts of the image. Our method produces more compact and relevant saliency maps, with fewer artifacts compared to previous methods.
@inproceedings{chang_explaining_2019, title = {Explaining {Image} {Classifiers} by {Counterfactual} {Generation}}, booktitle = {International Conference on Learning Representations}, url = {http://arxiv.org/abs/1807.08024}, urldate = {2022-09-07}, publisher = {arXiv}, author = {Chang, Chun-Hao and Creager, Elliot and Goldenberg, Anna and Duvenaud, David}, month = may, year = {2019}, note = {arXiv:1807.08024 [cs]}, keywords = {Computer Science - Computer Vision and Pattern Recognition}, }
ACM-FAccT
Fairness Through Causal Awareness: Learning Latent-Variable Models for Biased Data

David Madras, Elliot Creager, Toniann Pitassi, and Richard Zemel

In ACM Conference on Fairness, Accountability, and Transparency Jan 2019

Abs Bib

How do we learn from biased data? Historical datasets often reflect historical prejudices; sensitive or protected attributes may affect the observed treatments and outcomes. Classification algorithms tasked with predicting outcomes accurately from these datasets tend to replicate these biases. We advocate a causal modeling approach to learning from biased data, exploring the relationship between fair classification and intervention. We propose a causal model in which the sensitive attribute confounds both the treatment and the outcome. Building on prior work in deep learning and generative modeling, we describe how to learn the parameters of this causal model from observational data alone, even in the presence of unobserved confounders. We show experimentally that fairness-aware causal modeling provides better estimates of the causal effects between the sensitive attribute, the treatment, and the outcome. We further present evidence that estimating these causal effects can help learn policies that are both more accurate and fair, when presented with a historically biased dataset.
@inproceedings{madras_fairness_2019, title = {Fairness {Through} {Causal} {Awareness}: {Learning} {Latent}-{Variable} {Models} for {Biased} {Data}}, shorttitle = {Fairness {Through} {Causal} {Awareness}}, booktitle = {ACM Conference on Fairness, Accountability, and Transparency}, url = {http://arxiv.org/abs/1809.02519}, urldate = {2022-09-07}, publisher = {arXiv}, author = {Madras, David and Creager, Elliot and Pitassi, Toniann and Zemel, Richard}, month = jan, year = {2019}, note = {arXiv:1809.02519 [cs, stat]}, keywords = {Computer Science - Machine Learning, Statistics - Machine Learning}, }

2018

ICML
Learning Adversarially Fair and Transferable Representations

David Madras*, Elliot Creager*, Toniann Pitassi, and Richard Zemel

In Proceedings of the 35th International Conference on Machine Learning Jul 2018

Abs arXiv Bib PDF Code

In this paper, we advocate for representation learning as the key to mitigating unfair prediction outcomes downstream. Motivated by a scenario where learned representations are used by third parties with unknown objectives, we propose and explore adversarial representation learning as a natural method of ensuring those parties act fairly. We connect group fairness (demographic parity, equalized odds, and equal opportunity) to different adversarial objectives. Through worst-case theoretical guarantees and experimental validation, we show that the choice of this objective is crucial to fair prediction. Furthermore, we present the first in-depth experimental demonstration of fair transfer learning and demonstrate empirically that our learned representations admit fair predictions on new tasks while maintaining utility, an essential goal of fair representation learning.
@inproceedings{madras_learning_2018, title = {Learning {Adversarially} {Fair} and {Transferable} {Representations}}, url = {https://proceedings.mlr.press/v80/madras18a.html}, language = {en}, urldate = {2022-09-07}, booktitle = {Proceedings of the 35th {International} {Conference} on {Machine} {Learning}}, publisher = {PMLR}, author = {Madras*, David and Creager*, Elliot and Pitassi, Toniann and Zemel, Richard}, month = jul, year = {2018}, note = {ISSN: 2640-3498}, pages = {3384--3393}, }
ICLR Workshops
Gradient-Based Optimization Of Neural Network Architecture

Will Grathwohl*, Elliot Creager*, Seyed Kamyar Seyed Ghasemipour*, and Richard Zemel

In ICLR (Workshop Track) Apr 2018

Abs Bib PDF

Neural networks can learn relevant features from data, but their predictive accuracy and propensity to overﬁt are sensitive to the values of the discrete hyperparameters that specify the network architecture (number of hidden layers, number of units per layer, etc.). Previous work optimized these hyperparmeters via grid search, random search, and black box optimization techniques such as Bayesian optimization. Bolstered by recent advances in gradient-based optimization of discrete stochastic objectives, we instead propose to directly model a distribution over possible architectures and use variational optimization to jointly optimize the network architecture and weights in one training pass. We discuss an implementation of this approach that estimates gradients via the Concrete relaxation, and show that it ﬁnds compact and accurate architectures for convolutional neural networks applied to the CIFAR10 and CIFAR100 datasets.
@inproceedings{grathwohl_gradient-based_2018, title = {{Gradient}-{Based} {Optimization} {Of} {Neural} {Network} {Architecture}}, language = {en}, author = {Grathwohl*, Will and Creager*, Elliot and Ghasemipour*, Seyed Kamyar Seyed and Zemel, Richard}, year = {2018}, month = apr, pages = {6}, booktitle = {ICLR (Workshop Track)}, }

2016

ISMIR
Nonnegative Tensor Factorization with Frequency Modulation Cues for Blind Audio Source Separation

Elliot Creager, Noah D. Stein, Roland Badeau, and Philippe Depalle

In 17th International Society for Music Information Retrieval Conference Aug 2016

Abs arXiv Bib PDF Code

We present Vibrato Nonnegative Tensor Factorization, an algorithm for single-channel unsupervised audio source separation with an application to separating instrumental or vocal sources with nonstationary pitch from music recordings. Our approach extends Nonnegative Matrix Factorization for audio modeling by including local estimates of frequency modulation as cues in the separation. This permits the modeling and unsupervised separation of vibrato or glissando musical sources, which is not possible with the basic matrix factorization formulation. The algorithm factorizes a sparse nonnegative tensor comprising the audio spectrogram and local frequency-slope-to-frequency ratios, which are estimated at each time-frequency bin using the Distributed Derivative Method. The use of local frequency modulations as separation cues is motivated by the principle of common fate partial grouping from Auditory Scene Analysis, which hypothesizes that each latent source in a mixture is characterized perceptually by coherent frequency and amplitude modulations shared by its component partials. We derive multiplicative factor updates by Minorization-Maximization, which guarantees convergence to a local optimum by iteration. We then compare our method to the baseline on two separation tasks: one considers synthetic vibrato notes, while the other considers vibrato string instrument recordings.
@inproceedings{creager_nonnegative_2016, title = {Nonnegative Tensor Factorization with Frequency Modulation Cues for Blind Audio Source Separation}, url = {http://arxiv.org/abs/1606.00037}, urldate = {2022-11-06}, publisher = {arXiv}, author = {Creager, Elliot and Stein, Noah D. and Badeau, Roland and Depalle, Philippe}, month = aug, year = {2016}, note = {arXiv:1606.00037 [cs]}, keywords = {Computer Science - Sound}, booktitle = {17th International Society for Music Information Retrieval Conference}, }

2015

Thesis
Musical Source Separation by Coherent Frequency Modulation Cues

Elliot Creager

Aug 2015

Abs Bib PDF

This thesis explores the extraction of vibrato sounds from monaural excerpts of polyphonic music using the coherent frequency modulation (CFM) of component partials as a grouping cue. Nonnegative Matrix Factorization (NMF) (Lee and Seung 1999) is currently a popular tool for musical source separation (Wang and Plumbley 2005), since it can provide a lowrank approximate factorization of the magnitude spectrogram of the analyzed sound, where the factors can be interpreted as the spectral templates and temporal activations of the notes contributing to the recording. However, NMF implicitly models each source as having a ﬁxed spectral template and is thus ill-suited to the analysis of vibrato sounds, which are characterized by slowly varying frequency and amplitude modulations.
@mastersthesis{creager_musical_2015, title = {Musical {Source} {Separation} by {Coherent} {Frequency} {Modulation} {Cues}}, language = {en}, school = {McGill University (Canada)}, author = {Creager, Elliot}, year = {2015}, }