Contextualized Models

Models with parameters that adapt to patient, task, or environmental context to support personalized inference.

Rather than relying on a one-size-fits-all model, contextualized systems learn parameters that adapt to the local environment — patient characteristics, tasks, or domain shifts. These models improve both accuracy and generalization in heterogeneous data settings.

We’ve developed theory and methods to support this framework, including:

Contextualized Machine Learning (Lengerich et al., 2023)
Contextualiezd Policy Recovery (Deuschel et al., 2024)
Personalized treatment effects (Lengerich et al., 2022)
Personalized Gene Regulatory Networks (Ellington et al., 2025)

Our open-source toolkit,

, supports model development and application across domains.

References

2025

Learning to estimate sample-specific transcriptional networks for 7,000 tumors

Caleb N Ellington, Benjamin J Lengerich, Thomas BK Watkins, and 6 more authors

Proceedings of the National Academy of Sciences (PNAS), 2025

Abs Bib HTML

Cancers are shaped by somatic mutations, microenvironment, and patient background, each altering gene expression and regulation in complex ways, resulting in heterogeneous cellular states and dynamics. Inferring gene regulatory networks (GRNs) from expression data can help characterize this regulation-driven heterogeneity, but network inference requires many statistical samples, limiting GRNs to cluster-level analyses that ignore intracluster heterogeneity. We propose to move beyond coarse analyses of predefined subgroups by using contextualized learning, a multitask learning paradigm that uses multiview contexts including phenotypic, molecular, and environmental information to infer personalized models. With sample-specific contexts, contextualization enables sample-specific models and even generalizes at test time to predict network models for entirely unseen contexts. We unify three network model classes (Correlation, Markov, and Neighborhood Selection) and estimate context-specific GRNs for 7,997 tumors across 25 tumor types, using copy number and driver mutation profiles, tumor microenvironment, and patient demographics as model context. Our generative modeling approach allows us to predict GRNs for unseen tumor types based on a pan-cancer model of how somatic mutations affect gene regulation. Finally, contextualized networks enable GRN-based precision oncology by providing a structured view of expression dynamics at sample-specific resolution, explaining known biomarkers in terms of network-mediated effects and leading to subtypings that improve survival prognosis. We provide a SKLearn-style Python package https://contextualized.ml for learning and analyzing contextualized models, as well as interactive plotting tools for pan-cancer data exploration at https://github.com/cnellington/CancerContextualized
@article{ellington2025learning, title = {Learning to estimate sample-specific transcriptional networks for 7,000 tumors}, author = {Ellington, Caleb N and Lengerich, Benjamin J and Watkins, Thomas BK and Yang, Jiekun and Adduri, Abhinav K and Mahbub, Sazan and Xiao, Hanxi and Kellis, Manolis and Xing, Eric P}, journal = {Proceedings of the National Academy of Sciences (PNAS)}, volume = {122}, number = {21}, pages = {e2411930122}, year = {2025}, publisher = {National Academy of Sciences}, }

2024

Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning

Jannik Deuschel, Caleb Ellington, Yingtao Luo, and 3 more authors

International Conference on Machine Learning (ICML), 2024

Abs arXiv Bib HTML

Interpretable policy learning seeks to estimate intelligible decision policies from observed actions; however, existing models fall short by forcing a tradeoff between accuracy and interpretability. This tradeoff limits data-driven interpretations of human decision-making process. e.g. to audit medical decisions for biases and suboptimal practices, we require models of decision processes which provide concise descriptions of complex behaviors. Fundamentally, existing approaches are burdened by this tradeoff because they represent the underlying decision process as a universal policy, when in fact human decisions are dynamic and can change drastically with contextual information. Thus, we propose Contextualized Policy Recovery (CPR), which re-frames the problem of modeling complex decision processes as a multi-task learning problem in which complex decision policies are comprised of context-specific policies. CPR models each context-specific policy as a linear observation-to-action mapping, and generates new decision models on-demand as contexts are updated with new observations. CPR is compatible with fully offline and partially observable decision environments, and can be tailored to incorporate any recurrent black-box model or interpretable decision model. We assess CPR through studies on simulated and real data, achieving state-of-the-art performance on the canonical tasks of predicting antibiotic prescription in intensive care units (+22% AUROC vs. previous SOTA) and predicting MRI prescription for Alzheimer’s patients (+7.7% AUROC vs. previous SOTA). With this improvement in predictive performance, CPR closes the accuracy gap between interpretable and black-box methods for policy learning, allowing high-resolution exploration and analysis of context-specific decision models.
@article{deuschel2024contextualized, title = {Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning}, author = {Deuschel, Jannik and Ellington, Caleb and Luo, Yingtao and Lengerich, Ben and Friederich, Pascal and Xing, Eric}, informal_venue = {ICML}, year = {2024}, journal = {International Conference on Machine Learning (ICML)} }

2023

Contextualized Machine Learning

Ben Lengerich, Caleb N. Ellington, Andrea Rubbi, and 2 more authors

2023

Abs arXiv Bib

We examine Contextualized Machine Learning (ML), a paradigm for learning heterogeneous and context-dependent effects. Contextualized ML estimates heterogeneous functions by applying deep learning to the meta-relationship between contextual information and context-specific parametric models. This is a form of varying-coefficient modeling that unifies existing frameworks including cluster analysis and cohort modeling by introducing two reusable concepts: a context encoder which translates sample context into model parameters, and sample-specific model which operates on sample predictors. We review the process of developing contextualized models, nonparametric inference from contextualized models, and identifiability conditions of contextualized models. Finally, we present the open-source PyTorch package ContextualizedML.
@article{lengerich2023contextualized, title = {Contextualized Machine Learning}, author = {Lengerich, Ben and Ellington, Caleb N. and Rubbi, Andrea and Kellis, Manolis and Xing, Eric P.}, year = {2023}, archiveprefix = {arXiv}, }

2022

Automated interpretable discovery of heterogeneous treatment effectiveness: A COVID-19 case study

Benjamin J Lengerich, Mark E Nunnally, Yin Aphinyanaphongs, and 2 more authors

Journal of biomedical informatics, 2022

Abs HTML PDF

Testing multiple treatments for heterogeneous (varying) effectiveness with respect to many underlying risk factors requires many pairwise tests; we would like to instead automatically discover and visualize patient archetypes and predictors of treatment effectiveness using multitask machine learning. In this paper, we present a method to estimate these heterogeneous treatment effects with an interpretable hierarchical framework that uses additive models to visualize expected treatment benefits as a function of patient factors (identifying personalized treatment benefits) and concurrent treatments (identifying combinatorial treatment benefits). This method achieves state-of-the-art predictive power for COVID-19 in-hospital mortality and interpretable identification of heterogeneous treatment benefits. We first validate this method on the large public MIMIC-IV dataset of ICU patients to test recovery of heterogeneous treatment effects. Next we apply this method to a proprietary dataset of over 3000 patients hospitalized for COVID-19, and find evidence of heterogeneous treatment effectiveness predicted largely by indicators of inflammation and thrombosis risk: patients with few indicators of thrombosis risk benefit most from treatments against inflammation, while patients with few indicators of inflammation risk benefit most from treatments against thrombosis. This approach provides an automated methodology to discover heterogeneous and individualized effectiveness of treatments.