
John P. Cunningham
· ProfessorVerifiedColumbia University · Statistics
Active 1936–2025
Research topics
- Computer Science
- Artificial Intelligence
- Medicine
- Neuroscience
- Psychology
- Political Science
- Engineering
- Biology
- Algorithm
- Medical education
- Physics
- Anatomy
- Mathematics
- Pathology
Selected publications
Reverse Diffusion Sequential Monte Carlo Samplers
ArXiv.org · 2025-08-08
preprintOpen accessSenior authorWe propose a novel sequential Monte Carlo (SMC) method for sampling from unnormalized target distributions based on a reverse denoising diffusion process. While recent diffusion-based samplers simulate the reverse diffusion using approximate score functions, they can suffer from accumulating errors due to time discretization and imperfect score estimation. In this work, we introduce a principled SMC framework that formalizes diffusion-based samplers as proposals while systematically correcting for their biases. The core idea is to construct informative intermediate target distributions that progressively steer the sampling trajectory toward the final target distribution. Although ideal intermediate targets are intractable, we develop exact approximations using quantities from the score estimation-based proposal, without requiring additional model training or inference overhead. The resulting sampler, termed Reverse Diffusion Sequential Monte Carlo, enables consistent sampling and unbiased estimation of the target's normalization constant under mild conditions. We demonstrate the effectiveness of our method on a range of synthetic targets and real-world Bayesian inference problems.
Differentially Private Hyperparameter Tuning using Local Bayesian Optimization
ArXiv.org · 2025-02-09
preprintOpen accessSenior authorHyperparameter tuning is a key component of machine learning procedures, but when validation data contain sensitive user information, search mechanisms can leak private information through the selected configuration. Existing differentially private hyperparameter tuning methods often rely on near-random search, while prior differentially private Bayesian optimization approaches are typically global and, therefore, scale poorly with the hyperparameter dimensionality. We study differentially private hyperparameter tuning using local Bayesian optimization, focusing on settings where the validation objective is available only through noisy black box evaluations and gradients are unavailable or impractical to compute. We introduce DP-GIBO, a differentially private local Bayesian optimization framework that privately approximates gradients using a Gaussian Process surrogate. Under suitable conditions, we prove that DP-GIBO converges to a locally optimal hyperparameter configuration up to a privacy-dependent error, with dimensional dependence that is polynomial rather than exponential.Empirically, we show that DP-GIBO provides scalable private hyperparameter tuning across multiple tasks, substantially outperforming non-private random search and global Bayesian optimization baselines in moderate-to-high-dimensional hyperparameter spaces.
Bayesian Invariance Modeling of Multi-Environment Data
ArXiv.org · 2025-06-27
preprintOpen accessInvariant prediction [Peters et al., 2016] analyzes feature/outcome data from multiple environments to identify invariant features - those with a stable predictive relationship to the outcome. Such features support generalization to new environments and help reveal causal mechanisms. Previous methods have primarily tackled this problem through hypothesis testing or regularized optimization. Here we develop Bayesian Invariant Prediction (BIP), a probabilistic model for invariant prediction. BIP encodes the indices of invariant features as a latent variable and recover them by posterior inference. Under the assumptions of Peters et al. [2016], the BIP posterior targets the true invariant features. We prove that the posterior is consistent and that greater environment heterogeneity leads to faster posterior contraction. To handle many features, we design an efficient variational approximation called VI-BIP. In simulations and real data, we find that BIP and VI-BIP are more accurate and scalable than existing methods for invariant prediction.
An emerging view of neural geometry in motor cortex supports high-performance decoding
eLife · 2025-02-03 · 1 citations
articleOpen accessDecoders for brain-computer interfaces (BCIs) assume constraints on neural activity, chosen to reflect scientific beliefs while yielding tractable computations. Recent scientific advances suggest that the true constraints on neural activity, especially its geometry, may be quite different from those assumed by most decoders. We designed a decoder, MINT, to embrace statistical constraints that are potentially more appropriate. If those constraints are accurate, MINT should outperform standard methods that explicitly make different assumptions. Additionally, MINT should be competitive with expressive machine learning methods that can implicitly learn constraints from data. MINT performed well across tasks, suggesting its assumptions are well-matched to the data. MINT outperformed other interpretable methods in every comparison we made. MINT outperformed expressive machine learning methods in 37 of 42 comparisons. MINT’s computations are simple, scale favorably with increasing neuron counts, and yield interpretable quantities such as data likelihoods. MINT’s performance and simplicity suggest it may be a strong candidate for many BCI applications.
Variational Deep Learning via Implicit Regularization
ArXiv.org · 2025-05-26
preprintOpen accessSenior authorModern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of architecture, hyperparameters, and optimization procedure. However, deep neural networks can be surprisingly non-robust, resulting in overconfident predictions and poor out-of-distribution generalization. Bayesian deep learning addresses this via model averaging, but typically requires significant computational resources as well as carefully elicited priors to avoid overriding the benefits of implicit regularization. Instead, in this work, we propose to regularize variational neural networks solely by relying on the implicit bias of (stochastic) gradient descent. We theoretically characterize this inductive bias in overparametrized linear models as generalized variational inference and demonstrate the importance of the choice of parametrization. Empirically, our approach demonstrates strong in- and out-of-distribution performance without additional hyperparameter tuning and with minimal computational overhead.
2025-02-03
peer-reviewOpen accessALICE-USA Barrel Tracking Upgrade (Project Final Report)
2025-07-14
reportThe ALICE-USA Barrel Tracking Upgrade (BTU) project began in 2015 to participate in a major upgrade of the Time Projection Chamber (TPC) and the Inner Tracking System (ITS) of the ALICE Experiment at CERN in Geneva, Switzerland. This upgrade increased the maximum readout rate of the ALICE detector to accommodate the increased heavy ion interaction rates following Long Shutdown 2 of the Large Hadron Collider, and greatly improved the vertexing performance of the detector. The original scope was completed and reviewed in November 2019. The BTU project document BTU.19.v1 "ALICE Barrel Tracking Upgrade Project Closeout / Transition to Operations (2019)" from November 2019 describes the status in detail and is attached as Appendix 1 below. At that point, all delivery milestones had been met on time, all Key Performance Parameters (KPPs) of the project had been satisfied, and the project was under budget. Several subsequent No Cost Extensions (NCEs) supported integration and pre-commissioning of the BTU project deliverables at CERN, as well as study and assessment of performance with Pb-Pb collisions at 50 kHz. The BTU project was successfully completed on 30 September 2024.
Estimating the Hallucination Rate of Generative AI
2024-01-01 · 3 citations
article2024-11-27
peer-reviewOpen accessDecoders for brain-computer interfaces (BCIs) assume constraints on neural activity, chosen to reflect scientific beliefs while yielding tractable computations. Recent scientific advances suggest that the true constraints on neural activity, especially its geometry, may be quite different from those assumed by most decoders. We designed a decoder, MINT, to embrace statistical constraints that are potentially more appropriate. If those constraints are accurate, MINT should outperform standard methods that explicitly make different assumptions. Additionally, MINT should be competitive with expressive machine learning methods that can implicitly learn constraints from data. MINT performed well across tasks, suggesting its assumptions are well-matched to the data. MINT outperformed other interpretable methods in every comparison we made. MINT outperformed expressive machine learning methods in 37 of 42 comparisons. MINT’s computations are simple, scale favorably with increasing neuron counts, and yield interpretable quantities such as data likelihoods. MINT’s performance and simplicity suggest it may be a strong candidate for many BCI applications.
Identifying Interpretable Latent Factors with Sparse Component Analysis
bioRxiv (Cold Spring Harbor Laboratory) · 2024-02-06 · 7 citations
preprintOpen accessAbstract In many neural populations, the computationally relevant signals are posited to be a set of ‘latent factors’ – signals shared across many individual neurons. Understanding the relationship between neural activity and behavior requires the identification of factors that reflect distinct computational roles. Methods for identifying such factors typically require supervision, which can be suboptimal if one is unsure how (or whether) factors can be grouped into distinct, meaningful sets. Here, we introduce Sparse Component Analysis (SCA), an unsupervised method that identifies interpretable latent factors. SCA seeks factors that are sparse in time and occupy orthogonal dimensions. With these simple constraints, SCA facilitates surprisingly clear parcellations of neural activity across a range of behaviors. We applied SCA to motor cortex activity from reaching and cycling monkeys, single-trial imaging data from C. elegans , and activity from a multitask artificial network. SCA consistently identified sets of factors that were useful in describing network computations.
Recent grants
NIH · $937k · 2019
Frequent coauthors
- 60 shared
Mark M. Churchland
Columbia University
- 51 shared
Liam Paninski
Columbia University
- 34 shared
Krishna V. Shenoy
Howard Hughes Medical Institute
- 30 shared
Shreya Saxena
Yale University
- 23 shared
Byron M. Yu
Center for the Neural Basis of Cognition
- 21 shared
Gabriel Loaiza-Ganem
- 20 shared
Geoff Pleiss
- 19 shared
Sean M. Perkins
Columbia University
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with John P. Cunningham
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup