John P. Cunningham

· ProfessorVerified

Columbia University · Statistics

Active 1936–2025

h-index47

Citations13.8k

Papers21686 last 5y

Funding$937k

Faculty page Lab page

See your match with John P. Cunningham — sign in to PhdFit.Sign in

Research topics

Computer Science
Artificial Intelligence
Medicine
Neuroscience
Psychology
Political Science
Engineering
Biology
Algorithm
Medical education
Physics
Anatomy
Mathematics
Pathology

Selected publications

Reverse Diffusion Sequential Monte Carlo Samplers
ArXiv.org · 2025-08-08
preprintOpen accessSenior author
We propose a novel sequential Monte Carlo (SMC) method for sampling from unnormalized target distributions based on a reverse denoising diffusion process. While recent diffusion-based samplers simulate the reverse diffusion using approximate score functions, they can suffer from accumulating errors due to time discretization and imperfect score estimation. In this work, we introduce a principled SMC framework that formalizes diffusion-based samplers as proposals while systematically correcting for their biases. The core idea is to construct informative intermediate target distributions that progressively steer the sampling trajectory toward the final target distribution. Although ideal intermediate targets are intractable, we develop exact approximations using quantities from the score estimation-based proposal, without requiring additional model training or inference overhead. The resulting sampler, termed Reverse Diffusion Sequential Monte Carlo, enables consistent sampling and unbiased estimation of the target's normalization constant under mild conditions. We demonstrate the effectiveness of our method on a range of synthetic targets and real-world Bayesian inference problems.
Publisher OA PDF DOI
Differentially Private Hyperparameter Tuning using Local Bayesian Optimization
ArXiv.org · 2025-02-09
preprintOpen accessSenior author
Hyperparameter tuning is a key component of machine learning procedures, but when validation data contain sensitive user information, search mechanisms can leak private information through the selected configuration. Existing differentially private hyperparameter tuning methods often rely on near-random search, while prior differentially private Bayesian optimization approaches are typically global and, therefore, scale poorly with the hyperparameter dimensionality. We study differentially private hyperparameter tuning using local Bayesian optimization, focusing on settings where the validation objective is available only through noisy black box evaluations and gradients are unavailable or impractical to compute. We introduce DP-GIBO, a differentially private local Bayesian optimization framework that privately approximates gradients using a Gaussian Process surrogate. Under suitable conditions, we prove that DP-GIBO converges to a locally optimal hyperparameter configuration up to a privacy-dependent error, with dimensional dependence that is polynomial rather than exponential.Empirically, we show that DP-GIBO provides scalable private hyperparameter tuning across multiple tasks, substantially outperforming non-private random search and global Bayesian optimization baselines in moderate-to-high-dimensional hyperparameter spaces.
Publisher OA PDF DOI
Bayesian Invariance Modeling of Multi-Environment Data
ArXiv.org · 2025-06-27
preprintOpen access
Invariant prediction [Peters et al., 2016] analyzes feature/outcome data from multiple environments to identify invariant features - those with a stable predictive relationship to the outcome. Such features support generalization to new environments and help reveal causal mechanisms. Previous methods have primarily tackled this problem through hypothesis testing or regularized optimization. Here we develop Bayesian Invariant Prediction (BIP), a probabilistic model for invariant prediction. BIP encodes the indices of invariant features as a latent variable and recover them by posterior inference. Under the assumptions of Peters et al. [2016], the BIP posterior targets the true invariant features. We prove that the posterior is consistent and that greater environment heterogeneity leads to faster posterior contraction. To handle many features, we design an efficient variational approximation called VI-BIP. In simulations and real data, we find that BIP and VI-BIP are more accurate and scalable than existing methods for invariant prediction.
Publisher OA PDF DOI
An emerging view of neural geometry in motor cortex supports high-performance decoding
eLife · 2025-02-03 · 1 citations
articleOpen access
Decoders for brain-computer interfaces (BCIs) assume constraints on neural activity, chosen to reflect scientific beliefs while yielding tractable computations. Recent scientific advances suggest that the true constraints on neural activity, especially its geometry, may be quite different from those assumed by most decoders. We designed a decoder, MINT, to embrace statistical constraints that are potentially more appropriate. If those constraints are accurate, MINT should outperform standard methods that explicitly make different assumptions. Additionally, MINT should be competitive with expressive machine learning methods that can implicitly learn constraints from data. MINT performed well across tasks, suggesting its assumptions are well-matched to the data. MINT outperformed other interpretable methods in every comparison we made. MINT outperformed expressive machine learning methods in 37 of 42 comparisons. MINT’s computations are simple, scale favorably with increasing neuron counts, and yield interpretable quantities such as data likelihoods. MINT’s performance and simplicity suggest it may be a strong candidate for many BCI applications.
Publisher DOI
Variational Deep Learning via Implicit Regularization
ArXiv.org · 2025-05-26
preprintOpen accessSenior author
Modern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of architecture, hyperparameters, and optimization procedure. However, deep neural networks can be surprisingly non-robust, resulting in overconfident predictions and poor out-of-distribution generalization. Bayesian deep learning addresses this via model averaging, but typically requires significant computational resources as well as carefully elicited priors to avoid overriding the benefits of implicit regularization. Instead, in this work, we propose to regularize variational neural networks solely by relying on the implicit bias of (stochastic) gradient descent. We theoretically characterize this inductive bias in overparametrized linear models as generalized variational inference and demonstrate the importance of the choice of parametrization. Empirically, our approach demonstrates strong in- and out-of-distribution performance without additional hyperparameter tuning and with minimal computational overhead.
Publisher OA PDF DOI
Author response: An emerging view of neural geometry in motor cortex supports high-performance decoding
2025-02-03
peer-reviewOpen access
Publisher DOI
ALICE-USA Barrel Tracking Upgrade (Project Final Report)
2025-07-14
report
The ALICE-USA Barrel Tracking Upgrade (BTU) project began in 2015 to participate in a major upgrade of the Time Projection Chamber (TPC) and the Inner Tracking System (ITS) of the ALICE Experiment at CERN in Geneva, Switzerland. This upgrade increased the maximum readout rate of the ALICE detector to accommodate the increased heavy ion interaction rates following Long Shutdown 2 of the Large Hadron Collider, and greatly improved the vertexing performance of the detector. The original scope was completed and reviewed in November 2019. The BTU project document BTU.19.v1 "ALICE Barrel Tracking Upgrade Project Closeout / Transition to Operations (2019)" from November 2019 describes the status in detail and is attached as Appendix 1 below. At that point, all delivery milestones had been met on time, all Key Performance Parameters (KPPs) of the project had been satisfied, and the project was under budget. Several subsequent No Cost Extensions (NCEs) supported integration and pre-commissioning of the BTU project deliverables at CERN, as well as study and assessment of performance with Pb-Pb collisions at 50 kHz. The BTU project was successfully completed on 30 September 2024.
Publisher DOI
Estimating the Hallucination Rate of Generative AI
2024-01-01 · 3 citations
article
Publisher DOI
Author response: An emerging view of neural geometry in motor cortex supports high-performance decoding
2024-11-27
peer-reviewOpen access
Decoders for brain-computer interfaces (BCIs) assume constraints on neural activity, chosen to reflect scientific beliefs while yielding tractable computations. Recent scientific advances suggest that the true constraints on neural activity, especially its geometry, may be quite different from those assumed by most decoders. We designed a decoder, MINT, to embrace statistical constraints that are potentially more appropriate. If those constraints are accurate, MINT should outperform standard methods that explicitly make different assumptions. Additionally, MINT should be competitive with expressive machine learning methods that can implicitly learn constraints from data. MINT performed well across tasks, suggesting its assumptions are well-matched to the data. MINT outperformed other interpretable methods in every comparison we made. MINT outperformed expressive machine learning methods in 37 of 42 comparisons. MINT’s computations are simple, scale favorably with increasing neuron counts, and yield interpretable quantities such as data likelihoods. MINT’s performance and simplicity suggest it may be a strong candidate for many BCI applications.
Publisher DOI
Identifying Interpretable Latent Factors with Sparse Component Analysis
bioRxiv (Cold Spring Harbor Laboratory) · 2024-02-06 · 7 citations
preprintOpen access
Abstract In many neural populations, the computationally relevant signals are posited to be a set of ‘latent factors’ – signals shared across many individual neurons. Understanding the relationship between neural activity and behavior requires the identification of factors that reflect distinct computational roles. Methods for identifying such factors typically require supervision, which can be suboptimal if one is unsure how (or whether) factors can be grouped into distinct, meaningful sets. Here, we introduce Sparse Component Analysis (SCA), an unsupervised method that identifies interpretable latent factors. SCA seeks factors that are sparse in time and occupy orthogonal dimensions. With these simple constraints, SCA facilitates surprisingly clear parcellations of neural activity across a range of behaviors. We applied SCA to motor cortex activity from reaching and cycling monkeys, single-trial imaging data from C. elegans , and activity from a multitask artificial network. SCA consistently identified sets of factors that were useful in describing network computations.
Publisher OA PDF DOI

Recent grants

NIH Grant R01NS100066
NIH · $937k · 2019

Frequent coauthors

Mark M. Churchland
Columbia University
60 shared
Liam Paninski
Columbia University
51 shared
Krishna V. Shenoy
Howard Hughes Medical Institute
34 shared
Shreya Saxena
Yale University
30 shared
Byron M. Yu
Center for the Neural Basis of Cognition
23 shared
Gabriel Loaiza-Ganem
21 shared
Geoff Pleiss
20 shared
Sean M. Perkins
Columbia University
19 shared

Resume-aware match score
Save to shortlist
AI-drafted outreach

See your match with John P. Cunningham

PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.

Join the waitlist How it works

Free to start
No credit card
30-second signup

Find professors who actually fit you