
Theodore Miller
· Tutor in Biochemical SciencesVerifiedHarvard University · Molecular and Cellular Biology
Active 1967–2024
Research topics
- Machine Learning
- Computer Science
- Artificial Intelligence
- Natural Language Processing
- Psychology
Selected publications
Does BERT need domain adaptation for clinical negation detection?
Journal of the American Medical Informatics Association · 2020 · 61 citations
Senior authorCorresponding- Computer Science
- Computer Science
- Artificial Intelligence
INTRODUCTION: Classifying whether concepts in an unstructured clinical text are negated is an important unsolved task. New domain adaptation and transfer learning methods can potentially address this issue. OBJECTIVE: We examine neural unsupervised domain adaptation methods, introducing a novel combination of domain adaptation with transformer-based transfer learning methods to improve negation detection. We also want to better understand the interaction between the widely used bidirectional encoder representations from transformers (BERT) system and domain adaptation methods. MATERIALS AND METHODS: We use 4 clinical text datasets that are annotated with negation status. We evaluate a neural unsupervised domain adaptation algorithm and BERT, a transformer-based model that is pretrained on massive general text datasets. We develop an extension to BERT that uses domain adversarial training, a neural domain adaptation method that adds an objective to the negation task, that the classifier should not be able to distinguish between instances from 2 different domains. RESULTS: The domain adaptation methods we describe show positive results, but, on average, the best performance is obtained by plain BERT (without the extension). We provide evidence that the gains from BERT are likely not additive with the gains from domain adaptation. DISCUSSION: Our results suggest that, at least for the task of clinical negation detection, BERT subsumes domain adaptation, implying that BERT is already learning very general representations of negation phenomena such that fine-tuning even on a specific corpus does not lead to much overfitting. CONCLUSION: Despite being trained on nonclinical text, the large training sets of models like BERT lead to large gains in performance for the clinical negation detection task.
Recent grants
Bone Tissue Engineering Using Mineralized Collagen-GAG Scaffolds
NIH · 2012–2015
Automated domain adaptation for clinical natural language processing
NIH · $1.2M · 2018–2023
Learning Universal Patient Representations with Hierarchical Transformers
NIH · $2.9M · 2019–2027
Improving Specialty Care Delivery in the Safety Net with Natural Language Processing
NIH · $373k · 2018–2022
NIH · $2.5M · 2022–2027
Frequent coauthors
- 134 shared
Guergana Savova
Harvard University
- 113 shared
George H. Rudkin
University of California, Los Angeles
- 105 shared
Dmitriy Dligach
- 74 shared
Dean T. Yamaguchi
East Carolina University
- 57 shared
Danielle S. Bitterman
- 52 shared
Chen Lin
Shanghai Artificial Intelligence Laboratory
- 50 shared
R.O. Meyer
United States Nuclear Regulatory Commission
- 50 shared
Donald Penner
Education
- 2010
PhD, Computer Science
University of Minnesota System
- 2003
BS, Computer Science
Marquette University
Similar researchers at Harvard University
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Theodore Miller
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup