Resume-aware faculty matching

Find professors who actually fit you

Upload your resume. Four AI agents analyze your background, rank the faculty who fit, inspect their recent research, and help you draft outreach — grounded in their actual work, not templates.

Free to startNo credit cardCancel anytime
Top matches Balanced preset
Dr. Sarah Chen
Stanford · Interpretability · NLP
91
Dr. Marcus Holloway
MIT · Robotics · RL
84
Dr. Aisha Okonkwo
CMU · Fairness · HCI
82
Nova · Professor Researcher · re-ranking top 20…
Kavita Bala

Kavita Bala

· Provost Professor of Computer ScienceVerified

Cornell University · Computer Science

Active 1991–2025

h-index58
Citations11.8k
Papers27851 last 5y
Funding$2.0M
See your match with Kavita Bala — sign in to PhdFit.Sign in

About

Kavita Bala is the 17th Provost of Cornell University, a position she assumed on January 1, 2025. She is a distinguished computer scientist, entrepreneur, and professor with a strong record of leadership and scholarship. Prior to becoming provost, Bala served as the inaugural dean of the Cornell Ann S. Bowers College of Computing and Information Science and as chair of Cornell's Department of Computer Science. During her tenure as dean, she secured the naming gift for the college, led a significant expansion of faculty to support rapid growth, and initiated construction of a new 135,000-square-foot building designed for robotics labs, experiential learning spaces, and faculty offices. She also played a key role as lead dean of the Cornell AI Initiative, advancing academic programs including new minors in AI and AI in Society, and helped establish the Schmidt AI in Science postdoctoral program. Additionally, she co-led a university-wide task force that developed guidelines for the responsible use of generative AI in education and learning. Bala's foundational research spans computer vision, computer graphics, and artificial intelligence, with a focus on image understanding, including the recognition of materials, styles, and object attributes; modeling of complex materials; and the use of crowdsourced training data. Her groundbreaking work on style recognition using deep learning led to the co-founding of GrokStyle, a visual recognition AI company acquired by Facebook in 2019. Her research contributions have been recognized by election to the American Academy of Arts & Sciences and by induction as a Fellow of the Association for Computing Machinery (ACM) and the SIGGRAPH Academy. She has received numerous awards including the SIGGRAPH Computer Graphics Achievement Award and the IIT Bombay Distinguished Alumnus Award, as well as multiple teaching awards at Cornell. Bala holds a Bachelor of Technology in Computer Science and Engineering from the Indian Institute of Technology, Bombay, and both a Master of Science and a Ph.D. in Electrical Engineering and Computer Science from the Massachusetts Institute of Technology. Her leadership and scholarship have positioned her as a national leader in computing and AI education and research.

Research topics

  • Computer Science
  • Artificial Intelligence
  • Computer vision
  • Algorithm
  • Mathematics
  • Medicine
  • Computer graphics (images)

Selected publications

  • MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing

    ArXiv.org · 2025-07-22

    preprintOpen access

    Natural disasters cause devastating damage to communities and infrastructure every year. Effective disaster response is hampered by the difficulty of accessing affected areas during and after events. Remote sensing has allowed us to monitor natural disasters in a remote way. More recently there have been advances in computer vision and deep learning that help automate satellite imagery analysis, However, they remain limited by their narrow focus on specific disaster types, reliance on manual expert interpretation, and lack of datasets with sufficient temporal granularity or natural language annotations for tracking disaster progression. We present MONITRS, a novel multimodal dataset of more than 10,000 FEMA disaster events with temporal satellite imagery and natural language annotations from news articles, accompanied by geotagged locations, and question-answer pairs. We demonstrate that fine-tuning existing MLLMs on our dataset yields significant performance improvements for disaster monitoring tasks, establishing a new benchmark for machine learning-assisted disaster response systems. Code can be found at: https://github.com/ShreelekhaR/MONITRS

  • DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery

    2025-06-10

    article

    Visual data is used in numerous different scientific work-flows ranging from remote sensing to ecology. As the amount of observation data increases, the challenge is not just to make accurate predictions but also to understand the underlying mechanisms for those predictions. Good interpretation is important in scientific workflows, as it allows for better decision-making by providing insights into the data. This paper introduces an automatic way of obtaining such interpretable-by-design models, by learning programs that interleave neural networks. We propose DiSciPLE (Discovering Scientific Programs using LLMs and Evolution) an evolutionary algorithm that leverages common sense and prior knowledge of large language models (LLMs) to create Python programs explaining visual data. Additionally, we propose two improvements: a program critic and a program simplifier to improve our method further to synthesize good programs. On three different real-world problems, DiSciPLE learns state-of-the-art programs on novel tasks with no prior literature. For example, we can learn programs with 35% lower error than the closest non-interpretable baseline for population density estimation.

  • Analysis of ZVS Range of Single-Phase Shift Modulation for Bidirectional Dual Active Bridge DC-DC Converter in Electric Vehicle

    Lecture notes in electrical engineering · 2025-01-01

    book-chapter
  • DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery

    ArXiv.org · 2025-02-14

    preprintOpen access

    Visual data is used in numerous different scientific workflows ranging from remote sensing to ecology. As the amount of observation data increases, the challenge is not just to make accurate predictions but also to understand the underlying mechanisms for those predictions. Good interpretation is important in scientific workflows, as it allows for better decision-making by providing insights into the data. This paper introduces an automatic way of obtaining such interpretable-by-design models, by learning programs that interleave neural networks. We propose DiSciPLE (Discovering Scientific Programs using LLMs and Evolution) an evolutionary algorithm that leverages common sense and prior knowledge of large language models (LLMs) to create Python programs explaining visual data. Additionally, we propose two improvements: a program critic and a program simplifier to improve our method further to synthesize good programs. On three different real-world problems, DiSciPLE learns state-of-the-art programs on novel tasks with no prior literature. For example, we can learn programs with 35% lower error than the closest non-interpretable baseline for population density estimation.

  • Effects of Digital Communication on Segmented and Super-Segmented Features of Pronunciation Patters among Undergraduate English Language Students in Nigeria

    Federal University Gusau Faculty of Education Journal · 2025-12-10

    articleOpen access1st authorCorresponding

    The widespread use of digital communication platforms, such as instant messaging, social media, and video conferencing, has transformed how university students interact and use language. This study investigates the effects of digital communication on pronunciation patterns, with a focus on both segmental features (vowel quality and consonant articulation) and supra segmental features (stress placement, rhythm, and intonation) among undergraduate students in Nigeria. A total of sixty participants, aged 18–25, and provided speech samples through structured reading tasks, spontaneous speech, and digital voice notes. Acoustic analyses measured vowel formants, consonant precision, stress placement, and pitch variation, while perceptual analyses involved listener ratings of intelligibility and naturalness. Results indicate that frequent engagement in digital communication is associated with reduced articulation precision, weakened stress contrasts, and a narrower pitch range, particularly in spontaneous and digital speech samples. These changes negatively impact intelligibility, suggesting that informal, rapid, and text-mediated communication encourages adaptations that deviate from standard pronunciation norms. Nevertheless, exposure to varied speech contexts can mitigate some comprehension challenges. The findings have important implications for language teaching, pronunciation training, and digital literacy programmes, highlighting the need to balance informal digital speech habits with strategies that preserve clear and intelligible communication. By understanding how digital communication influences pronunciation, educators and speech professionals can better support students in maintaining effective oral communication across both online and face-to-face contexts.

  • Smart Model for Recognizing Hydroponic Lettuce Shortages Utilizing Higher Level Efficient Algorithms and Consensus Multi-Dilated Adaptive Networks

    2024-10-08

    articleSenior author

    The evolution that never stops, especially in smart agriculture, especially hydroponic farming has bought a great change for efficiencies and productivities. Nevertheless, identification and categorization of nutrient deficiencies in hydroponic lettuce are still challenging, specifically, when training deep learning models for low power ‘Edge’ IoT devices such as smart mobile phones and Raspberry Pi. Current methods are challenging due to high computational complexity, high power consumption, and inadequate utilization of resources, which hinders real-life implementation in embedded systems. To overcome these challenges, this paper presents an intelligent, highly efficient approach for the identification of nutrient deficiencies in hydroponic lettuce using advanced algorithms and a Consensus Multi-Dilated Adaptive Network (CMDA-Net). The proposed model harnesses or combines many of the current best-known or best performing approaches to yield a proper feature extraction and classification. This approach takes advantage of deep neural networks including VGG-16, Residual Attention Network (RAN) and Convolutional Neural Networks (CNNs) with a multi-dilated convolutional network to improve spatial feature learning while at the same time reducing computational cost. These features are then passed through LSTM network that help capture temporal information control and ensure that lettuce deficiencies are correctly classified. The entire system structure is enhanced through the new Improvised Honey Badger Algorithm (I-HBA), which adjusts specific networks’ parameters for maximum effectiveness with minimal resource consumption. Many experiments were carried out on Lettuce NPK dataset which contains test images of Fully Nutritional, Nitrogen Deficient, Phosphorus Deficient and Potassium Deficient lettuce. It respectively assessed the accuracy, specificity rate, precision, NPV, FPR, and MCC of the model. We also gained compelling improvements in terms of both computational time and model accuracy with exactly terminal prediction accuracy 96% and specificity 95%.

  • Scale-Aware Recognition in Satellite Images under Resource Constraints

    arXiv (Cornell University) · 2024-10-31

    preprintOpen accessSenior author

    Recognition of features in satellite imagery (forests, swimming pools, etc.) depends strongly on the spatial scale of the concept and therefore the resolution of the images. This poses two challenges: Which resolution is best suited for recognizing a given concept, and where and when should the costlier higher-resolution (HR) imagery be acquired? We present a novel scheme to address these challenges by introducing three components: (1) A technique to distill knowledge from models trained on HR imagery to recognition models that operate on imagery of lower resolution (LR), (2) a sampling strategy for HR imagery based on model disagreement, and (3) an LLM-based approach for inferring concept "scale". With these components we present a system to efficiently perform scale-aware recognition in satellite imagery, improving accuracy over single-scale inference while following budget constraints. Our novel approach offers up to a 26.3% improvement over entirely HR baselines, using 76.3% fewer HR images.

  • Enhancing syphilis diagnosis through innovative adaptation of wet mount microscopy

    Indian Journal of Dermatology Venereology and Leprology · 2024-08-01 · 2 citations

    articleOpen access
  • AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery

    2024-01-01

    article1st authorCorresponding
  • AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery

    arXiv (Cornell University) · 2024-10-31 · 1 citations

    preprintOpen accessSenior author

    Clouds in satellite imagery pose a significant challenge for downstream applications. A major challenge in current cloud removal research is the absence of a comprehensive benchmark and a sufficiently large and diverse training dataset. To address this problem, we introduce the largest public dataset -- $\textit{AllClear}$ for cloud removal, featuring 23,742 globally distributed regions of interest (ROIs) with diverse land-use patterns, comprising 4 million images in total. Each ROI includes complete temporal captures from the year 2022, with (1) multi-spectral optical imagery from Sentinel-2 and Landsat 8/9, (2) synthetic aperture radar (SAR) imagery from Sentinel-1, and (3) auxiliary remote sensing products such as cloud masks and land cover maps. We validate the effectiveness of our dataset by benchmarking performance, demonstrating the scaling law -- the PSNR rises from $28.47$ to $33.87$ with $30\times$ more data, and conducting ablation studies on the temporal length and the importance of individual modalities. This dataset aims to provide comprehensive coverage of the Earth's surface and promote better cloud removal results.

Recent grants

Frequent coauthors

Labs

  • Kavita Bala's LabPI

    Research in computer vision, computer graphics, and human perception

Education

  • PhD, Computer Science

    Massachusetts Institute of Technology

    1999

Awards & honors

  • Fellow of the American Academy of Arts & Sciences (2025)
  • ACM Fellow (2019)
  • Fellow of the SIGGRAPH Academy (2020)
  • Computer Graphics Achievement Award (2020)
  • NSF Faculty Early Career Development Award (CAREER)
  • Resume-aware match score
  • Save to shortlist
  • AI-drafted outreach

See your match with Kavita Bala

PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.

  • Free to start
  • No credit card
  • 30-second signup