
Amey Patil
· Assistant ProfessorVerifiedRutgers University · Restorative Dentistry
Active 2012–2025
Research topics
- Medicine
- Dermatology
- Internal medicine
- Pharmacology
- Surgery
- Urology
- Intensive care medicine
- Immunology
Selected publications
LLMs as Architects and Critics for Multi-Source Opinion Summarization
ArXiv.org · 2025-07-07
articleOpen accessMulti-source Opinion Summarization (M-OS) extends beyond traditional opinion summarization by incorporating additional sources of product metadata such as descriptions, key features, specifications, and ratings, alongside reviews. This integration results in comprehensive summaries that capture both subjective opinions and objective product attributes essential for informed decision-making. While Large Language Models (LLMs) have shown significant success in various Natural Language Processing (NLP) tasks, their potential in M-OS remains largely unexplored. Additionally, the lack of evaluation datasets for this task has impeded further advancements. To bridge this gap, we introduce M-OS-EVAL, a benchmark dataset for evaluating multi-source opinion summaries across 7 key dimensions: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, specificity. Our results demonstrate that M-OS significantly enhances user engagement, as evidenced by a user study in which, on average, 87% of participants preferred M-OS over opinion summaries. Our experiments demonstrate that factually enriched summaries enhance user engagement. Notably, M-OS-PROMPTS exhibit stronger alignment with human judgment, achieving an average Spearman correlation of \r{ho} = 0.74, which surpasses the performance of previous methodologies.
Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce
2025-01-01
articleOpen accessCustomer reviews on e-commerce platforms capture critical affective signals that drive purchasing decisions.However, no existing research has explored the joint task of emotion detection and explanatory span identification in ecommerce reviews -a crucial gap in understanding what triggers customer emotional responses.To bridge this gap, we propose a novel joint task unifying EMOTION detection and OPINION TRIGGER extraction (EOT), which explicitly models the relationship between causal text spans (opinion triggers) and affective dimensions (emotion categories) grounded in Plutchik's theory of 8 primary emotions.In the absence of labeled data, we introduce EOT-X, a human-annotated collection of 2,400 reviews with fine-grained emotions and opinion triggers.We evaluate 23 Large Language Models (LLMs) and present EOT-DETECT, a structured prompting framework with systematic reasoning and self-reflection.Our framework surpasses zero-shot and chain-of-thought techniques, across e-commerce domains.
Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce
ArXiv.org · 2025-07-07
articleOpen accessCustomer reviews on e-commerce platforms capture critical affective signals that drive purchasing decisions. However, no existing research has explored the joint task of emotion detection and explanatory span identification in e-commerce reviews - a crucial gap in understanding what triggers customer emotional responses. To bridge this gap, we propose a novel joint task unifying Emotion detection and Opinion Trigger extraction (EOT), which explicitly models the relationship between causal text spans (opinion triggers) and affective dimensions (emotion categories) grounded in Plutchik's theory of 8 primary emotions. In the absence of labeled data, we introduce EOT-X, a human-annotated collection of 2,400 reviews with fine-grained emotions and opinion triggers. We evaluate 23 Large Language Models (LLMs) and present EOT-DETECT, a structured prompting framework with systematic reasoning and self-reflection. Our framework surpasses zero-shot and chain-of-thought techniques, across e-commerce domains.
"This Suits You the Best": Query Focused Comparative Explainable Summarization
ArXiv.org · 2025-07-07
preprintOpen accessProduct recommendations inherently involve comparisons, yet traditional opinion summarization often fails to provide holistic comparative insights. We propose the novel task of generating Query-Focused Comparative Explainable Summaries (QF-CES) using Multi-Source Opinion Summarization (M-OS). To address the lack of query-focused recommendation datasets, we introduce MS-Q2P, comprising 7,500 queries mapped to 22,500 recommended products with metadata. We leverage Large Language Models (LLMs) to generate tabular comparative summaries with query-specific explanations. Our approach is personalized, privacy-preserving, recommendation engine-agnostic, and category-agnostic. M-OS as an intermediate step reduces inference latency approximately by 40% compared to the direct input approach (DIA), which processes raw data directly. We evaluate open-source and proprietary LLMs for generating and assessing QF-CES. Extensive evaluations using QF-CES-PROMPT across 5 dimensions (clarity, faithfulness, informativeness, format adherence, and query relevance) showed an average Spearman correlation of 0.74 with human judgments, indicating its potential for QF-CES evaluation.
A Teacher Is Worth A Million Instructions
arXiv (Cornell University) · 2024-06-27
preprintOpen accessLarge Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved training method for these models by utilising knowledge from larger models, such as a mixture of experts (8x7B) architectures. The scale of these larger models allows them to capture a wide range of variations from data alone, making them effective teachers for smaller models. Moreover, we implement a novel post-training domain alignment phase that employs domain-specific expert models to boost domain-specific knowledge during training while preserving the model's ability to generalise. Fine-tuning Mistral 7B and 2x7B with our method surpasses the performance of state-of-the-art language models with more than 7B and 13B parameters: achieving up to $7.9$ in MT-Bench and $93.04\%$ on AlpacaEval.
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
arXiv (Cornell University) · 2024-02-18
preprintOpen accessEvaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluation datasets inhibit progress. To address this, we release the SUMMEVAL-OP dataset covering 7 dimensions related to the evaluation of opinion summaries: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, and specificity. We investigate Op-I-Prompt a dimension-independent prompt, and Op-Prompts, a dimension-dependent set of prompts for opinion summary evaluation. Experiments indicate that Op-I-Prompt emerges as a good alternative for evaluating opinion summaries achieving an average Spearman correlation of 0.70 with humans, outperforming all previous approaches. To the best of our knowledge, we are the first to investigate LLMs as evaluators on both closed-source and open-source models in the opinion summarization domain.
The Journal of the American Dental Association · 2024-05-06 · 8 citations
articleOpen accessSenior authorCureus · 2024-06-09 · 2 citations
articleOpen accessCorrespondingBackground Understanding root canal anatomy variations, particularly C-shaped canals, is crucial for successful endodontic treatment. This study used clinical and radiographic methods to assess the prevalence and characteristics of C-shaped canals in mandibular second molars in Western Maharashtra. Materials and methods This prospective study was conducted in the western region of Maharashtra, India. The samples included patients requiring endodontic treatment for mandibular second molars. Clinical evaluation was conducted using a surgical endodontic microscope and cone beam computed tomography (CBCT) imaging. Inclusion and exclusion criteria ensured the selection of a focused and homogeneous sample. Data analysis included assessment of unilateral/bilateral occurrence, canal distribution, and cross-sectional characteristics. Results Out of 200 mandibular second molars, 7.5% exhibited C-shaped root canals, with no significant gender differences. Canal distribution varied across coronal, middle, and apical levels, with prevalent configurations being C1, C2, C3, and C4. No significant differences were observed in canal distribution based on root levels. No significant gender differences were found in the presence of grooves on the root surfaces. Conclusion This study provides valuable insights into the prevalence and characteristics of C-shaped canals in mandibular second molars in Western Maharashtra. Further research into histological and genetic aspects can enhance our understanding, leading to improved treatment strategies for complex root canal anatomy variations.
arXiv (Cornell University) · 2024-02-23 · 1 citations
preprintOpen accessReinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($φ$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of tens of thousands) to train $φ$. Such a large-scale annotation is justifiable when it's a one-time effort, and the reward model is universally applicable. However, human goals are subjective and depend on the task, requiring task-specific preference annotations, which can be impractical to fulfill. To address this challenge, we propose a novel approach to infuse domain knowledge into $φ$, which reduces the amount of preference annotation required ($21\times$), omits Alignment Tax, and provides some interpretability. We validate our approach in E-Commerce Opinion Summarization, with a significant reduction in dataset size (to just $940$ samples) while advancing the SOTA ($\sim4$ point ROUGE-L improvement, $68\%$ of times preferred by humans over SOTA). Our contributions include a novel Reward Modeling technique and two new datasets: PromptOpinSumm (supervised data for Opinion Summarization) and OpinPref (a gold-standard human preference dataset). The proposed methodology opens up avenues for efficient RLHF, making it more adaptable to applications with varying human values. We release the artifacts (Code: github.com/efficient-rlhf. PromptOpinSumm: hf.co/prompt-opin-summ. OpinPref: hf.co/opin-pref) for usage under MIT License.
Product Description and QA Assisted Self-Supervised Opinion Summarization
2024-01-01 · 3 citations
articleOpen accessTejpalsingh Siledar, Rupasai Rangaraju, Sankara Muddu, Suman Banerjee, Amey Patil, Sudhanshu Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya. Findings of the Association for Computational Linguistics: NAACL 2024. 2024.
Frequent coauthors
- 25 shared
A. H. Patil
Bharati Vidyapeeth Deemed University
- 25 shared
Sabina Shaikh
Maharashtra University of Health Sciences
- 16 shared
Vedant U Kalgutkar
Maharashtra University of Health Sciences
- 16 shared
Sailee A Bhandarkar
Maharashtra University of Health Sciences
- 10 shared
Pankaj Joshi
- 10 shared
Marco Bandini
San Raffaele University of Rome
- 10 shared
Sanjay Kulkarni
- 9 shared
Shreyas Bhadranavar
Education
- 2016
Master of Science in Dentistry (MSD), Orofacial Pain & Temporomandibular Joint Disorders
Rutgers School of Dental Medicine
- 2016
Post Graduate Certificate in TMD and Orofacial Pain, Orofacial Pain & Temporomandibular Joint Disorders
Rutgers School of Dental Medicine
- 2013
Fellowship in Aesthetic and Cosmetic Dentistry (FACD), IICER
New York University College of Dentistry
- 2012
General Practice Residency (GPR), MGV's KBH Dental College & Hospital
Maharashtra University of Health Sciences
- 2010
Bachelor of Dental Surgery (BDS), MGV's KBH Dental College & Hospital
Maharashtra University of Health Sciences
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Amey Patil
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup