Resume-aware faculty matching

Find professors who actually fit you

Upload your resume. Four AI agents analyze your background, rank the faculty who fit, inspect their recent research, and help you draft outreach — grounded in their actual work, not templates.

Free to startNo credit cardCancel anytime
Top matches Balanced preset
Dr. Sarah Chen
Stanford · Interpretability · NLP
91
Dr. Marcus Holloway
MIT · Robotics · RL
84
Dr. Aisha Okonkwo
CMU · Fairness · HCI
82
Nova · Professor Researcher · re-ranking top 20…
Yan Chen

Yan Chen

· Professor of Computer ScienceVerified

Northwestern University · Chemical Engineering

Active 2005–2025

h-index28
Citations3.4k
Papers11833 last 5y
Funding$2.4M1 active
See your match with Yan Chen — sign in to PhdFit.Sign in

About

Yan Chen is a Professor of Computer Science at Northwestern University, affiliated with the Northwestern Lab for Internet and Security Technology. His research interests focus on security, measurement, and diagnosis for networking and large-scale distributed systems. He leads the Northwestern LIST (Lab for Internet and Security Technology) and has made significant contributions in these areas. Professor Chen has received several prestigious awards, including the DOE Early CAREER Award in 2005, the DOD Young Investigator Award in 2007, and the Most Influential Paper Award of ACM ASPLOS in 2018. His scholarly work has been widely recognized, with his papers cited approximately 17,000 times and an h-index of 63 as of January 2026. He is a Fellow of IEEE, reflecting his notable impact in the field of computer science and network security.

Research topics

  • Computer Security
  • Computer Science
  • Data Mining
  • Artificial Intelligence
  • Theoretical computer science
  • Operating system
  • Distributed computing

Selected publications

  • FaaSTracker: Efficient Cross-Layer Provenance Tracking of Serverless Applications With Multi-Source Correlation

    IEEE Transactions on Information Forensics and Security · 2025-01-01

    articleSenior author

    Serverless computing, also known as Function-as-a-Service (FaaS), has gained popularity due to its flexibility, scala bility, and transparent development. However, attacks against serverless are also increasing. Unfortunately, complex multi-layer FaaS architecture and frequently launched lightweight functions help attackers conceal their tracks. Specifically, (<italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i</i>) fully tracking the behavior of a function requires crossing multiple layers of FaaS. (<italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ii</i>) Intrusive auditing components in functions affect function startup latency and performance. (<italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">iii</i>) Accurately provenance cross-layer function invocations require integrating data from multiple sources. In this paper, we propose FAASTRACKER, a cross-layer, non-intrusive, efficient provenance framework for accurately tracking user function behaviors in FaaS. FAASTRACKER tracks function behaviors across layers using a non-intrusive agent without any modifications to the function. In addition, it correlates data from multiple sources to construct a provenance graph of function workflows to locate attackers. We implement FAASTRACKER on the OpenFaaS platform and evaluate its performance using real-world serverless applications. Compared with state-of-the-art serverless provenance systems, FAASTRACKER provides a more accurate and complete view of provenance graphs and reduces 54.0% CPU and 48.9% memory resources.

  • Real-Time Synchronization of Building Information Modelling (BIM) and Windows Forms Application (WFA)

    Advances in transdisciplinary engineering · 2025-05-23

    book-chapterOpen accessSenior author

    In order to promote the digital transformation of the AEC engineering industry towards data-driven high-quality development, efficient information integration and real-time synchronization of Building Information Modelling (BIM) are crucial. However, existing research, especially in the field of bridges, is not deep and mature enough. Therefore, this study aims to develop a Windows Forms Application (WFA) based system for rapid modelling and real-time synchronization in the BIM software. The four main objectives are as follows: the development of structural parameter acquisition application for steel box girder bridge; the secondary development of 3D modeling software; the development of the BIM co-design platform for steel box girder-bridge; and the study of the application of the BIM co-design platform for steel box girder bridge in the project. To achieve these objectives, a software development life cycle (SDLC) approach is used to develop a system applied to steel box girder bridges through five phases: planning, analysis, design, implementation, and testing. Fundamentally, the platform consisting of the developed WFA and BIM software saves resources, dramatically reduces repetitive work, ensures the effectiveness of model information transfer and real-time data update, improves information integration of steel box girder bridges, and realizes the construction of a digital city.

  • AutoSeg: Automatic micro-segmentation policy generation via configuration analysis

    Computers & Security · 2025-07-29

    article
  • CRUcialG: Reconstruct Integrated Attack Scenario Graphs by Cyber Threat Intelligence Reports

    IEEE Transactions on Dependable and Secure Computing · 2025-06-30 · 4 citations

    articleSenior author

    Cyber Threat Intelligence (CTI) reports are factual records compiled by security analysts through their observations of threat events or their own practical experience with attacks. In order to utilize CTI reports for attack detection, existing methods have attempted to map the content of reports onto system-level attack provenance graphs to clearly depict attack procedures. However, existing studies on constructing graphs from CTI reports suffer from problems such as weak Natural Language Processing (NLP) capabilities, discrete and fragmented graphs, and insufficient attack semantic representation. Therefore, we propose a system called CRUcialG for the automated reconstruction of Attack Scenario Graphs (ASGs) by CTI reports. First, we use NLP models to extract systematic attack knowledge from CTI reports to form preliminary ASGs. Then, we propose a four-phase attack rationality validation framework from the tactical phase with attack procedure to evaluate the reasonability of ASGs. Finally, we implement the relation repair and phase supplement of ASGs by adopting a serialized graph generation model. We collect a total of 10,607 CTI reports and generate 5,761 complete ASGs. Experimental results on CTI reports from 30 security vendors and DARPA show that the similarity of ASG reconstruction by CRUcialG can reach 84.54%. Compared with SOTA (EXTRACTOR and AttackG), the recall of CRUcialG (extraction of real attack events) can reach 88.13% and 94.46% respectively, which is 40% higher than SOTA on average. The F1-score of attack phase validation is able to reach 90.04%.

  • PentestAgent: Incorporating LLM Agents to Automated Penetration Testing

    2025-08-13 · 18 citations

    articleOpen access
  • Incorporating Gradients to Rules: Toward Online, Adaptive Provenance-Based Intrusion Detection

    IEEE Transactions on Dependable and Secure Computing · 2025-09-18 · 1 citations

    article
  • Radiation Characteristics and Optimization Design of a Flexural Ring Transducer with Single-Mode Excitation

    SSRN Electronic Journal · 2025-01-01

    preprintOpen accessSenior author
  • Poster: Obfuscating Function Activity States to Enhance Privacy in Serverless Applications

    2025-11-19

    articleOpen accessSenior author

    Serverless computing, also known as Function-as-a-Service (FaaS), is widely used in modern applications. Function instances share the underlying physical infrastructure, which makes co-location attacks possible and leads to the leakage of sensitive information such as function activity states. Existing work has respective limitations in serverless scenarios because of incomplete detection coverage, long training time, and intrusion into the function's runtime environment. In this paper, we propose FaaSGuard, an obfuscation framework to protect function activity states in network side-channels and enhance privacy in serverless applications. To be specific, we design an adaptive obfuscation strategy selection mechanism to make FaaSGuard flexible. We design a traffic camouflage method to make obfuscated traffic indistinguishable from normal traffic, making FaaSGuard invisible. In order not to affect normal traffic, we propose a tag-based obfuscation mechanism to identify obfuscated packets. The preliminary evaluation results show that FaaSGuard can conceal function activity states with negligible resource overhead.

  • TAGAPT: Toward Automatic Generation of APT Samples With Provenance-Level Granularity

    IEEE Transactions on Information Forensics and Security · 2025-01-01 · 2 citations

    articleSenior author

    Detecting advanced persistent threats (APTs) at a host via data provenance has emerged as a valuable yet challenging task. Compared with attack rule matching, machine learning approaches offer new perspectives for efficiently detecting attacks by leveraging their inherent ability to autonomously learn from data and adapt to dynamic environments. However, the scarcity of APT samples poses a significant limitation, rendering supervised learning methods that have demonstrated remarkable capabilities in other domains (e.g., malware detection) impractical. Therefore, we propose a system called TAGAPT, which is able to automatically generate numerous APT samples with provenance-level granularity. First, we introduce a deep graph generation model to generalize various graph structures that represent new attack patterns. Second, we propose an attack stage division algorithm to divide each generated graph structure into stage subgraphs. Finally, we design a genetic algorithm to find the optimal attack technique explanation for each subgraph and obtain fully instantiated APT samples. Experimental results demonstrate that TAGAPT can learn from existing attack patterns and generalize to novel attack patterns. Furthermore, the generated APT samples 1) exhibit the ability to help with efficient threat hunting and 2) provide additional assistance to the state-of-the-art (SOTA) attack detection system (Kairos) by filtering out 73% of the observed false positives. We have open-sourced the code and the generated samples to support the development of the security community.

  • Programming knowledge tracing based on heterogeneous graph representation

    Knowledge-Based Systems · 2024-06-22 · 5 citations

    article

Recent grants

Frequent coauthors

  • Yinzhi Cao

    Johns Hopkins University

    15 shared
  • Vaibhav Rastogi

    13 shared
  • Chunlin Xiong

    11 shared
  • Tiantian Zhu

    9 shared
  • Kai Chen

    Nanjing University

    9 shared
  • Yao Zhao

    8 shared
  • Xitao Wen

    7 shared
  • Tieming Chen

    Zhejiang University of Technology

    6 shared

Awards & honors

  • DOE Early CAREER Award (2005)
  • DOD (Air Force of Scientific Research) Young Investigator Aw…
  • Most Influential Paper Award of ACM ASPLOS (2018)
  • Fellow of IEEE
  • Resume-aware match score
  • Save to shortlist
  • AI-drafted outreach

See your match with Yan Chen

PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.

  • Free to start
  • No credit card
  • 30-second signup