Resume-aware faculty matching

Find professors who actually fit you

Upload your resume. Four AI agents analyze your background, rank the faculty who fit, inspect their recent research, and help you draft outreach — grounded in their actual work, not templates.

Free to startNo credit cardCancel anytime
Top matches Balanced preset
Dr. Sarah Chen
Stanford · Interpretability · NLP
91
Dr. Marcus Holloway
MIT · Robotics · RL
84
Dr. Aisha Okonkwo
CMU · Fairness · HCI
82
Nova · Professor Researcher · re-ranking top 20…
Alvaro Cardenas

Alvaro Cardenas

· ProfessorVerified

University of California, Santa Cruz · Electrical and Computer Engineering

Active 2003–2026

h-index38
Citations7.6k
Papers20769 last 5y
Funding$2.7M
See your match with Alvaro Cardenas — sign in to PhdFit.Sign in

Research signals

Five dimensions sourced from public faculty / publication signals. Sign in to compare against your own profile and see your match score.

Research topics

  • Computer Science
  • Computer Security
  • Artificial Intelligence
  • Embedded system
  • Business
  • Software engineering
  • Operating system
  • Computer network
  • Risk analysis (engineering)
  • Programming language
  • Mathematics
  • Engineering
  • Real-time computing
  • Algorithm

Selected publications

  • Stable GFlowNets with Probabilistic Guarantees

    arXiv (Cornell University) · 2026-05-03

    preprintOpen access

    Generative Flow Networks (GFlowNets) learn to sample states proportional to an unnormalized reward. Despite their theoretical promise, practical training is often unstable, exhibiting severe loss spikes and mode collapse. To tackle this, we first assess the sensitivity of GFlowNet objectives, demonstrating that a small Total Variation (TV) distance between the learned and target distributions does not preclude unbounded training loss. Motivated by this mismatch, we establish converse guarantees by deriving loss-to-TV bounds that certify global fidelity from bounded trajectory balance losses. Lastly, we propose Stable GFlowNets, an algorithm that leverages our theoretical results to stabilize training, and empirically demonstrate improved training behavior and superior distributional fidelity.

  • Stable GFlowNets with Probabilistic Guarantees

    ArXiv.org · 2026-05-03

    articleOpen access

    Generative Flow Networks (GFlowNets) learn to sample states proportional to an unnormalized reward. Despite their theoretical promise, practical training is often unstable, exhibiting severe loss spikes and mode collapse. To tackle this, we first assess the sensitivity of GFlowNet objectives, demonstrating that a small Total Variation (TV) distance between the learned and target distributions does not preclude unbounded training loss. Motivated by this mismatch, we establish converse guarantees by deriving loss-to-TV bounds that certify global fidelity from bounded trajectory balance losses. Lastly, we propose Stable GFlowNets, an algorithm that leverages our theoretical results to stabilize training, and empirically demonstrate improved training behavior and superior distributional fidelity.

  • Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

    arXiv (Cornell University) · 2026-04-22

    preprintOpen access

    Frontier coding agents are increasingly used in workflows where users supervise progress primarily through repeated improvement of a public score, namely the reported score on a public evaluation file with labels in the workspace, rather than through direct inspection of the agent's intermediate outputs. We study whether multi-round user pressure to improve that score induces public score exploitation: behavior that raises the public score through shortcuts without improving hidden private evaluation. We begin with a preliminary single-script tabular classification task, where GPT-5.4 and Claude Opus 4.6 both exploit label information within 10 rounds of user-agent interaction. We then build AgentPressureBench, a 34-task machine-learning repository benchmark spanning three input modalities, and collect 1326 multi-round trajectories from 13 coding agents. On our benchmark, we observe 403 exploitative runs, spanning across all tasks. We also find that stronger models have higher exploitation rates, supported by a significant Spearman rank correlation of 0.77. Our ablation experiments show that higher user pressure leads to earlier exploitation, reducing the average first exploit round by 15.6 rounds (i.e., 19.67 to 4.08). As a mitigation, adding explicit anti-exploit wordings in prompt mostly eliminates exploitation (100% to 8.3%). We hope that our work can bring attention to more careful use of coding agents workflow, and developing more robust coding agents under user pressure. Our project page is at https://ucsc-vlaa.github.io/AgentPressureBench .

  • Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

    arXiv (Cornell University) · 2026-04-22

    articleOpen access

    Frontier coding agents are increasingly used in workflows where users supervise progress primarily through repeated improvement of a public score, namely the reported score on a public evaluation file with labels in the workspace, rather than through direct inspection of the agent's intermediate outputs. We study whether multi-round user pressure to improve that score induces public score exploitation: behavior that raises the public score through shortcuts without improving hidden private evaluation. We begin with a preliminary single-script tabular classification task, where GPT-5.4 and Claude Opus 4.6 both exploit label information within 10 rounds of user-agent interaction. We then build AgentPressureBench, a 34-task machine-learning repository benchmark spanning three input modalities, and collect 1326 multi-round trajectories from 13 coding agents. On our benchmark, we observe 403 exploitative runs, spanning across all tasks. We also find that stronger models have higher exploitation rates, supported by a significant Spearman rank correlation of 0.77. Our ablation experiments show that higher user pressure leads to earlier exploitation, reducing the average first exploit round by 15.6 rounds (i.e., 19.67 to 4.08). As a mitigation, adding explicit anti-exploit wordings in prompt mostly eliminates exploitation (100% to 8.3%). We hope that our work can bring attention to more careful use of coding agents workflow, and developing more robust coding agents under user pressure. Our project page is at https://ucsc-vlaa.github.io/AgentPressureBench .

  • VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

    ArXiv.org · 2026-04-23

    articleOpen access

    Autonomous GUI agents face two fundamental challenges: early stopping, where agents prematurely declare success without verifiable evidence, and repetitive loops, where agents cycle through the same failing actions without recovery. We present VLAA-GUI, a modular GUI agentic framework built around three integrated components that guide the system on when to Stop, Recover, and Search. First, a mandatory Completeness Verifier enforces UI-observable success criteria and verification at every finish step -- with an agent-level verifier that cross-examines completion claims with decision rules, rejecting those lacking direct visual evidence. Second, a mandatory Loop Breaker provides multi-tier filtering: switching interaction mode after repeated failures, forcing strategy changes after persistent screen-state recurrence, and binding reflection signals to strategy shifts. Third, an on-demand Search Agent searches online for unfamiliar workflows by directly querying a capable LLM with search ability, returning results as plain text. We additionally integrate a Coding Agent for code-intensive actions and a Grounding Agent for precise action grounding, both invoked on demand when required. We evaluate VLAA-GUI across five top-tier backbones, including Opus 4.5, 4.6 and Gemini 3.1 Pro, on two benchmarks with Linux and Windows tasks, achieving top performance on both (77.5% on OSWorld and 61.0% on WindowsAgentArena). Notably, three of the five backbones surpass human performance (72.4%) on OSWorld in a single pass. Ablation studies show that all three proposed components consistently improve a strong backbone, while a weaker backbone benefits more from these tools when the step budget is sufficient. Further analysis also shows that the Loop Breaker nearly halves wasted steps for loop-prone models.

  • VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

    arXiv (Cornell University) · 2026-04-23

    preprintOpen access

    Autonomous GUI agents face two fundamental challenges: early stopping, where agents prematurely declare success without verifiable evidence, and repetitive loops, where agents cycle through the same failing actions without recovery. We present VLAA-GUI, a modular GUI agentic framework built around three integrated components that guide the system on when to Stop, Recover, and Search. First, a mandatory Completeness Verifier enforces UI-observable success criteria and verification at every finish step -- with an agent-level verifier that cross-examines completion claims with decision rules, rejecting those lacking direct visual evidence. Second, a mandatory Loop Breaker provides multi-tier filtering: switching interaction mode after repeated failures, forcing strategy changes after persistent screen-state recurrence, and binding reflection signals to strategy shifts. Third, an on-demand Search Agent searches online for unfamiliar workflows by directly querying a capable LLM with search ability, returning results as plain text. We additionally integrate a Coding Agent for code-intensive actions and a Grounding Agent for precise action grounding, both invoked on demand when required. We evaluate VLAA-GUI across five top-tier backbones, including Opus 4.5, 4.6 and Gemini 3.1 Pro, on two benchmarks with Linux and Windows tasks, achieving top performance on both (77.5% on OSWorld and 61.0% on WindowsAgentArena). Notably, three of the five backbones surpass human performance (72.4%) on OSWorld in a single pass. Ablation studies show that all three proposed components consistently improve a strong backbone, while a weaker backbone benefits more from these tools when the step budget is sufficient. Further analysis also shows that the Loop Breaker nearly halves wasted steps for loop-prone models.

  • Control Barrier Function based Attack-Recovery with Provable Guarantees

    IEEE Transactions on Automatic Control · 2026-01-01

    preprintOpen accessSenior author

    This paper investigates security guarantees for cyber-physical systems (CPS) against actuator attacks. We in troduce a new attack detection mechanism based on zeroing control barrier function (ZCBF) conditions. We propose an adaptive recovery mechanism that responds based on the system's proximity to safety violations. Our attack-detection mechanism has been proven to be sound, meaning it consistently detects adversarial attacks without any false negatives. Additionally, we propose a novel hybrid control law that addresses delays in attack detection and prevents Zeno behavior. We also propose a sampling-based method to verify whether a set is a viability domain for CPS. Finally, we employ a Quadratic Programming (QP) approach for synthesizing control laws for the hybrid control policy, utilizing the viability domain to ensure safety in the presence of adversarial attacks on system actuators. The efficacy of the proposed method is demonstrated in a simulation case study involving a quadrotor system.

  • D4: Dynamic Data-Driven Discovery of Adversarial Vehicle Maneuvers

    Lecture notes in computer science · 2025-08-25

    book-chapterSenior author
  • Large Language Models are Autonomous Cyber Defenders

    2025-05-05 · 2 citations

    preprintOpen accessSenior author

    Fast and effective incident response is essential to prevent adversarial cyberattacks. Autonomous Cyber Defense (ACD) aims to automate incident response through Artificial Intelligence (AI) agents that plan and execute actions. Most ACD approaches focus on single-agent scenarios and leverage Reinforcement Learning (RL). However, ACD RL-trained agents depend on costly training, and their reasoning is not always explainable or transferable. Large Language Models (LLMs) can address these concerns by providing explainable actions in general security contexts. Researchers have explored LLM agents for ACD but have not evaluated them on multi-agent scenarios or interacting with other ACD agents. In this paper, we show the first study on how LLMs perform in multi-agent ACD environments by proposing a new integration to the CybORG CAGE 4 environment. We examine how ACD teams of LLM and RL agents can interact by proposing a novel communication protocol. Our results highlight the strengths and weaknesses of LLMs and RL and help us identify promising research directions to create, train, and deploy future teams of ACD agents.

  • Cybersecurity for Next-Generation Road Transportation: A Review

    ACM Journal on Autonomous Transportation Systems · 2025-06-14 · 4 citations

    reviewOpen access

    Emerging transportation technologies, including electric vehicles (EVs), autonomous vehicles (AVs), and connected vehicles (CVs), are poised to revolutionize mobility and logistics through advancements in connectivity, automation, and electrification. However, the convergence of these systems introduces substantial cybersecurity challenges, including data breaches, spoofing, and infrastructure attacks, due to increasingly complex and interconnected attack surfaces. This article presents a comprehensive survey of cybersecurity threats, vulnerabilities, and evaluation practices within the AV, CV, and EV landscape. We analyze 227 peer-reviewed studies published between 2021 and 2025 and introduce two novel taxonomies: a three-stage cyber-attack lifecycle framework—Stream to Information (S2I), Information to Decision (I2D), and Decision to Actuation (D2A)—and 11 representative attack paths. Our findings reveal that the stream and data processing stages are most frequently exploited, with limited standardization in evaluation metrics and inadequate emphasis on real-world operational consequences. We highlight the growing importance of integrating quantum-safe cryptography and AI-driven anomaly detection as proactive defense mechanisms. Finally, we offer stage-wise design recommendations and identify future research directions, including the need for cross-domain evaluation frameworks, cyber-social risk assessments, and secure integration of emerging multi-modal systems. This survey aims to support policymakers, researchers, and industry stakeholders in developing resilient, secure, and trustworthy next-generation road transportation ecosystems.

Recent grants

Frequent coauthors

  • Jairo Giraldo

    Massachusetts Institute of Technology

    52 shared
  • Nicanor Quijano

    52 shared
  • Luis Francisco Cómbita

    District University of Bogotá

    44 shared
  • Xenofon Koutsoukos

    Vanderbilt University

    36 shared
  • Erik Yoon

    Massachusetts Institute of Technology

    36 shared
  • Girish V. Chowdhary

    36 shared
  • Shashank Shekhar

    36 shared
  • Peter W. Sauer

    University of Illinois Urbana-Champaign

    36 shared
  • Resume-aware match score
  • Save to shortlist
  • AI-drafted outreach

See your match with Alvaro Cardenas

PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.

  • Free to start
  • No credit card
  • 30-second signup