publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. AgentXploit: End-to-End Redteaming of Black-Box AI Agents
    Zhun Wang, Vincent Siu, Zhe Ye, and 6 more authors
    2025
  2. COSMIC: Generalized Refusal Direction Identification in LLM Activations
    Vincent Siu, Nicholas Crispino, Zihao Yu, and 5 more authors
    2025