王俊杰

Junjie Wang

研究员 / 博士生导师

中国科学院软件研究所

📍 北京

📧 junjie [at] iscas [dot] ac [dot] cn 🎓 Google Scholar 📚 DBLP

👤 About Me

我是中国科学院软件研究所的研究员,博士生导师,智能博弈重点实验室副主任(主持工作),主要从事智能软件工程研究。我们团队研究工作涵盖AI驱动的软件开发与测试智能化、面向AI系统的测试与安全、AIGC内容安全等。

I am a research scientist and doctoral supervisor at the Institute of Software, Chinese Academy of Sciences (ISCAS). I am also the deputy director (acting) of the Key Laboratory of Intelligent Gaming. My research focuses on Intelligent Software Engineering, including AI-driven software development and testing, testing and security for AI systems, and AIGC content security.

🎓 招生与招聘
实验室每年约有2个直博、10个硕士名额,欢迎感兴趣同学邮件联系!
实验室长期开放副研究员、特别研究助理、博士后等岗位,欢迎邮件联系!

📖 Experience

🔥 News

[2026.05] 实验室博士生沐方文通过评审答辩,推荐中国科学院院长奖,博士生陈孟卓通过评审答辩,获三好学生标兵!
[2026.05] 实验室2名博士、6名硕士毕业论文顺利通过答辩!
[2026.05] 大模型智能体生成内容安全论文被ICML 2026(CCF A)录用!之前工作主要关注大模型的内容安全,我们工作首次探索了智能体生成风险内容的机理以及相应的攻击防御手段!
[2026.04] 8篇论文被ACL 2026(CCF A)录用,涉及大模型智能体故障归因、内容安全检测、智能体安全、智能体可解释等方面!
[2026.04] 实验室2名博士、6名硕士毕业论文全盲审意见返回,全部优良!
[2026.04] 实验室组织《神经符号协同推理现状及思考》"零电力"研讨会!本年度创新形式,除报告人电脑,参会人员不能携带电子设备,“拔掉电源,打开脑洞”!

💡 Research Interests

🤖 AI 驱动的软件工程
  • 大模型辅助软件测试 / LLM-assisted Software Testing
  • 基于大模型的移动应用测试 / LLM-based Mobile App Testing
  • 智能化代码生成与缺陷修复 / Intelligent Code Generation & Bug Fixing
  • 智能化软件过程服务 / Intelligent Software Process Services
🛡️ 面向AI系统的测试与安全
  • 大模型智能体故障归因与增强 / LLM Agent Failure Attribution & Enhancement
  • 多智能体强化学习测试 / Multi-Agent RL Testing
  • 大模型/智能体安全 / LLM/Agent Security
  • 生成式人工智能内容安全 / AIGC Content Security

📷 Gallery

🤖 智能化的软件测试技术

From Suspicious Signals to Crashes: Guiding Bug-driven GUI Testing via Code-inspired Tracing
FSE 2026
Think Outside the Box: Automating Inter-App Functionality Testing via Memory Implanting and Reasoning
ICSE 2026
Seeing is Believing: Vision-Driven Non-Crash Functional Bug Detection for Mobile Apps
TSE 2025
Beyond Static GUI: Agent Evolving LLM-based GUI Testing via Dynamic Memory
ASE 2025
Standing on the Shoulders of Giants: Bug-Aware Automated GUI Testing via Retrieval Augmentation
FSE 2025
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps with LLM
CHI 2024
CrashTranslator: Automatically Reproducing Mobile Application Crashes Directly from Stack Trace
ICSE 2024
Make LLM a Testing Expert: Bringing Human-like Interaction to Mobile GUI Testing
ICSE 2024
Testing the Limits: Unusual Text Inputs Generation for Mobile App Crash Detection
ICSE 2024
A Roadmap for Software Testing in Open-Collaborative and AI-Powered Era
TOSEM 2024
Software Testing with Large Language Models: Survey, Landscape, and Vision
TSE 2024 📖 综述论文
Fill in the Blank: Context-aware Automated Text Input Generation for Mobile GUI Testing
ICSE 2023
Ex pede Herculem: Augmenting Activity Transition Graph for Apps via Graph Convolution Network
ICSE 2023
Context-aware Bug Reproduction for Mobile Apps
ICSE 2023
Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding
TSE 2023
The Metamorphosis: Automatic Detection of Scaling Issues for Mobile Apps
ASE 2022
Guided Bug Crush: Assist Manual GUI Testing of Android Apps via Hint Moves
CHI 2022
NaviDroid: A Tool for Guiding Manual Android Testing via Hint Moves
ICSE (Demo) 2022
Context- and Fairness-aware In-process Crowdworker Recommendation
TOSEM 2022
Context-Aware Personalized Crowdtesting Task Recommendation
TSE 2022
Characterizing Crowds to Better Optimize Worker Recommendation in Crowdsourced Testing
TSE 2021
OwlEyes-Online: A Fully Automated Platform for Detecting and Localizing UI Display Issues
FSE (Demo) 2021
Owl Eyes: Spotting UI Display Issues via Visual Understanding
ASE 2020
Context-aware In-process Crowdworker Recommendation
ICSE 2020
iSENSE2.0: Improving Completion-aware Crowdtesting Management
TOSEM 2020
Quest for the Golden Approach: Duplicate Crowdtesting Reports Detection
ESEM 2020
iSENSE: Completion-Aware Crowdtesting Management
ICSE 2019
Images Don't Lie: Duplicate Crowdtesting Reports Detection with Screenshot Information
IST 2019
Method-level Test Selection for Continuous Integration
QRS 2019
Domain Adaptation for Test Report Classification in Crowdsourced Testing
ICSE 2017
Local-Based Active Classification of Test Report to Assist Crowdsourced Testing
ASE 2016

🛡️ 面向智能算法/智能体的测试和安全

Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems
ACL Main 2026
DEFT: Demystifying VLN Failures via a Unified Dual-View Explainability
ACL Main 2026
Adversarial Attack on Black-Box Multi-Agent by Adaptive Perturbation
AAAI 2026
OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments
ACL Findings 2026
Where Did It Go Wrong: Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents
ACL Findings 2026
Understanding Individual Agent Importance in Multi-Agent Reinforcement Learning via Counterfactual Reasoning
AAAI 2025
Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations
TOSEM 2024
Diversity-Oriented Testing for Competitive Game Agent via Constraint-Guided Adversarial Agent Training
TSE 2024
Enhancing Multi-agent System Testing with Diversity-Guided Exploration and Adaptive Critical State Exploitation
ISSTA 2024
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
ACL Findings 2024
Automatic Static Vulnerability Detection for Machine Learning Libraries: Are We There Yet?
ISSRE 2023
Fuzzing with Sequence Diversity Inference for Sequential Decision-making Model Testing
ISSRE 2023
The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks
TOSEM 2023
Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We
ICSE 2021
API Recommendation for Machine Learning Libraries: How Far Are We
FSE 2022
Find Bugs in Static Bug Finders
ICPC 2022 🏆 Distinguished Paper Award

🔒 生成式人工智能内容安全

SAGE: Synergistic Adaptive Gating of Experts for Hateful Video Detection
ACL Main 2026
All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection
ACL Main 2026
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
AAAI 2026
Generative Text-to-Image Retrieval via Hierarchical Identifiers and Semantic Internalization
ACL Findings 2026
Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis
ACL Findings 2026
Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
ACL Main 2025
Vulnerability of Text-to-Image Models to Prompt Template Stealing
ACL Findings 2025
One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems
EMNLP Findings 2025
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection
NAACL 2025
Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement
EMNLP Findings 2024
Defeating Regeneration Attacks by Embedding Watermark into Predicted Noise of Diffusion Models
ICASSP 2024

💻 智能化代码生成、需求增强等

ExpeRepair: Dual-Memory Enhanced LLM-based Repository-Level Program Repair
FSE 2026
Deep API Sequence Generation via Golden Solution Samples and API Seeds
TOSEM 2025
Vehicle Domain-Specific Language: Unifying Modeling and Code Generation for Low-Code Automotive Development
ASE Industry 2024
Cross-Domain Requirements Linking via Adversarial-based Domain Adaptation
ICSE 2023
An Empirical Study on the Stability of Explainable Software Defect Prediction
APSEC 2023
Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries
MSR 2023
一种语义感知的细粒度 App 评论缺陷挖掘方法
软件学报 2023
CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing
ACM TR 2023
Where is Your App Frustrating Users?
ICSE 2022
Putting Them under Microscope: A Fine-Grained Approach for Detecting Redundant Test Cases
FSE 2022
Are We Building on the Rock? On the Importance of Data Preprocessing for Code Summarization
FSE 2022
Automated Data Function Extraction from Textual Requirements
IST 2022
响应时间约束的代码评审人推荐方法
软件学报 2020
信息产品及科技服务集成化众测服务研究
中国基础科学 2020
Enhancing Unsupervised Requirements Traceability with Sequential Semantics
APSEC 2019

在读博士研究生

在读硕士研究生

已毕业博士

已毕业硕士

🏆 Honors & Awards

2025 ACL 2025 SAC Highlights
2024 CHI 2024 Best Paper Honorable Mention
2023 APSEC 2023 Distinguished Paper Award
2022 ICPC 2022 ACM SIGSOFT Distinguished Paper Award
2020 ICSE 2020 ACM SIGSOFT Distinguished Paper Award
2019 ICSE 2019 ACM SIGSOFT Distinguished Paper Award
2019 QRS 2019 IEEE Best Paper Award

🎓 Academic Service

期刊任职

Associate Editor — IEEE Transactions on Software Engineering(现任)
Review Board — Automated Software Engineering Journal(现任)

程序委员会成员

ICSE 2027, FSE 2026, ISSTA 2026, ASE 2025, ICSME 2026 — 2026
ICSE 2025, ASE 2025, ICSME 2025, ISSRE 2025, ESEM 2025, SANER 2025 — 2025
FSE 2024, ISSRE 2024, ESEM 2024 — 2024
FSE 2023, ICST 2023 — 2023

会议组织职务

Program Chair — QRS 2024, AI Reliability and Security, UK London
Publicity Chair — ISCCP 2024, ESEM 2024
Diversity Chair — ICSSP 2023

论坛主席

中国软件大会 - 智能化软件开发、测试和维护论坛 — 2024 西安 🏆 优秀论坛奖
中国软件大会 - 大模型与软件测试论坛 — 2023 🏆 优秀论坛奖

🎤 Invited Talks

2025.11 ChinaSoft 中国软件大会,武汉
《软件测试和程序修复中的大模型智能体记忆机制应用》
2024.11 ChinaSoft 中国软件大会,西安
《面向连续决策场景智能算法的可靠性保障技术》
2024.11 ChinaSoft 中国软件大会,西安
《从博士生到科研人:在不同身份转换间体会成长感悟》
2024.09 英国皇家学会-中国科学院 AI Ethics 论坛,伦敦
《Backdoor Attacks and Defenses for Neural Code Models》
2024.08 AiDD AI+研发数字峰会,北京
《基于多模态大模型的用户界面交互和测试》
2024.08 首届华为智能化测试论坛,北京
《基于大模型的界面测试》
2024.07 中国科学院青年创新促进会信息与管理分会,乌鲁木齐
《智能算法与软件测试》