王俊杰 - Junjie Wang

👤 About Me

我是中国科学院软件研究所的研究员，博士生导师，智能博弈重点实验室副主任（主持工作），主要从事智能软件工程研究。我们团队研究工作涵盖AI驱动的软件开发与测试智能化、面向AI系统的测试与安全、AIGC内容安全等。

I am a research scientist and doctoral supervisor at the Institute of Software, Chinese Academy of Sciences (ISCAS). I am also the deputy director (acting) of the Key Laboratory of Intelligent Gaming. My research focuses on Intelligent Software Engineering, including AI-driven software development and testing, testing and security for AI systems, and AIGC content security.

🎓 招生与招聘
实验室每年约有2个直博、10个硕士名额，欢迎感兴趣同学邮件联系！
实验室长期开放副研究员、特别研究助理、博士后等岗位，欢迎邮件联系！

📖 Experience

2025.02 - 至今 智能博弈重点实验室副主任（主持工作），中国科学院软件研究所
Deputy Director (Acting), Key Laboratory of Intelligent Gaming, ISCAS
2022.10 - 至今 研究员，中国科学院软件研究所
Research Scientist, ISCAS
2017.10 - 2022.10 副研究员，中国科学院软件研究所
Associate Research Scientist, ISCAS
2017.09 - 2018.09 访问学者，美国北卡罗来纳州立大学
Visiting Scholar, NC State University, USA
2015.07 - 2017.10 助理研究员，中国科学院软件研究所
Assistant Research Scientist, ISCAS

🔥 News

[2026.05] 实验室博士生沐方文通过评审答辩，推荐中国科学院院长奖，博士生陈孟卓通过评审答辩，获三好学生标兵！

[2026.05] 实验室2名博士、6名硕士毕业论文顺利通过答辩！

[2026.05] 大模型智能体生成内容安全论文被ICML 2026（CCF A）录用！之前工作主要关注大模型的内容安全，我们工作首次探索了智能体生成风险内容的机理以及相应的攻击防御手段！

[2026.04] 8篇论文被ACL 2026（CCF A）录用，涉及大模型智能体故障归因、内容安全检测、智能体安全、智能体可解释等方面！

[2026.04] 实验室2名博士、6名硕士毕业论文全盲审意见返回，全部优良！

[2026.04] 实验室组织《神经符号协同推理现状及思考》"零电力"研讨会！本年度创新形式，除报告人电脑，参会人员不能携带电子设备，“拔掉电源，打开脑洞”！

💡 Research Interests

🤖 AI 驱动的软件工程

大模型辅助软件测试 / LLM-assisted Software Testing
基于大模型的移动应用测试 / LLM-based Mobile App Testing
智能化代码生成与缺陷修复 / Intelligent Code Generation & Bug Fixing
智能化软件过程服务 / Intelligent Software Process Services

🛡️ 面向AI系统的测试与安全

大模型智能体故障归因与增强 / LLM Agent Failure Attribution & Enhancement
多智能体强化学习测试 / Multi-Agent RL Testing
大模型/智能体安全 / LLM/Agent Security
生成式人工智能内容安全 / AIGC Content Security

📷 Gallery

2024年9月教师节

2025年9月教师节

ChinaSoft 2025 武汉

零电力研讨会202604

午餐会 20260430

2026博士答辩-常志远

2026博士答辩-沐方文

2026硕士答辩

🤖 智能化的软件测试技术

From Suspicious Signals to Crashes: Guiding Bug-driven GUI Testing via Code-inspired Tracing

FSE 2026

Think Outside the Box: Automating Inter-App Functionality Testing via Memory Implanting and Reasoning

ICSE 2026

Seeing is Believing: Vision-Driven Non-Crash Functional Bug Detection for Mobile Apps

TSE 2025

Beyond Static GUI: Agent Evolving LLM-based GUI Testing via Dynamic Memory

ASE 2025

Standing on the Shoulders of Giants: Bug-Aware Automated GUI Testing via Retrieval Augmentation

FSE 2025

Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps with LLM

CHI 2024

CrashTranslator: Automatically Reproducing Mobile Application Crashes Directly from Stack Trace

ICSE 2024

Make LLM a Testing Expert: Bringing Human-like Interaction to Mobile GUI Testing

ICSE 2024

Testing the Limits: Unusual Text Inputs Generation for Mobile App Crash Detection

ICSE 2024

A Roadmap for Software Testing in Open-Collaborative and AI-Powered Era

TOSEM 2024

Software Testing with Large Language Models: Survey, Landscape, and Vision

TSE 2024 📖 综述论文

Fill in the Blank: Context-aware Automated Text Input Generation for Mobile GUI Testing

ICSE 2023

Ex pede Herculem: Augmenting Activity Transition Graph for Apps via Graph Convolution Network

ICSE 2023

Context-aware Bug Reproduction for Mobile Apps

ICSE 2023

Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding

TSE 2023

The Metamorphosis: Automatic Detection of Scaling Issues for Mobile Apps

ASE 2022

Guided Bug Crush: Assist Manual GUI Testing of Android Apps via Hint Moves

CHI 2022

NaviDroid: A Tool for Guiding Manual Android Testing via Hint Moves

ICSE (Demo) 2022

Context- and Fairness-aware In-process Crowdworker Recommendation

TOSEM 2022

Context-Aware Personalized Crowdtesting Task Recommendation

TSE 2022

Characterizing Crowds to Better Optimize Worker Recommendation in Crowdsourced Testing

TSE 2021

OwlEyes-Online: A Fully Automated Platform for Detecting and Localizing UI Display Issues

FSE (Demo) 2021

Owl Eyes: Spotting UI Display Issues via Visual Understanding

ASE 2020

Context-aware In-process Crowdworker Recommendation

ICSE 2020

iSENSE2.0: Improving Completion-aware Crowdtesting Management

TOSEM 2020

Quest for the Golden Approach: Duplicate Crowdtesting Reports Detection

ESEM 2020

iSENSE: Completion-Aware Crowdtesting Management

ICSE 2019

Images Don't Lie: Duplicate Crowdtesting Reports Detection with Screenshot Information

IST 2019

Method-level Test Selection for Continuous Integration

QRS 2019

Domain Adaptation for Test Report Classification in Crowdsourced Testing

ICSE 2017

Local-Based Active Classification of Test Report to Assist Crowdsourced Testing

ASE 2016

🛡️ 面向智能算法/智能体的测试和安全

Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems

ACL Main 2026

DEFT: Demystifying VLN Failures via a Unified Dual-View Explainability

ACL Main 2026

Adversarial Attack on Black-Box Multi-Agent by Adaptive Perturbation

AAAI 2026

OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments

ACL Findings 2026

Where Did It Go Wrong: Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents

ACL Findings 2026

Understanding Individual Agent Importance in Multi-Agent Reinforcement Learning via Counterfactual Reasoning

AAAI 2025

Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations

TOSEM 2024

Diversity-Oriented Testing for Competitive Game Agent via Constraint-Guided Adversarial Agent Training

TSE 2024

Enhancing Multi-agent System Testing with Diversity-Guided Exploration and Adaptive Critical State Exploitation

ISSTA 2024

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues

ACL Findings 2024

Automatic Static Vulnerability Detection for Machine Learning Libraries: Are We There Yet?

ISSRE 2023

Fuzzing with Sequence Diversity Inference for Sequential Decision-making Model Testing

ISSRE 2023

The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks

TOSEM 2023

Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We

ICSE 2021

API Recommendation for Machine Learning Libraries: How Far Are We

FSE 2022

Find Bugs in Static Bug Finders

ICPC 2022 🏆 Distinguished Paper Award

🔒 生成式人工智能内容安全

SAGE: Synergistic Adaptive Gating of Experts for Hateful Video Detection

ACL Main 2026

All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection

ACL Main 2026

Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems

AAAI 2026

Generative Text-to-Image Retrieval via Hierarchical Identifiers and Semantic Internalization

ACL Findings 2026

Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis

ACL Findings 2026

Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System

ACL Main 2025

Vulnerability of Text-to-Image Models to Prompt Template Stealing

ACL Findings 2025

One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems

EMNLP Findings 2025

From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection

NAACL 2025

Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement

EMNLP Findings 2024

Defeating Regeneration Attacks by Embedding Watermark into Predicted Noise of Diffusion Models

ICASSP 2024

💻 智能化代码生成、需求增强等

ExpeRepair: Dual-Memory Enhanced LLM-based Repository-Level Program Repair

FSE 2026

Deep API Sequence Generation via Golden Solution Samples and API Seeds

TOSEM 2025

Vehicle Domain-Specific Language: Unifying Modeling and Code Generation for Low-Code Automotive Development

ASE Industry 2024

Cross-Domain Requirements Linking via Adversarial-based Domain Adaptation

ICSE 2023

An Empirical Study on the Stability of Explainable Software Defect Prediction

APSEC 2023

Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries

MSR 2023

一种语义感知的细粒度 App 评论缺陷挖掘方法

软件学报 2023

CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing

ACM TR 2023

Where is Your App Frustrating Users?

ICSE 2022

Putting Them under Microscope: A Fine-Grained Approach for Detecting Redundant Test Cases

FSE 2022

Are We Building on the Rock? On the Importance of Data Preprocessing for Code Summarization

FSE 2022

Automated Data Function Extraction from Textual Requirements

IST 2022

响应时间约束的代码评审人推荐方法

软件学报 2020

信息产品及科技服务集成化众测服务研究

中国基础科学 2020

Enhancing Unsupervised Requirements Traceability with Sequential Semantics

APSEC 2019

在读博士研究生

马序言博士研究生（硕博连读） 2021.9 入学
陈建明博士研究生 2023.9 入学
陈孟卓博士研究生（硕博连读） 2023.9 入学
王文硕博士研究生 2024.9 入学
冯宦翔博士研究生（硕博连读） 2025.9 入学

在读硕士研究生

薛阳光 (2024.9 入学)
代易涵 (2025.9 入学)

已毕业博士

刘哲 (2018-2023) → first job: 中科院软件所特别研究助理
🏆 中科院院长奖、北京市三好学生、ACM SRC全球总冠军、6篇CCF A（一作）
黄悦凯 (2018-2024) → first job: 中科院软件所特别研究助理
1篇CCF A（一作）、3篇CCF B
江子攸 (2019-2025) → first job: 中科院软件所特别研究助理
🏆 中科院院长奖、6篇CCF A（一作）、ASE杰出论文奖
黄芋超 (2019-2025) → first job: 德国慕尼黑工业大学博后
3篇CCF A（一作）、ICPC（CCF B）杰出论文奖
沐方文 (2020-2026) → first job: 腾讯 AI Coding
🏆 国家奖学金、三好学生标兵、7篇CCF A（一作/共一作）
常志远 (2020-2026) → first job: 蚂蚁 AI Security
5篇CCF A（一作）、1篇CCF B

已毕业硕士

苏宇辉 (2020-2023) → first job: 字节跳动质量保障
明旭冉 (2020-2023) → first job: 中科院软件所工程师
王凯锐 (2021-2024) → first job: 阿里巴巴高德地图
闫熠光 (2021-2024) → first job: 字节跳动国际电商
车行 (2022-2025) → first job: 字节跳动广告
李澄 (2022-2025) → first job: 美国华盛顿大学读博
秦昊 (2022-2025) → first job: 江苏省选调生
王浩伟 (2023-2026) → first job: 阿里广告
张濡芃 (2023-2026) → first job: 腾讯云

🏆 Honors & Awards

2025 ACL 2025 SAC Highlights

2024 CHI 2024 Best Paper Honorable Mention

2023 APSEC 2023 Distinguished Paper Award

2022 ICPC 2022 ACM SIGSOFT Distinguished Paper Award

2020 ICSE 2020 ACM SIGSOFT Distinguished Paper Award

2019 ICSE 2019 ACM SIGSOFT Distinguished Paper Award

2019 QRS 2019 IEEE Best Paper Award

🎓 Academic Service

期刊任职

Associate Editor — IEEE Transactions on Software Engineering（现任）

Review Board — Automated Software Engineering Journal（现任）

程序委员会成员

ICSE 2027, FSE 2026, ISSTA 2026, ASE 2025, ICSME 2026 — 2026

ICSE 2025, ASE 2025, ICSME 2025, ISSRE 2025, ESEM 2025, SANER 2025 — 2025

FSE 2024, ISSRE 2024, ESEM 2024 — 2024

FSE 2023, ICST 2023 — 2023

会议组织职务

Program Chair — QRS 2024, AI Reliability and Security, UK London

Publicity Chair — ISCCP 2024, ESEM 2024

Diversity Chair — ICSSP 2023

论坛主席

中国软件大会 - 智能化软件开发、测试和维护论坛 — 2024 西安 🏆 优秀论坛奖

中国软件大会 - 大模型与软件测试论坛 — 2023 🏆 优秀论坛奖

🎤 Invited Talks

2025.11 ChinaSoft 中国软件大会，武汉

《软件测试和程序修复中的大模型智能体记忆机制应用》

2024.11 ChinaSoft 中国软件大会，西安

《面向连续决策场景智能算法的可靠性保障技术》

2024.11 ChinaSoft 中国软件大会，西安

《从博士生到科研人：在不同身份转换间体会成长感悟》

2024.09 英国皇家学会-中国科学院 AI Ethics 论坛，伦敦

《Backdoor Attacks and Defenses for Neural Code Models》

2024.08 AiDD AI+研发数字峰会，北京

《基于多模态大模型的用户界面交互和测试》

2024.08 首届华为智能化测试论坛，北京

《基于大模型的界面测试》

2024.07 中国科学院青年创新促进会信息与管理分会，乌鲁木齐

《智能算法与软件测试》