🤖 智能化的软件测试技术
From Suspicious Signals to Crashes: Guiding Bug-driven GUI Testing via Code-inspired Tracing
FSE 2026
Think Outside the Box: Automating Inter-App Functionality Testing via Memory Implanting and Reasoning
ICSE 2026
Seeing is Believing: Vision-Driven Non-Crash Functional Bug Detection for Mobile Apps
TSE 2025
Beyond Static GUI: Agent Evolving LLM-based GUI Testing via Dynamic Memory
ASE 2025
Standing on the Shoulders of Giants: Bug-Aware Automated GUI Testing via Retrieval Augmentation
FSE 2025
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps with LLM
CHI 2024
CrashTranslator: Automatically Reproducing Mobile Application Crashes Directly from Stack Trace
ICSE 2024
Make LLM a Testing Expert: Bringing Human-like Interaction to Mobile GUI Testing
ICSE 2024
Testing the Limits: Unusual Text Inputs Generation for Mobile App Crash Detection
ICSE 2024
A Roadmap for Software Testing in Open-Collaborative and AI-Powered Era
TOSEM 2024
Software Testing with Large Language Models: Survey, Landscape, and Vision
TSE 2024 📖 综述论文
Fill in the Blank: Context-aware Automated Text Input Generation for Mobile GUI Testing
ICSE 2023
Ex pede Herculem: Augmenting Activity Transition Graph for Apps via Graph Convolution Network
ICSE 2023
Context-aware Bug Reproduction for Mobile Apps
ICSE 2023
Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding
TSE 2023
The Metamorphosis: Automatic Detection of Scaling Issues for Mobile Apps
ASE 2022
Guided Bug Crush: Assist Manual GUI Testing of Android Apps via Hint Moves
CHI 2022
NaviDroid: A Tool for Guiding Manual Android Testing via Hint Moves
ICSE (Demo) 2022
Context- and Fairness-aware In-process Crowdworker Recommendation
TOSEM 2022
Context-Aware Personalized Crowdtesting Task Recommendation
TSE 2022
Characterizing Crowds to Better Optimize Worker Recommendation in Crowdsourced Testing
TSE 2021
OwlEyes-Online: A Fully Automated Platform for Detecting and Localizing UI Display Issues
FSE (Demo) 2021
Owl Eyes: Spotting UI Display Issues via Visual Understanding
ASE 2020
Context-aware In-process Crowdworker Recommendation
ICSE 2020
iSENSE2.0: Improving Completion-aware Crowdtesting Management
TOSEM 2020
Quest for the Golden Approach: Duplicate Crowdtesting Reports Detection
ESEM 2020
iSENSE: Completion-Aware Crowdtesting Management
ICSE 2019
Images Don't Lie: Duplicate Crowdtesting Reports Detection with Screenshot Information
IST 2019
Method-level Test Selection for Continuous Integration
QRS 2019
Domain Adaptation for Test Report Classification in Crowdsourced Testing
ICSE 2017
Local-Based Active Classification of Test Report to Assist Crowdsourced Testing
ASE 2016
🛡️ 面向智能算法/智能体的测试和安全
Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems
ACL Main 2026
DEFT: Demystifying VLN Failures via a Unified Dual-View Explainability
ACL Main 2026
Adversarial Attack on Black-Box Multi-Agent by Adaptive Perturbation
AAAI 2026
OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments
ACL Findings 2026
Where Did It Go Wrong: Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents
ACL Findings 2026
Understanding Individual Agent Importance in Multi-Agent Reinforcement Learning via Counterfactual Reasoning
AAAI 2025
Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations
TOSEM 2024
Diversity-Oriented Testing for Competitive Game Agent via Constraint-Guided Adversarial Agent Training
TSE 2024
Enhancing Multi-agent System Testing with Diversity-Guided Exploration and Adaptive Critical State Exploitation
ISSTA 2024
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
ACL Findings 2024
Automatic Static Vulnerability Detection for Machine Learning Libraries: Are We There Yet?
ISSRE 2023
Fuzzing with Sequence Diversity Inference for Sequential Decision-making Model Testing
ISSRE 2023
The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks
TOSEM 2023
Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We
ICSE 2021
API Recommendation for Machine Learning Libraries: How Far Are We
FSE 2022
Find Bugs in Static Bug Finders
ICPC 2022 🏆 Distinguished Paper Award
🔒 生成式人工智能内容安全
SAGE: Synergistic Adaptive Gating of Experts for Hateful Video Detection
ACL Main 2026
All Changes May Have Invariant Principles: Improving Ever-Shifting Harmful Meme Detection
ACL Main 2026
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
AAAI 2026
Generative Text-to-Image Retrieval via Hierarchical Identifiers and Semantic Internalization
ACL Findings 2026
Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis
ACL Findings 2026
Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
ACL Main 2025
Vulnerability of Text-to-Image Models to Prompt Template Stealing
ACL Findings 2025
One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems
EMNLP Findings 2025
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection
NAACL 2025
Repairing Catastrophic-Neglect in Text-to-Image Diffusion Models via Attention-Guided Feature Enhancement
EMNLP Findings 2024
Defeating Regeneration Attacks by Embedding Watermark into Predicted Noise of Diffusion Models
ICASSP 2024
💻 智能化代码生成、需求增强等
ExpeRepair: Dual-Memory Enhanced LLM-based Repository-Level Program Repair
FSE 2026
Deep API Sequence Generation via Golden Solution Samples and API Seeds
TOSEM 2025
Vehicle Domain-Specific Language: Unifying Modeling and Code Generation for Low-Code Automotive Development
ASE Industry 2024
Cross-Domain Requirements Linking via Adversarial-based Domain Adaptation
ICSE 2023
An Empirical Study on the Stability of Explainable Software Defect Prediction
APSEC 2023
Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries
MSR 2023
一种语义感知的细粒度 App 评论缺陷挖掘方法
软件学报 2023
CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing
ACM TR 2023
Where is Your App Frustrating Users?
ICSE 2022
Putting Them under Microscope: A Fine-Grained Approach for Detecting Redundant Test Cases
FSE 2022
Are We Building on the Rock? On the Importance of Data Preprocessing for Code Summarization
FSE 2022
Automated Data Function Extraction from Textual Requirements
IST 2022
响应时间约束的代码评审人推荐方法
软件学报 2020
信息产品及科技服务集成化众测服务研究
中国基础科学 2020
Enhancing Unsupervised Requirements Traceability with Sequential Semantics
APSEC 2019