Skip to content

Latest commit

 

History

History
214 lines (159 loc) · 34.1 KB

README.md

File metadata and controls

214 lines (159 loc) · 34.1 KB

⚔🛡 Security of Language Models for Code

This repository provides a summary of recent advancements in the security landscape surrounding Language Models for Code (also known as Neural Code Models), including backdoor, adversarial attacks, corresponding defenses and so on.

NOTE: We collect the original code from the papers and the code we have reproduced. While, our reproduced code is not guaranteed to be fully accurate and is for reference only. For specific issues, please consult the original authors.

arXiv GitHub stars Awesome

Overview

Language Models for Code (CodeLMs) have significantly advanced code-related tasks, they are excel in programming language understanding and generation. Despite their success, CodeLMs are prone to security vulnerabilities, which have become a growing concern. While existing research has explored various attacks and defenses for CodeLMs. To address a systematic review of CodeLM security, this repository organizes the current knowledge on Security Threats and Defense Strategies.

Table of Contents

NOTE: Our paper is labeled with 🚩.

📃Survey

The survey analyzes security threats to CodeLMs, categorizing existing attack types such as backdoor and adversarial attacks, and explores their implications for code intelligence tasks.

Year Conf./Jour. Paper
2024 CoRR Security of Language Models for Code: A Systematic Literature Review 🚩
2024 《软件学报》 深度代码模型安全综述 🚩
2024 CoRR Robustness, Security, Privacy, Explainability, Efficiency, and Usability of Large Language Models for Code.
2023 CoRR A Survey of Trojans in Neural Models of Source Code: Taxonomy and Techniques.

⚔Security Threats

According to the document, security threats in CodeLMs are mainly classified into two categories: backdoor attacks and adversarial attacks. Backdoor attacks occur during the training phase, where attackers implant hidden backdoors in the model, allowing it to function normally on benign inputs but behave maliciously when triggered by specific patterns. In contrast, adversarial attacks happen during the testing phase, where carefully crafted perturbations are added to the input, causing the model to make incorrect predictions with high confidence while remaining undetectable to humans.

An overview of attacks in CodeLMs.

Backdoor Attacks

Backdoor attacks inject malicious behavior into the model during training, allowing the attacker to trigger it at inference time using specific triggers:

  • Data poisoning attacks: Slight changes to the training data that cause backdoor behavior.
Year Conf./Jour. Paper Code Repository Reproduced Repository
2024 ISSTA FDI: Attack Neural Code Generation Systems through User Feedback Channel. Octocat
2024 TSE Stealthy Backdoor Attack for Code Models. Octocat
2024 SP Trojanpuzzle: Covertly Poisoning Code-Suggestion Models. Octocat
2024 TOSEM Poison Attack and Poison Detection on Deep Source Code Processing Models. Octocat
2023 ICPC Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks. Octocat
2023 EMNLP TrojanSQL: SQL Injection against Natural Language Interface to Database. Octocat
2023 ACL Backdooring Neural Code Search. 🚩 Octocat
2022 ICPR Backdoors in Neural Models of Source Code. Octocat
2022 FSE You See What I Want You to See: Poisoning Vulnerabilities in Neural Code Search. Octocat
2021 USENIX Security Explanation-Guided Backdoor Poisoning Attacks Against Malware Classifiers. Octocat
2021 USENIX Security You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion.
  • Model poisoning attacks: Changes that do not alter the functionality of the code but trick the model.
Year Conf./Jour. Paper Code Repository Reproduced Repository
2024 Internetware LateBA: Latent Backdoor Attack on Deep Bug Search via Infrequent Execution Codes. Octocat
2023 CoRR BadCS: A Backdoor Attack Framework for Code search.
2023 ACL Multi-target Backdoor Attacks for Code Pre-trained Models. Octocat
2023 USENIX Security PELICAN: Exploiting Backdoors of Naturally Trained Deep Learning Models In Binary Code Analysis. Octocat
2021 USENIX Security You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion.

Adversarial Attacks

These attacks manipulate the input data to deceive the model into making incorrect predictions. Including two categories:

  • White-box attacks: Attackers have complete knowledge of the target model, including model structure, weight parameters, and training data.
Year Conf./Jour. Paper Code Repository Reproduced Repository
2023 BdCloud AdvBinSD: Poisoning the Binary Code Similarity Detector via Isolated Instruction Sequences.
2023 CoRR Adversarial Attacks against Binary Similarity Systems.
2023 SANER How Robust Is a Large Pre-trained Language Model for Code Generation𝑓 A Case on Attacking GPT2.
2022 TOSEM Towards Robustness of Deep Program Processing Models - Detection, Estimation, and Enhancement. Octocat
2022 ICECCS Generating Adversarial Source Programs Using Important Tokens-based Structural Transformations.
2021 ICLR Generating Adversarial Computer Programs using Optimized Obfuscations. Octocat
2020 OOPSLA Adversarial Examples for Models of Code. Octocat
2020 ICML Adversarial Robustness for Code. Octocat
2018 CoRR Adversarial Binaries for Authorship Identification.
  • Black-box attacks: Adversaries attackers can only generate adversarial examples by obtaining limited model outputs through model queries.
Year Conf./Jour. Paper Code Repository Reproduced Repository
2024 TOSEM CARL: Unsupervised Code-Based Adversarial Attacks for Programming Language Models via Reinforcement Learning.
2024 FSE Evolutionary Multi-objective Optimization for Contextual Adversarial Example Generation. Octocat
2024 CoRR Investigating Adversarial Attacks in Software Analytics via Machine Learning Explainability.
2024 ICSE MalwareTotal: Multi-Faceted and Sequence-Aware Bypass Tactics against Static Malware Detection. Octocat
2024 TIFS Exploiting the Adversarial Example Vulnerability of Transfer Learning of Source Code. Octocat
2024 CollaborateCom Structural Adversarial Attack for Code Representation Models. Octocat
2024 CoRR Demonstration Attack against In-Context Learning for Code Intelligence. 🚩
2024 JSEP CodeBERT‐Attack: Adversarial Attack against Source Code Deep Learning Models via Pre‐trained Model.
2024 TOSEM How Important Are Good Method Namesin NeuralCode Generation? A Model Robustness Perspective. Octocat
2023 CoRR Deceptprompt: Exploiting llm-driven code generation via adversarial natural language instructions.
2023 AAAI CodeAttack: Code-Based Adversarial Attacks for Pre-trained Programming Language Models. Octocat
2023 PACM PL Discrete Adversarial Attack to Models of Code.
2023 CoRR Adversarial Attacks on Code Models with Discriminative Graph Patterns.
2023 Electronics AdVulCode: Generating Adversarial Vulnerable Code against Deep Learning-Based Vulnerability Detectors.
2023 ACL DIP: Dead code Insertion based Black-box Attack for Programming Language Model.
2023 CoRR A Black-Box Attack on Code Models via Representation Nearest Neighbor Search. Octocat
2023 CoRR SHIELD: Thwarting Code Authorship Attribution.
2023 ASE Code Difference Guided Adversarial Example Generation for Deep Code Models. Octocat
2022 CoRR FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods.
2022 TOSEM Adversarial Robustness of Deep Code Comment Generation. Octocat
2022 ICSE Natural Attack for Pre-trained Models of Code. Octocat Localcat
2022 ICSE RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style Transformation. Octocat
2022 EMNLP TABS: Efficient Textual Adversarial Attack for Pre-trained NL Code Model Using Semantic Beam Search.
2021 TIFS A Practical Black-box Attack on Source Code Authorship Identification Classifiers.
2021 ICST A Search-Based Testing Framework for Deep Neural Networks of Source Code Embedding. Octocat
2021 QRS Generating Adversarial Examples of Source Code Classification Models via Q-Learning-Based Markov Decision Process.
2021 GECCO Deceiving Neural Source Code Classifiers: Finding Adversarial Examples with Grammatical Evolution. Octocat
2021 CoRR On Adversarial Robustness of Synthetic Code Generation. Octocat
2020 CoRR STRATA: Simple, Gradient-free Attacks for Models of Code.
2020 AAAI Generating Adversarial Examples for Holding Robustness of Source Code Processing Models. Octocat
2019 USENIX Security Misleading Authorship Attribution of Source Code using Adversarial Learning.
2019 CODASPY Adversarial Authorship Attribution in Open-source Projects.

🛡Defensive Strategies

In response to the growing security threats, researchers have proposed various defense mechanisms:

Backdoor Defense

Methods for defending against backdoor attacks include:

Year Conf./Jour. Paper Code Reporisty Reproduced Reporisty
2024 TOSEM Poison Attack and Poison Detection on Deep Source Code Processing Models.
2024 CoRR Eliminating Backdoors in Neural Code Models via Trigger Inversion. 🚩
2024 CoRR Defending Code Language Models against Backdoor Attacks with Deceptive Cross-Entropy Loss. Octocat
2023 CoRR Occlusion-based Detection of Trojan-triggering Inputs in Large Language Models of Code.
2022 ICPR Backdoors in Neural Models of Source Code. Octocat

Adversarial Defense

Approaches to counter adversarial attacks include:

Year Conf./Jour. Paper Code Reporisty Reproduced Reporisty
2024 TOSEM How Important Are Good Method Namesin NeuralCode Generation? A Model Robustness Perspective. Octocat
2023 ICSE RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style Transformation. Octocat
2023 PACM PL Discrete Adversarial Attack to Models of Code.
2023 CCS Large Language Models for Code: Security Hardening and Adversarial Testing. Octocat
2023 CoRR Enhancing Robustness of AI Offensive Code Generators via Data Augmentation.
2022 SANER Semantic Robustness of Models of Source Code. Octocat
2022 COLING Semantic-Preserving Adversarial Code Comprehension. Octocat

Citation

If you find this repository useful for your work, please include the following citation:

@article{2024-Security-of-CodeLMs,
  title={Security of Language Models for Code: A Systematic Literature Review},
  author={Yuchen Chen and Weisong Sun and Chunrong Fang and Zhenpeng Chen and Yifei Ge and Tingxu Han and Quanjun Zhang and Yang Liu and Zhenyu Chen and Baowen Xu},
  journal={arXiv preprint arXiv:2410.15631},
  year={2024}
}