Generating API Parameter Security Rules with LLM for API Misuse Detection

Liu, Jinghua; Yang, Yi; Chen, Kai; Lin, Miaoqian

doi:10.14722/ndss.2025.23465

Computer Science > Cryptography and Security

arXiv:2409.09288 (cs)

[Submitted on 14 Sep 2024 (v1), last revised 19 Sep 2024 (this version, v2)]

Title:Generating API Parameter Security Rules with LLM for API Misuse Detection

Authors:Jinghua Liu, Yi Yang, Kai Chen, Miaoqian Lin

View PDF HTML (experimental)

Abstract:In this paper, we present a new framework, named GPTAid, for automatic APSRs generation by analyzing API source code with LLM and detecting API misuse caused by incorrect parameter use. To validate the correctness of the LLM-generated APSRs, we propose an execution feedback-checking approach based on the observation that security-critical API misuse is often caused by APSRs violations, and most of them result in runtime errors. Specifically, GPTAid first uses LLM to generate raw APSRs and the Right calling code, and then generates Violation code for each raw APSR by modifying the Right calling code using LLM. Subsequently, GPTAid performs dynamic execution on each piece of Violation code and further filters out the incorrect APSRs based on runtime errors. To further generate concrete APSRs, GPTAid employs a code differential analysis to refine the filtered ones. Particularly, as the programming language is more precise than natural language, GPTAid identifies the key operations within Violation code by differential analysis, and then generates the corresponding concrete APSR based on the aforementioned operations. These concrete APSRs could be precisely interpreted into applicable detection code, which proven to be effective in API misuse detection. Implementing on the dataset containing 200 randomly selected APIs from eight popular libraries, GPTAid achieves a precision of 92.3%. Moreover, it generates 6 times more APSRs than state-of-the-art detectors on a comparison dataset of previously reported bugs and APSRs. We further evaluated GPTAid on 47 applications, 210 unknown security bugs were found potentially resulting in severe security issues (e.g., system crashes), 150 of which have been confirmed by developers after our reports.

Comments:	Accepted by NDSS Symposium 2025. Please cite this paper as "Jinghua Liu, Yi Yang, Kai Chen, and Miaoqian Lin. Generating API Parameter Security Rules with LLM for API Misuse Detection. In the 32nd Annual Network and Distributed System Security Symposium (NDSS 2025)
Subjects:	Cryptography and Security (cs.CR); Software Engineering (cs.SE)
Cite as:	arXiv:2409.09288 [cs.CR]
	(or arXiv:2409.09288v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2409.09288
Related DOI:	https://doi.org/10.14722/ndss.2025.23465

Submission history

From: Jinghua Liu [view email]
[v1] Sat, 14 Sep 2024 03:34:43 UTC (2,698 KB)
[v2] Thu, 19 Sep 2024 04:15:40 UTC (2,698 KB)

Computer Science > Cryptography and Security

Title:Generating API Parameter Security Rules with LLM for API Misuse Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Generating API Parameter Security Rules with LLM for API Misuse Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators