4.6 Article

Blindspots in Python and Java APIs Result in Vulnerable Code

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3571850

Keywords

Software vulnerabilities; Java; Python; APIs; API blindspots

Ask authors/readers for more resources

Blindspots in APIs can lead to vulnerabilities and are common. Using APIs with blindspots reduces developers' ability to reason about them, especially in Python. Interestingly, Python developers are less likely to notice vulnerabilities in complex code, while Java developers are more careful with complex code but more careless with simple code. Long-term memory recall improves developers' ability to reason about APIs with blindspots, but other memory factors have no effect. Experience and expertise do not help in recognizing blindspots, and personality traits only affect the ability to reason about Java APIs with blindspots. Overall, tools are needed to help developers recognize blindspots in APIs as they write code.
Blindspots in APIs can cause software engineers to introduce vulnerabilities, but such blindspots are, unfortunately, common. We study the effect APIs with blindspots have on developers in two languages by replicating a 109-developer, 24-Java-API controlled experiment. Our replication applies to Python and involves 129 new developers and 22 new APIs. We find that using APIs with blindspots statistically significantly reduces the developers' ability to correctly reason about the APIs in both languages, but that the effect is more pronounced for Python. Interestingly, for Java, the effect increased with complexity of the code relying on the API, whereas for Python, the opposite was true. This suggests that Python developers are less likely to notice potential for vulnerabilities in complex code than in simple code, whereas Java developers are more likely to recognize the extra complexity and apply more care, but are more careless with simple code. Whether the developers considered API uses to be more difficult, less clear, and less familiar did not have an effect on their ability to correctly reason about them. Developers with better long-term memory recall were more likely to correctly reason about APIs with blindspots, but short-term memory, processing speed, episodic memory, and memory span had no effect. Surprisingly, professional experience and expertise did not improve the developers' ability to reason about APIs with blindspots across both languages, with long-term professionals with many years of experience making mistakes as often as relative novices. Finally, personality traits did not significantly affect the Python developers' ability to reason about APIs with blindspots, but less extroverted and more open developers were better at reasoning about Java APIs with blindspots. Overall, our findings suggest that blindspots in APIs are a serious problem across languages, and that experience and education alone do not overcome that problem, suggesting that tools are needed to help developers recognize blindspots in APIs as they write code that uses those APIs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available