4.6 Article

LiDetector: License Incompatibility Detection for Open Source Software

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3518994

关键词

Open source software; license; incompatibility detection

向作者/读者索取更多资源

Open-source software licenses determine the conditions for reusing, distributing, and modifying software. Custom licenses allow developers to create their own licenses with more flexible descriptions. To avoid financial and legal risks, it is crucial to ensure license compatibility when using third-party packages or code with licenses. LiDetector, a proposed tool, can extract and interpret OSS licenses, including custom licenses, and detect license incompatibility. It outperforms existing methods in terms of precision and accuracy, and reveals a high percentage of projects suffering from license incompatibility.
Open-source software (OSS) licenses dictate the conditions, which should be followed to reuse, distribute, and modify software. Apart from widely-used licenses such as the MIT License, developers are also allowed to customize their own licenses (called custom license), whose descriptions are more flexible. The presence of such various licenses imposes challenges to understand licenses and their compatibility. To avoid financial and legal risks, it is essential to ensure license compatibility when integrating third-party packages or reusing code accompanied with licenses. In this work, we propose LiDetector, an effective tool that extracts and interprets OSS licenses (including both official licenses and custom licenses), and detects license incompatibility among these licenses. Specifically, LiDetector introduces a learning-based method to automatically identify meaningful license terms from an arbitrary license, and employs Probabilistic Context-Free Grammar (PCFG) to infer rights and obligations for incompatibility detection. Experiments demonstrate that LiDetector outperforms existing methods with 93.28% precision for term identification, and 91.09% accuracy for right and obligation inference, and can effectively detect incompatibility with 10.06% FP rate and 2.56% FN rate. Furthermore, with LiDetector, our large-scale empirical study on 1,846 projects reveals that 72.91% of the projects are suffering from license incompatibility, including popular ones such as the MIT License and the Apache License. We highlighted lessons learned from perspectives of different stakeholders and made all related data and the replication package publicly available to facilitate follow-up research.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据