-
Notifications
You must be signed in to change notification settings - Fork 114
Project Ideas Improve RPM package license detection
The goal of this project is to improve RPM package license detection across the board.
scancode-toolkit's RPM package license detection is not great. Usually
this is because a declared_license
value has not been collected correctly, and has not
be made sense of.
This project would be a mix of adding new license detection rules to scancode, adding new and improved code to handle the specific patterns of license, creating new license mappings and possibly working with upstream maintainers to improve their license declarations. The approach should be to start with a complete data set of all package manifests and find patterns of license issues and establish the baseline, possibly with classifiers and ML. The end results should be a significant improvement to the license detection quality for the RPM packages.
See this ticket for additional details: https://github.com/nexB/scancode-toolkit/issues/2412
-
Level
- Intermediate
-
Tech
- Python
- URLS
-
Mentors
- @pombredanne https://github.com/pombredanne
- @AyanSinhaMahapatra https://github.com/AyanSinhaMahapatra