Fetch data for all installers, not only pip #39
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Helps #36.
pypinfo --all
tells it to fetch data for all installers, and not to filter to only pip (the default).(
--all
was added to pypinfo in ofek/pypinfo#51.)This results in a cheaper BigQuery call: the pip-only query costs an extra 25% in bytes processed, in bytes billed, and in dollars; see #36 (comment).
pip is still by far the most popular installer, and unsurprising uv is up there too, with about 10% of pip's downloads.
The others are 10% of uv or less. A lot of them are mirroring services, that we wanted to exclude before.
I think given uv's importance, and my expectation that it will continue to take a bigger share of the pie, plus especially the extra cost for filtering by just pip, means that we should switch to fetching data for all downloaders. Plus the others don't account for that much of the pie.