Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Struggling to use github_my_repository with a lot of repos (50k+) #461

Closed
chris-garrett opened this issue Nov 2, 2024 · 2 comments
Closed
Assignees
Labels
bug Something isn't working

Comments

@chris-garrett
Copy link

Describe the bug
Running select * from github.github_my_repository causes timeouts and rate limits:

SQL Error [HV000]: ERROR: rpc error: code = Unknown desc = github: non-200 OK status code: 403 Forbidden body: "{\n  \"documentation_url\": \"https://docs.github.com/free-pro-team@latest/rest/overview/rate-limits-for-the-rest-api#about-secondary-rate-limits\",\n  \"message\": \"You have exceeded a secondary rate limit. Please wait a few minutes before you try again. If you reach out to

Steampipe version (steampipe -v)
Steampipe v1.0.0

Plugin version (steampipe plugin list)

+-----------------------------------------------+---------+-------------+
| Installed                                     | Version | Connections |
+-----------------------------------------------+---------+-------------+
| hub.steampipe.io/plugins/turbot/github@latest | 1.0.0   | github      |
| hub.steampipe.io/plugins/turbot/jira@latest   | 1.0.0   | jira        |
+-----------------------------------------------+---------+-------------+

To reproduce
Run the command above.

Expected behavior
Results are returned from query.

Additional context
I did a bit of poking around and I think that the plugin is pulling a large number of forks that I have nothing to do with. I'm attached to the EpicGames repo and it looks like it's trying to pull 50k+ forks from there?

I'm also surprised that I'm hitting rate limits at all. There is a well known 5k request per hour limit I would have expected this plugin to use that limit as a sane default.

@chris-garrett chris-garrett added the bug Something isn't working label Nov 2, 2024
@misraved misraved self-assigned this Nov 18, 2024
@misraved
Copy link
Contributor

Apologies for the delayed response on this issue @chris-garrett!!

GitHub rate limits are extremely harsh and they often return errors.

We do have a GitHub issue where we have some discussions/suggestions on how one might be able to control the rate limits - #398

Unfortunately, we haven't been able to get to a meaningful solution yet, but as a workaround, you could try out Pipes which should be able to handle large datasets.

@misraved
Copy link
Contributor

Please feel free to add your observations in #398, your input would be greatly appreciated and will help us address the matter more effectively. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants