-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve discoverability and sharing of notebooks created in VEDA Hub #4
Comments
Doesn't exist. Currently relies on users self sharing often via github. MAAP has this same issue around algorithms which are more formal. Some prior art to consider: |
All we have is veda-docs mounted into JupyterHub containers by default, but that is very very bare-bones. We do not even link to the docs anywhere. So, the connectivity between docs and JupyterHub could definitely be improved - even without new convenience features for adding or editing notebooks, just by integrating the two better, like
I think once these tutorial / docs repos are more prominent, people will want to contribute more to them, and then we may need to make that more convenient. |
@j08lue I don't think this ticket was talking about Docs, I think it's talking about a vibrant community library where notebooks of any kind can be shared. It's more akin to public gists, or putting notebooks in a reproducibility archive like Zenodo or OpenScienceFramework where others might search and discover. We also don't want contributions to the VEDA Docs beyond VEDAHub specific nuances and references to external docs like Openscapes, unless we plan to follow the Planetary Computer model 🤔 . |
Interesting! 2i2c has an award from NASA that might be a nice way to explore some of these ideas. We could:
The ephemeral hub could be set up with small compute resource and made available to the public. Alternatively, the hub could perhaps be set up with large compute resources behind a log in gate controlled by VEDA. |
1 similar comment
Interesting! 2i2c has an award from NASA that might be a nice way to explore some of these ideas. We could:
The ephemeral hub could be set up with small compute resource and made available to the public. Alternatively, the hub could perhaps be set up with large compute resources behind a log in gate controlled by VEDA. |
That's still a curated set of notebooks, I think the original intent of this ticket was around user to user sharing that does not require platform based curation. |
Just noting that I think this use-case might be well-served by functionality like what we're discussing in this MyST issue: This is currently at the level of the MyST document engine (more at https://mystmd.org) but if it could be brought into JupyterLab via JupyterLab-MyST maybe (?) it could be rendered in JupyterLab as well. Other thoughts for inspiration: It's similar to the Jupyter Book gallery (if you could add a "filter by dataset" feature) https://executablebooks.org/en/latest/gallery This one is very simple, you just add an entry to this YML file and it pops up there. Very low-complexity :-) Or the Pangeo gallery (if you removed the hard dependence on BinderHub): You could give users a metadata specification to follow (e.g., Or for a "hacky but possible right now" approach, you could follow an example like this dynamic "list of running hubs" in the 2i2c infrastructure docs: https://infrastructure.2i2c.org/reference/hubs/ Which renders a CSV file that is generated at build time from a Python script, and then uses a little JavaScript tool called |
We should have a good way for users to publish and share notebooks that they create in VEDA Hub. Users should have a good way to find existing notebooks, searching by datasets that they use, or themes.
Creating this as a broad ticket for now - there's likely a few different implementation pathways here, that would depend on the exact use-cases we want to fulfill, as well as the level of "publicness" we want for these shared notebooks.
@wildintellect will be great to understand from you how this is being done currently (or not), and what we should hope to accomplish from this feature that can inform the approach we take to building it out.
Likely what we want is some UI integration into Jupyter Lab that would let users easily push notebooks to a git repository, and add some metadata, and then build some UI that can consume metadata from the git repository and provide an interface to search and run the notebooks.
This will take a bit of research and thought to spec more precisely, but ticketing to capture the broad sentiment / idea that we do want an easy way for users to share and discover notebooks related to datasets on VEDA.
The text was updated successfully, but these errors were encountered: