ESGF to Google Cloud CMIP6 workflow

This is a simplification of my old workflow. It still handles requests, but the collection is now primarily updated by cycling through popular searches. This is done by simply specifying the keyword values of the type of data desired. As before, the ESGF repo is searched for relevant data and a comparison with the GC catalog is made to see what new data is available.

The notebook GetSpecified.ipynb is now what I use day-to-day to update our GC collection.

The notebook GetRequested.ipynb is now what I use day-to-day to handle data requests (may be be phased out in the future)

The esm-collection-spec CSV file for the Pangeo Google Cloud datasets is pangeo-cmip6.csv.

My working request tracker for updating the Google Cloud CMIP6 zarr collection is here.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
CVs		CVs
ESDOC		ESDOC
csv		csv
csv1		csv1
csv2		csv2
.gitignore		.gitignore
GetRequested.ipynb		GetRequested.ipynb
GetSpecified.ipynb		GetSpecified.ipynb
Hints.ipynb		Hints.ipynb
Instructions.txt		Instructions.txt
Intake-esm.ipynb		Intake-esm.ipynb
ListReplacedVersions.ipynb		ListReplacedVersions.ipynb
MakeCloudCat_GC.ipynb		MakeCloudCat_GC.ipynb
MakeCloudCat_S3.ipynb		MakeCloudCat_S3.ipynb
NZS-testing.ipynb		NZS-testing.ipynb
README.md		README.md
TestDataNodes.ipynb		TestDataNodes.ipynb
myconfig.py		myconfig.py
mydataset.py		mydataset.py
myidentify.py		myidentify.py
myrequest.py		myrequest.py
myresponse.py		myresponse.py
mysearch.py		mysearch.py
mysets.py		mysets.py
mytasks.py		mytasks.py
myutilities.py		myutilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ESGF to Google Cloud CMIP6 workflow

About

Releases

Packages

Languages

naomi-henderson/cmip6collect2

Folders and files

Latest commit

History

Repository files navigation

ESGF to Google Cloud CMIP6 workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages