We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I am exploring to use esgf-pyclient to get a list of all retracted CMIP6 datasets (for our automated maintenance of Pangeo CMIP6 cloud data.
I am trying the following:
from pyesgf.search import SearchConnection conn = SearchConnection( 'https://esgf-node.llnl.gov/esg-search', distrib=True, ) ctx = conn.new_context(mip_era='CMIP6', retracted=True, replica=False, fields='id', facets=['doi']) ctx.hit_count
And I get back a hit count of 691984
691984
But when I try to extract a list of instance_ids
results = ctx.search(batch_size=10000) retracted = [ds.dataset_id for ds in results] len(retracted)
The list only has 240000 elements. That very even number makes me think that there is some internal limit I am hitting here?
240000
Or did I miss something in the above code?
Any help on this would be greatly appreciated.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
I am exploring to use esgf-pyclient to get a list of all retracted CMIP6 datasets (for our automated maintenance of Pangeo CMIP6 cloud data.
I am trying the following:
And I get back a hit count of
691984
But when I try to extract a list of instance_ids
The list only has
240000
elements. That very even number makes me think that there is some internal limit I am hitting here?Or did I miss something in the above code?
Any help on this would be greatly appreciated.
The text was updated successfully, but these errors were encountered: