Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug - rake geocombine:clone errs out #144

Closed
ewlarson opened this issue Mar 9, 2023 · 5 comments
Closed

Bug - rake geocombine:clone errs out #144

ewlarson opened this issue Mar 9, 2023 · 5 comments
Milestone

Comments

@ewlarson
Copy link
Contributor

ewlarson commented Mar 9, 2023

I've been playing around with GeoCombine for Aardvark metadata harvesting.

Harvesting individual institutions has been working well:

bundle exec rake geocombine:clone\[edu.umn\]
=> 5480 docs

But cloning all the repos, fails:

First run

ewlarson@beanburrito GeoDiscovery % bundle exec rake geocombine:clone
rake aborted!
SystemStackError: stack level too deep
/Users/ewlarson/.rbenv/versions/3.2.1/bin/bundle:25:in `load'
/Users/ewlarson/.rbenv/versions/3.2.1/bin/bundle:25:in `<main>'
Tasks: TOP => geocombine:clone
(See full trace by running task with --trace)
ewlarson@beanburrito GeoDiscovery % cd tmp/opengeometadata 
ewlarson@beanburrito opengeometadata % ls -la
total 0
drwxr-xr-x  10 ewlarson  staff  320 Mar  9 08:19 .
drwxr-xr-x  15 ewlarson  staff  480 Mar  9 08:18 ..
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.harvard
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.nyu
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.princeton.arks
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:18 edu.stanford.purl
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.tufts
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.umn
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:19 edu.virginia
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:18 shared-repository

Second run

ewlarson@beanburrito GeoDiscovery % bundle exec rake geocombine:clone
rake aborted!
SystemStackError: stack level too deep
/Users/ewlarson/.rbenv/versions/3.2.1/bin/bundle:25:in `load'
/Users/ewlarson/.rbenv/versions/3.2.1/bin/bundle:25:in `<main>'
Tasks: TOP => geocombine:clone
(See full trace by running task with --trace)
ewlarson@beanburrito GeoDiscovery % cd tmp/opengeometadata 
ewlarson@beanburrito opengeometadata % ls -la
total 0
drwxr-xr-x  10 ewlarson  staff  320 Mar  9 08:43 .
drwxr-xr-x  15 ewlarson  staff  480 Mar  9 08:41 ..
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:43 edu.harvard
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:43 edu.nyu
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:42 edu.princeton.arks
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:41 edu.stanford.purl
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:43 edu.tufts
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:43 edu.umn
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:43 edu.virginia
drwxr-xr-x   3 ewlarson  staff   96 Mar  9 08:41 shared-repository

Can anyone else confirm? Seems to likely StackError the same place each clone run...

@thatbudakguy
Copy link
Member

Can you do a full trace? I wonder if this is being thrown from the ruby git client or something. I think I might've seen it once or twice when testing but I haven't managed to replicate it recently.

@kaloyan13
Copy link

kaloyan13 commented Mar 23, 2023

Saw the same issue, when trying to execute bundle exec rake geocombine:clone --trace
trace.txt

Cloging only edu.nyu worked bundle exec rake geocombine:clone[edu.nyu]

Working on Ubuntu 22.04 with these versions:

$ ruby -v
ruby 3.2.0 (2022-12-25 revision a528908271) [x86_64-linux]
$ gem -v
3.4.9
$ bundle -v
Bundler version 2.4.9

thatbudakguy added a commit that referenced this issue Mar 27, 2023
Providing both `directory` and `path` to Git.clone results in
the repo being cloned into a directory nested inside itself.

This may be a fix for #144.
thatbudakguy added a commit that referenced this issue Mar 27, 2023
Providing both `directory` and `path` to Git.clone results in
the repo being cloned into a directory nested inside itself.

This may be a fix for #144.
@thatbudakguy
Copy link
Member

Thanks for the trace — I think this might've been related to arguments I was passing to Git.clone. Hopefully #148 will resolve that.

It could also be related to the behavior that happens when we try to clone a Git repository with no contents (of which there are a few in OGM); #139 has better behavior to handle that.

Unfortunately PRs are failing because a new rubocop is angry about one of our tests being skipped without a reason; #147 fixes that issue.

@thatbudakguy thatbudakguy added this to the v0.8 milestone Mar 29, 2023
thatbudakguy added a commit that referenced this issue Mar 30, 2023
Providing both `directory` and `path` to Git.clone results in
the repo being cloned into a directory nested inside itself.

This may be a fix for #144.
@thatbudakguy
Copy link
Member

3f4389e removes the recursion which I think might've been the cause of this.

@ewlarson
Copy link
Contributor Author

This is fixed! I was able to clone all the repos...

ewlarson@beanburrito .internal_test_app % bundle exec rake geocombine:clone --trace
** Invoke geocombine:clone (first_time)
** Execute geocombine:clone
Cloned https://github.com/OpenGeoMetadata/shared-repository.git
Cloned https://github.com/OpenGeoMetadata/edu.stanford.purl.git
Cloned https://github.com/OpenGeoMetadata/edu.princeton.arks.git
Cloned https://github.com/OpenGeoMetadata/edu.virginia.git
Cloned https://github.com/OpenGeoMetadata/edu.nyu.git
Cloned https://github.com/OpenGeoMetadata/edu.harvard.git
Cloned https://github.com/OpenGeoMetadata/edu.umn.git
Cloned https://github.com/OpenGeoMetadata/edu.tufts.git
Cloned https://github.com/OpenGeoMetadata/edu.columbia.git
Cloned https://github.com/OpenGeoMetadata/edu.lclark.git
Cloned https://github.com/OpenGeoMetadata/gov.data.git
Cloned https://github.com/OpenGeoMetadata/geobtaa.git
Cloned https://github.com/OpenGeoMetadata/edu.uarizona.git
Cloned https://github.com/OpenGeoMetadata/edu.berkeley.git
Cloned https://github.com/OpenGeoMetadata/edu.cornell.git
Cloned https://github.com/OpenGeoMetadata/edu.vt.git
Cloned https://github.com/OpenGeoMetadata/edu.upenn.git
Cloned https://github.com/OpenGeoMetadata/edu.mit.git
Cloned https://github.com/OpenGeoMetadata/ca.frdr.geodisy.git
Cloned https://github.com/OpenGeoMetadata/edu.wisc.git
Cloned 20 repositories

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants