-
-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Package data misses information about the user who released versions of packages on npm #838
Comments
Agreed, this should be collected and stored on each version record in the |
@andrew, no intention for this to be a rude ask and I empathically respect your time; I would like to ask if this and #839 would be part of any roadmap action items to land into the ecosystem.ms API and database? edit: I don't know Ruby too well but if implementing these is relatively straight-forward and you'd be interested to give me some pointers, I am happy to attempt this myself. |
@lirantal not a problem, happy to recieve nudges, there's way more work to do than time I have, so prioritizing things in nessesary. I've implemented code for both this and #839 in 7e48822 and new versions will start picking up that data automatically. For existing versions I'm going to need to resync all versions of all packages on npm, which will take some time (50,726,808 versions for npm in the db at the moment). |
That's awesome, thank you! |
Based on the current rate of processing I'd guess it will take a few weeks to totally complete a resync of all 50 million records, I'm going to prioritize resyncing the most popular npm packages first, which shouldn't take too long, maybe a couple days. |
Sound like a good plan! Thank you Andrew. I am planning to rebuild my npq tool in and around the ecosyste.ms API for a holistic data set foundation instead of the many different registries and their endpoints. I'll keep you posted in how this progresses. |
Exciting that you will also start using this more @lirantal 👏 Also: A reminder of the existence of this one: https://opencollective.com/ecosystems |
So far I've processed around 6% of all versions, primarily from the top 10% of packages |
Up to 17% now |
Andrew, I noticed at least a few days lag for some random package I picked: tldts 6.1.59 published on npm 2 days ago vs [tlds 6.1.58 showing up on ecosystems]https://packages.ecosyste.ms/registries/npmjs.org/packages/tldts/versions from 8 days ago Similarly, this package is also behind (0.68.0 on ecosystems) from Aug vs 0.70.0 release on github from 2 weeks ago. Is this expected? wondering what would be the expectation I should set to users given that they'd use it for ad-hoc package install like |
I had some issues with the background queue skipping jobs it shouldn't have a couple weeks ago, I'm still working to catch up on all of the lagging projects. The goal is that it should discover new versions within an hour of them being published where the package managers have a feed of recent releases, and within a few days to a week for package managers that don't have a good way of finding new versions (i.e. need to manually sync everthing on a regular basis to check) There is also a /ping endpoint you can hit to request an update be checked for any package, for example: https://packages.ecosyste.ms/api/v1/registries/npmjs.org/packages/tldts/ping |
Thanks, that's useful to know we can ping it if necessary :-) |
Up to 25% now |
Hi folks! 👋
I would expect that the Packages API for the versions data point will also include information about the user who released the version, on npm.
For example, the npm registry API at
https://registry.npmjs.org/safe-regex2
would return that information under theversions
key (look for the_npmUser
key below):However, in the existing Packages API today for versions (or any other endpoints), I couldn't find this information exist. I thought it should appear in this endpoint: https://packages.ecosyste.ms/api/v1/registries/npmjs.org/packages/safe-regex2
The text was updated successfully, but these errors were encountered: