SPEC 11: API telemetry #1

guenp · 2024-05-03T16:58:38Z

Brainstorm and discuss a SPEC to establish how to add instrumentation and telemetry for scientific python projects to gain insights into usage patterns. Currently, Python projects typically don't have any direct insights into how users interact with their library, what common errors they run into or which APIs are most (in)frequently used. The goal of this SPEC is to design a way to collect usage logs from users in a transparent, ethical and efficient way, with minimal impact to user experience, in order to provide project maintainers with useful metrics and insights into the performance, usability and common user errors of components (modules, functions, etc) in their library.

drammock · 2024-05-03T18:43:52Z

I'll drop a link here to popylar which is I think only giving stats on overall module usage (i.e., number of times imported), not granular detail about which classes or functions are called. But @arokem is going to be at the summit and might have some thoughts on more granular metrics, so probably worth cornering him for a chat about it!

Carreau · 2024-05-07T16:14:02Z

I will link to scientific-python/summit-2023#17, we might want to pull out notes from scipy about 10 years ago. See as well https://github.com/Carreau/consent_broker I started to work on some time ago, and a related discussion pyOpenSci/software-peer-review#183

betatim · 2024-05-13T08:19:49Z

Different but related: https://github.com/betatim/kamal - a tool you run locally over a code base to get statistics on what API of a particular library is being used. The idea is that people can run this themselves and report the stats they get to somewhere (central?). The stats that are collected are kind of easy to look at, the goal being that you can somewhat easily convince yourself that no unwanted information is being shared. The original use case I had in mind was organisations that have private code bases but want to help a project learn which parts of their API are being used. Another idea could be running this in the CI of your project and reporting back to some central place (e.g. scikit-learn and pandas run this in their CI and report back to matplotlib or numpy regarding which parts of their API they use).

guenp · 2024-06-04T04:35:45Z

Summarizing today's discussions during the summit and dinner with @drammock @seberg @betatim @Carreau @stefanv et al:

To get user consent to track data, a nice solution would be to have to explicitly run a command like python -c import tracer; tracer.enable() or similar. This command can be described in the installation instructions in the scipy or scikit-learn docs and users are encouraged to turn it on to support the developers. We discussed various other options as well (stdout during first import, pop up in Jupyter notebook or Spyder, etc.).
To gather the telemetry data, our main options are to 1) track usage during runtime (e.g. with a decorator) or 2) statically like e.g. with kamal or pyinstrument. The downside of 1) is that it slows down the actual code or can cause confusing bugs during runtime, and downside of 2) is that someone (or some process) needs to trigger the analysis. An idea from @betatim is to use sampling (a la PySpy) to gather the info intermittently to minimize performance impact. @seberg suggests to write a very simple "counter" decorator that doesn't inspect any arguments. This will be very fast to do in C. We're not yet 100% decided if (1) or (2) is the best solution but we are leaning towards (2).
We have a quick-and-dirty prototype that implements runtime-based telemetry that uses the importlib machinery to dynamically add a decorator to a given API, as per suggested by @eriknw during SciPy last year. I added the code to this gist. It's not performance optimized - the wrapper uses inspect and serializes input args - but it's a basic demo of what this could look like.

drammock · 2024-06-04T04:43:27Z

great summary @guenp. Also discussed breifly was how to incentivize opt-in; some ideas were:

bundle it with something useful folks already use (like a linter) --- only works for static analysis
detect anti-patterns / common inefficiencies in the code and suggest refactorings (I think it was @lagru who had this idea)
output a list of DOIs/citations for the packages you use in that script (could work for static or dynamic analysis)

lagru · 2024-06-04T13:47:34Z

The idea about combining the tool with linting was initially brought up by @rossbar. :D

guenp · 2024-06-05T19:42:44Z

Summary from chat with @crazy4pi314

Important to get explicit consent for tracking
Like the idea of making a standalone telemetry tool that is useful on its own
For air gapped/non-networked usage/devices, have some way to collect locally and upload later
Command line tool can take care of everything else (telemetry locally)
GDPR issues if we track session IDs
Create an extension for IDEs that run the tracker locally and have a checkbox for uploading to the cloud. Extension itself will handle networking for uploading. It also installs the telemetry tool if it's not already installed. Creates an environment and keeps it isolated from the project itself.
This could go in a .devcontainer
Gamification! Make extension show some silly stuff. "Your code is in the top 100 for usage of this function"
Another idea for incentive: web hosted dashboard with stats (per user? per package)

guenp · 2024-06-06T00:15:11Z

Chat with @eriknw and @betatim :

This PR adds scripts to networkx to generate a table of functions and which libraries use them: Add scripts for analyzing networkx usage by dependents on conda-forge networkx/networkx#7314
Needed kamal to send to customers to run on their confidential data, and send a simple CSV that demonstrates exactly what data is recorded. It uses jedi.

jarrodmillman transferred this issue from scientific-python/scientific-python-hugo-theme May 3, 2024

jarrodmillman transferred this issue from scientific-python/specs May 3, 2024

guenp self-assigned this May 3, 2024

betatim self-assigned this May 6, 2024

lagru self-assigned this May 6, 2024

guenp changed the title ~~SPEC: API observability~~ SPEC 11: API observability Jun 6, 2024

guenp changed the title ~~SPEC 11: API observability~~ SPEC 11: API telemetry Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPEC 11: API telemetry #1

SPEC 11: API telemetry #1

guenp commented May 3, 2024

drammock commented May 3, 2024

Carreau commented May 7, 2024

betatim commented May 13, 2024

guenp commented Jun 4, 2024

drammock commented Jun 4, 2024

lagru commented Jun 4, 2024

guenp commented Jun 5, 2024

guenp commented Jun 6, 2024

SPEC 11: API telemetry #1

SPEC 11: API telemetry #1

Comments

guenp commented May 3, 2024

drammock commented May 3, 2024

Carreau commented May 7, 2024

betatim commented May 13, 2024

guenp commented Jun 4, 2024

drammock commented Jun 4, 2024

lagru commented Jun 4, 2024

guenp commented Jun 5, 2024

guenp commented Jun 6, 2024