✨ Update cron's JSON format #1001

laurentsimon · 2021-09-10T21:17:25Z

Please check if the PR fulfills these requirements

Tests for the changes have been added (for bug fixes / features)
PR title follows the guidelines defined in https://github.com/ossf/scorecard/blob/main/CONTRIBUTING.md#pr-process

What kind of change does this PR introduce? (Bug fix, feature, docs update, ...)
feature
What is the current behavior? (You can also link to an open issue here)
older json
What is the new behavior (if this is a feature change)?
same json format as scorecard itself. It's a mere copy of files
Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)
yes
Other information:

oliverchang · 2021-09-13T01:01:14Z

cron/format/testdata/check3.json

+         "score": 0,
+         "reason": "min result reason",
+         "name": "Check-Name",
+         "documentation": {


These will be very repetitive in the BQ table. Is there any benefit to including this in every single row, as opposed to asking users to get this info from elsewhere?

yep, I agree and thought about this too. BQ table supports column compression (Run Length Encoding, etc), so this should not take much space on disk. However, I just looked up pricing and we're charged based on data size before compression, so thats not ideal.
The main reason I included it is ease for users (in particular tools that script the results) to retrieve and for us to keep scorecard changes in sync with where the URL lives. Deps.dev already broke because the doc location changed.

Alternatives:

Create a url per json instead of per check. That still requires us to keep url#check-name non-breaking. This would break if we separate checks in their own doc file (as suggested in Include URL for more information #998).

URL that redirects to doc...

What are better options?

@azeemsgoogle any thoughts on this?

Do we want to launch Scorecard V2 (version with scores)? Given our conversations with ProjectsByIf, it is likely that our result formats will undergo massive overhaul soon. Do we gain anything by launching this version then?

It helps users of deps.dev get a better format. I don't know how much traffic they get, but I think they are a source of discovery for scorecard we should not neglect. @kimsterv @inferno-chromium do you know?

David also expressed interest in having score-based format in OSSF dashboard.

In terms of projectsbyif, if we need to change scorecard/format a lot, then this means it will take quite some time to land the changes. In the meantime it's useful to have deps.dev use a score-based version.

follow-up about pricing. https://cloud.google.com/bigquery/pricing says, under storage section: $0.020 per GB. If we reach 1M repos, considering a 300 characters/bytes per doc (url + description), given 10 checks per run, and with 4 cron runs per month, we get:
$0.020 * 300 * 10 * 1000000 * 4 / 1000000000 = $0.24/month.

Please chime in if I forgot something in the calculation above. If the calculation is correct, pricing is not an issue/blocker.

Given the use of compression by BQ, latency should not be an issue either.

So I think we can keep the doc fields per check in JSON.

@oliverchang waiting for ur feedback

My concern is making deps.dev and OSSF metrics teams go through 2 different migrations. There might also be other teams (like Project Thoth - #978 (comment)) who are relying on the BQ data without us knowing about it. It might be worth figuring out a plan which minimizes the overhead on our clients for doing these migrations.

re: pricing/storage - IMO its fine to have this data, even though it maybe redundant. For teams relying on our BQ data, it'll be useful to have this present in BQ itself.

Thanks for explaining. Ease of use + link stability seems like a good enough reason to keep this in the table.

re: other teams relying on BQ dataset. I think we need to announce breaking changes/new versions on a dedicated channel. Maybe scorecard-announce@ mailing list or a dedicated section on pkg.go.dev, probably both. We cannot, ourselves, keep an inventory of who is using scorecard. I've added an item for the next scorecard sync.

Note that updating the cron JSON does not force our users to use it. We can still support the old format for 6 months or more. That should be part of some SLO we need to write for each release, e.g., that we will keep providing support for at least X months after releasing a breaking change version. This will allows teams to make an informed decision whether they want to upgrade or not. Teams should decide if they want to upgrade, we cannot do that for them unequivocally, especially as the number of teams will increase and their requirements differ.

Let's discuss more during the next scorecard sync. That's really important

laurentsimon · 2021-09-13T14:55:52Z

some error

protoc --go_out=../../../ cron/data/metadata.proto
/bin/bash: protoc: command not found
make: *** [Makefile:94: cron/data/metadata.pb.go] Error 127

@naveensrinivasan

naveensrinivasan · 2021-09-13T15:13:15Z

some error

protoc --go_out=../../../ cron/data/metadata.proto
/bin/bash: protoc: command not found
make: *** [Makefile:94: cron/data/metadata.pb.go] Error 127

@naveensrinivasan

https://github.com/ossf/scorecard/blob/main/.github/workflows/main.yml#L23 this was fixed by @azeemshaikh38 and I was hoping this would fix the issue

laurentsimon · 2021-09-13T17:44:17Z

some error

protoc --go_out=../../../ cron/data/metadata.proto
/bin/bash: protoc: command not found
make: *** [Makefile:94: cron/data/metadata.pb.go] Error 127

@naveensrinivasan

for some reason

some error
protoc --go_out=../../../ cron/data/metadata.proto
/bin/bash: protoc: command not found
make: *** [Makefile:94: cron/data/metadata.pb.go] Error 127
@naveensrinivasan
https://github.com/ossf/scorecard/blob/main/.github/workflows/main.yml#L23 this was fixed by @azeemshaikh38 and I was hoping this would fix the issue

for some reason this is now working. The problem seems to be happening sometimes only

JSON2 for cron

e04a48c

laurentsimon requested a review from azeemshaikh38 September 10, 2021 21:18

laurentsimon added 2 commits September 10, 2021 21:36

fixes

8d08fbc

linter

9000df7

laurentsimon requested a review from oliverchang September 10, 2021 22:39

fix

fd735ae

oliverchang reviewed Sep 13, 2021

View reviewed changes

azeemshaikh38 approved these changes Sep 13, 2021

View reviewed changes

Merge branch 'main' into feat/cronjson3

1c41215

Merge branch 'main' into feat/cronjson3

273a340

Merge branch 'main' into feat/cronjson3

4e8fe11

oliverchang approved these changes Sep 13, 2021

View reviewed changes

Merge branch 'main' into feat/cronjson3

c57afc3

laurentsimon enabled auto-merge (squash) September 13, 2021 21:44

laurentsimon merged commit 6178207 into ossf:main Sep 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Update cron's JSON format #1001

✨ Update cron's JSON format #1001

laurentsimon commented Sep 10, 2021 •

edited

Loading

oliverchang Sep 13, 2021

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon Sep 13, 2021

azeemshaikh38 Sep 13, 2021

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon Sep 13, 2021 •

edited

Loading

azeemshaikh38 Sep 13, 2021

oliverchang Sep 13, 2021

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon commented Sep 13, 2021

naveensrinivasan commented Sep 13, 2021

laurentsimon commented Sep 13, 2021

✨ Update cron's JSON format #1001

✨ Update cron's JSON format #1001

Conversation

laurentsimon commented Sep 10, 2021 • edited Loading

oliverchang Sep 13, 2021

Choose a reason for hiding this comment

laurentsimon Sep 13, 2021 • edited Loading

Choose a reason for hiding this comment

laurentsimon Sep 13, 2021

Choose a reason for hiding this comment

azeemshaikh38 Sep 13, 2021

Choose a reason for hiding this comment

laurentsimon Sep 13, 2021 • edited Loading

Choose a reason for hiding this comment

laurentsimon Sep 13, 2021 • edited Loading

Choose a reason for hiding this comment

azeemshaikh38 Sep 13, 2021

Choose a reason for hiding this comment

oliverchang Sep 13, 2021

Choose a reason for hiding this comment

laurentsimon Sep 13, 2021 • edited Loading

Choose a reason for hiding this comment

laurentsimon commented Sep 13, 2021

naveensrinivasan commented Sep 13, 2021

laurentsimon commented Sep 13, 2021

laurentsimon commented Sep 10, 2021 •

edited

Loading

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon Sep 13, 2021 •

edited

Loading

laurentsimon Sep 13, 2021 •

edited

Loading