NIP-95 Revisit #1145

arthurfranca · 2024-03-28T22:43:08Z

Read here

I was reading #345 new comments and came up with this spec.

Differences:

a random pubkey (not the uploader's one) used just once as author of the file event(s) is what identifies the file
file can be made of multiple chunks, all with the same above pubkey
new NIP-65 flag to configure user's "file relays" cause most relays won't accept NIP-95 events
nfile entity

vitorpamplona · 2024-03-28T22:52:01Z

File relays and nfile are good ideas, but why the random pubkey? I don't get it.

arthurfranca · 2024-03-29T13:31:42Z

but why the random pubkey?

The NIP is missing a section that I'm gonna add soon. The idea is that the same chunk event set may have multiple "owners/uploaders" to avoid duplication. Because of that, one reason for using a random pubkey is that the chunk event author does not need to be the main pubkey of the user who first uploaded the file.

The random pubkey would be used as author of all chunk events of a single file and then should never be reused as author of anything else. To get all chunks, client filter by { authors: ["<the-random-pubkey>"], kind: [1064] }. This way the pubkey would identify this "version" of a file (the same file can have many "versions", e.g. it can be split into 3 chunks or 1 or 10 which would be 3 versions).

But some user may misbehave and reuse the key so maybe it is not a good way to group all chunks of a file.

vitorpamplona · 2024-03-29T13:48:40Z

Interesting, so the point is to find a way to query the chunks of a single file without knowing all the event ids beforehand and without allowing other people to add a malicious chunk in the middle of your file:

{ authors: ["<key-per-file>"], kind: [1064] }

Meaning: If the event header has all the ids in a list, no one can add a malicious chunk, but you have to create a filter with all the ids from that event and that filter can be huge.

{ ids: [<huge list of chunk event ids>] }

Alternatively, chunk events can tag an unbound list. But, since it's very easy for anyone else out there to create a new malicious chunk and also point to the same list, the filter must include the author of the header:

{ #n: ["<list name>"], authors: ["owner"], kind: [1064] }

vitorpamplona · 2024-03-29T14:05:08Z

n could be the file hash that is already computed for the header event.

{ #n: ["<full file hash>"], authors: ["owner"], kind: [1064] }

vitorpamplona

Frankly this random pubkey business seems like a lot of work for what could be achieved with unbound lists.

vitorpamplona · 2024-03-29T14:36:03Z

95.md

+
+## Upload
+
+To upload a file, first client must convert its bytes to base64. It may do it in chunks made of multiples of 3 bytes or in one go.


multiples of 3 cause base64 needs atleast 3 bytes to encode. it could be 255000 bytes per chunk for example

Bad english, tried to improve text

vitorpamplona · 2024-03-29T14:38:10Z

95.md

+Client should upload to user's "file relays", which use the [NIP-65](65.md) `f` flag.
+When downloading a file uploaded with this NIP, it should search on the uploader's "file relays".
+
+**Relays must NOT honor `kind:5` deletion events referencing file chunk events.** Deletion


No need for this. I might want to get a separate chunk of the file in each relay. Deleting chunks should be possible and it should be fine.

I thought of the following flow, maybe it is bogus:

An userA uploads a file.

Another userB sees it on his client and asks client to copy and upload it too (to make it his file) and both users happen to use the same file relay.

UserB client instead of re-uploading it, just registers userB as an owner/uploader of the file that is already on the relay.

Now that userA and userB both own the file, we can't let userA delete the chunks.. userA can just unregister himself as not an uploader/owner anymore. The file is deleted if there is no registered uploader.

Are you trying to register many owners for shared chunks and only allow deletion when all owners request or stop using the chunk?

Yes, anyone the file relay authorizes can become an owner ("uploader) of a file chunk set.

There is an "uploader event" for that. When no "uploader event" is present on the relay anymore (deleted with kind:5), the file relay is free to automatically delete the chunk set (not using kind:5 here, just auto-deletes).

vitorpamplona · 2024-03-29T14:39:40Z

95.md

+- `["OK", "<kind:1065-event-id>", true, "uploaded: ..."]`: The corresponding `kind:1064` file chunks are already uploaded, trying to re-upload them will fail;
+- `["OK", "<kind:1065-event-id>", true, "upload: Missing chunks 1, 2, 7, 10"]`: File isn't uploaded yet or incomplete, user is allowed to upload it on this ws connection;
+
+Trying to send a `kind:1064` event before a `kind:1065` one should fail.


I don't like these custom behaviors for relays.

The reason for this is that a file relay can't let a client send a big event (a file chunk) just to later reply that the client/user had no authorization to do it. It wastes relay resources so I imagine file relay will start with a capped max ws message size for every new ws connection until it sees a kind:1065 event "asking" for authorization to upload a kind:1064 event.

95.md

arthurfranca · 2024-03-29T18:21:37Z

@NfNitLoop I get your point but authors aren't listed anymore on NIPs. I did steal many ideas while adding some of my own and glued them together here cause there were many changes that needed to be placed together to make it cohesive and would be hard to explain and ask them to be considered separately there at #345.

My goal is solely to help come up with the best version of a NIP we could. This one was my vision of how it could look like.

I can change the NIP number and the kinds.. doesn't matter, I just put the text here for whoever may be insterested to discuss if it is better, worse or if could be improved further or ditched in favor of a better version.

arthurfranca · 2024-03-29T18:37:39Z

@vitorpamplona I think I may have confused you by reusing the kind:1065 to mean something other than file metadata.

On this NIP, kind:1065 means "uploader" and would look like this:

{
  kind: 1065,
 pubkey: "<uploader-main-pubkey>",
  ...,
  tags: [
    ["f", "<key-per-file>"]
  ]
}

There could have a NIP-94 event or a copy with another number like you suggested that would have the metadata tags like:

{
  kind: 10xx,
  pubkey: "<user-main-pubkey>",
  ...,
  tags: [
    ["f", "<key-per-file>"],
    ["nip95u", "<uploader-main-pubkey>"], // most times it is the same as "<user-main-pubkey>"
    ["size", "..."],
    ["dim", "..."],
    ["blurhash", "..."]
  ]
}

NIP-94 event isn't required cause nfile with or without NIP-54 inline metadata could be used instead inside a kind:1 for example.

I will change all the kind numbers.

vitorpamplona · 2024-03-29T19:27:00Z

The gains of using random pubkeys to represent files are still not clear to me though.

arthurfranca · 2024-03-29T19:31:59Z

@vitorpamplona It is true that a file chunk set fits well into the unbound list spec, that addresses a set with the owner pubkey + n tag (that could be set to the sha256 as you said).

But it may not be a perfect fit. Because a pubkey may (don't know why it would want to do it but it is possible) upload the same file with the same hash twice but with a different set of chunks.

Example:
First time it sends 3 chunks of 9 bytes
Second time it sends 1 chunk of 27 bytes

Now we got two versions of the same file. 4 chunks that shouldn't belong on the same unbound list.

vitorpamplona · 2024-03-29T19:37:53Z

Yeah, that would require not using the hash of the file as the name of the unbound list, but we could do hash+blocksize as a name and then have 2 tag entries in the header event pointing to each unbound list. The receiver can choose which one to download.

arthurfranca · 2024-03-29T19:51:27Z

Right, I will edit it.

arthurfranca · 2024-03-30T18:26:24Z

@vitorpamplona now it is using unbound list. One thing left is adding nfile to NIP-19 but not sure yet how to do it.

@NfNitLoop @frbitten what do you think of this version of NIP-95?

…es without the need to do it all at once

arthurfranca · 2024-04-02T00:44:18Z

Reviewing this I think it has some problems:

on upload: serializing a somewhat big payload (chunk event) to sign it is bad;
on download: need to put file chunk content in memory at once to send it as a nostr event, though small chunks wouldn't be that of a problem;
storage: it feels wrong to possibly store many versions (sets of chunks of variable chunk sizes) of the same file (same sha256). somehow the identifier should be just the sha256 hash instead of it plus chunk size;
of course the base64 encoding/decoding step isn't ideal too;

NIP-96 is better. #719 may be good too

Add NIP-95

0275b58

Add pre-upload step

3110ff0

arthurfranca force-pushed the nip-95-revisit branch from 4e5989f to 3110ff0 Compare March 29, 2024 14:17

vitorpamplona reviewed Mar 29, 2024

View reviewed changes

This comment was marked as off-topic.

Sign in to view

arthurfranca added 5 commits March 29, 2024 15:42

Fix chunk size text

ee7ef4b

Use new kind numbers

2f33b91

Add uploader event example

e7a0135

Add file download filter example

33e2ca2

Minor text improvement on deletes

538940c

arthurfranca added 2 commits April 1, 2024 15:05

Use unbound list

dfa0bee

Use the byte size before encoding for the n tag

f75c076

arthurfranca force-pushed the nip-95-revisit branch from 0e6b06d to f75c076 Compare April 1, 2024 18:33

Make it clear it is possible to convert individual chunks ba^C to byt…

a1d080d

…es without the need to do it all at once

arthurfranca closed this Apr 2, 2024

arthurfranca deleted the nip-95-revisit branch May 9, 2024 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NIP-95 Revisit #1145

NIP-95 Revisit #1145

arthurfranca commented Mar 28, 2024

vitorpamplona commented Mar 28, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024 •

edited

Loading

vitorpamplona left a comment

vitorpamplona Mar 29, 2024

arthurfranca Mar 29, 2024

arthurfranca Mar 29, 2024

vitorpamplona Mar 29, 2024

arthurfranca Mar 29, 2024

vitorpamplona Mar 29, 2024

arthurfranca Mar 29, 2024

vitorpamplona Mar 29, 2024

arthurfranca Mar 29, 2024

This comment was marked as off-topic.

arthurfranca commented Mar 29, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024 •

edited

Loading

arthurfranca commented Mar 29, 2024

arthurfranca commented Mar 30, 2024

arthurfranca commented Apr 2, 2024


		## Upload

		To upload a file, first client must convert its bytes to base64. It may do it in chunks made of multiples of 3 bytes or in one go.

NIP-95 Revisit #1145

NIP-95 Revisit #1145

Conversation

arthurfranca commented Mar 28, 2024

vitorpamplona commented Mar 28, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024 • edited Loading

vitorpamplona left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

arthurfranca commented Mar 29, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024

arthurfranca commented Mar 29, 2024

vitorpamplona commented Mar 29, 2024 • edited Loading

arthurfranca commented Mar 29, 2024

arthurfranca commented Mar 30, 2024

arthurfranca commented Apr 2, 2024

vitorpamplona commented Mar 29, 2024 •

edited

Loading

vitorpamplona commented Mar 29, 2024 •

edited

Loading