Support I/O with text and token ids #79

JoeZijunZhou · 2024-05-08T17:21:11Z

Customer request: We use multiple languages for clients and cannot implement detokenization in each one. Need to have server-side detokenization support.

kiratp · 2024-05-12T22:45:15Z

Just chiming in here that the customer quoted is us :). The main challenger is that we have clients in multiple languages that don’t always have tokenizer implementations readily available. Every other prominent model server does detokenization, hence the request.

Doesn’t hurt that there are so many CPU cores on the TPU VMs that are mostly idle during inference anyway.

Thanks @JoeZijunZhou !

JoeZijunZhou · 2024-05-14T23:11:25Z

Resolved this issue in #78 . It's available in main.

JoeZijunZhou self-assigned this May 8, 2024

JoeZijunZhou mentioned this issue May 8, 2024

Update JetStream grpc proto to support I/O with text and token ids #78

Merged

JoeZijunZhou closed this as completed May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support I/O with text and token ids #79

Support I/O with text and token ids #79

JoeZijunZhou commented May 8, 2024

kiratp commented May 12, 2024

JoeZijunZhou commented May 14, 2024

Support I/O with text and token ids #79

Support I/O with text and token ids #79

Comments

JoeZijunZhou commented May 8, 2024

kiratp commented May 12, 2024

JoeZijunZhou commented May 14, 2024