19 Jul 14:01

Plexcalibur

4c98beb

1.1.0 Latest

Latest

What's Changed

increase gpt-3.5-turbo maxContextLength to 16k by @dafriz in #92
add gpt-4-turbo model by @dafriz in #94
feat: Implement o200k_base encoding and support gpt-4o by @chatanywhere in #99
add o200k_base encoding to docs by @dafriz in #101
add gpt-4o-mini model by @dafriz in #102

New Contributors

@dafriz made their first contribution in #92
@imsosleepy made their first contribution in #97
@chatanywhere made their first contribution in #99

Full Changelog: 1.0.0...1.1.0

Contributors

imsosleepy, dafriz, and chatanywhere

Assets 2

10 Feb 13:46

tox-p

1.0.0

6a54eb2

1.0.0

Features

Improved performance of the CL100k encoding by 5x
- Thanks @paplorinc for the great work!
Added text-embedding-3-small and text-embedding-3-large to the ModelType enum

Breaking Changes

Due to the performance optimization, we now return a custom IntArrayList instead of a List<Integer> to prevent unnecessary boxing. The IntArrayList does not implement List and therefore is a breaking change. If you are missing any critical functionality from IntArrayList, please raise an issue.

Full Changelog: 0.6.1...1.0.0

Contributors

l0rinc

Assets 2

03 Jul 08:37

tox-p

0.6.1

21f8ba1

0.6.1

Fixes

Added a workaround to prevent issue with regex compilation on Android devices

Full Changelog: 0.6.0...0.6.1

Assets 2

30 Jun 14:45

tox-p

0.6.0

e646c5c

0.6.0

Features

Added GPT_3_5_TURBO_16k to the ModelType enum

Full Changelog: 0.5.1...0.6.0

Assets 2

26 Jun 08:42

tox-p

0.5.1

0e8b8ca

0.5.1

Fixes

Fixed an issue resulting in wrong encodings for Unicode input . Thanks @VoidIsVoid for raising and fixing this issue 🙂

New Contributors

@VoidIsVoid made their first contribution in #34

Full Changelog: 0.5.0...0.5.1

Contributors

VoidIsVoid

Assets 2

16 May 12:11

tox-p

0.5.0

87789cd

0.5.0

Features

Added a new EncodingRegistry that loads only the requested vocabularies lazily instead of loading all vocabularies eagerly at initialization. Thanks @blackdiz for raising this feature request and implementing it 😊

New Contributors

@blackdiz made their first contribution in #24

Full Changelog: 0.4.0...0.5.0

Contributors

blackdiz

Assets 2

17 Apr 08:30

tox-p

0.4.0

e67fb1c

0.4.0

Features

Added two new methods to Encoding: encode(String, int) and encodeOrdinary(String, int). Both methods allow you to pass a maxTokens integer parameter that stops encoding after the given maximum amount of tokens is reached. Thanks @radosdesign for raising this feature request and implementing it 😊

Breaking Changes

The Encoding interface got two new methods: encode(String, int) and encodeOrdinary(String, int). If you implemented this interface yourself, you have to update your implementations when upgrading.

New Contributors

@radosdesign made their first contribution in #12

Full Changelog: 0.3.0...0.4.0

Contributors

radoslavdodek

Assets 2

15 Apr 10:11

tox-p

0.3.0

cfc5105

0.3.0

Features

Added gpt-4-32k to ModelType
Added ModelType#getMaxContextLength which returns the maximum context length the model allows. Note that this context length includes prompt tokens and, where applicable, completion tokens.

Breaking Changes

The name and encodingType property of ModelType were changed from public access to private. Migrate to modelType.getName() and modelType.getEncodingType() if you were previously using direct property access.

Full Changelog: 0.2.0...0.3.0

Assets 2

06 Apr 07:25

tox-p

0.2.0

df17669

0.2.0

Features

Add encodeOrdinary and countTokensOrdinary methods to Encoding.
- The existing encode and countTokens method currently throw an exception if a special token is encountered. This change introduced encodeOrdinary which simply encodes special tokens as if they were normal text.
Add getEncodingForModel(String) to EncodingRegistry to allow retrieving encodings for models by their string name.
It is now possible to call EncodingRegistry#getEncodingForModel(String) with a snapshot of a model, for example "gpt-4-0314" and receive the correct encoding.

Full Changelog: 0.1.0...0.2.0

Assets 2

20 Mar 21:47

tox-p

0.1.0

d60150c

0.1.0

⭐ Initial Release

Implementations for cl100k_base, p50k_base, p50k_edit, r50k_base

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

Features

Breaking Changes

Contributors

Fixes

Features

Fixes

New Contributors

Contributors

Features

New Contributors

Contributors

Features

Breaking Changes

New Contributors

Contributors

Features

Breaking Changes

Features

⭐ Initial Release

Releases: knuddelsgmbh/jtokkit

1.1.0

What's Changed

New Contributors

Contributors

1.0.0

Features

Breaking Changes

Contributors

0.6.1

Fixes

0.6.0

Features

0.5.1

Fixes

New Contributors

Contributors

0.5.0

Features

New Contributors

Contributors

0.4.0

Features

Breaking Changes

New Contributors

Contributors

0.3.0

Features

Breaking Changes

0.2.0

Features

0.1.0

⭐ Initial Release