Skip to content

How to Enable Fully Greedy Decoding Sample Parameters? #3005

Answered by ggerganov
PenutChen asked this question in Q&A
Discussion options

You must be logged in to vote

Setting temp = 0 will no longer be equivalent to greedy decoding (see #9897). To enable it, configure a single top_k sampler and set k = 1. For example, with llama-cli this can be done with the following CLI args:

--sampling-seq k --top-k 1

Replies: 4 comments 9 replies

Comment options

You must be logged in to vote
3 replies
@PenutChen
Comment options

@KerfuffleV2
Comment options

@PenutChen
Comment options

Comment options

You must be logged in to vote
6 replies
@jackfsuia
Comment options

@PenutChen
Comment options

@jackfsuia
Comment options

@PenutChen
Comment options

@jackfsuia
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by ggerganov
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
5 participants