Reference: https://www.github.com/ggerganov/llama.cpp/pull/2135 - [ ] https://github.com/rendezqueue/rendezllama/issues/36 - Wait for sampling refactor. - [ ] Implement.