Tag: model-orchestration

Showing 1-1 of 1

Mar 27, 20267 min readTooling Deep DiveIntermediate240 min build

How to prototype a token-level confidence-weighted LLM ensemble

Step-by-step prototype to run multiple LLMs in parallel, use token-level confidence (logprobs/entropy) to weight and stitch outputs, and reproduce Sup AI's HLE gain (52.15% vs 44.74%).

ensemble confidence-weighting logprob entropy model-orchestration

+3 more

Sup AI HLE tooling