Thaw adds Git-style branching to running LLMs, enabling mid-inference agent forks
A new open-source tool lets developers branch LLM inference mid-generation, skip redundant prefill computation, and merge agent outputs—addressing a core bottleneck in multi-agent reasoning systems.