InferTehk

Browser-native local GGUF inference.

Settings

Backend placement stays binary: GPU uses full offload, CPU uses none.

Session defaults
RuntimeNot initialized
ModelNot loaded
IsolationChecking
Context 0 / 2048 estimated tokens

Model Setup

Generation