In research

Weft

Tenant-fair LLM inference on Apple Silicon.

In research

Read the full project

Weft is an early thread on local inference scheduling. No public artifact yet. The shape is to keep tenants honest under load and make measurements easy to reproduce, on a class of hardware that is increasingly shared between agents on the same machine.