SIDE CHANNEL

(1)

1 hack(s).

Prompt theft by timing: prefix-cache side channels in multi-tenant LLMs

Shared prefix caching makes LLM APIs faster — and leaks prompts. By timing the first token, an attacker can rebuild another tenant's prompt. A March 2026 paper defends it without killing performance.

2026-06-01//7 min