Prompt caching
PairPilot supports prompt caching for cost savings and faster coding. Currently Anthropic provides caching for Sonnet and Haiku, and DeepSeek provides caching for Chat.
PairPilot organizes the chat history to try and cache:
- The system prompt.
- Read only files added with
--read
or/read-only
. - The repository map.
- The editable files that have been added to the chat.
Usage
Run PairPilot with --cache-prompts
or add that setting to your
configuration files.
Due to limitations in the provider APIs, caching statistics and costs
are not available when streaming responses.
To turn off streaming, use --no-stream
.
When caching is enabled, it will be noted for the main model when PairPilot launches:
Main model: claude-3-5-sonnet-20240620 with diff edit format, prompt cache, infinite output
Preventing cache expiration
PairPilot can ping the provider to keep your prompt cache warm and prevent
it from expiring.
By default, Anthropic keeps your cache for 5 minutes.
Use --cache-keepalive-pings N
to tell PairPilot to ping
every 5 minutes to keep the cache warm.
PairPilot will ping up to N
times over a period of N*5
minutes
after each message you send.