Use /compact before short follow-up edits. "b로 바꿔줘" (36.5% of tokens) was a tiny edit that paid for 9 prior turns of context. Running /compact after large document edits compresses history before making small tweaks, cutting per-prompt cost significantly.
Stabilize CLAUDE.md & skills files to improve cache reuse. Cache hit rate is 87.1% — each time skill files or rules are modified mid-session the cache invalidates. Batch skill documentation changes into a single commit so subsequent sessions reuse the warmed cache.
Consider subagents for large document analysis. The session processed large HTML and MD files (fe-ai-workflow.html, WDG00 AI 활용.md) in the main context. Spawning an Explore or general-purpose subagent to read/analyze big documents protects the main context window and reduces token accumulation across turns.
Break iterative doc-editing sessions into shorter conversations. The longest session (183bbac6) ran 23 API calls and 1.14M tokens over ~9 minutes. Starting a new session after major milestones (e.g., after adding skills) resets context cost to zero and keeps individual sessions cheap.