Curate: Decide which documents, memories, or APIs matter for each specific task
Structure: Layer system messages → tools → retrieved data → user prompt in optimal order
Compress: Summarize or chunk information to stay under token limits while preserving what matters
Evaluate: Measure accuracy and watch for "context dilution" where irrelevant info distracts the model