Posts tagged with cost-optimization

Model Routing Is the Prompt Trick Nobody Talks About

Most prompt engineering advice assumes you've already picked a model.

prompt-routingmodel-selectioncost-optimization

Data Eng Daily · Apr 2 ·5 min read

Stop Running Spark for 40 GB Jobs

Every quarter, someone on the team asks: "Do we really need this Spark cluster?" For most of the jobs running on it, the answer in 2026 is no.

duckdbapache-sparkbenchmarks

The Prompt Engineer · Apr 2 ·5 min read

Cache-Shaped Prompts

Someone analyzed 3,007 Claude Code sessions and found a ratio that broke my brain: for every fresh token sent to the API, 525 tokens were served from cache.

prompt-cachingprompt-structurecost-optimization

Data Eng Daily · Mar 29 ·6 min read

Your Vector Database Bill Will Double — Here's Why

Everyone picks their vector database based on latency benchmarks and API ergonomics.

vector-databaseragcost-optimization