Skip to content

kksKishore K Sharma

Work Writing About Uses Contact

Get in touch→

/tag

#redis

← all writing

/footer · still here

If you're building something hard, let's talk.

Start a conversation→

Direct

via the contact form →
Noida, UP

Elsewhere

LinkedIn ↗
GitHub ↗
X ↗
Hashnode ↗
dev.to ↗
Bluesky ↗
Mastodon ↗
Instagram ↗
About
RSS feed ↗

Views are my own and do not represent any current or past employer. All work shown was completed under appropriate confidentiality and IP terms.

© 2026 Kishore · Built with restraint.·Privacy Termsv2 · System online

3 pieces

Jun 4, 202610 min read
Semantic Caching for LLMs: Cache on Meaning, Not on Strings
A normal cache keyed on the exact request string is almost useless for LLM calls, because every paraphrase is a miss. Semantic caching keys on meaning instead — embed the query, search for a near-identical past question, and return its answer with no model call. Here's the architecture, the threshold problem that makes or breaks it, and real pgvector code.
- #llm
- #caching
- #pgvector
- #redis
- #embeddings
- #cost-optimization
- #typescript
- #backend
May 31, 202611 min read
Just Use Postgres: One Database Until It Actually Hurts
A modest app somehow grew Postgres, Redis, RabbitMQ, Elasticsearch and a vector DB — five things to back up, secure and pay for. Most of that is now one Postgres. Here's the queue, vector, search and pub/sub SQL, and the honest signals for when to graduate.
- #postgres
- #backend
- #infrastructure
- #sql
- #architecture
- #redis
- #vector-search
May 8, 20267 min read
Idempotency Without a Database: The Redis Pattern That Survived 10× Traffic
Click 'pay' twice when the page hangs and you shouldn't get charged twice. Here's how to make that promise — with one Redis key, a TTL, and a small race-condition guard.
- #backend
- #patterns
- #redis
- #idempotency
- #production