Tech Expert & Vibe Coder

With 15+ years of experience, I specialize in self-hosting, AI automation, and Vibe Coding – building applications using AI-powered tools like Google Antigravity, Dyad, and Cline. From homelabs to enterprise solutions.

Mail Us [email protected]

My Address 14/291, 43H,
A Square Building, Edathala,
Kochi, Kerala, India

Home
Blog

Blog

Technical notes, walkthroughs, and troubleshooting write-ups focused on self-hosting, automation, networking, and developer tooling.

Latest technical writing 323 articles

AI & Tools Jan 27, 2026

Building AI-powered mathematical proof assistants with local LLMs: implementing theorem verification workflows inspired by autonomous problem solving

Why I Built a Local Math Proof Assistant I've been running local LLMs for about a year now, mostly for text processing and code generation. But I kept...

7 min read Read article

AI & Tools Jan 27, 2026

Optimizing RTX 5090 VRAM allocation for parallel LLM inference: running multiple Ollama models simultaneously without memory thrashing

Why I'm Running Multiple LLMs at Once I run several LLM workflows on my home server. Some handle code generation, others process documentation, and a few...

7 min read Read article

AI & Tools Jan 27, 2026

Debugging Memory Leaks in Long-Running Ollama Instances: Monitoring VRAM Fragmentation and Implementing Automatic Model Reloads

Why I Started Looking Into This I run Ollama on my Proxmox server with a passthrough NVIDIA GPU. It handles various automation tasks through n8n—summarizing...

6 min read Read article

AI & Tools Jan 27, 2026

Implementing Semantic Caching for LLM APIs: Using Vector Embeddings to Match Similar Queries and Reduce Inference Costs

# Implementing Semantic Caching for LLM APIs: Using Vector Embeddings to Match Similar Queries and Reduce Inference Costs Why I Built This I run several...

7 min read Read article

AI & Tools Jan 26, 2026

Running Multiple Quantization Levels of the Same Model: Dynamic VRAM Allocation in Ollama for Speed vs Quality Tradeoffs

# Running Multiple Quantization Levels of the Same Model: Dynamic VRAM Allocation in Ollama for Speed vs Quality Tradeoffs Why I Started Running Multiple...

7 min read Read article

AI & Tools Jan 26, 2026

Building AI-Assisted Code Review Pipelines: Using Local LLMs to Validate Pull Requests Before Human Review

# Building AI-Assisted Code Review Pipelines: Using Local LLMs to Validate Pull Requests Before Human Review ## Why I Built This I maintain several projects...

7 min read Read article

AI & Tools Jan 26, 2026

Benchmarking RTX 5090 vs 4090 for Local LLM Inference: Real-World Token/Second Gains with Ollama and LM Studio

Why I Benchmarked the RTX 5090 Against My 4090 I've been running local LLMs on my RTX 4090 for over a year now. My setup includes Ollama for quick CLI...

7 min read Read article

Docker Jan 26, 2026

Implementing Container Startup Ordering with Custom Health Gates: Replacing depends_on with TCP Socket Polling for Database-Dependent Services

Why I Built Custom Health Gates Instead of Using depends_on I run several containerized services on my Proxmox homelab that depend on PostgreSQL and MariaDB....

6 min read Read article

Docker Jan 26, 2026

Fixing Docker Bridge Network MTU Mismatches in Nested Virtualization: Resolving Packet Loss Between Proxmox VMs and Container Networks

# Fixing Docker Bridge Network MTU Mismatches in Nested Virtualization: Resolving Packet Loss Between Proxmox VMs and Container Networks Why I Had to Fix This...

6 min read Read article