Tech Expert & Vibe Coder

With 15+ years of experience, I specialize in self-hosting, AI automation, and Vibe Coding – building applications using AI-powered tools like Google Antigravity, Dyad, and Cline. From homelabs to enterprise solutions.

Mail Us [email protected]

My Address 14/291, 43H,
A Square Building, Edathala,
Kochi, Kerala, India

Home
Blog
AI & Tools

Category archive

AI & Tools

AI integrations and productivity tools

Focused topic archive 43 articles

AI & Tools Jan 22, 2026

Building a Local LLM Response Cache with Redis: Reducing Inference Costs and Latency for Repeated Queries

Why I Built a Local LLM Response Cache I run multiple LLMs locally—Mistral, Llama variants, and sometimes Qwen for specific tasks. These models live on my...

6 min read Read article

AI & Tools Jan 22, 2026

Debugging CUDA Out-of-Memory Errors in Ollama Multi-Model Deployments: Memory Pooling Strategies for 24GB VRAM Limits

Why I Started Debugging CUDA Memory Errors I run a Proxmox home server with an RTX 4090 passed through to a dedicated VM for local AI workloads. When I first...

6 min read Read article

AI & Tools Jan 22, 2026

Implementing Circuit Breakers for Self-Hosted LLM APIs: Preventing Cascading Failures in n8n Workflows with Timeout Fallbacks

Why I Built Circuit Breakers for My LLM Workflows I run several n8n workflows that call self-hosted LLM APIs—mostly Ollama on my Proxmox cluster and...

6 min read Read article

AI & Tools Jan 18, 2026

Debugging Ollama API rate limiting when running multiple concurrent inference requests from n8n automation workflows

Why I Worked on This I run multiple n8n workflows that call Ollama for various tasks — summarizing documents, extracting structured data, and generating...

5 min read Read article

AI & Tools Jan 18, 2026

Creating a local AI workforce readiness audit: testing which self-hosted models can actually replace SaaS tools in 2026

Why I Built My Own Local AI Readiness Test I've been self-hosting AI models on Proxmox for about two years now. Not as an experiment, but as actual...

7 min read Read article

AI & Tools Jan 17, 2026

Implementing sparse model loading in LM Studio: reducing VRAM usage by 40% with lottery ticket pruning for RTX 4000 series

Why I Started Looking at Sparse Loading I run LM Studio on my local RTX 4070 Ti with 12GB of VRAM. That's enough for most 7B models, but the moment I wanted to...

5 min read Read article

AI & Tools Jan 17, 2026

Building a botnet detection system for your homelab: using local LLMs to analyze suspicious network patterns in real-time

Why I Started Looking at Network Traffic in My Homelab I run a Proxmox homelab with multiple VMs, Docker containers, and services exposed through reverse...

7 min read Read article

AI & Tools Jan 17, 2026

Detecting AI model poisoning in local Ollama deployments: validating GGUF checksums against community-verified hashes

Why I Started Looking at Model Checksums I run Ollama locally on my Proxmox cluster. It's convenient—I pull models, test them, sometimes leave them running...

6 min read Read article

AI & Tools Jan 15, 2026

Creating a local AI-powered search engine for archived documentation using embeddings with chromadb and sentence-transformers

Why I Built This I maintain a lot of technical documentation across different projects. Configuration files, troubleshooting notes, setup guides, API...

6 min read Read article