Recipe Categories

LLM Serving & Inference

Production configurations for llama.cpp, Ollama, vLLM, and DeepSeek R1. Optimized settings for different hardware.

4 recipes
View recipes

RAG Systems

Hybrid search with BM25, optimal chunking strategies, reranking with cross-encoders, and ColBERT late interaction.

4 recipes
View recipes

Vector Databases

Production configurations for Qdrant and ChromaDB with HNSW index tuning and performance optimization.

2 recipes
View recipes

Quantization

GGUF quantization selection guide and EXL2 vs GGUF decision matrix. Memory requirements by VRAM.

2 recipes
View recipes

Fine-Tuning with LoRA

Unsloth LoRA configurations with QLoRA, optimal rank selection, and training hyperparameters.

1 recipe
View recipes

AI Agents & Tool Calling

LangGraph agent workflows and CrewAI multi-agent configurations with state management and tool integration.

2 recipes
View recipes

Prompt Engineering

System prompt best practices and few-shot chain-of-thought templates for improved reasoning.

2 recipes
View recipes

AI Coding Assistants

Cursor .cursorrules configuration and Claude Code context management with CLAUDE.md patterns.

2 recipes
View recipes

Troubleshooting

Common pitfalls and solutions for flash attention, context windows, RAG failures, OOM errors, and more.

6 solutions
View solutions

Implementation Guides

Self-RAG architecture, production deployment checklist, and embedding model selection matrix.

3 guides
View guides