2024-2026
KARL
RAG cloud-intelligence chatbot in production at Orange Business. Local multi-LLM integration (Llama 3.3 70B, DeepSeek R1, QwQ 32B) via vLLM on H100 NVL and L40S GPUs, LangChain + ChromaDB orchestration. Built for auditable answers, not for the demo.
LangChain · ChromaDB · vLLM · H100 NVL · Llama 3.3 70B · DeepSeek R1 · RAG · Python