About Kog Kog builds the fastest LLM inference engine on standard datacenter GPUs. Our Kog Inference Engine generates 3,000 output tokens per second per request on a single 8× AMD MI300X node and 2,100 on an
KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world’s fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries