DeepSeek-V3.2 on GB300: Performance Breakthrough Feb 13, 2026 · 12 min read DeepSeek-V3.2 (NVFP4 + TP2)has been successfully and smoothly run on GB300 (SM103 - Blackwell Ultra). Leveraging FP4 quantization, it achieves a single-GPU throughput of 7360 TGS (tokens / GPU /...
DeepSeek-V3.2 on GB300: Performance Breakthrough Summary DeepSeek-V3.2 (NVFP4 + TP2)has been successfully and smoothly r…
Retrieval RAG Evaluation Rita Fernandes Neves Senior Solution Architect - AI at NVIDIA Bilge Yücel DevRel Engineer Optimize RAG Applications with Document Reranking Using Haystack With NVIDIA NeMo Retriever March 20, 2025
Optimize RAG Applications with Document Reranking Using Haystack With NVIDIA NeMo Retriever In retrieval-augmented gener…
User Story Bilge Yücel DevRel Engineer Nils Hilgers Lead AI Engineer @LHIND Lufthansa Industry Solutions Uses Haystack to Power Enterprise RAG Learn how Lufthansa Industry Solutions (LHIND) built an enterprise-grade, compliant AI knowledge assistant October 24, 2025
Lufthansa Industry Solutions Uses Haystack to Power Enterprise RAG Learn how Lufthansa Industry Solutions (LHIND) built …
LLM RAG Daniel Fleischer Research Engineer at Intel Labs Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them with a local LLM endpoint June 10, 2025
Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them…
Product 11/3/2025 40X Faster, and Smarter Outputs: How Vercel Turbocharged their Code Fixing Model with Open Models, Speculative Decoding and Reinforcement Fine Tuning on Fireworks
Vercel, a leading platform provider for full-stack web applications, partnered with Fireworks to solve a critical challe…