$ timeahead.in
← back
$ articles --tag rag

#rag

100 articles

01
Samsung’s memory chip employees negotiated $340,000 bonuses this year
Details have emerged about a tentative deal struck between Samsung and semiconductor employees who had threatened to str…
The Verge AIHardware#rag#multimodal
24d
02
The Enhanced Games fit right in with the rest of 2026’s longevity vibes
The Enhanced Games fit right in with the rest of 2026’s longevity vibes We’re evidently in our enhancement era. This Sun…
MIT Technology Review#rag
24d
03
Build AI agents for business intelligence with Amazon Bedrock AgentCore
Artificial Intelligence Build AI agents for business intelligence with Amazon Bedrock AgentCore OPLOG, a technology-driv…
AWS Machine Learning BlogAPI#claude#rag
25d
04
Simulate real-world places with Project Genie and Street View
Simulate real-world places with Project Genie and Street View Genie is our general-purpose world model capable of genera…
Google DeepMind BlogTutorial#rag
27d
05
GDS weighs in on the NHS's decision to retreat from Open Source
17th May 2026 - Link Blog GDS weighs in on the NHS's decision to retreat from Open Source. Terence Eden continues his co…
Simon Willison BlogOpen Source#rag#coding#open-source
29d
06
The Download: China’s AI drama factory and the WHO’s missing health targets
The Download: China’s AI drama factory and the WHO’s missing health targets Plus: as their trial goes to the jury, Musk …
MIT Technology ReviewRelease#rag
31d
07
How Chinese short dramas became AI content machines
How Chinese short dramas became AI content machines The viral short dramas are increasingly being created entirely with …
MIT Technology Review#rag
31d
08
Trump’s Tech Posse in China, Who’s Winning in Musk v. Altman, and Hantavirus Conspiracy Theories
This week on Uncanny Valley, the team dives into Trump’s selected entourage for his high-stakes visit to China, ranging …
Wired AIHardware#rag
32d
09
Gen Z Is Pioneering a New Understanding of Truth
The polar bear video has millions of views. Set to a haunting piano score that's become ubiquitous on TikTok, it shows a…
Wired AIResearch#rag#multimodal
32d
10
Desperate Trump taps "Tim Apple," Jensen Huang, Elon Musk to attend Xi summit
Donald Trump has very little leverage heading into two days of meetings with China’s leader, Xi Jinping, in Beijing this…
Ars Technica AI#rag
32d
11
Everyone at the Musk v. Altman Trial Is Using Fancy Butt Cushions
The final stragglers testified on Wednesday in the Musk v. Altman trial. The witnesses generated few waves, aside from t…
Wired AI#rag
33d
12
mimalloc: A new, high-performance, scalable memory allocator for the modern era
At a glance - Today’s critical services and applications are often highly concurrent, using hundreds of threads. They al…
Microsoft Research BlogOpen Source#rag#open-source
33d
13
Parents say ChatGPT got their son killed with bad advice on party drugs
The family of a 19-year-old college student is suing OpenAI over claims that his conversations with ChatGPT led to an ac…
The Verge AIModel#gpt#rag
34d
14
Using LLM in the shebang line of a script
11th May 2026 Kim_Bruning on Hacker News: But seriously, you can put a shebang on an english text file now (if you're su…
Simon Willison BlogModel#rag
35d
15
Building Blocks for Foundation Model Training and Inference on AWS
Building Blocks for Foundation Model Training and Inference on AWS Figure: Adapted from "AI's Three Scaling Laws, Explai…
Hugging Face BlogHardware#rag#inference#observability
35d
16
Fostering breakthrough AI innovation through customer-back engineering
Sponsored Fostering breakthrough AI innovation through customer-back engineering Agentic AI is helping organizations com…
MIT Technology ReviewResearch#rag
35d
17
Chrome's 4GB AI model isn't new, but you're not wrong for being confused
All of Google’s products have been getting more AI features, including Chrome, which now offers split-screen Gemini chat…
Ars Technica AIModel#gemini#rag#local
38d
18
Advanced RAG: Data Cleaning and Retrieval Techniques
Retrieval-augmented generation (RAG) makes queries smarter, arming them with proprietary data and contextualized knowled…
n8n BlogTutorial#rag#agents
39d
19
Vibe coding and agentic engineering are getting closer than I'd like
Vibe coding and agentic engineering are getting closer than I’d like 6th May 2026 I recently talked with Joseph Ruscio a…
Simon Willison BlogAgents#rag#agents#coding
40d
20
Google's Gemma 4 AI models get 3x speed boost by predicting future tokens
Google launched its Gemma 4 open models this spring, promising a new level of power and performance for local AI. Google…
Ars Technica AIResearch#gemini#rag#coding
40d
21
Chrome’s AI features may be hogging 4GB of your computer storage
Google Chrome may be taking up more of your storage than expected thanks to a large on-device AI model file that, in som…
The Verge AIInfra#rag#local
40d
22
A blueprint for using AI to strengthen democracy
A blueprint for using AI to strengthen democracy AI is changing what it means to be a democratic citizen. Here’s how we …
MIT Technology Review#rag
41d
23
The Zig project's rationale for their firm anti-AI contribution policy
30th April 2026 Zig has one of the most stringent anti-LLM policies of any major open source project: No LLMs for issues…
Simon Willison BlogOpen Source#rag#open-source
46d
24
Unleashing Agentic AI Analytics on Amazon SageMaker with Amazon Athena and Amazon Quick
Artificial Intelligence Unleashing Agentic AI Analytics on Amazon SageMaker with Amazon Athena and Amazon Quick Modern e…
AWS Machine Learning BlogInfra#rag#agents
46d
25
The Download: storing nuclear waste and orchestrating agents
The Download: storing nuclear waste and orchestrating agents Plus: Elon Musk says Sam Altman “stole a charity” at the Op…
MIT Technology Review#rag
47d
26
Elon Musk appeared more petty than prepared
Today the first witness was sworn in in Musk v. Altman: Elon Musk. I was surprised by how flat he seemed. Elon Musk appe…
The Verge AI#rag
48d
27
Choco automates food distribution with AI agents
Choco automates food distribution with AI agents Using OpenAI APIs, Choco processes millions of orders, reducing manual …
OpenAI BlogInfra#rag#inference
49d
28
OpenAI available at FedRAMP Moderate
OpenAI has achieved FedRAMP 20x Moderate authorization(opens in a new window) for ChatGPT Enterprise and API Platform, m…
OpenAI BlogResearch#gpt#rag#observability
49d
29
The AI-designed car is taking shape
The auto design world is full of advanced 3D visualization tools and VR sculpting platforms, but your average new car st…
The Verge AI#rag
49d
30
How Popsa used Amazon Nova to inspire customers with personalised title suggestions
Artificial Intelligence How Popsa used Amazon Nova to inspire customers with personalised title suggestions This post wa…
AWS Machine Learning BlogInfra#claude#rag#multimodal
49d
31
Build and deploy an automatic sync solution for Amazon Bedrock Knowledge Bases
Artificial Intelligence Build and deploy an automatic sync solution for Amazon Bedrock Knowledge Bases With Amazon Bedro…
AWS Machine Learning BlogInfra#rag#observability
49d
32
AutoAdapt: Automated domain adaptation for large language models
At a glance - Problem: Adapting large language models to specialized, high-stakes domains is slow, expensive, and hard t…
Microsoft Research BlogInfra#rag#agents#fine-tuning
54d
33
Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch
Artificial Intelligence Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch Many or…
AWS Machine Learning BlogTutorial#rag#inference#multimodal
54d
34
CyberAgent moves faster with ChatGPT Enterprise and Codex
CyberAgent moves faster with ChatGPT Enterprise and Codex CyberAgent uses ChatGPT Enterprise and Codex to help teams wor…
OpenAI BlogAgents#gpt#rag#agents
67d
35
Cut Checkpoint Costs with About 30 Lines of Python and NVIDIA nvCOMP
Training LLMs requires periodic checkpoints. These full snapshots of model weights, optimizer states, and gradients are …
NVIDIA Developer BlogModel#rag#training#gpu
67d
36
Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries
Physical AI—AI systems that perceive, reason, and act in physically grounded simulated environments—is changing how team…
NVIDIA Developer Blog#rag#coding#gpu
68d
37
RAG System Architecture: Components, How To Implement, Challenges, and Best Practices
A simple retrieval augmented generation architecture (RAG) setup usually works fine with a few documents and a basic ret…
n8n BlogTutorial#rag
70d
38
Any Custom Frontend with Gradio's Backend
gradio.Server: Any Custom Frontend with Gradio's Backend gr.HTML : building rich, interactive frontends entirely inside …
Hugging Face BlogTutorial#rag
75d
39
Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety
Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety g…
NVIDIA Developer BlogInfra#rag#agents#multimodal
83d
40
Design, Simulate, and Scale AI Factory Infrastructure with NVIDIA DSX Air
Building AI factories is complex and requires efficient integration across compute, networking, security, and storage sy…
NVIDIA Developer BlogInfra#rag#gpu
91d
41
Introducing NVIDIA BlueField-4-Powered CMX Context Memory Storage Platform for the Next Frontier of AI
AI‑native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions o…
NVIDIA Developer BlogInfra#rag#agents#gpu
91d
42
Introducing Storage Buckets on the Hugging Face Hub
Introducing Storage Buckets on the Hugging Face Hub Storage Buckets are built exactly for this: mutable, S3-like object …
Hugging Face BlogRelease#rag
97d
43
Build Multi-Domain RAG Systems with Specialized Knowledge Bases
This Verified Node Spotlight was written by Jenna Pederson, Staff Developer Advocate for Pinecone. Imagine you manage mu…
n8n BlogTutorial#rag#embeddings
98d
44
How Axios uses AI to help deliver high-impact local journalism
How Axios uses AI to help deliver high-impact local journalism A conversation with Allison Murphy, Chief Operating Offic…
OpenAI BlogResearch#rag#inference#local
103d
45
Building Telco Reasoning Models for Autonomous Networks with NVIDIA NeMo
Autonomous networks are quickly becoming one of the top priorities in telecommunications. According to the latest NVIDIA…
NVIDIA Developer Blog#rag#agents#gpu
106d
46
Awakening Sleeping Beauties at The Met
Collaborating with The Met to Awaken “Sleeping Beauties” with AI At OpenAI, we believe AI can enrich our lives by making…
OpenAI BlogTutorial#rag
110d
47
Genmab launches “AI Everywhere”
Genmab launches “AI Everywhere” Genmab(opens in a new window), a leading global biotechnology company, is pioneering nex…
OpenAI BlogResearch#gpt#rag#multimodal
110d
48
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST ITBench HF Space ITBench HF Dataset MAST…
Hugging Face BlogTutorial#gemini#rag#agents
117d
49
Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities
Enterprise data is inherently complex: real-world documents are multimodal, spanning text, tables, charts and graphs, im…
NVIDIA Developer BlogInfra#rag#multimodal
118d
50
DeepSeek-V3.2 on GB300: Performance Breakthrough Feb 13, 2026 · 12 min read DeepSeek-V3.2 (NVFP4 + TP2)has been successfully and smoothly run on GB300 (SM103 - Blackwell Ultra). Leveraging FP4 quantization, it achieves a single-GPU throughput of 7360 TGS (tokens / GPU /...
DeepSeek-V3.2 on GB300: Performance Breakthrough Summary DeepSeek-V3.2 (NVFP4 + TP2)has been successfully and smoothly r…
vLLM BlogHardware#rag#inference#gpu
122d
51
Harness engineering: leveraging Codex in an agent-first world
Harness engineering: leveraging Codex in an agent-first world By Ryan Lopopolo, Member of the Technical Staff Over the p…
OpenAI BlogResearch#rag#observability#coding
124d
52
Testing ads in ChatGPT
Update on March 26, 2026: Our ads pilot is focused on supporting broader access to ChatGPT while preserving consumer tru…
OpenAI BlogTutorial#gpt#rag#inference
126d
53
Transformers.js v4: Now Available on NPM!
Transformers.js v4: Now Available on NPM! npm i @huggingface/transformers Performance & Runtime Improvements The biggest…
Hugging Face BlogHardware#rag#coding
126d
54
How to Build a Document Processing Pipeline for RAG with Nemotron
What if your AI agent could instantly parse complex PDFs, extract nested tables, and “see” data within charts as easily …
NVIDIA Developer BlogTutorial#rag#agents#gpu
131d
55
Establishing a Scalable Sparse Ecosystem with the Universal Sparse Tensor
Sparse tensors are vectors, matrices, and higher-dimensional generalizations with many zeros. They are crucial in variou…
NVIDIA Developer BlogResearch#rag#embeddings
136d
56
Import AI 442: Winners and losers in the AI economy; math proof automation; and industrialization of cyber espionage
Import AI 442: Winners and losers in the AI economy; math proof automation; and industrialization of cyber espionage Is …
Import AI (Jack Clark)Research#rag#agents#coding
140d
57
How countries can end the capability overhang
How countries can end the capability overhang By George Osborne, Head of OpenAI for Countries AI is advancing at extraor…
OpenAI BlogResearch#rag
145d
58
NVIDIA DLSS 4.5 Delivers Super Resolution Upgrades and New Dynamic Multi Frame Generation
NVIDIA DLSS 4 with Multi Frame Generation has become the fastest-adopted NVIDIA gaming technology ever. Over 250 games a…
NVIDIA Developer BlogHardware#rag#observability#coding
152d
59
Zenken boosts a lean sales team with ChatGPT Enterprise
Zenken boosts a lean sales team with ChatGPT Enterprise Zenken is rethinking sales with AI—cutting preparation time, imp…
153d
60
OpenAI and SoftBank Group partner with SB Energy
OpenAI and SoftBank Group partner with SB Energy - SoftBank Group and OpenAI invest $1 billion in SB Energy to support i…
OpenAI BlogInfra#gpt#rag
157d
61
Retrieval RAG Evaluation Rita Fernandes Neves Senior Solution Architect - AI at NVIDIA Bilge Yücel DevRel Engineer Optimize RAG Applications with Document Reranking Using Haystack With NVIDIA NeMo Retriever March 20, 2025
Optimize RAG Applications with Document Reranking Using Haystack With NVIDIA NeMo Retriever In retrieval-augmented gener…
Haystack (deepset) BlogResearch#rag#agents#gpu
165d
62
12/15/2025 NVIDIA Nemotron 3 Nano on Fireworks: The Engine for Next-Generation AI Agents
We're excited to launch Day-0 support on Fireworks for the latest model in the NVIDIA Nemotron family, NVIDIA Nemotron 3…
Fireworks AI BlogInfra#rag#inference#gpu
182d
63
Our approach to mental health-related litigation
Our approach to mental health-related litigation Cases involving mental health are tragic and complex, and they involve …
OpenAI Blog#rag
202d
64
OpenAI and Target team up on new AI-powered experiences
OpenAI and Target partner to bring new AI-powered experiences across retail Key takeaways: - With the Target app in Chat…
OpenAI Blog#gpt#rag
208d
65
Intuit and OpenAI join forces on new AI-powered experiences
Intuit and OpenAI join forces on new AI-powered experiences Key takeaways: - Multi-year strategic partnership will soon …
OpenAI Blog#gpt#rag
209d
66
Neuro drives national retail wins with ChatGPT Business
Neuro drives national retail wins with ChatGPT Business Neuro uses ChatGPT Business to move faster across sales and oper…
OpenAI BlogResearch#gpt#rag
215d
67
Build to Last
Note from Jeremy: We’re teaching a course starting Nov 3rd on how to build towards software mastery and craftsmanship wh…
fast.ai BlogTutorial#rag
228d
68
User Story Bilge Yücel DevRel Engineer Nils Hilgers Lead AI Engineer @LHIND Lufthansa Industry Solutions Uses Haystack to Power Enterprise RAG Learn how Lufthansa Industry Solutions (LHIND) built an enterprise-grade, compliant AI knowledge assistant October 24, 2025
Lufthansa Industry Solutions Uses Haystack to Power Enterprise RAG Learn how Lufthansa Industry Solutions (LHIND) built …
Haystack (deepset) BlogTutorial#rag
234d
69
Unlock the power of images with AI Sheets
Unlock the power of images with AI Sheets 🧭TL;DR: Hugging Face AI Sheets is an open-source tool for supercharging datas…
Hugging Face BlogOpen Source#rag#inference#multimodal
237d
70
7/10/2025 Using Model-as-a-Judge for Reward in Reinforcement Fine Tuning
In domains that are inherently challenging to quantify, such as creative writing, we demonstrate that leveraging a super…
Fireworks AI Blog#rag#training
251d
71
Building OpenAI with OpenAI
Chief Commercial Officer, Giancarlo ‘GC’ Lionetti, kicks off our series sharing internal examples of how OpenAI is using…
OpenAI BlogResearch#rag
259d
72
Which image editing model should I use?
Which image editing model should I use? Replicate Playground In the past few weeks, nearly every major AI lab has releas…
Replicate BlogInfra#rag#agents#inference
265d
73
SafetyKit scales risk agents with OpenAI’s most capable models
SafetyKit scales risk agents with OpenAI’s most capable models From prototyping with early vision model previews to scal…
OpenAI BlogModel#rag#safety
279d
74
mmBERT: ModernBERT goes Multilingual
mmBERT: ModernBERT goes Multilingual TL;DR This blog post introduces mmBERT, a state-of-the-art massively multilingual e…
Hugging Face BlogOpen Source#rag#training#open-source
279d
75
Welcome EmbeddingGemma, Google's new efficient embedding model
Welcome EmbeddingGemma, Google's new efficient embedding model TL;DR Today, Google releases EmbeddingGemma, a state-of-t…
Hugging Face BlogResearch#rag#local#benchmark
284d
76
4/9/2025 Building Enterprise-Scale RAG Systems with Fireworks AI and MongoDB Atlas
In the fast-paced world of enterprise data, extracting actionable insights from vast amounts of unstructured information…
Fireworks AI BlogInfra#rag#inference
284d
77
Scaling domain expertise in complex, regulated domains
Blue J’s approach for scaling fast in complex, regulated domains Blue J scaled its AI-powered tax research system to thr…
OpenAI BlogResearch#gpt#rag
298d
78
Day Zero Support for OpenAI Open Models
Day Zero Support for OpenAI Open Models Fast and Affordable AI Inference For the World’s Most Popular Models We're excit…
Groq BlogInfra#rag#inference
314d
79
Parquet Content-Defined Chunking
Parquet Content-Defined Chunking TL;DR: Parquet Content-Defined Chunking (CDC) is now available in PyArrow and Pandas, e…
Hugging Face BlogOpen Source#rag
325d
80
AI as the greatest source of empowerment for all
AI as the greatest source of empowerment for all Fidji Simo In a few weeks, I’ll be joining OpenAI as CEO of Application…
OpenAI Blog#rag
329d
81
Accelerate a World of LLMs on Hugging Face with NVIDIA NIM
Accelerate a World of LLMs on Hugging Face with NVIDIA NIM NVIDIA AI customers and ecosystem partners leverage NVIDIA NI…
Hugging Face BlogInfra#mistral#rag#inference
329d
82
Working with 400,000 teachers to shape the future of AI in schools
Working with 400,000 teachers to shape the future of AI in schools OpenAI joins the American Federation of Teachers to l…
OpenAI BlogTutorial#rag
342d
83
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows AuthorsJiatao Gu†, Ying Shen‡**, Tianrong Chen, …
Apple Machine Learning Research#rag#multimodal
350d
84
LLM RAG Daniel Fleischer Research Engineer at Intel Labs Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them with a local LLM endpoint June 10, 2025
Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them…
Haystack (deepset) BlogResearch#rag#fine-tuning#observability
370d
85
5/28/2025 FireAttention V4: Industry-Leading Latency and Cost Efficiency with FP4
Today, we’re announcing we've achieved industry-leading speeds of >250 tokens/second on NVIDIA B200 GPUs using our lates…
Fireworks AI BlogResearch#rag#inference#benchmark
383d
86
The Transformers Library: standardizing model definitions
The Transformers Library: standardizing model definitions Transformers was created in 2019, shortly following the releas…
Hugging Face BlogInfra#llama#qwen#rag
396d
87
Vision Language Models (Better, faster, stronger)
Vision Language Models (Better, faster, stronger) Motivation Vision Language Models (VLMs) are the talk of the town. In …
Hugging Face BlogTutorial#rag#fine-tuning#multimodal
399d
88
Lowe’s leverages AI to power home improvement retail
Lowe’s leverages AI to power home improvement retail A conversation with Chandhu Nair, Senior Vice President of Data, AI…
OpenAI BlogInfra#rag#inference
406d
89
Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More
Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More As part of our ongoing efforts,…
Hugging Face BlogResearch#rag#benchmark
433d
90
Canva enables creativity with AI
Canva enables creativity with AI A conversation with Cameron Adams, Chief Product Officer and Co-founder of Canva. Our E…
OpenAI BlogRelease#rag#agents
434d
91
AI Policy @🤗: Response to the White House AI Action Plan RFI
AI Policy @🤗: Response to the White House AI Action Plan RFI Context: Don't Sleep on (Strongly) Open Models' Capabiliti…
Hugging Face BlogOpen Source#claude#rag#coding
453d
92
Xet is on the Hub
Xet is on the Hub Over the past few weeks, Hugging Face’s Xet Team took a major step forward by migrating the first Mode…
Hugging Face Blog#rag#multimodal
454d
93
Product 11/3/2025 40X Faster, and Smarter Outputs: How Vercel Turbocharged their Code Fixing Model with Open Models, Speculative Decoding and Reinforcement Fine Tuning on Fireworks
Vercel, a leading platform provider for full-stack web applications, partnered with Fireworks to solve a critical challe…
Fireworks AI BlogHardware#rag#fine-tuning#inference
461d
94
Nubank elevates customer experiences with OpenAI
Nubank elevates customer experiences with OpenAI Since its founding in 2013, Nubank—one of the world’s largest digital f…
466d
95
HuggingFace, IISc partner to supercharge model building on India's diverse languages
HuggingFace, IISc partner to supercharge model building on India's diverse languages Partnership The partnership between…
Hugging Face BlogInfra#rag#open-source
473d
96
Build awesome datasets for video generation
Build awesome datasets for video generation hlky and Sayak) Tooling for image generation datasets is well established, w…
Hugging Face BlogTutorial#rag#fine-tuning#multimodal
488d
97
Understanding Reasoning LLMs
Understanding Reasoning LLMs Methods and Strategies for Building and Refining Reasoning Models This article describes th…
Ahead of AI (Sebastian Raschka)Model#rag#fine-tuning#coding
495d
98
Strengthening America’s AI leadership with the U.S. National Laboratories
Strengthening America’s AI leadership with the U.S. National Laboratories OpenAI’s latest line of reasoning models will …
OpenAI BlogResearch#rag#coding
501d
99
Welcome to Inference Providers on the Hub 🔥
Welcome to Inference Providers on the Hub 🔥 We’ve been hosting a serverless Inference API on the Hub for a long time (w…
Hugging Face BlogInfra#rag#fine-tuning#inference
503d
100
Supporting sellers with enhanced product listings
Mercari enhances product listings and better supports sellers with GPT‑4o mini Mercari, one of Japan’s leading online ma…
OpenAI BlogModel#gpt#rag
551d