$ timeahead.in

$ articles --tag fine-tuning

#fine-tuning

100 articles

01

Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints

Artificial Intelligence Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints Today, Amazon SageMak…

AWS Machine Learning BlogInfra#fine-tuning#inference#langchain

65d

02

Unlocking asynchronicity in continuous batching

Unlocking asynchronicity in continuous batching TL;DR: we explain how to separate CPU and GPU workloads to get a massive…

Hugging Face BlogTutorial#fine-tuning#inference

71d

03

Fine-tune LLM with Databricks Unity Catalog and Amazon SageMaker AI

Artificial Intelligence Fine-tune LLM with Databricks Unity Catalog and Amazon SageMaker AI When you fine-tune large lan…

AWS Machine Learning BlogTutorial#agents#fine-tuning#inference

72d

04

Build financial document processing with Pulse AI and Amazon Bedrock

Artificial Intelligence Build financial document processing with Pulse AI and Amazon Bedrock Financial institutions proc…

AWS Machine Learning BlogTutorial#fine-tuning

72d

05

llm 0.32a2

12th May 2026 A bunch of useful stuff in this LLM alpha, but the most important detail is this one: Most reasoning-capab…

Simon Willison BlogFrameworks#fine-tuning#observability

73d

06

How to Eliminate Pipeline Friction in AI Model Serving

The path from a trained AI model to production should be smooth, but rarely is. Many teams invest weeks fine-tuning mode…

NVIDIA Developer BlogTutorial#fine-tuning#inference

73d

07

Navigating EU AI Act requirements for LLM fine-tuning on Amazon SageMaker AI

Artificial Intelligence Navigating EU AI Act requirements for LLM fine-tuning on Amazon SageMaker AI The EU AI Act requi…

AWS Machine Learning BlogTutorial#fine-tuning#open-source

73d

08

Android is getting a big AI overhaul in 2026

Google’s I/O conference is next week, and we expect to hear a lot about the company’s AI endeavors. The company says the…

Ars Technica AIModel#gemini#fine-tuning

73d

09

Course correction: Google to link more sources in AI Overviews

The top of a Google search page is prime real estate, but it has primarily been the domain of AI Overviews for the past …

Ars Technica AIResearch#fine-tuning

77d

10

EMO: Pretraining mixture of experts for emergent modularity

EMO: Pretraining mixture of experts for emergent modularity Today we're releasing EMO, a new mixture-of-experts (MoE) mo…

Hugging Face BlogModel#fine-tuning#coding#training

77d

11

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required The Idea Medical question answering is one of those task…

Hugging Face BlogHardware#fine-tuning#gpu

77d

12

Introducing Multi-LoRA on Cerebras Inference May 06, 2026

Today, we are launching Multi-LoRA—multi-adapter support for Low-Rank Adaptation—on Cerebras Inference in private previe…

Cerebras BlogTutorial#fine-tuning#inference#training

78d

13

Quoting Andy Masley

4th May 2026 [...] Between 2000 and 2024, farmers sold in total a Colorado-sized chunk of land all on their own, 77 time…

Simon Willison BlogModel#fine-tuning

81d

14

Capacity-aware inference: Automatic instance fallback for SageMaker AI endpoints

Artificial Intelligence Capacity-aware inference: Automatic instance fallback for SageMaker AI endpoints As organization…

AWS Machine Learning BlogInfra#fine-tuning#inference#multimodal

81d

15

Configuring Amazon Bedrock AgentCore Gateway for secure access to private resources

Artificial Intelligence Configuring Amazon Bedrock AgentCore Gateway for secure access to private resources AI agents in…

AWS Machine Learning BlogInfra#fine-tuning#multimodal

85d

16

Reinforcement fine-tuning with LLM-as-a-judge

Artificial Intelligence Reinforcement fine-tuning with LLM-as-a-judge Large language models (LLMs) now drive the most ad…

AWS Machine Learning BlogModel#fine-tuning

85d

17

4/27/2026 DeepSeek V4 Pro: Validating Frontier Models For Production

Why we chose correctness over a Day-0 launch DeepSeek V4 Pro is one of the most important open-model releases this year,…

Fireworks AI BlogInfra#fine-tuning#inference

88d

18

Build Strands Agents with SageMaker AI models and MLflow

Artificial Intelligence Build Strands Agents with SageMaker AI models and MLflow Enterprises building AI agents often re…

AWS Machine Learning BlogTutorial#agents#fine-tuning#observability

88d

19

Build with DeepSeek V4 Using NVIDIA Blackwell and GPU-Accelerated Endpoints

DeepSeek just launched its fourth generation of flagship models with DeepSeek-V4-Pro and DeepSeek-V4-Flash, both targete…

NVIDIA Developer BlogTutorial#fine-tuning#gpu

91d1 view

20

AutoAdapt: Automated domain adaptation for large language models

At a glance - Problem: Adapting large language models to specialized, high-stakes domains is slow, expensive, and hard t…

Microsoft Research BlogInfra#rag#agents#fine-tuning

93d

21

Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabilities

Artificial Intelligence Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabil…

AWS Machine Learning BlogTutorial#fine-tuning#training

98d

22

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers As a practical example, I'll w…

Hugging Face BlogInfra#fine-tuning#multimodal#training

99d

23

Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference

Artificial Intelligence Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference…

AWS Machine Learning BlogModel#fine-tuning#inference

99d

24

4/6/2026 Own Your AI: Fireworks Training Preview

Fireworks Training is now in preview: an end-to-end platform for training and deploying frontier models at scale. Three …

Fireworks AI BlogInfra#fine-tuning#inference#training

109d

25

Industrial policy for the Intelligence Age

Industrial policy for the Intelligence Age Ideas to keep people first. As we move toward superintelligence, incremental …

OpenAI BlogResearch#fine-tuning

109d

26

4/3/2026 Scaling and Optimizing Frontier Model Training

On this page How Fireworks scales frontier model training and offers the broadest set of fine-tunable MoE models on any …

Fireworks AI BlogHardware#fine-tuning#inference#training

112d

27

Training mRNA Language Models Across 25 Species for $165

Training mRNA Language Models Across 25 Species for $165 Part II: Building the Pipeline, From Structure Prediction to Co…

Hugging Face BlogHardware#agents#fine-tuning#coding

115d

28

3/28/2026 The Fine-Tuning Bottleneck Isn't the Algorithm

TL;DR: Integration friction and slow iteration cycles are the bottlenecks that actually stall fine-tuning — not the algo…

Fireworks AI BlogModel#fine-tuning#training

118d

29

Build a Domain-Specific Embedding Model in Under a Day

Build a Domain-Specific Embedding Model in Under a Day With a single GPU and less than a day of training time, you can t…

Hugging Face BlogResearch#fine-tuning#training#embeddings

126d

30

Production AI Playbook: Human Oversight

This post is part of a series that explores strategies, shares best practices, and provides practical examples for build…

n8n BlogTutorial#fine-tuning

137d

31

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Ulysses Sequence Parallelism: Training with Million-Token Contexts Ulysses Sequence Parallelism (part of the Arctic Long…

Hugging Face BlogResearch#fine-tuning#benchmark#training

137d

32

Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints

Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this se…

NVIDIA Developer BlogHardware#qwen#fine-tuning#multimodal

147d

33

Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock Feb 26, 2026 · 11 min read Organizations and individuals running multiple custom AI models, especially recent Mixture of Experts (MoE) model families, can face the challenge of paying for idle GPU capacity when the...

Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock Organizations and indi…

vLLM BlogHardware#fine-tuning#inference

148d

34

Delivering contextual job matching for millions with OpenAI

Indeed uses OpenAI to deliver contextual job matching to millions of job seekers Indeed(opens in a new window), whose mi…

OpenAI BlogModel#fine-tuning

149d

35

OpenAI o1 and new tools for developers

OpenAI o1 and new tools for developers Introducing OpenAI o1, Realtime API improvements, a new fine-tuning method and mo…

OpenAI BlogModel#fine-tuning#coding

149d

36

Train AI models with Unsloth and Hugging Face Jobs for FREE

Train AI models with Unsloth and Hugging Face Jobs for FREE LiquidAI/LFM2.5-1.2B-Instruct ) through coding agents like C…

Hugging Face BlogInfra#claude#fine-tuning#coding

154d

37

How to Build License-Compliant Synthetic Data Pipelines for AI Model Distillation

Specialized AI models are built to perform specific tasks or solve particular problems. But if you’ve ever tried to fine…

NVIDIA Developer BlogTutorial#fine-tuning

169d

38

Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints

Kimi K2.5 is the newest open vision language model (VLM) from the Kimi family of models. Kimi K2.5 is a general-purpose …

NVIDIA Developer BlogTutorial#fine-tuning#multimodal#gpu

170d

39

1/30/2026 The Missing Piece of the OpenClaw Mania: Truly ‘Own Your AI’ with Fireworks AI

Building a "Personal Operating System" means nothing if you don't control the brain. Move your OpenClaw agent onto secur…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

175d

40

1/26/2026 Kimi K2.5 is Live on Fireworks: Vibe Coding, Agents, and Full-Parameter RFT

Kimi K2.5 is Moonshot AI’s flagship agentic model and a new SOTA open model. It unifies vision and text, thinking and no…

Fireworks AI BlogInfra#agents#fine-tuning#inference

179d

41

1/23/2026 Turning Production Logs into Evaluation Datasets: A Data-Driven Approach

If you are running an LLM in production, you have access to the most valuable resource for improving your model: your ac…

Fireworks AI BlogResearch#fine-tuning#inference#open-source

182d

42

12/31/2025 DPO, your simplest RL pipeline with two rollouts

A recent research paper, "IT TAKES TWO: YOUR GRPO IS SECRETLY DPO", bridged DPO and GRPO by framing both DPO and GRPO un…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

205d

43

Codex is Open Sourcing AI models

Codex is Open Sourcing AI models Building on our work to get Claude Code to train open source models, we are now getting…

Hugging Face BlogTutorial#claude#agents#fine-tuning

225d

44

9/12/2025 Understanding Embeddings and Reranking at Scale

Retrieval-Augmented Generation has emerged as the dominant paradigm for grounding large language models with external kn…

Fireworks AI BlogInfra#fine-tuning#inference#embeddings

227d

45

8/12/2025 Quality first: how Fireworks.ai is the go-to place for gpt-oss

It’s been an incredible week for the open-source AI community. The release of GPT-OSS marked a significant milestone, op…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

228d

46

5/12/2025 Supervised Fine-Tuning (SFT) with LoRA on Fireworks AI: Tutorial

Supervised Fine-Tuning (SFT) is critical for adapting general-purpose Large Language Models (LLMs) to domain-specific ta…

Fireworks AI BlogTutorial#fine-tuning#inference

231d

47

We Got Claude to Fine-Tune an Open Source LLM

We Got Claude to Fine-Tune an Open Source LLM We gave Claude the ability to fine-tune language models using a new tool c…

Hugging Face BlogOpen Source#claude#fine-tuning#open-source

232d

48

3/12/2025 Fine-Tuning DeepSeek v3 & R1 to optimize quality, latency, & cost

At Fireworks, we’re happy to announce customization of DeepSeek R1 & V3, through Quantization Aware Fine Tuning, is now …

Fireworks AI BlogInfra#fine-tuning#inference

233d

49

OVHcloud on Hugging Face Inference Providers 🔥

OVHcloud on Hugging Face Inference Providers 🔥 We're thrilled to share that OVHcloud is now a supported Inference Provi…

Hugging Face BlogResearch#llama#qwen#fine-tuning

242d

50

20x Faster TRL Fine-tuning with RapidFire AI

20x Faster TRL Fine-tuning with RapidFire AI Why this matters When fine-tuning or post-training LLMs, teams often do not…

Hugging Face BlogModel#fine-tuning

245d

51

11/20/2025 Eval Protocol: RL on your agents, in any environment

Eval Protocol (EP) is an open-source, language-agnostic framework that makes it easy to do reinforcement fine-tuning on …

Fireworks AI BlogResearch#agents#fine-tuning#open-source

246d

52

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models When MiniMax published their M2 post-mortem in Oc…

Hugging Face BlogInfra#fine-tuning#inference#training

247d

53

11/19/2025 Fireworks Achieves Triple ISO Certification, giving Enterprises Full Control and Trust in AI at Scale

AI adoption is accelerating, but enterprises face risk without verifiable security, privacy, and governance. Fireworks d…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

247d

54

Join the AMD Open Robotics Hackathon

Join the AMD Open Robotics Hackathon Looking to show off your robotics aptitude? The AMD Open Robotics Hackathon hosted …

Hugging Face BlogTutorial#fine-tuning

253d

55

7/11/2025 Understanding Function Calling: The Bridge to Agentic AI

Large language models (LLMs) have revolutionized natural language processing by generating impressive text based on mass…

Fireworks AI BlogInfra#agents#fine-tuning#inference

259d

56

Doppel’s AI defense system stops attacks before they spread

Doppel’s AI defense system stops attacks before they spread With GPT‑5 and reinforcement fine-tuning (RFT), Doppel cut a…

OpenAI BlogModel#fine-tuning

269d

57

From Monolithic to Modular: Scaling Semantic Routing with Extensible LoRA Oct 27, 2025 · 8 min read Semantic routing systems face a scaling challenge. When each classification request requires running multiple fine-tuned models independently, the computational cost grows linearly with the number...

From Monolithic to Modular: Scaling Semantic Routing with Extensible LoRA Semantic routing systems face a scaling challe…

vLLM BlogInfra#fine-tuning

270d

58

No More Retokenization Drift: Returning Token IDs via the OpenAI Compatible API Matters in Agent RL Oct 22, 2025 · 9 min read TL;DR. Agent often calls LLMs via OpenAI‑compatible endpoints, which previously return only string-based inputs and outputs. In agent RL, this can lead to inconsistencies between training and...

No More Retokenization Drift: Returning Token IDs via the OpenAI Compatible API Matters in Agent RL TL;DR. Agent often c…

vLLM BlogHardware#agents#fine-tuning#training

275d

59

Supercharge your OCR Pipelines with Open Models

Supercharge your OCR Pipelines with Open Models Chandra and OlmOCR-2 to this blog, as well as OlmOCR Scores of the model…

Hugging Face BlogTutorial#fine-tuning#multimodal#local

276d

60

10/15/2025 LLM on the edge: Model picking with Fireworks Eval Protocol + Ollama

Modern AI apps rarely run on a single model forever. Teams iterate, swap providers, and increasingly run open-source mod…

Fireworks AI BlogInfra#llama#fine-tuning#inference

282d

61

12/10/2025 Best Practices for Multi-Turn RL

How to train LLM agents that can reliably plan, call tools, and recover from their own mistakes. Introduction In the evo…

Fireworks AI BlogTutorial#agents#fine-tuning#training

285d

62

11/10/2025 Fireworks RFT: Build AI agents with fine-tuned open models that outperform frontier closed models

Fireworks RFT enables you to fine-tune frontier open models like DeepSeek V3 and Kimi K2 for your agentic product. Gensp…

Fireworks AI BlogResearch#agents#fine-tuning#inference

286d

63

Introducing AgentKit, new Evals, and RFT for agents

Today we’re launching AgentKit, a complete set of tools for developers and enterprises to build, deploy, and optimize ag…

OpenAI BlogModel#fine-tuning#coding#benchmark

291d

64

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs Complex Scenarios Missing You start with a simple data…

Hugging Face BlogModel#fine-tuning#training

305d

65

9/22/2025 Traces Are All You Need (to rank LLMs)

From your existing observability platform logs to a data-driven model leaderboard in minutes – quickly compare candidate…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

305d

66

11/9/2025 Modernizing Healthcare with AI: How RADPAIR and Fireworks Unlock Smarter Radiology Workflows

Executive Summary RADPAIR is transforming radiology workflows with its intent to create an open-source SDK standard – an…

Fireworks AI BlogInfra#agents#fine-tuning#inference

316d

67

Fine-tune Any LLM from the Hugging Face Hub with Together AI

Fine-tune Any LLM from the Hugging Face Hub with Together AI But here's the challenge: finding an amazing model is just …

Hugging Face BlogInfra#fine-tuning#inference

317d

68

10/9/2025 Announcing Embeddings and Reranking On Fireworks AI

Today, we're announcing a major upgrade to Fireworks for RAG workloads – we’re bringing the state-of-the-art Qwen3 8B Em…

Fireworks AI BlogInfra#fine-tuning#inference#embeddings

317d

69

Torch compile caching for inference speed

Torch compile caching for inference speed We now cache torch.compile artifacts to reduce boot times for models that use …

Replicate BlogTutorial#fine-tuning#inference#coding

319d

70

6/9/2025 Reinforcement Fine Tuning (Beta): Train expert open models to surpass closed frontier models

Today, we’re excited to announce the beta release of Reinforcement Fine-Tuning (RFT), a powerful new technique to create…

Fireworks AI BlogAgents#agents#fine-tuning#coding

321d

71

8/26/2025 DeepSeek V3.1 now on Fireworks AI!

TL;DR DeepSeek V3.1 is a major leap forward in open‑source LLMs. It introduces hybrid reasoning modes (“thinking” vs. “n…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

332d

72

8/25/2025 LLM Eval Driven Development with Claude Code

In our previous blog, we showed how to go from one test to many tests with Eval Protocol with Cursor. But what if you're…

Fireworks AI BlogInfra#claude#fine-tuning#inference

333d

73

8/15/2025 Your AI Benchmark is Lying to You. Here's How We Caught It

Your AI Benchmark is Lying to You. Here's How We Caught It Would you give GPT-4.1 an A grade for this image? We sure wou…

Fireworks AI BlogResearch#fine-tuning#inference#benchmark

343d

74

8/14/2025 Test-Driven Agent Development with Eval Protocol

Building AI agents is exciting, but let's be honest: they can be unpredictable. How do you add new features without secr…

Fireworks AI BlogInfra#agents#fine-tuning#inference

344d

75

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests: How Good are LLMs at Text-Based Video Games? Two core avenues exist to evaluate autonomous agents: either us…

Hugging Face BlogResearch#claude#gemini#agents

346d

76

Vision Language Model Alignment in TRL ⚡️

Vision Language Model Alignment in TRL ⚡️ Introduction Vision Language Models (VLMs) are getting stronger, but aligning …

Hugging Face BlogTutorial#fine-tuning#multimodal#training

351d

77

Estimating worst case frontier risks of open weight LLMs

Estimating worst case frontier risks of open weight LLMs In this paper, we study the worst-case frontier risks of releas…

OpenAI BlogResearch#fine-tuning#open-source

353d

78

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio For Python developers, Gradio makes implementin…

Hugging Face BlogAPI#fine-tuning#coding

358d

79

7/31/2025 Run bulk async workloads with Fireworks Batch API

With Fireworks’ Batch API, you can asynchronously run large volumes of requests on 1000+ open or finetuned models with n…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

358d

80

7/30/2025 Fireworks Real-World Benchmarks: Find the Best OSS Model for the Job

The open-source model landscape is exploding, making it hard to choose the right model. To help you cut through the nois…

Fireworks AI BlogResearch#fine-tuning#inference#benchmark

359d

81

7/29/2025 Introducing Vision-Language Model Fine-tuning: Tailor VLMs to Your Domain

Fireworks AI now offers supervised fine-tuning for Vision-Language Models (Qwen 2.5 VL family), letting you adapt state-…

Fireworks AI BlogInfra#fine-tuning#inference#multimodal

360d

82

7/25/2025 How Notion Cuts Latency 4x and Scales Enterprise AI Workflows with Fireworks AI

Notion’s journey from individual users to enterprise powerhouse showcases how Fireworks AI enables scalable, reliable, a…

Fireworks AI BlogInfra#agents#fine-tuning#inference

364d

83

Fast LoRA inference for Flux with Diffusers and PEFT

Fast LoRA inference for Flux with Diffusers and PEFT In this post, we take the Flux.1-Dev model for text-to-image genera…

Hugging Face BlogModel#fine-tuning#inference

366d

84

7/22/2025 VibeRL: When AI Trains AI

Reinforcement Learning (RL) isn't new. Think about it like training a pet - you give a command, your pet performs an act…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

367d

85

7/22/2025 A Deep Dive into MLA training/inference difference and why QK-Clip from Kimi is such an elegant idea

Today, we're unpacking a clever insight from the researchers behind Kimi K2, a powerful LLM from Moonshot AI. This all s…

Fireworks AI BlogInfra#fine-tuning#inference#training

367d

86

Generate consistent characters

Generate consistent characters Until recently, the best way to generate images of a consistent character was from a trai…

Replicate BlogInfra#agents#fine-tuning

368d

87

7/17/2025 Sentient & Fireworks Powers Decentralized AI At Viral Scale

Backed by $85 million from Founders Fund, Pantera, Framework Ventures, and Polygon Labs, Sentient unites Sandeep Nailwal…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

372d

88

7/15/2025 Deep-dive into MuonClip: Fixing Attention Score Explosions in Transformer Training

Interactive visualization for MuonClip, brought to you from Fireworks.ai With the release of Kimi-K2, a state of the art…

Fireworks AI BlogInfra#fine-tuning#inference#training

374d

89

7/15/2025 Fireworks AI Now Supports Amazon SageMaker

We’re thrilled to announce Fireworks AI has now made Amazon SageMaker available as a Bring Your Own Compute (BYOC) deplo…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

374d

90

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Training and Finetuning Sparse Embedding Models with Sentence Transformers Finetuning sparse embedding models involves s…

Hugging Face BlogModel#fine-tuning#training#embeddings

388d

91

6/22/2025 Unlock Your Tools: Fireworks Adds OpenAI-Response API with MCP Support (Beta)

TL;DR: Fireworks now supports an OpenAI-response API endpoint that allows you to connect our library of leading open mod…

Fireworks AI BlogInfra#fine-tuning#inference#open-source

397d

92

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware In our previous post, Exploring Quantization Backends in Diffusers, w…

Hugging Face BlogHardware#fine-tuning

400d

93

Toward understanding and preventing misalignment generalization

Toward understanding and preventing misalignment generalization A misaligned persona feature controls emergent misalignm…

OpenAI BlogResearch#fine-tuning#training#safety

401d

94

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm Introduction NVIDIA Isaac GR00T (Generalist Robot 00 Technology) i…

Hugging Face BlogTutorial#fine-tuning#multimodal#coding

408d

95

LLM RAG Daniel Fleischer Research Engineer at Intel Labs Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them with a local LLM endpoint June 10, 2025

Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them…

Haystack (deepset) BlogResearch#rag#fine-tuning#observability

409d

96

10/6/2025 Deep-Dive into LLM Fine-Tuning

Fine-tuning large language models (LLMs) has become one of the most critical levers for adapting general-purpose models …

Fireworks AI BlogInfra#fine-tuning#inference

409d

97

LoRA Fine-Tune Support Now Live on GroqCloud

LoRA Fine-Tune Support Now Live on GroqCloud GroqCloud now supports Low-Rank Adaptation (LoRA) fine-tunes, exclusively b…

Groq BlogInfra#fine-tuning#inference

416d

98

Run 30,000+ LoRAs on Hugging Face with Replicate

Run 30,000+ LoRAs on Hugging Face with Replicate LoRAs have become the leading way to train image models to express spec…

Replicate BlogInfra#fine-tuning#inference

435d

99

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. In this blogpost, we present the key…

Hugging Face BlogModel#fine-tuning

435d

100

Blazingly fast whisper transcriptions with Inference Endpoints

Blazingly fast whisper transcriptions with Inference Endpoints Through this release, we would like to make Inference End…

Hugging Face BlogAPI#fine-tuning#inference

437d