$ timeahead.in
← back
$ articles --tag fine-tuning

#fine-tuning

100 articles

01
Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints
Artificial Intelligence Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints Today, Amazon SageMak…
AWS Machine Learning BlogInfra#fine-tuning#inference#langchain
26d
02
Unlocking asynchronicity in continuous batching
Unlocking asynchronicity in continuous batching TL;DR: we explain how to separate CPU and GPU workloads to get a massive…
Hugging Face BlogTutorial#fine-tuning#inference
32d
03
Fine-tune LLM with Databricks Unity Catalog and Amazon SageMaker AI
Artificial Intelligence Fine-tune LLM with Databricks Unity Catalog and Amazon SageMaker AI When you fine-tune large lan…
AWS Machine Learning BlogTutorial#agents#fine-tuning#inference
33d
04
Build financial document processing with Pulse AI and Amazon Bedrock
Artificial Intelligence Build financial document processing with Pulse AI and Amazon Bedrock Financial institutions proc…
AWS Machine Learning BlogTutorial#fine-tuning
33d
05
llm 0.32a2
12th May 2026 A bunch of useful stuff in this LLM alpha, but the most important detail is this one: Most reasoning-capab…
Simon Willison BlogFrameworks#fine-tuning#observability
34d
06
How to Eliminate Pipeline Friction in AI Model Serving
The path from a trained AI model to production should be smooth, but rarely is. Many teams invest weeks fine-tuning mode…
NVIDIA Developer BlogTutorial#fine-tuning#inference
34d
07
Navigating EU AI Act requirements for LLM fine-tuning on Amazon SageMaker AI
Artificial Intelligence Navigating EU AI Act requirements for LLM fine-tuning on Amazon SageMaker AI The EU AI Act requi…
AWS Machine Learning BlogTutorial#fine-tuning#open-source
34d
08
Android is getting a big AI overhaul in 2026
Google’s I/O conference is next week, and we expect to hear a lot about the company’s AI endeavors. The company says the…
Ars Technica AIModel#gemini#fine-tuning
34d
09
Course correction: Google to link more sources in AI Overviews
The top of a Google search page is prime real estate, but it has primarily been the domain of AI Overviews for the past …
Ars Technica AIResearch#fine-tuning
38d
10
EMO: Pretraining mixture of experts for emergent modularity
EMO: Pretraining mixture of experts for emergent modularity Today we're releasing EMO, a new mixture-of-experts (MoE) mo…
Hugging Face BlogModel#fine-tuning#coding#training
38d
11
MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required
MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required The Idea Medical question answering is one of those task…
Hugging Face BlogHardware#fine-tuning#gpu
38d
12
Introducing Multi-LoRA on Cerebras Inference May 06, 2026
Today, we are launching Multi-LoRA—multi-adapter support for Low-Rank Adaptation—on Cerebras Inference in private previe…
Cerebras BlogTutorial#fine-tuning#inference#training
39d
13
Quoting Andy Masley
4th May 2026 [...] Between 2000 and 2024, farmers sold in total a Colorado-sized chunk of land all on their own, 77 time…
Simon Willison BlogModel#fine-tuning
42d
14
Capacity-aware inference: Automatic instance fallback for SageMaker AI endpoints
Artificial Intelligence Capacity-aware inference: Automatic instance fallback for SageMaker AI endpoints As organization…
AWS Machine Learning BlogInfra#fine-tuning#inference#multimodal
42d
15
Configuring Amazon Bedrock AgentCore Gateway for secure access to private resources
Artificial Intelligence Configuring Amazon Bedrock AgentCore Gateway for secure access to private resources AI agents in…
AWS Machine Learning BlogInfra#fine-tuning#multimodal
46d
16
Reinforcement fine-tuning with LLM-as-a-judge
Artificial Intelligence Reinforcement fine-tuning with LLM-as-a-judge Large language models (LLMs) now drive the most ad…
AWS Machine Learning BlogModel#fine-tuning
46d
17
4/27/2026 DeepSeek V4 Pro: Validating Frontier Models For Production
Why we chose correctness over a Day-0 launch DeepSeek V4 Pro is one of the most important open-model releases this year,…
Fireworks AI BlogInfra#fine-tuning#inference
49d
18
Build Strands Agents with SageMaker AI models and MLflow
Artificial Intelligence Build Strands Agents with SageMaker AI models and MLflow Enterprises building AI agents often re…
AWS Machine Learning BlogTutorial#agents#fine-tuning#observability
49d
19
Build with DeepSeek V4 Using NVIDIA Blackwell and GPU-Accelerated Endpoints
DeepSeek just launched its fourth generation of flagship models with DeepSeek-V4-Pro and DeepSeek-V4-Flash, both targete…
NVIDIA Developer BlogTutorial#fine-tuning#gpu
52d1 view
20
AutoAdapt: Automated domain adaptation for large language models
At a glance - Problem: Adapting large language models to specialized, high-stakes domains is slow, expensive, and hard t…
Microsoft Research BlogInfra#rag#agents#fine-tuning
54d
21
Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabilities
Artificial Intelligence Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabil…
AWS Machine Learning BlogTutorial#fine-tuning#training
59d
22
Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers
Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers As a practical example, I'll w…
Hugging Face BlogInfra#fine-tuning#multimodal#training
60d
23
Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference
Artificial Intelligence Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference…
AWS Machine Learning BlogModel#fine-tuning#inference
60d
24
4/6/2026 Own Your AI: Fireworks Training Preview
Fireworks Training is now in preview: an end-to-end platform for training and deploying frontier models at scale. Three …
Fireworks AI BlogInfra#fine-tuning#inference#training
70d
25
Industrial policy for the Intelligence Age
Industrial policy for the Intelligence Age Ideas to keep people first. As we move toward superintelligence, incremental …
OpenAI BlogResearch#fine-tuning
70d
26
4/3/2026 Scaling and Optimizing Frontier Model Training
On this page How Fireworks scales frontier model training and offers the broadest set of fine-tunable MoE models on any …
Fireworks AI BlogHardware#fine-tuning#inference#training
73d
27
Training mRNA Language Models Across 25 Species for $165
Training mRNA Language Models Across 25 Species for $165 Part II: Building the Pipeline, From Structure Prediction to Co…
Hugging Face BlogHardware#agents#fine-tuning#coding
76d
28
3/28/2026 The Fine-Tuning Bottleneck Isn't the Algorithm
TL;DR: Integration friction and slow iteration cycles are the bottlenecks that actually stall fine-tuning — not the algo…
Fireworks AI BlogModel#fine-tuning#training
79d
29
Build a Domain-Specific Embedding Model in Under a Day
Build a Domain-Specific Embedding Model in Under a Day With a single GPU and less than a day of training time, you can t…
Hugging Face BlogResearch#fine-tuning#training#embeddings
87d
30
Production AI Playbook: Human Oversight
This post is part of a series that explores strategies, shares best practices, and provides practical examples for build…
n8n BlogTutorial#fine-tuning
98d
31
Ulysses Sequence Parallelism: Training with Million-Token Contexts
Ulysses Sequence Parallelism: Training with Million-Token Contexts Ulysses Sequence Parallelism (part of the Arctic Long…
Hugging Face BlogResearch#fine-tuning#benchmark#training
98d
32
Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints
Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this se…
NVIDIA Developer BlogHardware#qwen#fine-tuning#multimodal
108d
33
Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock Feb 26, 2026 · 11 min read Organizations and individuals running multiple custom AI models, especially recent Mixture of Experts (MoE) model families, can face the challenge of paying for idle GPU capacity when the...
Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock Organizations and indi…
vLLM BlogHardware#fine-tuning#inference
109d
34
Delivering contextual job matching for millions with OpenAI
Indeed uses OpenAI to deliver contextual job matching to millions of job seekers Indeed(opens in a new window), whose mi…
OpenAI BlogModel#fine-tuning
110d
35
OpenAI o1 and new tools for developers
OpenAI o1 and new tools for developers Introducing OpenAI o1, Realtime API improvements, a new fine-tuning method and mo…
OpenAI BlogModel#fine-tuning#coding
110d
36
Train AI models with Unsloth and Hugging Face Jobs for FREE
Train AI models with Unsloth and Hugging Face Jobs for FREE LiquidAI/LFM2.5-1.2B-Instruct ) through coding agents like C…
Hugging Face BlogInfra#claude#fine-tuning#coding
115d
37
How to Build License-Compliant Synthetic Data Pipelines for AI Model Distillation
Specialized AI models are built to perform specific tasks or solve particular problems. But if you’ve ever tried to fine…
NVIDIA Developer BlogTutorial#fine-tuning
130d
38
Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints
Kimi K2.5 is the newest open vision language model (VLM) from the Kimi family of models. Kimi K2.5 is a general-purpose …
NVIDIA Developer BlogTutorial#fine-tuning#multimodal#gpu
131d
39
1/30/2026 The Missing Piece of the OpenClaw Mania: Truly ‘Own Your AI’ with Fireworks AI
Building a "Personal Operating System" means nothing if you don't control the brain. Move your OpenClaw agent onto secur…
136d
40
1/26/2026 Kimi K2.5 is Live on Fireworks: Vibe Coding, Agents, and Full-Parameter RFT
Kimi K2.5 is Moonshot AI’s flagship agentic model and a new SOTA open model. It unifies vision and text, thinking and no…
Fireworks AI BlogInfra#agents#fine-tuning#inference
140d
41
1/23/2026 Turning Production Logs into Evaluation Datasets: A Data-Driven Approach
If you are running an LLM in production, you have access to the most valuable resource for improving your model: your ac…
Fireworks AI BlogResearch#fine-tuning#inference#open-source
143d
42
12/31/2025 DPO, your simplest RL pipeline with two rollouts
A recent research paper, "IT TAKES TWO: YOUR GRPO IS SECRETLY DPO", bridged DPO and GRPO by framing both DPO and GRPO un…
166d
43
Codex is Open Sourcing AI models
Codex is Open Sourcing AI models Building on our work to get Claude Code to train open source models, we are now getting…
Hugging Face BlogTutorial#claude#agents#fine-tuning
186d
44
9/12/2025 Understanding Embeddings and Reranking at Scale
Retrieval-Augmented Generation has emerged as the dominant paradigm for grounding large language models with external kn…
Fireworks AI BlogInfra#fine-tuning#inference#embeddings
188d
45
8/12/2025 Quality first: how Fireworks.ai is the go-to place for gpt-oss
It’s been an incredible week for the open-source AI community. The release of GPT-OSS marked a significant milestone, op…
189d
46
5/12/2025 Supervised Fine-Tuning (SFT) with LoRA on Fireworks AI: Tutorial
Supervised Fine-Tuning (SFT) is critical for adapting general-purpose Large Language Models (LLMs) to domain-specific ta…
Fireworks AI BlogTutorial#fine-tuning#inference
192d
47
We Got Claude to Fine-Tune an Open Source LLM
We Got Claude to Fine-Tune an Open Source LLM We gave Claude the ability to fine-tune language models using a new tool c…
Hugging Face BlogOpen Source#claude#fine-tuning#open-source
193d
48
3/12/2025 Fine-Tuning DeepSeek v3 & R1 to optimize quality, latency, & cost
At Fireworks, we’re happy to announce customization of DeepSeek R1 & V3, through Quantization Aware Fine Tuning, is now …
Fireworks AI BlogInfra#fine-tuning#inference
194d
49
OVHcloud on Hugging Face Inference Providers 🔥
OVHcloud on Hugging Face Inference Providers 🔥 We're thrilled to share that OVHcloud is now a supported Inference Provi…
Hugging Face BlogResearch#llama#qwen#fine-tuning
203d
50
20x Faster TRL Fine-tuning with RapidFire AI
20x Faster TRL Fine-tuning with RapidFire AI Why this matters When fine-tuning or post-training LLMs, teams often do not…
Hugging Face BlogModel#fine-tuning
206d
51
11/20/2025 Eval Protocol: RL on your agents, in any environment
Eval Protocol (EP) is an open-source, language-agnostic framework that makes it easy to do reinforcement fine-tuning on …
Fireworks AI BlogResearch#agents#fine-tuning#open-source
207d
52
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models When MiniMax published their M2 post-mortem in Oc…
Hugging Face BlogInfra#fine-tuning#inference#training
208d
53
11/19/2025 Fireworks Achieves Triple ISO Certification, giving Enterprises Full Control and Trust in AI at Scale
AI adoption is accelerating, but enterprises face risk without verifiable security, privacy, and governance. Fireworks d…
208d
54
Join the AMD Open Robotics Hackathon
Join the AMD Open Robotics Hackathon Looking to show off your robotics aptitude? The AMD Open Robotics Hackathon hosted …
Hugging Face BlogTutorial#fine-tuning
214d
55
7/11/2025 Understanding Function Calling: The Bridge to Agentic AI
Large language models (LLMs) have revolutionized natural language processing by generating impressive text based on mass…
Fireworks AI BlogInfra#agents#fine-tuning#inference
220d
56
Doppel’s AI defense system stops attacks before they spread
Doppel’s AI defense system stops attacks before they spread With GPT‑5 and reinforcement fine-tuning (RFT), Doppel cut a…
OpenAI BlogModel#fine-tuning
230d
57
From Monolithic to Modular: Scaling Semantic Routing with Extensible LoRA Oct 27, 2025 · 8 min read Semantic routing systems face a scaling challenge. When each classification request requires running multiple fine-tuned models independently, the computational cost grows linearly with the number...
From Monolithic to Modular: Scaling Semantic Routing with Extensible LoRA Semantic routing systems face a scaling challe…
vLLM BlogInfra#fine-tuning
231d
58
No More Retokenization Drift: Returning Token IDs via the OpenAI Compatible API Matters in Agent RL Oct 22, 2025 · 9 min read TL;DR. Agent often calls LLMs via OpenAI‑compatible endpoints, which previously return only string-based inputs and outputs. In agent RL, this can lead to inconsistencies between training and...
No More Retokenization Drift: Returning Token IDs via the OpenAI Compatible API Matters in Agent RL TL;DR. Agent often c…
236d
59
Supercharge your OCR Pipelines with Open Models
Supercharge your OCR Pipelines with Open Models Chandra and OlmOCR-2 to this blog, as well as OlmOCR Scores of the model…
Hugging Face BlogTutorial#fine-tuning#multimodal#local
237d
60
10/15/2025 LLM on the edge: Model picking with Fireworks Eval Protocol + Ollama
Modern AI apps rarely run on a single model forever. Teams iterate, swap providers, and increasingly run open-source mod…
Fireworks AI BlogInfra#llama#fine-tuning#inference
243d
61
12/10/2025 Best Practices for Multi-Turn RL
How to train LLM agents that can reliably plan, call tools, and recover from their own mistakes. Introduction In the evo…
Fireworks AI BlogTutorial#agents#fine-tuning#training
246d
62
11/10/2025 Fireworks RFT: Build AI agents with fine-tuned open models that outperform frontier closed models
Fireworks RFT enables you to fine-tune frontier open models like DeepSeek V3 and Kimi K2 for your agentic product. Gensp…
Fireworks AI BlogResearch#agents#fine-tuning#inference
247d
63
Introducing AgentKit, new Evals, and RFT for agents
Today we’re launching AgentKit, a complete set of tools for developers and enterprises to build, deploy, and optimize ag…
252d
64
SyGra: The One-Stop Framework for Building Data for LLMs and SLMs
SyGra: The One-Stop Framework for Building Data for LLMs and SLMs Complex Scenarios Missing You start with a simple data…
Hugging Face BlogModel#fine-tuning#training
266d
65
9/22/2025 Traces Are All You Need (to rank LLMs)
From your existing observability platform logs to a data-driven model leaderboard in minutes – quickly compare candidate…
266d
66
11/9/2025 Modernizing Healthcare with AI: How RADPAIR and Fireworks Unlock Smarter Radiology Workflows
Executive Summary RADPAIR is transforming radiology workflows with its intent to create an open-source SDK standard – an…
Fireworks AI BlogInfra#agents#fine-tuning#inference
277d
67
Fine-tune Any LLM from the Hugging Face Hub with Together AI
Fine-tune Any LLM from the Hugging Face Hub with Together AI But here's the challenge: finding an amazing model is just …
Hugging Face BlogInfra#fine-tuning#inference
278d
68
10/9/2025 Announcing Embeddings and Reranking On Fireworks AI
Today, we're announcing a major upgrade to Fireworks for RAG workloads – we’re bringing the state-of-the-art Qwen3 8B Em…
Fireworks AI BlogInfra#fine-tuning#inference#embeddings
278d
69
Torch compile caching for inference speed
Torch compile caching for inference speed We now cache torch.compile artifacts to reduce boot times for models that use …
Replicate BlogTutorial#fine-tuning#inference#coding
280d
70
6/9/2025 Reinforcement Fine Tuning (Beta): Train expert open models to surpass closed frontier models
Today, we’re excited to announce the beta release of Reinforcement Fine-Tuning (RFT), a powerful new technique to create…
Fireworks AI BlogAgents#agents#fine-tuning#coding
282d
71
8/26/2025 DeepSeek V3.1 now on Fireworks AI!
TL;DR DeepSeek V3.1 is a major leap forward in open‑source LLMs. It introduces hybrid reasoning modes (“thinking” vs. “n…
293d
72
8/25/2025 LLM Eval Driven Development with Claude Code
In our previous blog, we showed how to go from one test to many tests with Eval Protocol with Cursor. But what if you're…
Fireworks AI BlogInfra#claude#fine-tuning#inference
294d
73
8/15/2025 Your AI Benchmark is Lying to You. Here's How We Caught It
Your AI Benchmark is Lying to You. Here's How We Caught It Would you give GPT-4.1 an A grade for this image? We sure wou…
Fireworks AI BlogResearch#fine-tuning#inference#benchmark
304d
74
8/14/2025 Test-Driven Agent Development with Eval Protocol
Building AI agents is exciting, but let's be honest: they can be unpredictable. How do you add new features without secr…
Fireworks AI BlogInfra#agents#fine-tuning#inference
305d
75
TextQuests: How Good are LLMs at Text-Based Video Games?
TextQuests: How Good are LLMs at Text-Based Video Games? Two core avenues exist to evaluate autonomous agents: either us…
Hugging Face BlogResearch#claude#gemini#agents
307d
76
Vision Language Model Alignment in TRL ⚡️
Vision Language Model Alignment in TRL ⚡️ Introduction Vision Language Models (VLMs) are getting stronger, but aligning …
Hugging Face BlogTutorial#fine-tuning#multimodal#training
312d
77
Estimating worst case frontier risks of open weight LLMs
Estimating worst case frontier risks of open weight LLMs In this paper, we study the worst-case frontier risks of releas…
OpenAI BlogResearch#fine-tuning#open-source
314d
78
Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio
Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio For Python developers, Gradio makes implementin…
Hugging Face BlogAPI#fine-tuning#coding
319d
79
7/31/2025 Run bulk async workloads with Fireworks Batch API
With Fireworks’ Batch API, you can asynchronously run large volumes of requests on 1000+ open or finetuned models with n…
319d
80
7/30/2025 Fireworks Real-World Benchmarks: Find the Best OSS Model for the Job
The open-source model landscape is exploding, making it hard to choose the right model. To help you cut through the nois…
Fireworks AI BlogResearch#fine-tuning#inference#benchmark
320d
81
7/29/2025 Introducing Vision-Language Model Fine-tuning: Tailor VLMs to Your Domain
Fireworks AI now offers supervised fine-tuning for Vision-Language Models (Qwen 2.5 VL family), letting you adapt state-…
Fireworks AI BlogInfra#fine-tuning#inference#multimodal
321d
82
7/25/2025 How Notion Cuts Latency 4x and Scales Enterprise AI Workflows with Fireworks AI
Notion’s journey from individual users to enterprise powerhouse showcases how Fireworks AI enables scalable, reliable, a…
Fireworks AI BlogInfra#agents#fine-tuning#inference
325d
83
Fast LoRA inference for Flux with Diffusers and PEFT
Fast LoRA inference for Flux with Diffusers and PEFT In this post, we take the Flux.1-Dev model for text-to-image genera…
Hugging Face BlogModel#fine-tuning#inference
327d
84
7/22/2025 VibeRL: When AI Trains AI
Reinforcement Learning (RL) isn't new. Think about it like training a pet - you give a command, your pet performs an act…
328d
85
7/22/2025 A Deep Dive into MLA training/inference difference and why QK-Clip from Kimi is such an elegant idea
Today, we're unpacking a clever insight from the researchers behind Kimi K2, a powerful LLM from Moonshot AI. This all s…
Fireworks AI BlogInfra#fine-tuning#inference#training
328d
86
Generate consistent characters
Generate consistent characters Until recently, the best way to generate images of a consistent character was from a trai…
Replicate BlogInfra#agents#fine-tuning
329d
87
7/17/2025 Sentient & Fireworks Powers Decentralized AI At Viral Scale
Backed by $85 million from Founders Fund, Pantera, Framework Ventures, and Polygon Labs, Sentient unites Sandeep Nailwal…
333d
88
7/15/2025 Deep-dive into MuonClip: Fixing Attention Score Explosions in Transformer Training
Interactive visualization for MuonClip, brought to you from Fireworks.ai With the release of Kimi-K2, a state of the art…
Fireworks AI BlogInfra#fine-tuning#inference#training
335d
89
7/15/2025 Fireworks AI Now Supports Amazon SageMaker
We’re thrilled to announce Fireworks AI has now made Amazon SageMaker available as a Bring Your Own Compute (BYOC) deplo…
335d
90
Training and Finetuning Sparse Embedding Models with Sentence Transformers
Training and Finetuning Sparse Embedding Models with Sentence Transformers Finetuning sparse embedding models involves s…
Hugging Face BlogModel#fine-tuning#training#embeddings
349d
91
6/22/2025 Unlock Your Tools: Fireworks Adds OpenAI-Response API with MCP Support (Beta)
TL;DR: Fireworks now supports an OpenAI-response API endpoint that allows you to connect our library of leading open mod…
358d
92
(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware
(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware In our previous post, Exploring Quantization Backends in Diffusers, w…
Hugging Face BlogHardware#fine-tuning
361d
93
Toward understanding and preventing misalignment generalization
Toward understanding and preventing misalignment generalization A misaligned persona feature controls emergent misalignm…
OpenAI BlogResearch#fine-tuning#training#safety
362d
94
Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm
Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm Introduction NVIDIA Isaac GR00T (Generalist Robot 00 Technology) i…
Hugging Face BlogTutorial#fine-tuning#multimodal#coding
369d
95
LLM RAG Daniel Fleischer Research Engineer at Intel Labs Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them with a local LLM endpoint June 10, 2025
Summarize Hacker News Posts with Haystack & OPEA Build a RAG pipeline to fetch live Hacker News posts and summarize them…
Haystack (deepset) BlogResearch#rag#fine-tuning#observability
370d
96
10/6/2025 Deep-Dive into LLM Fine-Tuning
Fine-tuning large language models (LLMs) has become one of the most critical levers for adapting general-purpose models …
Fireworks AI BlogInfra#fine-tuning#inference
370d
97
LoRA Fine-Tune Support Now Live on GroqCloud
LoRA Fine-Tune Support Now Live on GroqCloud GroqCloud now supports Low-Rank Adaptation (LoRA) fine-tunes, exclusively b…
377d
98
Run 30,000+ LoRAs on Hugging Face with Replicate
Run 30,000+ LoRAs on Hugging Face with Replicate LoRAs have become the leading way to train image models to express spec…
Replicate BlogInfra#fine-tuning#inference
396d
99
Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.
Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. In this blogpost, we present the key…
Hugging Face BlogModel#fine-tuning
396d
100
Blazingly fast whisper transcriptions with Inference Endpoints
Blazingly fast whisper transcriptions with Inference Endpoints Through this release, we would like to make Inference End…
Hugging Face BlogAPI#fine-tuning#inference
398d