Dev Tools Synthesized from 3 sources

AWS Unveils Nova Forge SDK for Seamless Model Customization

Key Points

  • Nova Forge SDK lowers LLM customization barriers for enterprises
  • Supports SFT, DPO, and RFT to combat catastrophic forgetting
  • Stack Overflow case study uses 60,000-question dataset
  • Strands Evals evaluates non-deterministic AI agents
  • Full customization spectrum from Bedrock to SageMaker AI
References (3)
  1. [1] AWS Nova Forge SDK Simplifies Model Customization — AWS Machine Learning Blog
  2. [2] AWS Nova Forge SDK Enables Enterprise AI Customization — AWS Machine Learning Blog
  3. [3] Strands Evals: A Framework for Evaluating AI Agents — AWS Machine Learning Blog

AWS Aims to Democratize AI Model Customization

AWS has launched the Nova Forge SDK, a unified development kit designed to make customizing Amazon Nova models more accessible for enterprise teams. Announced on March 18, 2026, the SDK addresses one of the most persistent challenges in deploying large language models: tailoring general-purpose AI systems to specialized business requirements without requiring deep technical expertise.

The complexity of LLM customization has traditionally created a significant barrier for enterprises. Teams had to navigate intricate dependency management, select appropriate training images, configure recipes, and set up infrastructure—all of which demanded specialized knowledge and considerable time investment. Nova Forge SDK aims to eliminate these friction points by providing a streamlined, developer-friendly interface.

Addressing Catastrophic Forgetting

A critical issue in model fine-tuning is catastrophic forgetting, where models lose core capabilities—such as instruction-following, reasoning skills, and broad knowledge—as they are trained on specialized datasets. Nova Forge tackles this challenge through multiple customization approaches: Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Fine-Tuning (RFT).

The SDK supports the full spectrum of customization options, ranging from Amazon Bedrock-based adaptations to deep customization using Amazon SageMaker AI. Importantly, Nova Forge enables customers to start their development from early model checkpoints and blend proprietary datasets with Amazon-curated data, providing flexibility while maintaining model quality.

Practical Application: Stack Overflow Classification

AWS demonstrated Nova Forge's capabilities through a real-world case study involving Stack Overflow question quality classification. The company used a dataset of 60,000 questions from 2016-2020, classifying them into three categories: High Quality (HQ) posts requiring no edits, Low Quality requiring edits (LQ_EDIT), and Low Quality posts that should be closed (LQ_CLOSE).

The workflow involved evaluating baseline model performance, applying supervised fine-tuning to improve accuracy, then deploying reinforcement fine-tuning to further enhance response quality. After each customization phase, the team measured improvements across the training process before deploying the final model to an Amazon SageMaker AI Inference endpoint.

Strands Evals: Systematic Agent Evaluation

Complementing Nova Forge, AWS introduced Strands Evals, a framework specifically designed to evaluate AI agents in production environments. Unlike traditional software testing—which relies on deterministic outputs—AI agents generate varied responses even from identical inputs, making systematic evaluation challenging.

Strands Evals provides built-in evaluators, multi-turn simulation capabilities, and reporting tools for assessing agent quality dimensions that resist mechanical checking: helpfulness, coherence, and faithfulness to source materials. The framework addresses the complexity of multi-turn conversations where earlier responses affect subsequent interactions, enabling developers to measure whether agents take appropriate steps to reach their conclusions, not just whether the final output appears correct.

Industry Impact

These releases position AWS as a comprehensive platform for enterprise AI development, from initial model customization through production evaluation. By lowering technical barriers and providing systematic tooling, AWS is betting that the next wave of AI adoption will come from organizations customizing foundation models for their specific domains rather than building from scratch.

The SDK is available on GitHub at `aws/nova-customization-sdk`, with documentation covering prerequisites, setup instructions, and integration with Amazon SageMaker AI Training Jobs.

0:00