Home > Categories > Machine learning and neural networks > ML Platforms > Red Hat OpenShift AI

Red Hat OpenShift AI

Related Capabilities / Limitations Pricing

Tags

Enterprise AI Agentic AI Distributed Inference Hybrid Cloud InstructLab

Integrations

InstructLab (LAB)
vLLM
Model Context Protocol (MCP)
Llama Stack
Docling
NVIDIA NIXL
Tekton Pipelines

Categories:
Machine learning and neural networks
Creator Red Hat
Date 2022-04-01
Platforms Cloud, On-premise, Kubernetes
Status Live
Website redhat.com
Price Model Subscription
Sections:
ML Platforms

Pricing Details

Standard Red Hat subscription model enhanced with 'AI Units' for flexible compute and MaaS consumption tracking.

Official Site Pricing Reseller premium Reseller standard Faq

Useful Resources

Features

llm-d Distributed Inference Engine
InstructLab Taxonomy-based Model Customization
Model Context Protocol (MCP) native integration
Model-as-a-Service (MaaS) with AI Gateway
TrustyAI Governance and Bias Monitoring

Description

Red Hat OpenShift AI 3.0: Agentic & Distributed Architecture

As of January 2026, RHOAI has transitioned to version 3.0, focusing on distributed inference intelligence and autonomous agent support. The platform integrates llm-d, a high-performance engine designed to optimize LLM serving on Kubernetes via the Gateway API Inference Extension 📑.

Orchestration & Model Alignment Layer

The core stack now features an integrated InstructLab toolkit, enabling taxonomy-based model customization without catastrophic forgetting 📑.

llm-d Distributed Serving: Utilizes NVIDIA NIXL and DeepEP for low-latency Mixture-of-Experts (MoE) communication, allowing seamless scaling of large models across multi-node GPU clusters 📑.
Agentic Infrastructure: Native support for the Model Context Protocol (MCP) and Llama Stack APIs, facilitating the creation of AI agents that can interact with enterprise data via standardized connectors 📑.
Hardware Profiles: Replaces legacy accelerator profiles, providing granular control over NPU, GPU (H200/B200), and IBM Z/Power resource allocation 📑.

⠠⠉⠗⠑⠁⠞⠑⠙⠀⠃⠽⠀⠠⠁⠊⠞⠕⠉⠕⠗⠑⠲⠉⠕⠍

Data Ingestion & TrustyAI Governance

RHOAI 3.0 incorporates the Docling project for advanced unstructured data ingestion, converting complex documents into AI-ready formats for RAG and fine-tuning 📑.

TrustyAI Bias Monitoring: Automated detection of model drift and bias is now GA, utilizing the Kubernetes-native TrustyAI operator for real-time inference auditing 📑.
Models-as-a-Service (MaaS): Centralized model catalog and AI Gateway provide secure, metered access to internal models with built-in quota management 📑.

Evaluation Guidance

Technical architects should prioritize the llm-d engine for all MoE model deployments to ensure optimal TCO. Organizations migrating from 2.x must update their pipeline definitions to the 3.0 spec to utilize Hardware Profiles. Verify that your vector store integration leverages the new Llama Stack compatibility layer for future-proof agent orchestration 📑.

Tool Pros and Cons

Pros

Streamlined AI/ML lifecycle
Scalable Kubernetes platform
Enhanced team collaboration
Automated MLOps
Simplified deployment

Cons

Kubernetes expertise needed
Learning curve
OpenShift costs

Pricing (2026) – Red Hat OpenShift AI

Last updated: 22.01.2026

Standard subscription (1 year)

$12,488.99 / 1 year

Red Hat OpenShift AI standard subscription for 1-2 sockets (up to 128 cores). Hosted SKU available from reseller listings.

Premium subscription (1 year)

$17,015.99 / 1 year

Red Hat OpenShift AI premium subscription for 1-2 sockets (up to 128 cores). Includes broader support and services.

AWS Marketplace usage pricing

$0.022 / vCPU-hour

Usage-based pricing from AWS Marketplace, billed per vCPU hour consumed (in addition to AWS infrastructure costs).

Developer Sandbox / Trial

$0 / trial (60 days)

Red Hat provides a free 60-day trial via the Developer Sandbox or direct trial options (self-managed).

Underlying OpenShift platform requirement

$varies / required

OpenShift AI requires a Red Hat OpenShift subscription or cloud service as a prerequisite; its cost is separate and depends on SKU and deployment.