Home > Categories > Machine learning and neural networks > DL Frameworks > Keras

Keras

Related Capabilities / Limitations

Tags

Deep Learning Machine Learning AI Orchestration Open Source Python

Integrations

JAX / XLA
PyTorch / TorchInductor
TensorFlow / LiteRT
Hugging Face Hub
Google Vertex AI
OpenVINO (Intel)

Categories:
Machine learning and neural networks
Creator François Chollet
Date 2015-03-01
Platforms Python
Status Active
Website keras.io
Price Model Free
Sections:
DL Frameworks Model Training

Pricing Details

Keras is free to use under the Apache License 2.0.
Enterprise costs are associated with the infrastructure of the chosen backend (GCP for JAX/TPUs, AWS/Azure for PyTorch/GPUs).

Features

Multi-backend engine (JAX, PyTorch, TensorFlow, OpenVINO)
Native Quantization API (int8, int4, FP8, GPTQ)
Agentic AI Integration via KerasHub
Unified API for Custom Layers & Training Loops
LiteRT (TFLite) Deployment Pathway
Distributed Training via JIT/XLA Compilation

Description

Keras: Multi-Backend Orchestration & Agentic Review

As of early 2026, Keras functions as the definitive abstraction layer for deep learning, effectively decoupling high-level model semantics from backend-specific execution kernels. This architecture enables a 'write once, run anywhere' paradigm, allowing developers to target JAX for massive-scale research, PyTorch for ecosystem breadth, or TensorFlow for mobile/edge production 📑.

Multi-Backend Execution & Interoperability

The core of the Keras 2026 stack is its backend-agnostic engine, which maps standardized Keras ops to native backend primitives.

Multi-Backend Handover: Input: Keras model code (Functional or Sequential) → Process: Dynamic mapping to backend-specific graphs (XLA for JAX, TorchInductor for PyTorch) → Output: Hardware-optimized inference or training steps 📑.
Model Quantization: Input: High-precision weights (FP32) → Process: Native.quantize("int8") or.quantize("float8") call via the Keras Quantization API → Output: Compressed model with up to 4x VRAM savings 📑.
Adaptive Layers: 2026 updates include AdaptiveAveragePooling and ReversibleEmbedding layers, which dynamically adjust their logic based on input tensor rank and backend constraints 📑.

⠠⠉⠗⠑⠁⠞⠑⠙⠀⠃⠽⠀⠠⠁⠊⠞⠕⠉⠕⠗⠑⠲⠉⠕⠍

Agentic AI & Ecosystem Integration

Keras has expanded its modularity to include native support for Agentic AI through KerasHub and unified tool-calling protocols.

Tool-Calling Integration: Input: Natural language business goal and tool definitions → Process: Intent-to-action mapping using specialized KerasHub agent presets → Output: Autonomous multi-step task execution with API grounding 🧠.
LiteRT Export: Provides the standard pathway for deploying Keras models to edge NPUs and mobile devices, ensuring sub-100ms latency for on-device generative tasks 📑.

Evaluation Guidance

Technical evaluators should verify the following architectural characteristics for 2026 deployments:

Numerical Parity: Validate that custom-written operations produce consistent results across JAX and PyTorch backends to avoid gradient divergence during training 🌑.
Quantization Accuracy: Benchmark the accuracy trade-off when using the new int4 and FP8 quantization modes for domain-specific LLMs (e.g., Gemma 2) 📑.
JIT Compilation Overhead: Measure the 'warm-up' latency of XLA (JAX) versus TorchInductor (PyTorch) when initializing a Keras 3 model in a cold-start production environment 🧠.

Release History

Unified Training Framework 2025-12

Year-end update: Release of the Unified Training Framework. Synchronous training across hybrid clusters (e.g., JAX for compute, PyTorch for data loading).

Keras 4.0 Preview (Agentic Layers) 2025-10

Preview of Keras 4.0. Introduced 'Agentic Layers' that allow models to autonomously call external tools and APIs during inference.

Keras 3.5 (Mixed Precision 2.0) 2025-02

Enhanced support for FP8 and int8 quantization across all backends. Improved performance for large-scale Transformer training on TPUs.

Gemma Integration 2024-09

Native support for Google's Gemma models. Optimized KerasNLP workflows for fine-tuning open-weights models across different backends.

KerasCV & KerasNLP GA 2024-04

General availability of specialized libraries. Native support for complex Computer Vision and NLP tasks like Object Detection and LLM fine-tuning.

v3.0 (Keras Core) 2023-11

Revolutionary shift: Keras 3.0. Reintroduced multi-backend support (JAX, PyTorch, TensorFlow). Ability to run the same model on any engine.

v2.0 (The TF Era) 2017-03

Major update as Keras was integrated into the TensorFlow core (tf.keras). Became the official high-level API for TensorFlow 2.0.

v1.0 Launch 2015-03

Initial release by François Chollet. A high-level library supporting Theano and later TensorFlow. Focus on 'Deep Learning for humans'.

Tool Pros and Cons

Pros

Rapid prototyping
Simple & intuitive
TensorFlow powered
Easy model building
Flexible design

Cons

Hides implementation details
Limited low-level access
Requires performance tuning

Keras

Tags

Integrations

Pricing Details

Features

Description

Keras: Multi-Backend Orchestration & Agentic Review

Multi-Backend Execution & Interoperability

Agentic AI & Ecosystem Integration

Evaluation Guidance

Release History

Tool Pros and Cons

Pros

Cons

Related Tools You Might Find Useful

TensorFlow

PyTorch

PyTorch (Classification)

TensorFlow (Classification)

Amazon SageMaker

Databricks

Report an error