Simple pricing for complex AI systems.
Monthly credits. Predictable costs. No surprises.
Team
$950 / mo + usage
*$950 credited to usage monthly
For teams building AI from experiment to production.
1,000 concurrent actions
Workflows can make on-the-fly decisions at runtime with real-time logic, conditions, and retries.
30-day data retention
Workflows can automatically recover from interruptions and continue where they left off.
1 cluster
Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.
Enterprise
Custom
Significant volume discounts
For organizations building mission-critical AI, ML, and agents.
Significant volume discounts
Build and scale dynamic AI/ML workflows using Flyte’s open-source apltform and community.
Custom concurrent actions
Workflows can make on-the-fly decisions at runtime with real-time logic, conditions, and retries.
Custom retention options
Workflows can automatically recover from interruptions and continue where they left off.
3+ clusters
Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.
Enterprise-grade security
Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.
White-glove support
Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.
Compute costs (Team plan)
You’re billed only for the container-allocated resources your workflow actually uses, never for idle capacity.
Pay less with an Enterprise plan. Contact us for custom rates.
GPU
T4g
T4
L4
A10G
A100
V100
L40S
H100
H200
B200
$ / Hr
$0.0758
$0.1427
$0.1699
$0.1985
$0.3088
$0.3270
$0.3671
$0.6880
$0.7912
$1.4242
CPU
vCPU
$ / Hr
$0.0260
Memory
1 GB
$ / Hr
$0.0032

“Union.ai has been critical at the time that we needed to significantly scale up... Union.ai’s wealth of expertise has enabled us to move fast and deliver at scale.”
Alborz Alavian
Senior Engineering Manager, Woven by Toyota
Compare plans & features
Platform
Durability & Debugging
Scalability
Security & Support
Frequently asked questions
Do you offer discounts for startups?
Yes we do! Talk to our Startup Team to learn more.
Do you offer discounts for NGOs, universities, or non-profits?
Yes we do! Talk to our Public Sector Team to learn more.
How do monthly credits work for the Team plan?
Your monthly plan fee is issued back to you as usage credits. In practice, this means your monthly plan cost becomes your minimum monthly spend, and you can use that same amount in usage at no additional charge. Any usage beyond that amount is billed separately.
What is an action?
An action is an individual execution of a task. It represents a specific invocation of a task with particular inputs. If a task runs multiple times (such as inside a loop) you’ll see multiple actions, one for each invocation.
Can my team have a forward-deployed engineer (FDE) from Union.ai to help us build?
Yes, this is available as part of a Union Enterprise plan. Book a consultation with an engineer to learn more.
Do you offer a self-hosted control plane as a deployment option?
Yes, this is one of the deployment options available as part of a Union Enterprise plan. Self-hosted deployment supports on-prem, hybrid, and airgapped configurations. Book a consultation with an engineer to learn more.
How do you calculate GPU, CPU, Memory hours of usage?
We report the allocated resources (CPU, Memory, and GPU accelerator) from each container running the actions within your workflows and apply usage-based pricing down to the second. We do not include the resources consumed by any other services. Therefore, if you run Union on a shared K8s cluster, you are only paying for usage on the resources consumed by your Union tasks and workflows.
Is Union.ai a SaaS service?
No. You deploy the Union operator into a Kubernetes cluster you manage, which securely communicates with the Union control plane to poll for work. Your workflow executions, code, images, data, logs, and secrets all remain in your VPC/cloud, and are inaccessible to Union.
What’s the difference between action concurrency and actions/run (i.e., task fanout)?
Fanout is the total number of actions a run creates, while concurrency is how many of those actions are running at the same time. For example, a run might fan out to 50,000 actions but only execute around 100 of them concurrently.
Can I run Union.ai in my own cloud environment?
Yes, Union.ai supports bring-your-own-cloud (BYOC) deployments. You can run it in your own AWS, GCP, Azure, or neo-cloud environment while maintaining full control over your data, security, and infrastructure.