[go: up one dir, main page]

Simple pricing for complex AI systems.

Monthly credits. Predictable costs. No surprises.

Team

$950 / mo + usage
*$950 credited to usage monthly

Chat with us

For teams building AI from experiment to production.

1,000 concurrent actions

Workflows can make on-the-fly decisions at runtime with real-time logic, conditions, and retries.

30-day data retention

Workflows can automatically recover from interruptions and continue where they left off.

1 cluster

Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.

Enterprise

Custom
Significant volume discounts

For organizations building mission-critical AI, ML, and agents.

Significant volume discounts

Build and scale dynamic AI/ML workflows using Flyte’s open-source apltform and community.

Custom concurrent actions

Workflows can make on-the-fly decisions at runtime with real-time logic, conditions, and retries.

Custom retention options

Workflows can automatically recover from interruptions and continue where they left off.

3+ clusters

Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.

Enterprise-grade security

Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.

White-glove support

Test and debug tasks in your local environment using the same Python SDK that runs in production on Kubernetes.

Compute costs (Team plan)

You’re billed only for the container-allocated resources your workflow actually uses, never for idle capacity.

Pay less with an Enterprise plan. Contact us for custom rates.

GPU

T4g

T4

L4

A10G

A100

V100

L40S

H100

H200

B200

$ / Hr

$0.0758

$0.1427

$0.1699

$0.1985

$0.3088

$0.3270

$0.3671

$0.6880

$0.7912

$1.4242

CPU

vCPU

$ / Hr

$0.0260

Memory

1 GB

$ / Hr

$0.0032

Abstract 3D layered stacked panels in dark gray and gold on a black grid background.

“Union.ai has been critical at the time that we needed to significantly scale up... Union.ai’s wealth of expertise has enabled us to move fast and deliver at scale.”

Alborz

Alborz Alavian

‍Senior Engineering Manager, Woven by Toyota

Compare plans & features

Volume discounts
Monthly usage credit
$950
Custom
Data retention
30 days
1 year
Infra-aware orchestration
Dynamic, Python-based workflows
Real-time inference
Ultra-low latency
Ultra-low latency
Integrated cost tracking
Reusable containers
Price/action
$0.0075
Custom
Debugging & error handling
Advanced debugging, automatic retries
Advanced debugging, automatic retries
Output caching
Task-level overrides
Live remote debugging
Actions/run (fanout)
25,000
50,000+
Concurrent actions
1,000
Custom
Clusters
1
3+
Multi-cloud
Multi-region
SSO
Standard (OIDC)
Custom (OIDC, SAML/p)
Role-based access control
Standard
Fine-grained, custom
Managed secrets
Securely stored in your cloud
Securely stored in your cloud
VPC
Union-managed
Union-managed or self-managed
White-glove onboarding
Dedicated support
1 business day Sev 1
Custom white-glove support

Frequently asked questions

Yes we do! Talk to our Startup Team to learn more.

Yes we do! Talk to our Public Sector Team to learn more.

Your monthly plan fee is issued back to you as usage credits. In practice, this means your monthly plan cost becomes your minimum monthly spend, and you can use that same amount in usage at no additional charge. Any usage beyond that amount is billed separately.

An action is an individual execution of a task. It represents a specific invocation of a task with particular inputs. If a task runs multiple times (such as inside a loop) you’ll see multiple actions, one for each invocation.

Yes, this is available as part of a Union Enterprise plan. Book a consultation with an engineer to learn more.

Yes, this is one of the deployment options available as part of a Union Enterprise plan. Self-hosted deployment supports on-prem, hybrid, and airgapped configurations. Book a consultation with an engineer to learn more.

We report the allocated resources (CPU, Memory, and GPU accelerator) from each container running the actions within your workflows and apply usage-based pricing down to the second. We do not include the resources consumed by any other services. Therefore, if you run Union on a shared K8s cluster, you are only paying for usage on the resources consumed by your Union tasks and workflows.

No. You deploy the Union operator into a Kubernetes cluster you manage, which securely communicates with the Union control plane to poll for work. Your workflow executions, code, images, data, logs, and secrets all remain in your VPC/cloud, and are inaccessible to Union.

Fanout is the total number of actions a run creates, while concurrency is how many of those actions are running at the same time. For example, a run might fan out to 50,000 actions but only execute around 100 of them concurrently.

Yes, Union.ai supports bring-your-own-cloud (BYOC) deployments. You can run it in your own AWS, GCP, Azure, or neo-cloud environment while maintaining full control over your data, security, and infrastructure.

Need a sanity check? Walk us through what you’re building.