Skip to main content

Pricing

Flexible Pricing
for Teams and Innovators of Every Scale

Our pricing is designed for clarity: easy to start and scalable for growth.
The Enterprise plan is fully customizable to align with your specific use cases.

Starter

Simple way to get started. Limited access to the platform to discover its potential value.

$0/month

Create an account
  • $25 Free Credit
  • 1 Workspace
  • 1 Data Vault
  • 1 User
  • 1 API Key
  • Access To Lite AI models
  • No credit card required

Pro

Recommended

Tailored for businesses and collaborative teams. Experience limitless capabilities.

$45/month+ Usage Fees

Create an account
  • Unlimited Workspaces
  • Unlimited Data Vaults
  • Unlimited Users
  • Unlimited API Keys
  • Access To All services
  • Access To Premium AI Models
  • Access To Speech Models
  • Access To Tools
  • Usage Based Pricing
  • Chat and Email Customer Support

Enterprise

For Teams Exceeding 10 Members. Optimized for Critical Missions. Billed Yearly.

Starts at$350/month+ Usage Fees

Contact Us

All Standard Plan Features, plus:

  • Audit Capabilities
  • Model Fine Tuning
  • Dedicated Account Manager
  • Service Level Agreement (SLA)
  • Single Sign-On (SSO) Integration
  • Onboarding Support
  • PO & Invoicing Options
  • High Priority Support
  • Custom Domain
  • Volume Pricing Discounts

Usage Fees

Some fees are presented in micro-dollars (µ$) for clearer representation. One µ$ is equivalent to one-millionth of a dollar.

All-in-one stateful LLM Services

The Stateful LLM Service, an evolution of the Stateless LLM, delivers a harmonized integration of cache, memory, and context, elevating LLM operations and enhancing interaction quality. With this service, concerns like memory management, entity extractions, prompt optimizations, archival, audits, and history — common cross-cutting aspects of AI applications — are effortlessly addressed.

Lite

Leveraging sophisticated models like GPT-3.5 Turbo and Claude Instant, the service can handle context sizes of up to 100k tokens.

  • Input Tokens

    µ$16 per token

  • Output Tokens

    µ$18 per token

Premium

Leveraging sophisticated models like GPT-4 and Claude 2, the service can handle context sizes of up to 200k tokens.

  • Input Tokens

    µ$52 per token

  • Output Tokens

    µ$92 per token

Custom

Harnessing the power of your preferred models or fine-tuned models, it promises unparalleled performance and adaptability.

LLM Services

The Stateless LLM emphasizes streamlined efficiency, focusing purely on immediate interactions without relying on cache, memory, or context.

Lite

Delivering top-notch interaction quality, it utilizes cutting-edge models like GPT-3.5 Turbo and Claude Instant and supports context sizes of up to 100k tokens.

  • Input Tokens

    µ$4 per token

  • Output Tokens

    µ$6 per token

Premium

Delivering top-notch interaction quality, it utilizes cutting-edge models like GPT-4 Turbo and Claude 2 and supports context sizes of up to 200k tokens.

  • Input Tokens

    µ$40 per token

  • Output Tokens

    µ$80 per token

Memory Services

The Memory Services suite provides specialized solutions for data storage and retrieval, focusing on entity-specific contextual information and conversation histories. With performance optimizations and competitive pricing, these services ensure efficient and cost-effective data operations.

Entity Contextual Memory

This service is designed for the efficient storage and retrieval of contextual details related to different entities. By leveraging an integrated cache layer, we ensure optimized data access, thereby boosting performance. Entity Contextual Memory is stored for a period of one year, unless manually deleted.

  • Read Operations

    µ$2 per request

  • Write Operations

    µ$5 per request

Conversations Memory

The Conversations Memory Service allows for the preservation of conversation histories, serving both as a reference and to enhance contextual understanding. Conversations are stored for a period of one year, unless manually deleted.

  • Read Operations

    µ$0.01 per token

  • Write Operations

    µ$0.1 per token

Data Services

The Data Service comprises a suite of tools designed for the efficient ingestion, indexing, and semantic extraction of information.

Cognitive Search Service (RAG)

Leveraging content extraction and analysis methods, vector databases, embedding technologies, hybrid search techniques, and knowledge graphs, it ensures precise and meaningful data retrieval for Retrieval Augmented Generation (RAG).

  • Storage

    $0.50 per GB-month

  • Indexing (Semantic and Full-Text)

    µ$0.2 per token of embedded text

  • Hybrid Search

    µ$10 per query

Frequency Asked Questions

How long is the Starter plan available to me?+

The Starter plan does not have a set expiration, but it lasts until the $25 credit is used up.

Are the current prices subject to change?+

Absolutely. We are still in the process of finalizing our pricing decisions. The rates you see now are preliminary and subject to adjustments.

If I upgrade to the Pro plan, will I lose my remaining credit?+

No, any remaining free credits will carry over and be added to your account when you upgrade.

Do you charge sales or consumption taxes?+

Certain states are subject to a required sales or consumption tax. If your order includes sales tax, VAT, GST, or comparable consumption taxes, you will see the tax amount along with the rate of tax applied in your invoices.

My team needs a custom plan. Can you provide one?+

Sure, that's why we have a custom Enterprise plan. The Enterprise plan is fully customizable to align with your specific use cases. Please write to us at sales@centragate.com with your requirement and we would get back to you asap.