Pricing and billing guide for on-demand, usage-based customers
Mode | Description | Good For |
---|---|---|
On-Demand Get started immediately by creating a Contextual Workspace. New workspaces receive $25 in free credits. | Pay-as-you-go with no commitments or minimums - Purchase credits upfront, which are then drawn-down based on your usage, according to the pricing below - Easily gradaute to Provisioned Throughput as your needs evolve | Customers who: - Are experimenting with, or evaluating, the platform - Need the flexiblity to scale usage and costs up or down quickly - Don’t require SLAs or QPS guarantees |
Provisioned Throughput Contact sales (sales@contextual.ai) or your account team. | Purchase model units (MUs) to reserve guaranteed capacity and predictable throughput | Customers who: - Need guaranteed and predictable QPS levels to satisfy expected production volumes - Require service level agreements (SLAs) to meet production standards |
/query
is the endpoint called when you interact with an Agent. It invokes a multi-step RAG pipeline, and the cost of a given query is the sum of the costs for each step in the pipeline. These steps can be configured via Agent Settings. The average cost per query is ~$0.05.Step | Price |
---|---|
Query Optimization (reformulation and decomposition) | $1/1M toks |
Encode (encode the user query for search) | $0.03/1M toks |
Rerank (rerank the retrieved chunks) | $0.12/1M toks |
Filter (filter chunks prior to generation) | $1/1M toks |
Generate (final response generation) | Input: $3/1M toks Output: $15/1M toks |
Groundedness & Safety (post-generation groundedness checks) | $1/1M toks |
Component | Price |
---|---|
Parse Parse unstructured documents into markdown and/or JSON. (Parse API Docs) | Basic: $3/1K pages Standard: $40/1K pages |
Rerank Rerank based on natural language instructions (Rerank API Docs) | $0.12/1M toks |
Generate Generate using the most grounded language model in the world (Generate API Docs) | Input: $3/1M toks Output: $15/1M toks |
LMUnit Evaluate LLM output using natural language unit tests (LMUnit API Docs) | $3/1M toks |
Step | Price |
---|---|
Ingestion | $48.50/1K pages |