DISCOUNT WAVE

25% discount on GEMINI neural networks

And price reductions on other models

Available neural network models

Client type
Individual
Business
Tariff
ELITE
Product
APIDashboard
Currency
RUBCAPS
Cost in rubles
ModelContext size (in tokens)Output size (in tokens)Prompt (per 1M tokens)Image prompt (per 1k tokens)Response (per 1M tokens)
1 050 000128 0003 182,14019 092,86
1 050 000128 000265,1801 591,07
1 050 000128 000530,3603 182,14
4 095100 0001 060,7104 242,86
400 000128 000185,6301 485
400 000128 00079,550477,32
272 000128 000848,5701 591,07
400 000128 000132,5901 060,71
* Our markup on these prices is 5%, which is included in the cost of packages except Basic (Premium and higher)

LLM Request

Cost of a single request in the dashboard
All tariffs
Used tokens + 0.01 $per 1 request
Special attention: The use of Easy Writer is charged differently. For each text generation, Easy Writer charges an additional 0.1 $ per request + the token cost as specified above for a regular LLM request.

Tools Tariffing

A tool is a function on our server side that a model can call upon request. The execution result is sent to the model along with all previous context. Such resends can occur multiple times, which increases the request cost proportionally. If a tool is free for us — it is free for the user; if it is paid — the cost is also passed on to the user.
Link Analysis (URL, YouTube)
Extracts content from URL and embeds it in the prompt.0.01 Caps per 1 character
GitHub
Search for information in GitHub repositories.Can be called multiple times, cost depends on repository size.
Maximum number of searches
Called up to specified number of times.Each search returns up to 10 results.
Web Search, Legal Search
Limited by «Max. number of searches» value.Cost depends on number of searches.
Image Generation
Generates images from text description.Uses «Nano Banana 2». Cost according to tariffing.
Document Creation
Creates documents in various formats.Free, but consumes significant amount of context.
Scientific Articles Search
Limited by «Max. number of searches» value.Cost depends on number of searches. Summarization and bibliography incur additional costs.

Image Generation

Cost of a single generation by models
MidJourney — Relax
26 000 CAPS / 4,09 ₽ For 1 generation
MidJourney — Fast
52 000 CAPS / 8,17 ₽ For 1 generation
MidJourney — Turbo
104 000 CAPS / 16,34 ₽ For 1 generation
GPT Image 2 - Square
272 CAPS / 0,04 ₽ For 1 generation
GPT Image 2 - Portrait
408 CAPS / 0,06 ₽ For 1 generation
GPT Image 2 - Landscape
400 CAPS / 0,06 ₽ For 1 generation
Nano Banana — Pro
100 800 CAPS / 15,84 ₽ For 1 generation
Nano Banana — 2
112 000 CAPS / 17,6 ₽ For 1 generation
Nano Banana
29 025 CAPS / 4,56 ₽ For 1 generation
Flux
16 666 CAPS / 2,62 ₽ For 1 generation
Stable Diffusion
39 375 CAPS / 6,19 ₽ For 1 generation

Video Generation

Cost of creating one second of video
GoogleVeo
168 750 Caps / 26.52 ₽per 1 second
Runway
30 000 Caps / 4.71 ₽per 1 second
Sora
337 500 Caps / 53.04 ₽per 1 second
Kwaivgi
189 000 Caps / 29.70 ₽per 1 second
For video generation in 1080p quality using veo-3, an additional charge of +20% is added

Speech Synthesis

Cost of one speech synthesis
TTS
11 250 Caps / 1.77 ₽per 1 000 characters
TTS HD
27 225 Caps / 4.28 ₽per 1 000 characters

Transcription

The cost of one transcription
AssemblyAI — nano
2 000 Caps / 0.314 ₽Per 1 minute
AssemblyAI — best
5 500 Caps / 0.864 ₽Per 1 minute
A fixed surcharge on all requests: $0.05 per request, $0.10 for files over 50 MB, $0.50 for files over 500 MB

Embeddings

Model embeddings available through our API.
Cost in CapsCost in dollars
ModelEmbedding dimensionPrompt cost (per 1 token)Prompt cost (per 100,000 tokens)
text-embedding-3-largeThe most efficient embedding model
3 0720,120,16
text-embedding-3-smallIncreased performance compared to the 2nd generation ada embedding model
1 5360,020,02
text-embedding-ada-002-2models-page.additional-costs.embedding.text-embedding-ada-002-2
15 00020 000
text-embedding-3-largeThe most efficient embedding model
3 072Embedding dimension
0,12Prompt cost (per 100,000 tokens)
0,16Prompt cost (per 100,000 tokens)
text-embedding-3-smallIncreased performance compared to the 2nd generation ada embedding model
1 536Embedding dimension
0,02Prompt cost (per 100,000 tokens)
0,02Prompt cost (per 100,000 tokens)
text-embedding-ada-002-2models-page.additional-costs.embedding.text-embedding-ada-002-2
Embedding dimension
15 000Prompt cost (per 100,000 tokens)
20 000Prompt cost (per 100,000 tokens)

What are Caps?

Caps is the internal currency of the service, used to measure the cost of requests and responses of neural networks. It is fixed and depends on the model complexity: number of parameters, multimodality, and overall power.

    For example:
  • ChatGPT-3.5 — ~1 Caps per token
  • ChatGPT o1-Pro — ~400+ Caps per token
The higher your tariff, the better the price: 1 million Caps is cheaper on Elite than on Basic.

Still have questions?

What are tokens?

Tokens are units of text processing by the neural network, representing parts of words, entire words, or punctuation marks that determine the cost of requests.

How long will 1 million tokens last?

One million tokens of the GPT-4o model are enough to rewrite “The Brothers Karamazov” by F. M. Dostoevsky.

What to do if I run out of tokens?

Purchase additional Caps in your personal account — https://bothub.chat/profile

Why does the neural network pretend to be another?

The neural network does not know what model it is if it is not specified in the system prompt. The “self-identification” of the model without instruction is influenced by many factors, one of them being the model's data training set.

What is context in a neural network?

Context is the amount of information that the neural network retains in memory during a dialogue, affecting the coherence of responses and understanding of previous requests.

What is the context of different neural network models?

GPT o1 Pro and Claude 3.7 Sonnet support up to 200K tokens, Gemini 2.5 Pro works with 1KK, while Gemini 2.0 Pro supports up to 2KK tokens.

What file formats do models read?

Neural networks process TXT, PDF, DOCX, XLSX, CSV, JSON, XML, HTML, as well as images JPG, PNG, and audio files MP3, MP4.

Can neural networks be used for free?

There are free models with the postfix “:free” and “-exp” that can be used for free through a mini-window on the main page, as well as the model page.

How do neural network models differ from each other?

Models differ in the volume of training data, context size, processing speed, specialization in specific tasks, and ability to work with multimodal content.

How to use models via API?

To integrate models into your applications, you need to obtain an API key in your personal account. More details can be found here: https://bothub.chat/api/documentation/ru.

Can neural networks be used to automate business processes?

Neural networks effectively automate routine tasks of document management, data processing, customer support, and analytics, integrating with existing business systems via API.

Support ServiceOpen from 10:00 to 18:00 MSK
Available neural network models :: BotHub in Russia