Question 1

How much does Vikasit 3 Flash cost?

Accepted Answer

Vikasit 3 Flash is served through the Vikasit AI API on usage-based, pay-as-you-go pricing billed per million input and output tokens — see the Vikasit AI pricing page for current rates. Because it is built on the open-weight Qwen3.5-9B (Apache 2.0), you can also self-host the weights for free under the Apache 2.0 licence and pay only for your own compute.

Question 2

Is Vikasit 3 Flash open weight?

Accepted Answer

Yes. Vikasit 3 Flash is built on Qwen3.5-9B (Apache 2.0) and distributed under the Apache 2.0 licence, so the weights are openly available for self-hosting, fine-tuning, and commercial use, subject to the upstream licence terms.

Question 3

How do I use Vikasit 3 Flash with the OpenAI SDK?

Accepted Answer

The Vikasit AI API is OpenAI-compatible. Point the OpenAI client's base URL at https://api.vikasit.ai/v1, set your Vikasit API key, and pass "vikasit-3-flash" as the model. The quick-start snippet above shows the exact Python call.

Question 4

What context window does Vikasit 3 Flash support?

Accepted Answer

Vikasit 3 Flash supports a 262K native, ~1M via YaRN context window. It is a 9B Hybrid MoE (Gated DeltaNet + Gated Attention + sparse MoE) model — full specifications are listed in the table above.

Benchmark	Score	Notes
MMLU-Pro	82.5
GPQA-Diamond	81.7
LiveCodeBench v6	65.6
IFEval	91.5
HMMT Feb 2025	83.2	AIME 2025 not published; HMMT reported instead
MATH-500	N/A
SWE-bench Verified	N/A

Precision	Memory	Notes
bf16	~18 GB	single GPU
INT4	~5.5 GB	consumer GPU

Vikasit 3 Flash

Overview

Specifications

Capabilities

Benchmarks

Hardware & deployment

Quick start

Limitations

Vikasit 3 Flash FAQ

How much does Vikasit 3 Flash cost?

Is Vikasit 3 Flash open weight?

How do I use Vikasit 3 Flash with the OpenAI SDK?

What context window does Vikasit 3 Flash support?

License & attribution