Question 1

How much does Vikasit 32B cost?

Accepted Answer

Vikasit 32B is an open-weight model built on Qwen3-32B (Apache 2.0). Self-hosting the weights is free under the Apache 2.0 licence — you pay only for the hardware or cloud GPUs you run it on. Typical deployment fits the memory profiles listed in the hardware section above.

Question 2

Is Vikasit 32B open weight?

Accepted Answer

Yes. Vikasit 32B is built on Qwen3-32B (Apache 2.0) and distributed under the Apache 2.0 licence, so the weights are openly available for self-hosting, fine-tuning, and commercial use, subject to the upstream licence terms.

Question 3

How do I run Vikasit 32B?

Accepted Answer

Because Vikasit 32B is open weight, you self-host it with any OpenAI-compatible inference server (such as vLLM or SGLang) loaded with the Qwen3-32B (Apache 2.0) weights, then call it with the OpenAI SDK by setting the base URL to your own endpoint.

Question 4

What context window does Vikasit 32B support?

Accepted Answer

Vikasit 32B supports a 32K native, 131K via YaRN context window. It is a 32.8B Dense transformer model — full specifications are listed in the table above.

Benchmark	Score	Notes
MMLU-Pro	65.5	base model
GPQA-Diamond	68.4	thinking mode
AIME 2025	72.9	thinking mode
MATH-500	97.2	thinking mode
LiveCodeBench v5	65.7	thinking mode
BFCL v3	70.3	tool-use
IFEval	85.0	strict prompt
HumanEval	N/A

Precision	Memory	Notes
bf16	~66 GB	2× A100 / H100
INT4	~20 GB	RTX 4090

Vikasit 32B

Overview

Specifications

Capabilities

Benchmarks

Hardware & deployment

Quick start

Limitations

Vikasit 32B FAQ

How much does Vikasit 32B cost?

Is Vikasit 32B open weight?

How do I run Vikasit 32B?

What context window does Vikasit 32B support?

License & attribution