Question 1

How much does Vikasit 35B MoE cost?

Accepted Answer

Vikasit 35B MoE is an open-weight model built on Qwen3.6-35B-A3B (Apache 2.0). Self-hosting the weights is free under the Apache 2.0 licence — you pay only for the hardware or cloud GPUs you run it on. Typical deployment fits the memory profiles listed in the hardware section above.

Question 2

Is Vikasit 35B MoE open weight?

Accepted Answer

Yes. Vikasit 35B MoE is built on Qwen3.6-35B-A3B (Apache 2.0) and distributed under the Apache 2.0 licence, so the weights are openly available for self-hosting, fine-tuning, and commercial use, subject to the upstream licence terms.

Question 3

How do I run Vikasit 35B MoE?

Accepted Answer

Because Vikasit 35B MoE is open weight, you self-host it with any OpenAI-compatible inference server (such as vLLM or SGLang) loaded with the Qwen3.6-35B-A3B (Apache 2.0) weights, then call it with the OpenAI SDK by setting the base URL to your own endpoint.

Question 4

What context window does Vikasit 35B MoE support?

Accepted Answer

Vikasit 35B MoE supports a 262K native, ~1M via YaRN context window. It is a 35B total (3B active) Mixture-of-Experts (hybrid Gated DeltaNet + Gated Attention) model — full specifications are listed in the table above.

Benchmark	Score	Notes
MMLU-Pro	85.2
GPQA-Diamond	86.0
LiveCodeBench v6	80.4
SWE-bench Verified	73.4
AIME 2026	92.7	AIME 2025 not published
MATH-500	N/A
IFEval	N/A

Precision	Memory	Notes
bf16	~70 GB	all experts resident
INT4	~20 GB	RTX 4090 / single GPU

Vikasit 35B MoE

Overview

Specifications

Capabilities

Benchmarks

Hardware & deployment

Quick start

Limitations

Vikasit 35B MoE FAQ

How much does Vikasit 35B MoE cost?

Is Vikasit 35B MoE open weight?

How do I run Vikasit 35B MoE?

What context window does Vikasit 35B MoE support?

License & attribution