Question 1

How much does Vikasit Vision 4B cost?

Accepted Answer

Vikasit Vision 4B is an open-weight model built on Qwen3-VL-4B (Apache 2.0). Self-hosting the weights is free under the Apache 2.0 licence — you pay only for the hardware or cloud GPUs you run it on. Typical deployment fits the memory profiles listed in the hardware section above.

Question 2

Is Vikasit Vision 4B open weight?

Accepted Answer

Yes. Vikasit Vision 4B is built on Qwen3-VL-4B (Apache 2.0) and distributed under the Apache 2.0 licence, so the weights are openly available for self-hosting, fine-tuning, and commercial use, subject to the upstream licence terms.

Question 3

How do I run Vikasit Vision 4B?

Accepted Answer

Because Vikasit Vision 4B is open weight, you self-host it with any OpenAI-compatible inference server (such as vLLM or SGLang) loaded with the Qwen3-VL-4B (Apache 2.0) weights, then call it with the OpenAI SDK by setting the base URL to your own endpoint.

Question 4

What context window does Vikasit Vision 4B support?

Accepted Answer

Vikasit Vision 4B supports a 256K native, ~1M expandable context window. It is a 4B Dense ViT + LLM, interleaved-MRoPE, DeepStack multi-level features model — full specifications are listed in the table above.

Benchmark	Score	Notes
MMMU (val)	70.8	thinking
DocVQA (test)	94.2
ChartQA (test)	88.8
MathVista (mini)	79.5
AI2D	84.9
OCRBench	808
RealWorldQA	73.2
Video-MME	68.9	w/o subtitles
TextVQA	N/A	not reported for this series

Precision	Memory	Notes
bf16	~9 GB	single GPU
INT4	~3 GB	laptop GPU

Vikasit Vision 4B

Overview

Specifications

Capabilities

Benchmarks

Hardware & deployment

Quick start

Limitations