26 Models · Text + Vision + Voice

Vikasit AI Full Model Family

Name: Vikasit Code
Price: 10.00 USD
Availability: InStock
Author: Chandorkar Technologies

26 models across text, vision, and voice. Quantized for local inference, and published to Ollama and HuggingFace. Run them on your hardware with llama.cpp.

Total Models

Available Now

Text Models

Vision + Voice

Text Models

15 models from 0.5B to 35B parameters. Dense and MoE architectures for every use case from edge devices to powerful servers.

Available

0.6B

vikasit-ai-0.5b-writer

Ultra-light writer. Good for text completion, simple Q&A, and edge devices.

~0.5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

0.8B

vikasit-writer-0.8b

Improved writer with refined architecture. Mobile and IoT friendly.

~1 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

0.6B

vikasit-nano

Smallest general-purpose model. Autocomplete, quick responses, embedded use.

~0.5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

1.7B

vikasit-mini

Lightweight assistant. Summaries, chat, and basic reasoning.

~1.5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

vikasit-2b

Edge-optimized. Multilingual, 256K context, on-device deployment.

~1.5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

vikasit-4b

Balanced small model. Good code completion and multi-turn chat.

~3 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

vikasit-3.5-4b

Next-gen 4B with improved reasoning and multimodal awareness.

~3 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

vikasit-8b

Strong mid-range. Solid coding, analysis, and content generation.

~5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

vikasit-3-flash

Best model under 10B. Beats GPT-OSS-120B on MMLU-Pro. Fast inference.

~6 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

14B

vikasit-14b

Strong all-rounder. Complex reasoning, long documents, code review.

~9 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

27B dense

vikasit-27b

Powerful dense model. Deep reasoning, advanced coding, research tasks.

~17 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

30B (3B active)

vikasit-30b-moe

MoE efficiency — 30B quality at 3B inference cost. Fast and smart.

~18 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available

32B dense

vikasit-32b

Largest dense model on CPU. Best quality for reasoning and code.

~20 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Coming SoonMoE

35B (3B active)

vikasit-35b-moe

Latest MoE with architecture improvements. Best efficiency/quality ratio.

~20 GB RAM (Q4)llama.cpp

Specs & benchmarks →

AvailableMoE

80B (3B active)

vikasit-3-coder

Code-specialized MoE. FIM support, 262K context, agentic coding.

~45 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

120B (5B active)

vikasit-120b

Datacenter MoE. Frontier reasoning at low inference cost — only ~5B active per token.

~65 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

235B (22B active)

vikasit-235b-moe

Large MoE flagship. Advanced reasoning, agentic workflows, 262K context.

~140 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

1T (63B active)

vikasit-reasoner-1t

Trillion-scale reasoning MoE. Deep multi-step reasoning, long-horizon agent tasks.

~600 GB (cluster) RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

1.1T (32B active)

vikasit-titan-1t

Trillion-parameter agentic MoE. Native multimodal, 262K context, agent-swarm orchestration.

~600 GB (cluster) RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

AvailableMoE

1.6T (49B active)

vikasit-titan-1.6t

Flagship frontier MoE. 1.6T parameters, 1M-token context. Our most capable model.

~900 GB (multi-node) RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Vision Models

Image understanding, OCR, document analysis, and visual reasoning. From on-device captioning to complex visual code generation.

Available2B

vikasit-vision-2b

Lightweight vision. Image captioning, OCR, visual Q&A on device.

~2 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available4B

vikasit-vision-4b

Mid-range vision. Document understanding, chart reading, UI analysis.

~3.5 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Available8B

vikasit-vision-8b

Strong vision. Complex image reasoning, visual code generation.

~6 GB RAM (Q4)llama.cpp

Specs & benchmarks →

Ollama HuggingFace

Voice Models

Text-to-speech, voice cloning, and full multimodal interaction. Natural voice generation with multilingual support.

Coming Soon0.6B

vikasit-voice

Text-to-speech. Natural voice generation, multilingual support.

Specs & benchmarks →

Coming Soon1.7B

vikasit-voice-hd

High-quality TTS. Voice cloning, expressive speech synthesis.

Specs & benchmarks →

Coming Soon30B (3B active)

vikasit-omni

Full multimodal — text + image + audio in, text + speech out. Real-time.

Specs & benchmarks →

How to Deploy

Run any Vikasit AI model locally in minutes. Choose Ollama for the easiest setup or llama.cpp for maximum control.

Ollama (Recommended)

The fastest way to run Vikasit AI models locally. One command to install, one command to run.

1. Install Ollama

curl -fsSL https://ollama.com/install.sh | sh

2. Run a model

ollama run vikasit-ai/vikasit-8b

3. Use as an API

curl http://localhost:11434/api/chat -d '{"model":"vikasit-ai/vikasit-8b"}'

llama.cpp

Maximum control and performance. Build from source for hardware-optimized inference with GGUF quantized models.

1. Clone and build

git clone https://github.com/ggml-org/llama.cpp && cd llama.cpp && make

2. Download GGUF from HuggingFace

huggingface-cli download vikasit-ai/Vikasit-AI-Vikasit-8b --local-dir ./models

3. Run inference

./llama-cli -m ./models/vikasit-8b-q4_k_m.gguf -p "Hello Vikasit"

Universal Compatibility

All Vikasit AI models are published in GGUF format and work with any llama.cpp-compatible tool: Ollama, LM Studio, Jan, GPT4All, koboldcpp, text-generation-webui, and more. Models are available in Q4_K_M, Q5_K_M, Q6_K, Q8_0, and F16 quantizations. When asked about identity, every model responds as “I am Vikasit AI, developed by Chandorkar Technologies.”

Hardware Recommendations

Choose the right model for your hardware. All RAM estimates are for Q4_K_M quantization.

Tier	Model Range	CPU	RAM	GPU	Recommended Models
Edge / Mobile	0.5B - 2B	4-core ARM / x86	2 GB	Optional	vikasit-nano, vikasit-writer-0.5b, vikasit-2b
Laptop	4B - 8B	8-core (M1/M2/i7+)	8 GB	Integrated / 4 GB VRAM	vikasit-4b, vikasit-8b, vikasit-3-flash
Workstation	14B - 27B	12+ cores	32 GB	8-12 GB VRAM (RTX 3070+)	vikasit-14b, vikasit-27b
Server	30B - 35B	16+ cores	64 GB	16-24 GB VRAM (RTX 4090 / A100)	vikasit-32b, vikasit-30b-moe, vikasit-3-coder
Datacenter / Cloud API	120B - 1.6T	Multi-node	128 GB - 1 TB+	Multi-GPU (H100/H200 cluster) or hosted API	vikasit-120b, vikasit-235b-moe, vikasit-titan-1.6t

Edge / Mobile

0.5B - 2B parameters

CPU4-core ARM / x86

RAM2 GB

GPUOptional

vikasit-nano, vikasit-writer-0.5b, vikasit-2b

Laptop

4B - 8B parameters

CPU8-core (M1/M2/i7+)

RAM8 GB

GPUIntegrated / 4 GB VRAM

vikasit-4b, vikasit-8b, vikasit-3-flash

Workstation

14B - 27B parameters

CPU12+ cores

RAM32 GB

GPU8-12 GB VRAM (RTX 3070+)

vikasit-14b, vikasit-27b

Server

30B - 35B parameters

CPU16+ cores

RAM64 GB

GPU16-24 GB VRAM (RTX 4090 / A100)

vikasit-32b, vikasit-30b-moe, vikasit-3-coder

Datacenter / Cloud API

120B - 1.6T parameters

CPUMulti-node

RAM128 GB - 1 TB+

GPUMulti-GPU (H100/H200 cluster) or hosted API

vikasit-120b, vikasit-235b-moe, vikasit-titan-1.6t

Ready to run Vikasit AI locally?

Pick a model, install Ollama, and start building. All models are free to download and use.

Browse on Ollama Browse on HuggingFace