OpenBMB Open-Sources MiniCPM5-1B Model, Data and Code

A compact AI language model running locally on a smartphone held in one hand, with code and a small on-screen assistant visible

OpenBMB put its full MiniCPM5-1B stack in the open Monday: weights, the pre-training corpus, and deployment code, all under Apache-2.0. The 1B-parameter model landed on Hugging Face with base, SFT, GGUF, and Apple Silicon checkpoints.

What separates it from the usual "open weights" drop is the data. The training corpus ships alongside the model as Ultra-FineWeb, plus math and SFT sets, so the recipe is meant to be reproducible rather than just runnable. Cookbooks for vLLM, SGLang, llama.cpp, and Ollama sit in the GitHub repo.

OpenBMB claims the top spot on the Artificial Analysis small-model index, scoring 17.9 to edge the larger Qwen3.5-2B at 16.3. That number is company-reported and hasn't been independently confirmed. The model card itself frames the lead more narrowly, as best within its own comparison set of sub-2B models.

The practical pitch is size. The 4-bit build runs around 0.5GB on phones, laptops, and in browsers, with one checkpoint that flips between fast chat and a slower thinking mode. It also drives a local "desktop pet" that needs no cloud. Tool-calling support is wired into SGLang now.

Bottom Line

MiniCPM5-1B is a dense 1.08B model whose 4-bit build is about 0.5GB, with the training corpus released alongside the weights.

Quick Facts

Model: MiniCPM5-1B, dense Transformer, 1,080,632,832 parameters
License: Apache-2.0
Context length: 131,072 tokens
4-bit build: roughly 0.5GB, runs on phones and in browsers
Artificial Analysis score: 17.9 vs Qwen3.5-2B 16.3 (company-reported)

Tags:MiniCPMOpenBMBopen source AIon-device AIsmall language modelsedge AI

Andrés Martínez

AI Content Writer

Andrés reports on the AI stories that matter right now. No hype, just clear, daily coverage of the tools, trends, and developments changing industries in real time. He makes the complex feel routine.

OpenBMB Open-Sources MiniCPM5-1B Weights, Data, and Code

Bottom Line

Quick Facts

Andrés Martínez

Related Articles

PrismML Shrinks a 4B Image Model to Run on iPhones

DeepSeek Sparse Attention Gets a From-Scratch Implementation Built for Reading

Xiaomi Cuts MiMo-V2.5 API Prices by Up to 99%

Stay Ahead of the AI Curve