Follow instructions.

A 350M-parameter model that listens to your prompts.
Private. In your browser. No data leaves your device.

Scroll to explore
Philosophy

Small models can
follow instructions well.

Dolly Assistant is a fine-tuned language model built on LiquidAI's LFM2.5-350M architecture, trained on 15K hand-written instruction-response pairs from the Databricks Dolly dataset. It classifies, summarizes, extracts, brainstorms — small footprint, broad use.

Everything runs locally via ONNX Runtime and WebGPU. Your prompts never leave your browser. No servers. No logs. No surveillance. Just inference.

Method
01

Client-side
inference

The model runs entirely in your browser using WebGPU acceleration. Zero network calls after the initial load.

02

Quantized
precision

4-bit quantization keeps the model under 200 MB for browser delivery while preserving instruction-following quality.

03

Instruction
fine-tuning

Trained on 15K hand-written instruction-response pairs from Databricks Dolly, covering 8 task categories: QA, classification, summarization, extraction, and more.

"Simplicity is the ultimate sophistication."
Leonardo da Vinci