Overview

Plain English

Universal LLM Deployment Engine with ML Compilation.

Technical

Universal LLM Deployment Engine with ML Compilation.

Technical scorecard

License Apache 2.0

Commercial use Yes

OpenAI-compatible API No

REST API No

Fine-tuning support No

Quantization support No

Docker available No

GUI / no-code available No

Telemetry None

Offline after setup Yes

Data & Privacy

After setup, this listing is marked as usable offline. Confirm network behavior against the upstream project before regulated deployment.

Not verified in this directory yet. Review the upstream docs for persistence, logs, and workspace storage.

Commercial use is marked as allowed or likely allowed by the listed license.

None

Last verified: May 16, 2026. Maintainer verification should be treated as directory guidance, not legal advice.

Setup & Installation

Low

GUI, low-resource, or simple install path likely available.

Python, Bare Metal, Embedded / Edge

# Start with the official project documentation
# https://github.com/mlc-ai/mlc-llm

Hardware Requirements

RAM8 GB minimum / 16 GB recommended

Hardware tagsNVIDIA GPU (CUDA), Apple Silicon (Metal), Low-resource (< 8GB RAM)

Model formatsNot specified

Primary languagePython

Works Well With

You might also evaluate

Ollama Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other mode... llama.cpp LLM inference in C/C++.... vLLM A high-throughput and memory-efficient inference and serving engine for LLMs.... LocalAI LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any...