Skip to main content
⌘K

Nomic Embed

Open-source 8192-token embedding model with 768 dimensions, trained on diverse datasets for robust semantic search.

View on GitHub Official site
Embedding Models Apache 2.0 Medium setup 477 stars

Overview

Plain English

Open-source 8192-token embedding model with 768 dimensions, trained on diverse datasets for robust semantic search.

Technical

Open-source 8192-token embedding model with 768 dimensions, trained on diverse datasets for robust semantic search.

Technical scorecard

License Apache 2.0
Commercial use Yes
OpenAI-compatible API No
REST API No
Fine-tuning support No
Quantization support No
Docker available No
GUI / no-code available No
Telemetry None
Offline after setup Yes

Data & Privacy

Does it send data online?

After setup, this listing is marked as usable offline. Confirm network behavior against the upstream project before regulated deployment.

Does it store history?

Not verified in this directory yet. Review the upstream docs for persistence, logs, and workspace storage.

License checks?

Commercial use is marked as allowed or likely allowed by the listed license.

Telemetry?

None

Last verified: Apr 1, 2025. Maintainer verification should be treated as directory guidance, not legal advice.

Setup & Installation

Medium

A developer can usually get this running with standard docs.

Prerequisites

Python, Docker, Bare Metal

# Start with the official project documentation
# https://huggingface.co/nomic-ai/nomic-embed-text-v2-moe

Hardware Requirements

RAM8 GB minimum / 16 GB recommended
Hardware tagsCPU Only, NVIDIA GPU (CUDA)
Model formatsSafetensors
Primary languagePython

Works Well With