6 results
- Apache Spark Featureddata-utilitiesDistributed data processing engine with MLlib for large-scale data transformation and feature engineering pipelines.Medium setupCPU OnlyCommercial useApache 2.0★ 40kUpdated 28d ago
- Pandas Featureddata-utilitiesEssential Python data manipulation library for structured data cleaning, transformation, and analysis in offline pipelines.Medium setupCPU OnlyCommercial useBSD★ 44kUpdated 1mo ago
- Polars Featureddata-utilitiesLightning-fast DataFrame library in Rust with lazy evaluation for efficient data preprocessing before model training.Medium setupCPU OnlyCommercial useMIT★ 30kUpdated 26d ago
- DataCleanerdata-utilitiesOpen-source data cleaning and preprocessing tool with automated detection of missing values, outliers, and duplicates.Medium setupCPU OnlyCommercial useMIT★ 1.2kUpdated 5mo ago
- DuckDBdata-utilitiesIn-process analytical database for fast SQL queries on local data files, ideal for offline data preparation pipelines.Low setupCPU OnlyCommercial useMIT★ 7.2kUpdated 27d ago
- DVC (Data Version Control)data-utilitiesVersion control system for ML datasets and models enabling reproducible data pipelines in air-gapped environments.Medium setupCPU OnlyCommercial useApache 2.0★ 13kUpdated 28d ago
Offgrid AI tools · Updated daily
Enclavetools
Stop paying for AI APIs. Everything here runs on your hardware.
Sponsor
Reach 50,000+ enterprise buyers looking for private AI solutions.
Newsletter
5 new tools, every Friday
No fluff. No spam. Join 12,000+ builders.
Get featured
Put your tool at the top
Featured listings get 10× more clicks and are shown prominently across the directory.
Page 1 of 1