Skip to content

Core

The open-source Deeplake library provides a multi-modal AI database for dataset management, vector search, and deep learning training. Store datasets locally or on any cloud, stream data to PyTorch/TensorFlow, and run semantic search queries.

Installation

pip install deeplake

Key Features

  • Multi-modal storage: images, embeddings, tensors, text, and more
  • Cloud-native: S3, GCS, Azure, or local filesystem
  • Deep learning integration: native PyTorch and TensorFlow data loaders
  • Vector search: cosine similarity, BM25 text search
  • Version control: built-in data versioning and lineage

Get Started

  • API Reference

    Python API documentation (Dataset, Column, Types, Query).

  • Advanced

    TQL syntax, visualization, best practices.