Core¶
The open-source Deeplake library provides a multi-modal AI database for dataset management, vector search, and deep learning training. Store datasets locally or on any cloud, stream data to PyTorch/TensorFlow, and run semantic search queries.
Installation¶
Key Features¶
- Multi-modal storage: images, embeddings, tensors, text, and more
- Cloud-native: S3, GCS, Azure, or local filesystem
- Deep learning integration: native PyTorch and TensorFlow data loaders
- Vector search: cosine similarity, BM25 text search
- Version control: built-in data versioning and lineage
Get Started¶
-
Python API documentation (Dataset, Column, Types, Query).
-
TQL syntax, visualization, best practices.