Image Search¶

Image search (or Reverse Image Search) allows you to find visually similar images or retrieve images using natural language descriptions. This is powered by vector embeddings and high-performance indexing.

Objective¶

Build an image catalog, store visual embeddings, and perform semantic search to find the most relevant images.

Prerequisites¶

Deeplake SDK: pip install deeplake and AI stack: pip install torch "transformers>=4.48,<5" pillow accelerate (Python SDK tab)
curl and a terminal (REST API tab)
A Deeplake API token.

Set credentials first

export DEEPLAKE_API_KEY="your-token-here"
export DEEPLAKE_WORKSPACE="your-workspace"  # optional, defaults to "default"

Complete Code¶

Python SDKREST API

import os
import uuid
import torch
from PIL import Image
from deeplake import Client
from transformers import AutoModel, AutoProcessor

# 1. Setup ColQwen3 Visual Encoder
MODEL_ID = "TomoroAI/tomoro-colqwen3-embed-4b"
device = "cuda" if torch.cuda.is_available() else "cpu"

processor = AutoProcessor.from_pretrained(MODEL_ID, trust_remote_code=True)
model = AutoModel.from_pretrained(
    MODEL_ID, trust_remote_code=True, torch_dtype=torch.bfloat16, device_map=device
).eval()

def get_image_embedding(image_source):
    img = Image.open(image_source).convert("RGB")
    inputs = processor.process_images(images=[img])
    inputs = {k: v.to(device) for k, v in inputs.items()}
    with torch.inference_mode():
        out = model(**inputs)
    return out.embeddings[0].cpu().float().numpy()

# 2. Setup
client = Client()

# 3. Create table and ingest images with multi-vector embeddings
client.query("""
    CREATE TABLE IF NOT EXISTS "image_catalog" (
        _id UUID PRIMARY KEY,
        image IMAGE,
        filename TEXT,
        embedding FLOAT4[][]
    ) USING deeplake
""")

# Download sample images
import urllib.request, io
image_urls = {
    "robot.jpg":      "https://images.unsplash.com/photo-1485827404703-89b55fcc595e?w=256&h=256&fit=crop",
    "datacenter.jpg": "https://images.unsplash.com/photo-1558494949-ef010cbdcc31?w=256&h=256&fit=crop",
}
images = {}
for name, url in image_urls.items():
    with urllib.request.urlopen(url) as resp:
        images[name] = resp.read()

client.ingest("image_catalog", {
    "_id": [str(uuid.uuid4()) for _ in images],
    "image": list(images.values()),
    "filename": list(images.keys()),
    "embedding": [get_image_embedding(io.BytesIO(img)) for img in images.values()],
})

client.create_index("image_catalog", "embedding")

# 4. Semantic Visual Retrieval
query_text = "humanoid robot"
query_inputs = processor.process_texts(texts=[query_text])
query_inputs = {k: v.to(device) for k, v in query_inputs.items()}
with torch.inference_mode():
    query_emb = model(**query_inputs).embeddings[0].cpu().float().numpy().tolist()

# Format multi-vector as PG array literal: {{v1},{v2},...}
emb_pg = "{" + ",".join(
    "{" + ",".join(str(v) for v in row) + "}" for row in query_emb
) + "}"

results = (
    client.table("image_catalog")
        .select("filename", f"embedding <#> '{emb_pg}'::float4[][] AS score")
        .order_by("score DESC")
        .limit(5)
        .execute()
)

for r in results:
    print(f"Match: {r['filename']} (Score: {r['score']:.4f})")

# Requires: export DEEPLAKE_API_KEY="..." (see quickstart)
# Requires: export DEEPLAKE_ORG_ID="your-org-id"
API_URL="https://api.deeplake.ai"
TABLE="image_catalog"

# 1. Create table
curl -s -X POST "$API_URL/workspaces/$DEEPLAKE_WORKSPACE/tables/query" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPLAKE_API_KEY" \
  -H "X-Activeloop-Org-Id: $DEEPLAKE_ORG_ID" \
  -d '{
    "query": "CREATE TABLE IF NOT EXISTS \"'$DEEPLAKE_WORKSPACE'\".\"'$TABLE'\" (id SERIAL PRIMARY KEY, filename TEXT, embedding FLOAT4[][]) USING deeplake"
  }'

# 2. Insert metadata + embeddings (placeholder embeddings)
curl -s -X POST "$API_URL/workspaces/$DEEPLAKE_WORKSPACE/tables/query" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPLAKE_API_KEY" \
  -H "X-Activeloop-Org-Id: $DEEPLAKE_ORG_ID" \
  -d '{
    "query": "INSERT INTO \"'$DEEPLAKE_WORKSPACE'\".\"'$TABLE'\" (filename, embedding) VALUES ($1, $2::float4[][])",
    "params": ["robot.jpg", "{0.1,0.2,0.3}"]
  }'

curl -s -X POST "$API_URL/workspaces/$DEEPLAKE_WORKSPACE/tables/query" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPLAKE_API_KEY" \
  -H "X-Activeloop-Org-Id: $DEEPLAKE_ORG_ID" \
  -d '{
    "query": "INSERT INTO \"'$DEEPLAKE_WORKSPACE'\".\"'$TABLE'\" (filename, embedding) VALUES ($1, $2::float4[][])",
    "params": ["datacenter.jpg", "{0.1,0.2,0.3}"]
  }'

# 3. Search
curl -s -X POST "$API_URL/workspaces/$DEEPLAKE_WORKSPACE/tables/query" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPLAKE_API_KEY" \
  -H "X-Activeloop-Org-Id: $DEEPLAKE_ORG_ID" \
  -d '{
    "query": "SELECT filename, embedding <#> $1::float4[][] AS score FROM \"'$DEEPLAKE_WORKSPACE'\".\"'$TABLE'\" ORDER BY score DESC LIMIT 5",
    "params": ["{0.1,0.2,0.3}"]
  }'

Step-by-Step Breakdown¶

1. Multi-Vector Embeddings¶

ColQwen3 produces one embedding vector per visual token, giving a float4[][] per image (ColBERT-style late interaction). This captures more visual detail than a single vector, improving retrieval quality for complex queries.

2. The Vector Index¶

client.create_index() builds a deeplake_index on the embedding column. The index uses optimized C++ kernels to compute multi-vector similarity directly where the data resides.

3. Retrieval with `<#>`¶

The <#> operator computes similarity between query and stored multi-vectors. Higher scores represent closer matches. Use ORDER BY score DESC to get the top results.

4. Multi-Vector PG Format¶

Multi-vector embeddings use nested braces in PostgreSQL: {{0.1,0.2},{0.3,0.4}}. The formatting helper converts the 2D array returned by the model into this literal.

What to try next¶

Video retrieval: search inside video clips.
Advanced Hybrid RAG: combine text and image signals.
Search guide: understand indexing parameters.