Build a repeatable pipeline that turns foldered images in cloud storage into searchable vectors. Teams use it to prepare image libraries for fast search and basic anomaly tests by class. It is ideal for product images, farm photos, or satellite scenes.
The flow starts on click, sets cluster details, then checks if a collection exists in Qdrant. If not, it creates a new collection with a named vector and a cosine distance, and builds an index on the label field for quick counts. Next it lists images from Google Cloud Storage using a folder prefix, builds public links, and pulls the class name from each folder. A filter removes one class for testing. The data is split into batches, UUIDs are generated, and the payload is shaped for two APIs. Voyage AI creates embeddings for each batch, and the vectors with payloads are uploaded to Qdrant in one call.
Setup needs accounts for Qdrant, Voyage AI, and Google Cloud Storage. You set the cluster URL, collection name, embedding size, batch size, bucket name, and prefix. Expect a clean, scalable index that cuts manual prep and supports thousands of images. Common uses include visual search, catalog checks, and anomaly testing by class.