With Marqo Cloud you can build, deploy and scale end-to-end vector search applications in minutes, without the need to create and maintain complex infrastructure. Just choose your preferred storage and inference types, and how many instances you require. Vector generation, storage and retrieval are handled for you.
Plug and play functionality
End-to-end vector creation and storage
Horizontally scalable
Model customisation
Apache 2.0 licence
Community support
All the functionality of open source
Fully managed
CPU and GPU instances
24/7/365 dedicated support
Scale at the click of a button
Access control
High availability
Low latency
When you use Marqo, you are billed on an hourly basis for the resources you use. Usage is rounded up to 15-minute increments.
Vector search allows you to search documents, images and other data by converting items into a collection of vectors. This collection of vectors summarises the data in semantic form and allows us not only to match documents against queries through analysis of the semantic content, but also to understand where and how the document matched the query. With Marqo, inference to create the vectors is included.
The number of instances you will need depends on a number of factors. The number of documents, the size of the documents and the type of data (image vs text). When dealing with low search volumes that primarily involve text or when low latency is not crucial, using CPU inference nodes can be a cost-effective solution. On the other hand, GPU inference nodes provide a significant performance boost when indexing and searching with images and are recommended for indexing large datasets and processing high volume, low latency searches. For multimodal models marqo.CPU.large is recommended as a minimum.
The estimates for storage capacity provided in our calculator assume your are using a model that produces 768 dim. vectors.
The only changes you need to make are to update your URL and API key when accessing Marqo.
You will be billed at the end of the month for total inference and shard hours used. Usage is rounded up to 15-minute increments.
Discover the transformative capabilities of Marqo. Schedule your personalized demo now and elevate your workflow.
Book Demo