HF++ Logo

Navigating the Embeddings Landscape 🔍

Cut through the noise and pinpoint the ideal Embedding Model to propel your AI initiative forward.

Embeddings*

ModelSagemaker Endpoint Cost / Month **EC2 Cost / Month ***Model Size (GB)Best ForLanguageTraining DataEmbedding DimensionsMax TokensOpen Source or ProprietaryReleased Year
multilingual-e5-large-instruct1.12Translation, Classification, Textual Similarity1024512Open Source2024
LaBSE1.88Translation1024512Open Source2021
multilingual-e5-large2.24Translation1024512Open Source2023
GritLM-7B14.48Classification, Clustering, Pair Classification, Reranking, Retrieval409632768Open Source2024
voyage-lite-02-instructN/AClassification, Clustering, Pair Classification, Retrieval, Textual Similarity10244000Proprietary2024
GritLM-8x7B93.41Classification, Clustering, Retrieval409632768Open Source2024
e5-mistral-7b-instruct14.22Classification, Clustering, Pair Classification, Reranking, Retrieval, Textual Similarity, Summarization.409632768Open Source2024
Cohere-embed-english-v3.0N/AClassification, Clustering, Retrieval1024512Proprietary2024
text-embedding-3-largeN/AClustering, Reranking, Retrieval, Summarization30728191Proprietary2024
ember-v11.34Clustering, Reranking1024512Open Source2023
UAE-Large-V11.34Pair Classification, Reranking, Textual Similarity300512Open Source2023
mxbai-embed-large-v10.67Pair Classification, Reranking, Textual Similarity, Summarization1024512Open Source2024
bge-large-en-v1.51.34Pair Classification, Reranking, Summarization1024512Open Source2023
mxbai-embed-2d-large-v10.67Textual SimilarityUp to 1024 (user defined)512Open Source2024

* Allowed for commercial usage

** Cost to deploy an embedding model as a single instance of Sagemaker endpoint

*** Cost to an embedding model as a single instance of EC2 endpoint but you need to dockerize the model and perform all the infra related steps

Crafted by seasoned machine learning engineers with extensive backgrounds in top-tier tech companies.

Placeholder Image 1
Placeholder Image 2
Placeholder Image 3

© 2024 Sagify. All rights reserved

Github