MTEB is a Python framework for evaluating embeddings and retrieval systems for both text and image. MTEB covers more than 1000 languages and diverse tasks, from classics like classification and clustering to use-case specialized tasks such as legal, code, or healthcare retrieval.

You can get started using mteb, check out our documentation.

Overview
📈 Leaderboard The interactive leaderboard of the benchmark
Get Started.
🏃 Get Started Overview of how to use mteb
🤖 Defining Models How to use existing model and define custom ones
📋 Selecting tasks How to select tasks, benchmarks, splits etc.
🏭 Running Evaluation How to run the evaluations, including cache management, speeding up evaluations etc.
📊 Loading Results How to load and work with existing model results
Overview.
📋 Tasks Overview of available tasks
📐 Benchmarks Overview of available benchmarks
🤖 Models Overview of available Models
Contributing
🤖 Adding a model How to submit a model to MTEB and to the leaderboard
👩‍💻 Adding a dataset How to add a new task/dataset to MTEB
👩‍💻 Adding a benchmark How to add a new benchmark to MTEB and to the leaderboard
🤝 Contributing How to contribute to MTEB and set it up for development