AI Tools Directory

6,397 tools

OlympicArena

a benchmark for evaluating AI models across multiple academic disciplines like math, physics, chemistry, biology, and mo

Freemium

MMToM-QA

a multimodal question-answering benchmark designed to evaluate AI models' cognitive ability to understand human beliefs

Freemium

MMedBench

a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.

Freemium

MathEval

a comprehensive benchmarking platform designed to evaluate large models' mathematical abilities across 20 fields and nea

Freemium

LLMEval

focuses on understanding how these models perform in various scenarios and analyzing results from an interpretability pe

Freemium

LawBench

a benchmark designed to evaluate large language models in the legal domain.

Freemium

InfiBench

a benchmark designed to evaluate large language models (LLMs) specifically in their ability to answer real-world coding-

Freemium

FELM

a meta-benchmark that evaluates how well factuality evaluators assess the outputs of large language models (LLMs).

Freemium

PromptFoundry

The simple prompt engineering and evaluation tool designed for developers building AI applications.

Freemium

PromptHub

Full stack prompt management tool designed to be usable by technical and non-technical team members. Test, version, coll

Freemium

Parea AI

Platform and SDK for AI Engineers providing tools for LLM evaluation, observability, and a version-controlled enhanced p

Freemium

DreamBench++

a benchmark for evaluating the performance of large language models (LLMs) in various tasks related to both textual and

Freemium

CompMix

a benchmark evaluating QA methods that operate over a mixture of heterogeneous input sources (KB, text, tables, infoboxe

Freemium

Berkeley Function-Calling Leaderboard

evaluates LLM's ability to call external functions/tools.

Freemium

AlpacaEval

An Automatic Evaluator for Instruction-following Language Models using Nous benchmark suite.

Freemium

Subscribe on Unwind AI

to get new template drops + tutorials in your inbox.

Freemium

Manag.ai

Your all-in-one prompt management and observability platform. Craft, track, and perfect your LLM prompts with ease.

Freemium

MovieLens-1M

dataset, embodying varied social traits and preferences.

Freemium

form

. Please keep the alphabetical order and in the correct category.

Freemium

!["Buy Me A Coffee"

](https://www.buymeacoffee.com/filipecalegario)

Freemium

![CC0

](https://creativecommons.org/publicdomain/zero/1.0/)

Freemium

![Stargazers over time

](https://starchart.cc/filipecalegario/awesome-generative-ai)

Freemium

BIRME

Bulk Image Resizing Made Easy 2.0 (Online & Free)

Free

Izlo

Prompt management tools for teams. Store, improve, test, and deploy your prompts in one unified workspace.

Freemium