AI Tools Directory

1,173 tools

FELM logo

FELM

a meta-benchmark that evaluates how well factuality evaluators assess the outputs of large language models (LLMs).

Freemium
CompMix logo

CompMix

a benchmark evaluating QA methods that operate over a mixture of heterogeneous input sources (KB, text, tables, infoboxe

Freemium
Berkeley Function-Calling Leaderboard logo

Berkeley Function-Calling Leaderboard

evaluates LLM's ability to call external functions/tools.

Freemium
AlpacaEval logo

AlpacaEval

An Automatic Evaluator for Instruction-following Language Models using Nous benchmark suite.

Freemium
MovieLens-1M logo

MovieLens-1M

dataset, embodying varied social traits and preferences.

Freemium
form logo

form

. Please keep the alphabetical order and in the correct category.

Freemium
!["Buy Me A Coffee" logo

!["Buy Me A Coffee"

](https://www.buymeacoffee.com/filipecalegario)

Freemium
![Stargazers over time logo

![Stargazers over time

](https://starchart.cc/filipecalegario/awesome-generative-ai)

Freemium
Frea Buckler ~ Artist logo

Frea Buckler ~ Artist

obras usadas para criar essa rede [(19) derrick has started yet another project on Twitter: "Just sent @buntworthy a dem

Freemium
Confluence logo

Confluence

a generative art project by Devi Parikh on BrainDrops.

Freemium
Computer Vision Art Gallery : CVPR 2021 logo

Computer Vision Art Gallery : CVPR 2021

artworks dealing with computer vision technologies

Freemium
LAION logo

LAION

Large-scale Artificial Intelligence Open Network

Freemium
Carolina logo

Carolina

General Corpus of Contemporary Brazilian Portuguese with provenance and typology information - Corpus Geral do Português

Freemium
Taskbase logo

Taskbase

Virtual assistants packaged with AI powered software.

Freemium
M3CoT logo

M3CoT

a benchmark that evaluates large language models on a variety of multimodal reasoning tasks, including language, natural

Freemium
OneKE logo

OneKE

A bilingual Chinese-English knowledge extraction model with knowledge graphs and natural language processing technologie

Freemium
AutoGen | Microsoft logo

AutoGen | Microsoft

multi-agent conversation framework as a high-level abstraction by Microsoft [[github](https://github.com/microsoft/autog

Freemium
ChatArena logo

ChatArena

building multi-agent environments for LLMs

Freemium
Eden AI logo

Eden AI

provides a unique API connected to the AI engines

Freemium
LiveBench logo

LiveBench

A Challenging, Contamination-Free LLM Benchmark.

Free
Evaluating LLMs is a minefield logo

Evaluating LLMs is a minefield

talk by Princeton professor Arvind Narayanan

Freemium
LLM Use Case Leaderboard logo

LLM Use Case Leaderboard

a leaderboard that features LLM use cases.

Freemium
LMExamQA logo

LMExamQA

a leaderboard that benchmarks foundation models with Language-Model-as-an-Examiner.

Freemium
Marvin logo

Marvin

AI engineering framework for building natural language interfaces

Freemium