AI Tools Directory

6,396 tools

Harmonai's Dance Diffusion logo

Harmonai's Dance Diffusion

Open-Source AI Audio Generation Tool For Music Producers – Weights & Biases

Free
Taskbase logo

Taskbase

Virtual assistants packaged with AI powered software.

Freemium
FramePack logo

FramePack

next-frame prediction neural network structure that generates videos progressively

Freemium
Emergent Mind logo

Emergent Mind

The latest AI news, curated & explained by GPT-4.

Freemium
The Chinese Book for Large Language Models logo

The Chinese Book for Large Language Models

An Introductory LLM Textbook Based on [*A Survey of Large Language Models*](https://arxiv.org/abs/2303.18223).

Freemium
BUILD GPT: HOW AI WORKS logo

BUILD GPT: HOW AI WORKS

explains how to code a Generative Pre-trained Transformer, or GPT, from scratch.

Freemium
Alexander Rush Series logo

Alexander Rush Series

high quality and educational materials you don't want to miss.

Freemium
Arthur Shield logo

Arthur Shield

A paid product for detecting toxicity, hallucination, prompt injection, etc.

Paid
Weights & Biases logo

Weights & Biases

Machine learning experiment tracking, dataset versioning, hyperparameter search, visualization, and collaboration

Freemium
Guardrails.ai logo

Guardrails.ai

A Python library for validating outputs and retrying failures. Still in alpha, so expect sharp edges and bugs.

Freemium
Tune Studio logo

Tune Studio

Playground for devs to finetune & deploy LLMs

Freemium
WHOOPS! logo

WHOOPS!

a benchmark dataset testing AI's ability to reason about visual commonsense through images that defy normal expectations

Freemium
We-Math logo

We-Math

a benchmark that evaluates large multimodal models (LMMs) on their ability to perform human-like mathematical reasoning.

Freemium
VisualWebArena logo

VisualWebArena

a benchmark designed to assess the performance of multimodal web agents on realistic visually grounded tasks.

Freemium
TAT-DQA logo

TAT-DQA

a large-scale Document Visual Question Answering (VQA) dataset designed for complex document understanding, particularly

Freemium
SuperLim logo

SuperLim

a Swedish language understanding benchmark that evaluates natural language processing (NLP) models on various tasks such

Freemium
SuperBench logo

SuperBench

a benchmark platform designed for evaluating large language models (LLMs) on a range of tasks, particularly focusing on

Freemium
SciBench logo

SciBench

benchmark designed to evaluate large language models (LLMs) on solving complex, college-level scientific problems from d

Freemium
PubMedQA logo

PubMedQA

a biomedical question-answering benchmark designed for answering research-related questions using PubMed abstracts.

Freemium
OlympicArena logo

OlympicArena

a benchmark for evaluating AI models across multiple academic disciplines like math, physics, chemistry, biology, and mo

Freemium
MMToM-QA logo

MMToM-QA

a multimodal question-answering benchmark designed to evaluate AI models' cognitive ability to understand human beliefs

Freemium
MMedBench logo

MMedBench

a benchmark that evaluates large language models' ability to answer medical questions across multiple languages.

Freemium
MathEval logo

MathEval

a comprehensive benchmarking platform designed to evaluate large models' mathematical abilities across 20 fields and nea

Freemium
LLMEval logo

LLMEval

focuses on understanding how these models perform in various scenarios and analyzing results from an interpretability pe

Freemium