Benchmark Buddy

Created: 2023-11-23Author: Cavit Erginsoy
Dall·e
Browser
Data Analysis

-

Ratings(0)

other

Category

30

Conversations

Capabilities

Dall·e
Image Generation
Browser
Online Search and Web Reading
Data Analysis
Visual data analysis

Description

AI assistant for benchmarking community-finetuned LLMs, offering tailored questions in six areas and analysis.

Prompts

  • Give me two questions for technical explanation testing in LLMs.
  • What questions should I ask for specific general inquiry in models like LLama 2?
  • I need coding questions for a Mistral 7B test.
  • How would you grade this LLM response for creative writing?