AI Score Calculation

Evaluate performance with your own weightings

How does it work?

The AI Score Calculator lets you compute any conversational agent's performance with the standardized AI Score framework. Enter the results from your chatbot's test runs, and the system automatically generates:

  • The final AI Score (in %)
  • The corresponding letter grade (A–E)
  • The visual badge associated with the letter
  • The different criteria used for obtaining the AI Score

The goal

This score summarizes how reliable, consistent, and robust a chatbot is when facing real-world questions and follow-up challenges.

Optionally configure your own weightings to adapt the evaluation to your specific needs and test different scenarios.

Parameter Configuration
Default values: IP=0.7, R=0.2, SCA=0.1, LR=0.25. The sum of IP + R + SCA must equal 1.
Run 1
Question # Initial Correct Follow-up Correct Inconsistency Memory Loss
1
2
3
4
5
6
7
8
9
10
Run 2
Question # Initial Correct Follow-up Correct Inconsistency Memory Loss
1
2
3
4
5
6
7
8
9
10
Run 3
Question # Initial Correct Follow-up Correct Inconsistency Memory Loss
1
2
3
4
5
6
7
8
9
10
Run 4
Question # Initial Correct Follow-up Correct Inconsistency Memory Loss
1
2
3
4
5
6
7
8
9
10
Run 5
Question # Initial Correct Follow-up Correct Inconsistency Memory Loss
1
2
3
4
5
6
7
8
9
10