Execution Runner

Execute benchmark test cases against the MindSurf API and capture responses

Running

0

Pool Available

0

Pool In Use

0

Total Runs

0

Start New Execution
Configure and start a benchmark execution against the MindSurf API

Uses MindSurf API with session management

Safety Critical(0/3 selected)
CDR00s
RPR00s
HRR00s
Conversational Quality(0/7 selected)
BERTScore00s
Length_Ratio00s
Diversity00s
Conversation_Relevancy00s
Role_Adherence00s
Context_Retention00s
Conversation_Completeness00s
Therapeutic Value(0/3 selected)
CAS00s
Empathy00s
TAS00s
Selected: 0 metrics~0 entries | ~0s

Please select at least one metric to execute

Running Executions
Currently active benchmark runs

No executions running