How to Evaluate Chat Quality Using Standard NLP Benchmarks
02. March 2022
Until now, language model-based assessments of chatbots have only produced a score for overall quality, often neglecting the context of the dialogue. With models that are trained on GLUE tasks, this has come to an end....
read more