The performance on tau-bench is very poor.

#11
by Saoyu - opened

May I ask if you have conducted any tests on tau-bench? I tried it and found the results to be quite disappointing. I'm wondering if the official team has tested the model's performance on tau-bench, and whether certain prompts need to be modified to improve the model's performance?

Sign up or log in to comment