The performance on tau-bench is very poor.
#11
by
Saoyu - opened
May I ask if you have conducted any tests on tau-bench? I tried it and found the results to be quite disappointing. I'm wondering if the official team has tested the model's performance on tau-bench, and whether certain prompts need to be modified to improve the model's performance?