Meituan LongCat team releases VitaBench
2025-10-20 17:05:06

Meituan's LongCat team today officially released VitaBench (Versatile Interactive Tasks Benchmark), a large-scale intelligent agent evaluation benchmark that closely resembles real-life scenarios and targets complex problems. VitaBench, based on three high-frequency real-life scenarios: ordering takeout, dining in restaurants, and traveling, has constructed an interactive evaluation environment encompassing 66 tools and implemented comprehensive task design across these scenarios.
Email Subscription
Newsletters and emails are now available! Delivered on time, every weekday, to keep you up to date with North American business news.
ASIA TECH WIRE

Grasp technology trends

Download