More information: https://blog.nilenso.com/blog/2025/09/25/swe-benchmarks/ to_complete
Agentic / voice benchmarks
- EVA-Bench — end-to-end evaluation of voice agents across enterprise domains (airline, ITSM, healthcare HRSD)
More information: https://blog.nilenso.com/blog/2025/09/25/swe-benchmarks/ to_complete