Research index
Technical reportVv2.0
Benchmark Travel AI 2026: ChatGPT, Perplexity e tripbot sotto stress test
Benchmark IA completo con 150 richieste di viaggio DACH reali in sei categorie. Valutato per accuratezza, applicabilita e tempo di risposta, in modo trasparente e riproducibile.
Released
14 febbraio 2026
Subject
Travel intelligence v2.1
Dataset
N150 Queries
Abstract
This report evaluates deterministic accuracy with complex travel requirements
Benchmark performance
Higher is bettertripbotState of the art
94.5%Perplexity
87.1%ChatGPT
81.6%The results show
Technical evaluation
Detailed metrics about
| Model | Overall | Constraint match | Actionability | Latency (s) |
|---|---|---|---|---|
| tripbot | 94.50 | 95.80 | 96.00 | 0.23 |
| ChatGPT | 81.57 | 77.17 | 81.35 | 0.46 |
| Perplexity | 87.07 | 92.92 | 86.40 | 1.55 |
Methodology
Our test environment is
Sampling data
150 anonymisierte deutsche Reiseprompts, segmentiert in 6 Travel-Cases (Flug, Hotel, Pauschalreise, Visum, Wetter, Inspiration).
Evaluation protocol
Two-stage blind rating process with