Question 1

Is Claude 3 Haiku faster than Gemini 1.5 Flash?

Accepted Answer

Speed depends on two separate things: the model's own inference time and the network path between you and the provider. AIPerf runs a live test from your connection and measures both time to first token (TTFT) and tokens per second (TPS) for Claude 3 Haiku and Gemini 1.5 Flash, so the faster model for you depends on your location and current provider load.

Question 2

What is measured when comparing Claude 3 Haiku and Gemini 1.5 Flash?

Accepted Answer

Time to First Token (TTFT, in milliseconds) measures how long until the first token streams back. Tokens Per Second (TPS) measures sustained generation throughput after that. AIPerf also isolates client network latency so you can tell whether a slow response is the model or your connection.

Claude 3 Haiku vs Gemini 1.5 Flash: Which AI Is Faster?

The short answer

What the numbers mean

Other comparisons