Question 1

What is a TFLOP in AI?

Accepted Answer

A TFLOP (TeraFLOP) equals one trillion floating-point operations per second. It's the standard unit for measuring AI hardware performance. For example, the NVIDIA H100 GPU delivers 204 TFLOPS in FP16, while the RTX 4090 delivers 82.6 TFLOPS.

Question 2

How much compute is needed to train an LLM?

Accepted Answer

Training compute scales roughly with model size and training data. GPT-3 (175B parameters) required an estimated 3.14 × 10²³ FLOPs. Larger frontier models like GPT-4 are estimated to require 10²⁴–10²⁵ FLOPs. At H100 efficiency, that's tens of thousands of GPU-years.

Question 3

What is the difference between FP16 and FP32 performance?

Accepted Answer

FP16 (16-bit floating point) allows GPUs to perform roughly 2–4× more operations per second than FP32 (32-bit) because each number uses half the memory bandwidth. AI training and inference has largely shifted to FP16 and BF16 to exploit this performance advantage.

Question 4

How does H100 compare to A100?

Accepted Answer

The NVIDIA H100 SXM delivers approximately 204 TFLOPS in FP16, versus 77.97 TFLOPS for the A100 — about 2.6× more raw compute. The H100 also has faster memory bandwidth (3.35 TB/s vs 2 TB/s) and NVLink interconnect improvements that benefit large model training.

Question 5

What is a petaFLOP-day?

Accepted Answer

A petaFLOP-day is a unit of total compute equal to 10¹⁵ floating-point operations sustained for 24 hours, or 8.64 × 10¹⁹ total FLOPs. It's commonly used to measure AI training runs. GPT-3 required approximately 3,640 petaFLOP-days to train.

FLOPS	1.000T
KFLOPS	1.000G
MFLOPS	1.000M
GFLOPS	1.000K
TFLOPS	1.000
PFLOPS	0.001000
EFLOPS	0.000001000

GPU	TFLOPS	vs your value
NVIDIA RTX 4090	82.6	0.0×
NVIDIA A100 (80GB)	77.97	0.0×
NVIDIA H100	204	0.0×
NVIDIA RTX 3090	35.6	0.0×
Apple M3 Max	14.2	0.1×

Convert AI Compute Units: FLOPS, TFLOPS, PFLOPS

What is a TFLOP in AI?

How much compute is needed to train an LLM?

What is the difference between FP16 and FP32 performance?

How does H100 compare to A100?

What is a petaFLOP-day?