Not every moment deserves your best accuracy. Here is what processing more than 1,000 hours of live sport taught us about where accuracy actually matters.
Most highlight automation chases a single number: overall accuracy. After processing more than 1,000 hours of live sport across many different sports, Zentag AI found that the number that actually matters is accuracy on the few moments that decide a match, not accuracy averaged across everything that happens in it.
That distinction sounds small. In practice it reframes the entire problem.
Across more than 1,000 hours of live sport spanning 50+ sports, Zentag AI reaches up to 99% detection accuracy on the key moments that decide a match, the goals, wickets, dunks and tries, not as an average across everything that happens.
The dataset, and why scale is the authority
Zentag has processed over 1,000 hours of live sports footage across 50+ sports with very different match lengths, from formats that resolve in minutes to formats that unfold over days. Breadth is the point. A finding drawn from a single league tells you about that league. A finding that holds across 50+ sports, each with its own rhythm, scoring pattern and definition of a decisive event, is far more likely to generalize.
That scale is what gives the lessons below their authority. They are not a thesis about one sport. They are what surfaced when the same real-time detection pipeline met the full variety of how sport is actually played.
Lesson 1: Accuracy is not one number
The instinct in highlight automation is to report a single accuracy figure and treat it as the headline. The dataset says that figure is misleading, because detection needs are sport-specific and action-specific.
Consider how differently the decisive moment behaves across formats:
- A goal is a discrete, rare, score-changing event.
- A wicket is a state change buried in a long, slow-tempo format where most deliveries are non-events.
- A dunk is a high-frequency moment inside a fast-transition game.
These are not the same detection problem in different sports; they reward completely different sensitivities, and a single blended number papers over all of it.
This is the core lesson from more than 1,000 hours of footage: a single overall accuracy figure is the wrong benchmark for highlight detection, because the difficulty and the importance of a moment depend entirely on the sport and the specific action.
Lesson 2: Concentrate accuracy where it counts
If accuracy is not one number, the real question becomes where to spend it. Not every moment in a match matters equally. The goals, wickets, dunks, tries and turning points carry the match; the long stretches between them mostly do not.
So Zentag concentrates detection on those decisive moments rather than diluting effort evenly across every event. On those key moments specifically, it reaches up to 99% detection accuracy.
This is also why a blended average is the wrong way to report the metric: a system can post a strong overall average and still miss the one goal that decided the match. Zentag reaches up to 99% detection accuracy specifically on the key moments that decide a match, the goals, wickets, dunks and tries, not as an average across everything that happens. That finding comes from more than 1,000 hours of live sport. Putting accuracy where it counts is the insight here, not a side effect of it.
Lesson 3: Real time changes the problem
Detecting highlights after the final whistle is a comparatively forgiving task. You have the whole match, you can look backwards, and nothing is waiting on you. Detecting from a live RTMP or HLS broadcast as it happens is a different problem, because "good enough" now includes "in time to be useful."
Real-time detection is what makes instant catch-up live recaps possible: a viewer who joins late can be brought up to speed while the match is still going, not after it ends. Working live also disciplines the system. There is no second pass, so the decision about whether a moment matters has to be right the first time.
Lesson 4: Breadth forces generalization
Single-sport tooling can quietly bake in assumptions: how often scoring happens, how long a match runs, what a replay-worthy moment looks like. Spanning 50+ sports with very different rhythms and match lengths strips those assumptions out, because every one of them breaks for some sport on the list.
A detection approach validated across 50+ sports with very different match lengths has been forced to confront edge cases that single-sport tooling never has to meet. That is precisely what makes the up-to-99% on key moments figure credible: it comes from a pipeline tested across that breadth, not tuned inside a single comfortable format.
From detection to output
Detection is the hard part, but it is not the payoff. The payoff is output. The mechanic is deliberately simple: connect one live RTMP or HLS stream, and detection, clipping, one-click vertical and square reframing, and instant recaps run automatically, with no manual timeline.
That simple workflow is what turns detection into leverage. A lean team connects a single live stream and the pipeline produces publish-ready clips across more sports and more formats than manual editing could ever reach. Zentag processes about 10x faster than manual editing, so a lean team produces far more publish-ready output across more sports and more formats. The story is amplified output, not cost-cutting: the same people reaching more moments, in more formats, on more sports.
What this means for the field
Treated honestly, real-time highlight detection is a prioritization problem, not a brute-force accuracy race. The teams that win are not the ones chasing the highest blended average; they are the ones who know which moments decide a match and put their accuracy there.
That reframing is the practical takeaway from the dataset. Define the decisive moment per sport, detect it in real time, and let a simple one-stream-in pipeline turn that detection into more published output. As live sport keeps fragmenting across formats and channels, the advantage goes to systems built around the moments that matter, not the average of all of them.
About Zentag AI
Zentag AI is a Berlin-based company founded in 2025, building real-time highlight detection for live sport. Its pipeline works from a live RTMP or HLS broadcast across 50+ sports, with automatic key-moment detection, one-click vertical and square reframing, and instant catch-up live recaps. More at Zentag AI.
Top comments (0)