Interviews, insight & analysis on digital media & marketing

From Scale to Precision: Why YouTube Needs a Smarter Matchmaker

By Matthew Taylor, Global Senior Client Partner, TXT Media

On a big match night during the 2026 World Cup, you can almost watch attention move in real time. Fans start on the big screen for the build-up, jump to YouTube on their phones for instant highlights, bounce into creator watch-alongs, then scroll Shorts for reactions before bed.  The same person shifts across screens, formats and mindsets in minutes – yet the ad experience they see along the way can feel wildly inconsistent. Sometimes it is perfectly in the moment; other times, it feels like it belongs to a different day entirely. 

That, for me, captures the YouTube challenge better than any deck. YouTube is no longer just a video platform; it is the connective tissue between live sport, creator culture, music, gaming, education and everyday entertainment. It is where people go to feel part of something – whether that is a global event like the World Cup or a niche community.  The question for brands is not whether to be there, but how to show up in a way that feels natural to the content, the context and the audience’s intent. 

The opportunity is enormous. With billions of monthly viewers and ad revenues that rival, and often outperform, both social and linear TV, YouTube remains one of the most powerful canvases for advertisers.  Yet for all its scale, clients do not always feel in control of where ads appear, what emotional environment surrounds them, or whether the message is truly aligned with the content it runs alongside. After more than 20 years in media across TV, AV, mobile, CTV and YouTube, that tension between scale and suitability has been one of the few constants. 

For years, the market has relied on tools like channel lists, keyword targeting and broad exclusions to manage that tension.  They served a purpose in a simpler, more linear era. A keyword can tell you what a video appears to be about, but it cannot tell you how it feels, what values it reflects, who is present in it, or whether the tone quietly undermines a brand’s message.  In a world of creator content, live streams, Shorts and AI-generated video, those blunt instruments are increasingly out of sync with how people actually consume media.  This is exactly the challenge solutions like Cognify were designed to address: moving from surface-level classification to true contextual understanding of video environments. 

The next wave of YouTube innovation has to focus on what is really happening inside the video.  To move from blunt targeting to genuine precision, content needs to be analysed at a much deeper level: frame by frame, second by second, across multiple dimensions at once.  That means looking at text and speech, visual context, objects and scenes, sentiment and mood, and where appropriate, indicative demographic signals. Only then can advertisers understand not just what a video is labelled as, but what it is actually saying – and how it makes people feel. 

This matters because relevance is rarely binary.  A video can tick every box on a surface-level checklist and still be the wrong environment once you see it in full.  A kids’ brand might avoid explicit content yet end up next to chaotic or anxiety-inducing material.  A premium beauty brand might be technically “brand safe” but running alongside content that jars with its aspirational positioning. Deeper contextual intelligence allows advertisers to identify videos that genuinely match the brief – and just as importantly, to quietly exclude those that do not. 

When this is done well, the way YouTube is planned starts to look very different. Brands can brief not just on reach and demographics, but on the kind of stories, emotions and values they want to sit alongside. They can articulate the role of creative – what it says, who it speaks to, what it represents – and the type of environment that will help it land.  Advanced tools can then curate inventory accordingly and, crucially, provide transparent reporting that shows why specific placements were chosen, not just where spend went. 

This is not about ripping up existing setups or forcing planners to start from scratch. It is about enriching what already works.  At TXT Media, Cognify was built to sit within current YouTube planning and buying workflows rather than replace them.  It adds a more sophisticated layer of contextual understanding, transparency and control over the video environments where campaigns appear.  For advertisers, approaches like this mean stronger brand safety, less wastage and a higher likelihood that creative will be seen in moments where it can genuinely perform.

The same principles apply beyond YouTube. Whether it is location intelligence around events like the World Cup, creator-driven experiences, or CTV and live programming, the conversation is moving away from “how many impressions did we buy?” towards “how intelligently did we curate the environments our brand appeared in?” That is where media feels less like a blunt instrument and more like a carefully designed experience. 

Looking back over two decades in this industry, one thing has remained consistent: advertisers want visibility, control and confidence. They want to know their creative is appearing in the right place, for the right person, at the right moment – not just in aggregate, but in the lived reality of an individual’s media journey. Until now, delivering that consistently on YouTube at scale has been hard. 

That is why the conversation around YouTube needs to move beyond raw reach.  The platform will always be a giant.  What brands need now is not more volume for its own sake, but more intelligence behind it. The future lies in being present in the right context, with the right message, in the moments that matter most – whether that is a World Cup watch-along, a creator’s honest review, or a quiet moment of late-night scrolling.

Opinion

More posts from ->