Olamide Olaniyan

Posted on May 4

What Facebook Video Transcripts Reveal Before Sales Calls Do

#programming #tutorial #ai #webdev

Most teams treat competitor videos like brand content.

They watch a reel, nod a little, maybe save the link, then move on.

I think that leaves a lot of value on the table.

Because once you stop watching those videos as content and start treating them as data, a more useful question shows up:

what are they saying repeatedly, and what does that tell us about the market right now?

That is where transcripts become interesting.

Not because transcripts are glamorous, but because they make short-form video searchable, classifiable, and diffable.

And once you can do that, you can spot changes in:

positioning
offers
objections
proof points
CTA strategy

Often before those changes become obvious in sales calls or broader market chatter.

This is the workflow I would build if I wanted Facebook reels and video posts to become a structured competitor-intelligence source instead of random tabs in a browser.

Why Transcripts Matter More Than Views

Views tell you reach.

Transcripts tell you messaging.

If I am monitoring competitors, messaging is usually more useful.

I want to know:

which promises keep showing up
which customer pains they repeat
what proof they lead with
whether the CTA changed this week
whether they are moving upmarket, downmarket, or more self-serve

That is what transcripts make easier to capture.

The Workflow I Like

The workflow is simple.

Collect public Facebook video or reel URLs
Pull the transcript for each one
Split transcript text into short statements
Classify those statements into buckets
Compare those buckets across time

You do not need a giant NLP stack to get useful output.

You just need enough structure to stop relying on memory.

JavaScript Version: Transcript Bucketing for Competitor Research

This starts with the Facebook transcript route and then turns transcript text into reusable messaging buckets.

const headers = {
  'X-API-Key': process.env.SOCIAVAULT_API_KEY,
};

async function fetchJson(url) {
  const response = await fetch(url, { headers });
  if (!response.ok) {
    throw new Error(`Request failed with ${response.status}`);
  }
  return response.json();
}

function extractTranscript(data = {}) {
  if (typeof data === 'string') return data;
  if (typeof data.transcript === 'string') return data.transcript;
  if (typeof data.text === 'string') return data.text;
  if (Array.isArray(data.captions)) {
    return data.captions.map(item => item.text || item.caption || '').join(' ');
  }
  return '';
}

function splitStatements(text) {
  return text
    .split(/[.!?\n]+/)
    .map(part => part.trim())
    .filter(Boolean)
    .filter(part => part.length > 20);
}

function classifyStatement(statement) {
  const value = statement.toLowerCase();

  if (/(struggling|frustrated|hard|difficult|problem|stuck|wasting time)/.test(value)) return 'pain';
  if (/(customers|teams|creators|brands|founders|agencies)/.test(value)) return 'audience';
  if (/(results|case study|revenue|saved|grew|increase|improved)/.test(value)) return 'proof';
  if (/(free trial|book a demo|get started|sign up|dm us|click the link)/.test(value)) return 'cta';
  if (/(price|pricing|plan|subscription|monthly|annual|cost)/.test(value)) return 'offer';
  return 'positioning';
}

async function analyzeTranscript(url) {
  const json = await fetchJson(
    `https://api.sociavault.com/v1/scrape/facebook/post/transcript?url=${encodeURIComponent(url)}`
  );

  const transcript = extractTranscript(json.data || {});
  const statements = splitStatements(transcript);
  const buckets = {
    pain: [],
    audience: [],
    proof: [],
    offer: [],
    cta: [],
    positioning: [],
  };

  for (const statement of statements) {
    buckets[classifyStatement(statement)].push(statement);
  }

  return {
    url,
    transcriptLength: transcript.length,
    bucketCounts: Object.fromEntries(Object.entries(buckets).map(([key, value]) => [key, value.length])),
    examples: Object.fromEntries(Object.entries(buckets).map(([key, value]) => [key, value.slice(0, 3)])),
  };
}

analyzeTranscript('https://www.facebook.com/reel/1535656380759655')
  .then(report => console.log(report))
  .catch(error => console.error(error));

This is already enough to make transcript review much more systematic.

Instead of saying "their content feels more salesy lately," I can actually show which buckets increased.

Python Version: Compare Multiple Videos Over Time

Once you want weekly or monthly tracking, Python is a comfortable way to batch it.

import os
import re
import requests
from collections import Counter


HEADERS = {'X-API-Key': os.environ['SOCIAVAULT_API_KEY']}


def fetch_json(url):
    response = requests.get(url, headers=HEADERS, timeout=30)
    response.raise_for_status()
    return response.json()


def extract_transcript(data):
    if isinstance(data, str):
        return data
    if isinstance(data, dict):
        if isinstance(data.get('transcript'), str):
            return data['transcript']
        if isinstance(data.get('text'), str):
            return data['text']
        if isinstance(data.get('captions'), list):
            return ' '.join((item.get('text') or item.get('caption') or '') for item in data['captions'])
    return ''


def split_statements(text):
    parts = re.split(r'[.!?\n]+', text)
    return [part.strip() for part in parts if part.strip() and len(part.strip()) > 20]


def classify_statement(statement):
    value = statement.lower()
    if re.search(r'(struggling|frustrated|hard|difficult|problem|stuck|wasting time)', value):
        return 'pain'
    if re.search(r'(customers|teams|creators|brands|founders|agencies)', value):
        return 'audience'
    if re.search(r'(results|case study|revenue|saved|grew|increase|improved)', value):
        return 'proof'
    if re.search(r'(free trial|book a demo|get started|dm us|click the link|sign up)', value):
        return 'cta'
    if re.search(r'(price|pricing|plan|subscription|monthly|annual|cost)', value):
        return 'offer'
    return 'positioning'


def analyze_transcript(url):
    json_data = fetch_json(
        f'https://api.sociavault.com/v1/scrape/facebook/post/transcript?url={url}'
    )
    transcript = extract_transcript(json_data.get('data') or {})
    statements = split_statements(transcript)

    buckets = Counter(classify_statement(statement) for statement in statements)
    examples = {}
    for bucket in ['pain', 'audience', 'proof', 'offer', 'cta', 'positioning']:
        examples[bucket] = [statement for statement in statements if classify_statement(statement) == bucket][:3]

    return {
        'url': url,
        'transcriptLength': len(transcript),
        'bucketCounts': dict(buckets),
        'examples': examples,
    }


urls = [
    'https://www.facebook.com/reel/1535656380759655',
    'https://www.facebook.com/reel/1535656380759656',
]

reports = [analyze_transcript(url) for url in urls]
print(reports)

That is enough to answer questions like:

did their CTA mix change?
are they leaning harder into proof recently?
are they talking about pricing more often?
did they shift from pain-led messaging to offer-led messaging?

That is useful market intelligence from public video content.

Where This Gets Especially Useful

I like transcript analysis most in four cases.

1. Competitor sales messaging

Watch the words they keep repeating.

2. Offer changes

Notice when more pricing, trial, or package language starts showing up.

3. Objection mining

Competitor videos often answer customer concerns directly.

4. Content planning

You can see which positioning angles the market is saturating.

That helps your own team avoid parroting everyone else.

Honest Alternatives

You can absolutely do this manually.

For a handful of videos, manual review is fine.

But once you are tracking multiple competitors across time, transcripts are what make the workflow scalable.

You can also push this further with embeddings or LLM summarization later. I would just avoid starting there.

The basic transcript bucketing layer is already useful and much easier to trust.

Final Take

Competitor videos are not just content samples. They are messaging assets hiding in plain sight.

Transcripts turn them into something you can actually analyze over time.

If you want the public transcript layer without building the collection plumbing from scratch, SociaVault is a good place to start.

Then do the part that actually matters: classify what they keep saying, compare it across time, and use that signal before the market feels obvious.

That is usually where the edge is.

webdev #python #javascript #marketing #competitiveanalysis

DEV Community