← All guides

YouTube transcript to ChatGPT: feed any video to an LLM

ChatGPT and Claude can't watch a video — but they're excellent at reading one. Get the transcript, paste it into a chat, and you can summarize, question, and repurpose a 60-minute video in seconds. Here's the exact workflow.

Step-by-step: get a transcript into ChatGPT or Claude

  1. Get the transcript. Paste the video URL into the box on the VidWords homepage and click Get transcript. It works for any public video that has captions — auto-generated or creator-uploaded — including Shorts, youtu.be links, and live replays. You'll see the full text as readable paragraphs with clickable timestamps.
  2. Copy it as plain text. Switch the view to plain text and copy, or use Export → TXT for a clean block with no timecodes or markup. (SRT, VTT, CSV, and JSON exports are there too if you need them, but TXT is what an LLM wants.)
  3. Open ChatGPT or Claude. Start a fresh chat. Any model with a reasonable context window works — ChatGPT, Claude, Gemini, or a local model.
  4. Paste with a prompt. Type your instruction first, then paste the transcript below it. Putting the instruction before the text helps the model frame the task. See the example prompts below.
  5. Refine with follow-ups. Read the answer, then keep asking in the same chat: "expand point 3," "give me the exact quote," "reformat as a checklist." The transcript stays in context, so follow-ups are cheap and accurate.

That's the whole loop. The only thing that ever causes friction is a transcript that's too long for the model's context window — covered further down.

Prompts that work

The transcript is just raw material; the prompt decides what you get out. Here are four that consistently produce good results. Replace [paste transcript here] with the copied text.

1. Summarize:

Below is the transcript of a YouTube video. Give me:
1. A one-sentence TL;DR.
2. 5–8 key points in the order they appear.
Do not add anything that isn't in the transcript.

[paste transcript here]

2. Key takeaways and action items:

From this video transcript, extract every concrete
recommendation, step, tool, or number mentioned, as a
bulleted list of action items. Skip the filler and intros.

[paste transcript here]

3. Ask the video questions:

Here's a video transcript. Answer only from it, and say
"not covered" if the answer isn't there:
- What budget did they recommend?
- List every product they named.
- What was the main counter-argument?

[paste transcript here]

4. Turn it into an article:

Rewrite this video transcript as a clear, structured blog
post with a headline and subheadings. Keep the speaker's
points and examples; drop the spoken filler. Stay faithful
to what was actually said.

[paste transcript here]

The line telling the model not to invent information matters more than it looks — it keeps the output anchored to the real video instead of plausible-sounding guesses. If you want the full repurposing workflow, the turn YouTube videos into blog posts guide goes deeper.

When the video is too long for the context window

A typical talking-head video runs roughly 130–160 words per minute, so a 90-minute podcast can be 13,000+ words. Most chat models handle that fine today, but very long videos — or smaller/older models — can hit the context limit. Two fixes:

You might not need to leave VidWords at all

Pasting into ChatGPT is great when you want full control of the prompt. But VidWords has its own AI summaries and chat built right next to the transcript, so for most "what's in this video?" questions you never have to copy anything. Click AI Summary for the key points, or use the chat panel to ask the video direct questions — answers come straight from the transcript, with the timestamped text sitting right beside them so you can verify any claim in seconds. Because it's built for full transcripts, the long-video context problem above simply doesn't come up. The AI summary guide walks through it, and YouTube video to text covers the extraction side.

A note on privacy

When you paste a transcript into ChatGPT or Claude, the text goes to that provider under their terms — fine for public videos, worth a second thought for anything sensitive. VidWords' transcript extraction is free for 25 videos per month with no account, and the built-in AI keeps the whole flow in one place if you'd rather not spread a transcript across services. Plan details are on the pricing page.

FAQ

Can ChatGPT read a YouTube link directly?

Not reliably. ChatGPT can't watch video, and link-reading is hit-or-miss depending on the model and tools enabled. Pasting the actual transcript text is faster and far more accurate.

What format should I paste — TXT or SRT?

Plain TXT. Timestamps in SRT/VTT add noise and burn tokens without helping the model understand the content. Copy the plain-text view or export TXT.

Is feeding the transcript to an LLM free?

VidWords transcript extraction is free for 25 videos per month with no account. Whatever you do with the text in ChatGPT or Claude depends on your plan with them. VidWords' own AI features run on its credit system — see pricing.

Get a transcript to paste into ChatGPT →