Skip to content
#

transcript-corpus

Here are 3 public repositories matching this topic...

Spanish-first bilingual transcript corpus from the Samuel y Audrey YouTube channel, featuring 643 travel video records with Spanish and English transcripts, SRT payloads, metadata, CSV/JSONL exports, schema, citation, license, manifest, checksums, and llms files.

  • Updated May 23, 2026

English transcript corpus from the Samuel & Audrey YouTube travel and food channel, featuring 1,397 full video transcripts and 233,285 cue-level segment records from 2012–2026, with metadata, SRT payloads, CSV/JSONL exports, schema, citation, license, manifest, checksums, and llms files.

  • Updated May 22, 2026

Improve this page

Add a description, image, and links to the transcript-corpus topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transcript-corpus topic, visit your repo's landing page and select "manage topics."

Learn more