YCSEP_v2

This database contains transcript segments with associated audio from v2 of the YouTube Corpus of Singapore English Podcasts, transcribed with a fine-tuned ASR model (Coats et al., forthcoming). You can search by transcript text (e.g. I just think) or POS tags (e.g. PRP RB VB). Use the button to download a CSV for the current page. For more information about the resource, see Coats, Steven, Carmelo Alessandro Basile, Cameron Morin and Robert Fuchs. (2025). The YouTube Corpus of Singapore English Podcasts. English World-Wide. https://doi.org/10.1075/eww.25018.coa. A static version of the corpus is available at doi.org/10.7910/DVN/B7JRID
?
Tip: column filters below persist while you sort and run text search.
Filter by Channel:
Download CSV (Page)
Loading results...
ID▴▾ Channel▴▾ Video ID▴▾ Speaker▴▾ Start▴▾ End▴▾ Audio Text▴▾ POS▴▾