So I was screwing around with this recording of Coatzospan Mixtec from the UCLA Phonetics Lab Archive:
As usual for this archive, there is a nice tabular transcription:
Here’s the mp3:
(WAV version here.)
But what’s missing is timestamps. There are 73 words, each with a prompt followed by the enunciation of the word. It’s not great audio quality, but the waveform seems clear enough:
I’d like to add the timestamps for each word, but doing that by hand sounds like a grand old pain in ye butt. What would be easier would be to get some “guesses” for which bits of the audio are words, and then line up the existing transcriptions with those guesses. Surely it will require post-editing, but it would still be easier than starting from scratch, selecting each word in the waveform.
So I have been trying (unsuccessfully) to use the Analyze > Label sounds… function in Audacity. I have also heard that Praat can do this kind of thing. I’m a bit of a Praat know-nothing, I will admit.
Anyone have experience with this sort of thing?