Transcribing sound files (YES!) and then getting the text into ID (How?)
I apologize for this post as I only know half of what I want to know, so I’m posting this as an open-ended question for some brilliant person to come and finish the technique. But first, the background.
Did you know that Adobe Sound Booth CS4 now has a terrific transcribing feature? This means you can import a sound file and the Transcribe command will convert the words in the file into text?
Now, it’s not perfect. And can actually be the source of a fun party game. For instance, can you tell what the following words are supposed to be:
It’s actually the opening words to the latest episode of the InDesign Secrets podcast. And it’s supposed to read:
“…independent resource for all things InDesign-ine-ine-ine-ine-ine. It’s true…”
But the bottom line is that if you import an audio file into Sound Booth and then click the Transcribe button, you get a sort-of transcription of the words.
If you click on each of the words, you will jump to that portion of the audio waveform that contains that text. It’s supposed to make it easier for editors to find things. And (I think) it can be used to create subtitles in Flash videos.
But here’s my problem.
The text is not selectable as one complete story. Each word is only a separate markers in the box.
Now, you can export the text, but the file format is XML.
I know that InDesign can import XML, but when I import the XML file I get the following structure in the XML pane:
And when I open the XML file in a text editor I get:
(The text goes on and on.)
Now, I KNOW there is real text in there, somewhere. I just need someone who understands this stuff to give me some hints as to what I can do to extract it.