LibriVox Forum Index LibriVoxHome Page - Guides for Listeners & Volunteers (the LibriVox wiki) - Search the Catalog
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 
Help Improve Open Source Speech Recognition
Goto page 1, 2, 3  Next
 
Post new topic   Reply to topic    LibriVox Forum Index -> Suggestions, Comments, News & Discussion
View previous topic :: View next topic  
Author Message
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 4:25 pm    Post subject: Help Improve Open Source Speech Recognition Reply with quote

Hi,

I am the admin for the VoxForge project. VoxForge collects transcribed speech audio for the creation of Acoustic Models for use with Open Source Speech Recognition Engines ('SRE's). An Acoustic Model is basically a file that contains the statistical representations of sounds that make up the words in a large corpus of spoken audio. Currently, most Acoustic Models included with Open Source SREs are closed source (i.e. they don't provide the source audio because of license restrictions). VoxForge hopes to address this problem.

We are looking for submissions of audio books to help us with our goal. We are looking for audio books in uncompressed format (i.e. before you compress your audio book to mp3 for submission to Librivox), up to a 48kHz sampling rate at 16 bits per sample. Please consider submitting your ebook to VoxForge.

We've set up a page on the VoxForge site (called Uploads) that allows you to submit your audio books to VoxForge using an FTP client of your choice.

thanks in advance,

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
Starlite
LibriVox Admin Team


Joined: 30 Apr 2006
Posts: 13486
Location: Ontario, Canada

PostPosted: Fri Mar 02, 2007 4:42 pm    Post subject: Reply with quote

Great idea Ken but I'm afraid once a project is in the catalogue, most of us dump the raw, uncompressed files as they are sooo huge and we have limited space on our hard drives.

Will keep it in mind for the future though.

Esther Smile
_________________
“I am only one, but still I am one. I cannot do everything, but still I can do something; and because I cannot do everything, I will not refuse to do something that I can do.” Helen Keller
Back to top
View user's profile Send private message
kayray
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 9674
Location: San Diego, California

PostPosted: Fri Mar 02, 2007 5:04 pm    Post subject: Reply with quote

It's easy enough to take mp3s and re-convert them to .wav. Feel free to use any of our books, Ken!
_________________
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her USB MacMice MicFlex, but knew not how." -- Jane Austen (& Kara)
Back to top
View user's profile Send private message Visit poster's website AIM Address
hugh
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 7035
Location: Montreal, QC

PostPosted: Fri Mar 02, 2007 5:13 pm    Post subject: Reply with quote

hi ken contacted me about this... i thought we might be able to help... we could just put out a request here for say 20 people to offer up their next recordings in wav?

I'll contirbute one or 2...
_________________
hughmcguire.net |bookoven.com
Back to top
View user's profile Send private message Send e-mail
kayray
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 9674
Location: San Diego, California

PostPosted: Fri Mar 02, 2007 5:18 pm    Post subject: Reply with quote

Would single chapters be useful, or do you need entire complete books? (Sorry, haven't read your project page yet)
_________________
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her USB MacMice MicFlex, but knew not how." -- Jane Austen (& Kara)
Back to top
View user's profile Send private message Visit poster's website AIM Address
kristin
LibriVox Admin Team


Joined: 01 Jun 2006
Posts: 4846
Location: Huntsville, AL

PostPosted: Fri Mar 02, 2007 6:07 pm    Post subject: Reply with quote

I still have the wav files from most of the recordings I've done (at least since I got my new computer.) Definitely, from two of my solo projects and my duet with Kara.
_________________
Whereas story is processed in the mind in a straightforward manner, poetry bypasses rational thought and goes straight to the limbic system and lights it up like a brushfire. It's the crack cocaine of the literary world. - Jasper Fforde
Back to top
View user's profile Send private message Send e-mail Yahoo Messenger
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 6:09 pm    Post subject: Reply with quote

Starlite wrote:
Great idea Ken but I'm afraid once a project is in the catalogue, most of us dump the raw, uncompressed files as they are sooo huge and we have limited space on our hard drives.

Will keep it in mind for the future though.

Esther Smile

Hi Esther,

thanks for the reply,

Even a portion of what you recorded would be helpful. We need audio from as many different people, reading as much varied text as possible. We are looking for uncompressed audio because it provides the best quality audio for acoustic model creation.

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 6:22 pm    Post subject: Reply with quote

kayray wrote:
It's easy enough to take mp3s and re-convert them to .wav. Feel free to use any of our books, Ken!

Hi Kara,

MP3 compressed audio converted to wav is not the best audio for the creation of Acoustic Models for Speech Recognition. The compression introduces some noise that might affect the recognition process. Having said that, it may be 'good enough' for our purposes, and we are creating a sub-project within VoxForge (hopefully through Google Summer of Code) to explore this idea further, and are planning to use Librivox mp3 audio.

However, if we can get uncompressed audio, it would be better. Because Speech Recognition Engines work best to recognize the same type of speech audio (i.e. uncompressed) their Acoustic Models were trained with.

thanks,

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
kayray
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 9674
Location: San Diego, California

PostPosted: Fri Mar 02, 2007 6:25 pm    Post subject: Reply with quote

I understand. And since random chapters are OK, it'll be easy to upload my chapters to you before I delete the .wavs.
_________________
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her USB MacMice MicFlex, but knew not how." -- Jane Austen (& Kara)
Back to top
View user's profile Send private message Visit poster's website AIM Address
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 6:32 pm    Post subject: Reply with quote

kayray wrote:
Would single chapters be useful, or do you need entire complete books? (Sorry, haven't read your project page yet)


Hi Kayray,

Any contribution would be greatly appreciated.

We need audio contributions from as many different people as possible (covering different dialects and regions), using different equipment (headset mics, desktop boom mics, etc. on laptops, desktops, using an audio card or usb pod) reading a variety of texts.

thanks,

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
kayray
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 9674
Location: San Diego, California

PostPosted: Fri Mar 02, 2007 6:57 pm    Post subject: Reply with quote

Four chapters of "Persuasion" uploading now. Hope I followed all the protocol correctly :)
_________________
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her USB MacMice MicFlex, but knew not how." -- Jane Austen (& Kara)
Back to top
View user's profile Send private message Visit poster's website AIM Address
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 7:17 pm    Post subject: Reply with quote

kayray wrote:
Four chapters of "Persuasion" uploading now. Hope I followed all the protocol correctly Smile


Amazing! thank you so much.

When you get a chance, please include a README file with your submission. It's a bit tedious I know, but it will help with the classification of the your submitted audio. In addition, there are a few academics who interested in this project, and such information would help with their speech recognition research (especially microphone type).

Any feedback on improving the process would be greatly appreciated.

thanks,

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
kayray
LibriVox Admin Team


Joined: 26 Sep 2005
Posts: 9674
Location: San Diego, California

PostPosted: Fri Mar 02, 2007 7:24 pm    Post subject: Reply with quote

I did include a README... the upload won't finish for another hour, at least, so that file may not be on your server yet :)

The instructions were clear and straightforward. I remembered to amend my text files to include EVERY word that I spoke (librivox disclaimer, "end of chapter" etc). You didn't specify any filenaming conventions for the .wavs and textfiles, so I just went with something logical.

I am a passionate believer in open source projects, so I'm delighted to help with yours!
_________________
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her USB MacMice MicFlex, but knew not how." -- Jane Austen (& Kara)
Back to top
View user's profile Send private message Visit poster's website AIM Address
ductapeguy
LibriVox Admin Team


Joined: 02 Jan 2006
Posts: 1731
Location: Ontario, Canada

PostPosted: Fri Mar 02, 2007 7:47 pm    Post subject: Reply with quote

What an awesome project. I've often felt that speech recognition is a neglected area of Open Source software. I will contribute what I can.
_________________
Sean McGaughey
Librivox: Catalog | ductapeguy.net-- My music and podcasts
Back to top
View user's profile Send private message Visit poster's website
kmaclean



Joined: 02 Mar 2007
Posts: 19

PostPosted: Fri Mar 02, 2007 9:10 pm    Post subject: Reply with quote

kayray wrote:
I did include a README... the upload won't finish for another hour, at least, so that file may not be on your server yet Smile

The instructions were clear and straightforward. I remembered to amend my text files to include EVERY word that I spoke (librivox disclaimer, "end of chapter" etc). You didn't specify any filenaming conventions for the .wavs and textfiles, so I just went with something logical.

I am a passionate believer in open source projects, so I'm delighted to help with yours!

Sorry, I spoke too soon, I've got everything.

thanks for the submission!

Ken
_________________
www.voxforge.org
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    LibriVox Forum Index -> Suggestions, Comments, News & Discussion All times are GMT
Goto page 1, 2, 3  Next
Page 1 of 3

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group