12/28/2023 0 Comments Best audio book ms everThis made it possible to create targeted parsers that could adapt to each book’s idiosyncrasies. To address this, the team searched the collection for large groups of books with a similar look and file format. It became more of an art than a science to find what users would want to hear in a given book.” Though the books display nicely for online readers, they contain all sorts of text you wouldn’t want to hear in your audiobook. “It’s difficult to find even two books in Project Gutenberg that have exactly the same structure. Mark Hamilton, one of the project leads, shares that this was the toughest part. With a high quality text to speech model in hand, the team set out to transcribe as many of Project Gutenberg’s 60,000+ books as possible. The approach uses a deep network that’s trained to mimic the quality and tone of native speakers, can speak a variety of languages, and can even identify and stylize the reading of emotional text. The project uses new advances in neural text to speech to create lifelike voices that sound similar to native human speakers. “With this new technology, our partners were able to create audiobooks of vastly better quality much faster than ever before.” “We had tried to make audiobooks in the past, but the quality just wasn’t very good so we abandoned the effort,” says Project Gutenberg CEO Greg Newby. Project Gutenberg, the oldest online e-book library with over 60,000 works, is acutely aware of these challenges. Humans know to skip page numbers, tables of contents, and footnotes, but algorithms must be clever to avoid these pitfalls. Furthermore, it’s hard for algorithms to understand what to read from an e-book. Automated audiobook production offers a promising alternative, but has historically been plagued with clunky, robotic narration. With book publication rates on the rise, creators are hunting for faster solutions. Recording professional human readers can be time-consuming and costly, requiring hundreds of hours of reading time per book. However, creating audiobooks isn’t quite as easy as pressing play. No matter whether you are learning to read, looking for inclusive reading technology, or about to head out on a long drive, audiobooks can be a great resource. Harnessing AI to Scale Audiobook Production Freeman (MIT), seeks to democratize access to literature to include individuals with visual impairments, language learners, children, and those who simply prefer to listen to their books. This initiative, led by Mark Hamilton (MIT) and Brendan Walsh (Microsoft), along with supervising professor William T. The project leverages new advancements in human-like neural text to speech to bring thousands of beloved books to life in a new, accessible audio format and can even read books in a user’s voice given only 5 seconds of audio. The project releases thousands of free audiobooks to major platforms like Spotify, Apple, and Google podcasts. In a significant step to broaden access to classic literature, Project Gutenberg partnered with the Massachusetts Institute of Technology (MIT) and Microsoft to craft a vast collection of audiobooks using AI.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |