social.coop is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Fediverse instance for people interested in cooperative and collective projects. If you are interested in joining our community, please apply at https://join.social.coop/registration-form.html.

Administered by:

Server stats:

482
active users

#voicetech

0 posts0 participants0 posts today

💡 Interesting read on how one of the biggest commercial players out there plans to use Mozilla Open Voice data to make speech AI more inclusive and open to more language.

💬 Sounds idealistic, but Open Voice datasets are created by unpaid volunteers who donate hours and hours of their speech. Not sure whether I feel comfortable with that, tbh.

💭 Thoughts?

venturebeat.com/ai/nvidia-ente

VentureBeatNvidia takes on Meta and Google in the speech AI technology raceBy Victor Dey

We'll dig deeper into OpenAI Whisper (openai.com/blog/whisper/) this weekend. I'll announce a DateTime for the YT live stream here later.

Note: It's not a presentation but a #StudyWithMe session. You can also join in on the task/Livestream via audio/video through Google Meet.

YT Link: youtube.com/@datadrivenbabe

#Data #VoiceTech #NLP #SpeechTech #AI #ML #MLOps

How much do you know about the sub-field already?

OpenAIIntroducing WhisperWe’ve trained and are open-sourcing a neural net called Whisper that approaches human level robustness and accuracy on English speech recognition. Read Paper View Code View Model Card Whisper examples: Reveal Transcript Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask