Intel Helps Facilitate AI Language Recognition
December 9, 2021 | IntelEstimated reading time: 1 minute
At the annual Conference on Neural Information Processing Systems (NeurIPS), two Intel-supported whitepapers on spoken language datasets are being presented. The first paper, The People’s Speech, targets “automatic speech recognition” tasks; the second is Multilingual Spoken Words Corpus (MSWC), which involves “keyword spotting.” Datasets coming out of each project contribute a sizeable volume of rich audio data, and each is among the largest collection available in its class.
The MSWC paper is co-authored by Keith Achorn, an AI frameworks engineer in Intel’s Software and Advanced Technology Group (SATG). Keith talks about his experiences on the project in a blog on the Intel Community site.
The People’s Speech and MSWC projects started in 2018, under the auspices of ML Commons, to identify and chart the 50 most used languages in the world into a single dataset, and then figure out a way to make the data useful. Group members came from Intel, Harvard, Alibaba, Oracle, Landing AI, University of Michigan, Google, Baidu and others.
In today’s diverse international, multilingual work environment, the ability to accurately transcribe and translate becomes increasingly important. With these datasets, a computer using artificial intelligence can “hear” a spoken word and produce an automatic transcript or translation.
Both projects utilize “diverse speech,” which means they better represent a natural environment, complete with background noise and informal speech patterns with a mixture of recording equipment in different acoustic environments. This stands apart from highly controlled content such as audiobooks, which are more “sanitized.” Training on diverse speech has been correlated with better accuracy in real-world use.
The People’s Speech project includes tens of thousands of hours of supervised conversational audio. It is now among the world’s largest English speech recognition datasets licensed for academic and commercial usage, and is free to download.
MSWC is an audio speech dataset that has more than 300,000 keywords in dozens of languages, and can be accessed by smart devices. The MSWC is dataset spans languages spoken by over 5 billion people, and advances the research and development of voice applications for a wide global audience.
Both datasets will be widely available for users. They are licensed with extremely permissive licensing terms, including commercial use.
Suggested Items
Smart and Compact Sensors with Edge-AI
04/16/2025 | FraunhoferA newly launched interdisciplinary research project involving universities of Brandenburg and research institutions is developing new technological approaches for better and more effective integration of artificial intelligence at the edges of IT networks, so-called “edges”.
New TSN-MACsec IP Core for Secure Data Transmission in 5G/6G Communication Networks
04/15/2025 | FraunhoferReliability and security in broadband communication networks (5G/6G) are crucial for meeting the challenges of the digital future. Together with aconnic AG, Fraunhofer IPMS has developed an innovative IP core as part of the “RealSec5G” project, which combines the advantages of a MACsec IP core with those of a Time-Sensitive Networking (TSN) IP core.
New Splitting Method: Fraunhofer IIS Brings Satellites Into The 5G Era
04/04/2025 | Fraunhofer IAFGlobal mobile communications that reliably reach every remote region, leaving no gaps on the map? Satellites play a key role in achieving this goal. In the future, however, not all satellites will be powerful enough to act as complete base stations.
Airbus Foundation Joins Forces with the Solar Impulse Foundation to Boost Climate Action
03/28/2025 | AirbusThe Airbus Foundation and Solar Impulse Foundation have launched a three-year partnership aimed at driving global progress on sustainability through fostering innovation and collaboration.
Global Fab Equipment Investment Expected to Reach $110 Billion in 2025
03/26/2025 | SEMIGlobal fab equipment spending for front-end facilities in 2025 is anticipated to increase by 2% year-over-year (YoY) to $110 billion, marking the sixth consecutive year of growth since 2020, SEMI announced today in its latest quarterly World Fab Forecast report.