Kiratas
  • Home
  • World
  • Lifestyle

    Trending Tags

    • Pandemic
  • Business
  • Entertainment
  • Sports
No Result
View All Result
  • Home
  • World
  • Lifestyle

    Trending Tags

    • Pandemic
  • Business
  • Entertainment
  • Sports
No Result
View All Result
Kiratas
No Result
View All Result
Home World

Google: SoundStorm should make audio generation faster and more efficient

Keira Austin by Keira Austin
May 24, 2023
in World
0
Google: SoundStorm should make audio generation faster and more efficient
0
SHARES
2
VIEWS
Share on FacebookShare on Twitter

With SoundStorm, Google has released an audio AI model that can generate 30 seconds of audio in half a second using an AI processor called a Tensor Processing Unit v4 (TPU v4). According to Google, SoundStorm takes semantic tokens as input generated by the “AudioLM” framework. The quality is the same as with AudioLM, but SoundStorm is said to work more coherently and faster because speech processing processes run in parallel. This emerges from the paper by the Google research team around Zalán Borsos, which deals with generative audio AI models.

AudioLM

For AudioLM, texts do not have to be transcribed first. Instead, the AI ​​uses existing audio databases—in this case, the LibriSpeech automatic speech recognition corpus, consisting of 1,000 hours of public domain audiobooks. With the use of machine learning, the audio files are tokenized, i.e. divided into sound snippets. This training data is then fed into a machine learning model designed to use natural language processing to learn the sound patterns.

The Bark open source model is also based on a similar approach. In addition to music, speech including the melody, accent and other properties (prosody) can be generated. Speech that sounds more natural than previous models only requires a few seconds of audio input.

When used in conjunction with SPEAR-TTS, a multi-speaker text-to-speech system, SoundStorm can generate natural dialogue. The language is controlled via transcripts, the speaking voices via short voice prompts and the speaker change via instructions in the transcript. To generate 30 seconds of dialogue with multiple speakers, it takes two seconds with TPU-v4.

Copyright and Identity Theft

Ever-improving audio AI models also offer great potential for abuse and thus enable identity theft by tricking voice ID. Many banks in Europe and the USA offer Voice-ID as a login option. Voices readily available on the internet can fall victim to such scams.

AI researchers like those at Google are therefore also working on techniques so that people can distinguish between natural sounds and synthetically generated ones. For example, it is conceivable to watermark AI-generated products to make it easier to distinguish them from real sounds.

(mack)

To home page

Tags: Artificial intelligenceaudioefficientfasterGenerationGoogleSoundStorm
Previous Post

Cosentino will invest 250 million to open its first factory in the United States

Next Post

The PSOE barons conspire against the plebiscite pursued by the PP

Keira Austin

Keira Austin

Related Posts

In which castle, with the name of a book, can you sleep like a king just over two hours from Madrid?
World

In which castle, with the name of a book, can you sleep like a king just over two hours from Madrid?

by Keira Austin
June 2, 2023
Find out how a few cloves of garlic can keep your home roach-free
World

Find out how a few cloves of garlic can keep your home roach-free

by Keira Austin
June 2, 2023
Ospina Abogados reinforces its economic criminal area with Juan Antonio Jabaloy
World

Ospina Abogados reinforces its economic criminal area with Juan Antonio Jabaloy

by Keira Austin
June 2, 2023
Satellite radio for smartphones: What the systems can do - and what they can't
World

Satellite radio for smartphones: What the systems can do – and what they can’t

by Keira Austin
June 2, 2023
Andalusia leads the drop in unemployment in May, with 6,521 fewer unemployed,
World

Andalusia leads the drop in unemployment in May, with 6,521 fewer unemployed,

by Keira Austin
June 2, 2023
Next Post
EL PAÍS

The PSOE barons conspire against the plebiscite pursued by the PP

Premium Content

Health rules out the Marburg virus after testing the Santander patient

Health rules out the Marburg virus after testing the Santander patient

May 25, 2023
Polanco marks as a challenge that tourists extend their stay in Palencia

Polanco marks as a challenge that tourists extend their stay in Palencia

May 22, 2023
Those affected by the torrential rain that has flooded Cartagena: "We do not lack courage, but we have plenty of indignation"

Those affected by the torrential rain that has flooded Cartagena: “We do not lack courage, but we have plenty of indignation”

May 23, 2023

Browse by Category

  • Business
  • Sports
  • World

Browse by Tags

28M Apple arrested Artificial intelligence attack Barcelona campaign ChatGPT China city data data protection day due elections electoral European euros Feijóo Government great health intelligence iPhone live Madrid Microsoft million people president PSOE Real Security Spain Spanish Sánchez time Valencia Vinicius vote votes Vox vulnerabilities war years
Kiratas

Latest News from World, Health, Politics, Sports, Business, Education, Technology, Arts and Latin America, the Middle East, South Asia.

Categories

  • Business
  • Sports
  • World

Browse by Tag

28M Apple arrested Artificial intelligence attack Barcelona campaign ChatGPT China city data data protection day due elections electoral European euros Feijóo Government great health intelligence iPhone live Madrid Microsoft million people president PSOE Real Security Spain Spanish Sánchez time Valencia Vinicius vote votes Vox vulnerabilities war years

Recent Posts

  • In which castle, with the name of a book, can you sleep like a king just over two hours from Madrid?
  • Find out how a few cloves of garlic can keep your home roach-free
  • Ospina Abogados reinforces its economic criminal area with Juan Antonio Jabaloy
  • About us
  • Home
  • Privacy Policy
  • Terms and Conditions

© Kiratas 2023. All Rights Reserved.

No Result
View All Result
  • Home
  • World
  • Lifestyle
  • Business
  • Entertainment
  • Sports

© Kiratas 2023. All Rights Reserved.