Jump to content

Main menu Navigation ●Main page ●Contents ●Current events ●Random article ●About Wikipedia ●Contact us ●Donate Contribute ●Help ●Learn to edit ●Community portal ●Recent changes ●Upload file

●Create account ●Log in ●Create account ● Log in Pages for logged out editors learn more ●Contributions ●Talk

(Top) 1 Implications 2 Technology 3 See also 4 References

Generative audio

●한국어 Edit links ●Article ●Talk ●Read ●Edit ●View history Tools Actions ●Read ●Edit ●View history General ●What links here ●Related changes ●Upload file ●Special pages ●Permanent link ●Page information ●Cite this page ●Get shortened URL ●Download QR code ●Wikidata item Print/export ●Download as PDF ●Printable version Appearance From Wikipedia, the free encyclopedia

Generative audio refers to the creation of audio files from databases of audio clips.^{[citation needed]} This technology differs from synthesized voices such as Apple's Siri or Amazon's Alexa, which use a collection of fragments that are stitched together on demand.

Audio curves

Generative audio works by using neural networks to learn the statistical properties of an audio source, then reproduces those properties.^[1]

Implications[edit]

With this technology, a person's voice can be replicated to speak phrases that they may have never spoken. This could lead to a synthetic version of a public figure's voice being used against them.^[2]

Technology[edit]

This method uses a generative adversarial network (GAN), a deep machine learning technique where two machine learning models work against each other to create realistic audio.^[3]

References[edit]

^ "Fake news: you ain't seen nothing yet". The Economist. July 2017. Retrieved 2017-07-01.

^ Zotkin, D. N.; Shamma, S. A.; Ru, P.; Duraiswami, R.; Davis, L. S. (April 2003). "Pitch and timbre manipulations using cortical representation of sound". 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Vol. 5. pp. V–517–20. doi:10.1109/ICASSP.2003.1200020. ISBN 978-0-7803-7663-2. S2CID 10372569.

^ Mobin, Shariq (October 2016). "Voice Conversion using Convolutional Neural Networks". arXiv:1610.08927 [stat.ML].

Retrieved from "https://en.wikipedia.org/w/index.php?title=Generative_audio&oldid=1222108761" Category: ●Sound production Hidden categories: ●Articles with short description ●Short description matches Wikidata ●All articles with unsourced statements ●Articles with unsourced statements from January 2024 ●This page was last edited on 3 May 2024, at 22:58 (UTC). ●Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. ●Privacy policy ●About Wikipedia ●Disclaimers ●Contact Wikipedia ●Code of Conduct ●Developers ●Statistics ●Cookie statement ●Mobile view