Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio
Ars Technica 2023-01-09
Summary:
Text-to-speech model can preserve speaker's emotional tone and acoustic environment.
Link:
https://arstechnica.com/?p=1908618From feeds:
Cyberlaw » Ars TechnicaMusic and Digital Media » Ars Technica