[Note: This item comes from friend Geoff Goodfellow. DLH]
[would DEARLY LOVE to see this technology used to get, say https://en.wikipedia.org/wiki/Douglas_Rain‘s voice — the voice of The https://en.wikipedia.org/wiki/HAL_9000 — onto the iPhone (say, for Siri) or other voice interaction app type “things”!]
I’m not sure how I feel about the upcoming launch of Montreal-based Lyrebird’snew service. The company says its API will let you synthesize speech in anyone’s voice from just a minute-long recording – which means you could, for instance, generate a clip of President Trump declaring war on Canada.
Lyrebird has posted some audio examples that sound pretty convincing (listen below, and find more on this page). The company says that it doesn’t require the speaker to say the words that you’ll use the voice to speak in the audio you generate, and it’ll also be able to create different intonations.
If any of this sounds familiar, it might be because you’re thinking of Adobe’s demo of its similar tech last November. But while Adobe’s Project VoCo requires 20 minutes of audio and appears to use system resources for speech synthesis, Lyrebird only needs a minute-long recording and says it’s close to launching its cloud-based API to process audio and spit out results.
As I wrote when we covered Project VoCo last year, it’s likely that such software will lead to the creation and distribution of plenty of misleading information that people might believe to be genuine.
On its Ethics page, Lyrebird says that its technology “questions the validity of such evidence as it allows to easily manipulate audio recordings.” It added: