Ever wondered what Mona Lisa would look like rapping? Microsoft launches VASA-1 ... trends now

The boundary between what's real and what's not is becoming ever thinner thanks to a new AI tool from Microsoft.

Called VASA-1, the technology transforms a still image of a person's face into an animated clip of them talking or singing.

Lip movements are 'exquisitely synchronised' with audio to make it seem like the subject has come to life, the tech giant claims.

In one example, Leonardo da Vinci's 16th century masterpiece 'The Mona Lisa' starts rapping crudely in an American accent.

However, Microsoft admits the tool could be 'misused for impersonating humans' and is not releasing it to the public.

Microsoft's new tool VASA-1 can generate clips of people talking from a still image and audio of someone talking - but the tech giant isn't releasing it any time soon

VASA-1 takes a static image of a face – whether it's a photo of a real person or an artwork or drawing of someone fictional.

It then 'meticulously' matches this up with audio of speech 'from any person' to make the face come to life.

The AI was trained with a library of facial expressions, which even lets it animate the still image even in real time – so as the audio is being spoken.

In a blog post, Microsoft researchers describe VASA as a 'framework for generating lifelike talking faces of virtual characters'.

'It paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors,' they say.

'Our method is capable of not only producing precious lip-audio synchronisation, but also capturing a large spectrum of emotions and expressive facial nuances and natural head motions that contribute to the perception of realism and liveliness.'

In terms of use cases, the team thinks VASA-1 could enable digital AI avatars to 'engage with us in ways that are as natural and intuitive as interactions with real humans'.

But experts have shared their concerns around the technology, which if released could make people appear to say things that they never said.

VASA-1 requires a static image of a face – whether it's a photo of a real person or an artwork or drawing of someone imaginary. It 'meticulously' matches this up with audio of speech 'from any person' to make the face come to life

Microsoft's team said VASA-1 is 'not intended to create content that is used to mislead or deceive'

Another potential risk is fraud, as people online could be duped by a fake message from the image of someone they trust.

Jake Moore, a security specialist at ESET, said 'seeing is most definitely not believing anymore'.

'As this technology improves, it is a race against time to make sure everyone is fully aware of what is capable and that they should think twice before they accept correspondence as

Related posts

Rylan Clark reveals he was inundated with messages asking if he was on the run ... trends now

This Morning's Michelle Elman gets 'signature' manicure after splitting from ... trends now

USDA begins testing ground beef supply for BIRD FLU as steak dinners and ... trends now

California reverses years of decline with 67,000-person growth spurt, but ... trends now