00:00
hey I'm Hank iseria I'm here at the New
00:01
York Times to discuss whether uh AI can
00:04
replace
00:06
[Music]
00:07
me I do many voices mostly on The
00:11
Simpsons like motor bartender sing on or
00:15
police chief wium suspect is heartless
00:18
repeat heartless or snake um the
00:24
convict I'm taking this thing to Mexico
00:28
or Professor Frank of course the
00:31
scientist at superintendent chelmer one
00:33
of my personal favorites y the old sea
00:36
captain y Duff Man Of course Duff man
00:40
purveyor of Duff
00:43
[Music]
00:54
beer hi this is Hank is area I'm in some
00:58
mysterious undisclosed location of the
01:00
New York Times building I'm recording my
01:02
voice and we're going to see how AI can
01:05
recreate it and see how closely it can
01:07
match or not
01:09
okay this is Hank aaria I am not
01:13
actually Hank I am a bot and AI do you
01:17
think this actually sounds like me it
01:19
mispronounced my name I'm aaria not
01:23
aaria it felt like what it was which was
01:26
just um a vocal version of printed text
01:31
hey there this is mot bartender speaking
01:33
to you how will I describe my clientele
01:36
uh SLE bags uh scumballs um jerk wads uh
01:41
morons all of them
01:47
apply this is Mo the bartender you
01:50
should come to my Pub sometimes and have
01:52
a
01:52
Duff now was way
01:55
off um it doesn't have enough gravel in
01:58
it and it's missing uh lot of sounds
02:01
that Mo should make it's not here it's
02:04
here if we were trying to sound like a
02:08
robot that would be a pretty good
02:10
version of what we were trying to
02:13
do I didn't really think about AI
02:16
seriously in terms of voice acting until
02:20
about a year or two ago obviously the
02:21
Scarlett Johansson thing Scarlet
02:24
Johansson saying this voice used by open
02:26
ai's virtual assistant Sky Hello I'm
02:29
really excited about teaming up with you
02:31
sounds quote eerily similar to her own I
02:35
think there's a humanness that the AI
02:37
can't do right now at least vocally and
02:40
may never be able to do that involves a
02:42
character's motivation certain emotions
02:45
uh subtleties of physicality facially or
02:48
otherwise that add up to a human
02:54
being the biggest misconception people
02:57
have about voice acting is that it's
02:59
from the neck up but your body has to
03:01
get into it if I'm running and I have
03:03
dialogue it's easier just to be running
03:06
okay hmer I'm coming all right there I'm
03:09
getting tired I'm going to stop running
03:11
now okay Homer as soon as I get this
03:14
wood chopped I'll be there right with
03:17
you I hope that there's Donuts afterward
03:21
you got to kind of do it you know what I
03:22
mean stick something in your mouth that
03:25
makes the the cigar in your mouth sound
03:28
quite convincing now cigars I wouldn't
03:30
actually put one in my mouth but a pen
03:33
is perfectly fine actually this is
03:35
slightly gross I'm going to take it
03:37
[Music]
03:42
out people are going to listen to and
03:45
enjoy and watch what they like and
03:47
they're not going to care whether AI
03:49
generated it or human generated it or
03:52
some combination of the two right now
03:55
what AI generates by itself as motor
03:58
bartender or anything else isn't going
04:00
to cut it but if it does start to cut
04:03
it people are going to listen to it and
04:06
they're going to be grateful that it's
04:07
so readily available like what happened
04:09
you know to the music industry one thing
04:12
I cried a tear because the record
04:13
industry reinvented itself I got to
04:15
listen to all the music for free all of
04:17
a sudden so I don't I don't think people
04:19
are going to feel much differently about
04:21
any of this