hello everyone.welcome this is modern
hello everyone.welcome this is modern fraud detection prevention using deep
fraud detection prevention using deep
fraud detection prevention using deep learning that title was submitted quite
learning that title was submitted quite
learning that title was submitted quite a long time ago so I’d say actually the
a long time ago so I’d say actually the
a long time ago so I’d say actually the talk is probably bit more about machine
talk is probably bit more about machine
talk is probably bit more about machine learning in general now we had a good
learning in general now we had a good
learning in general now we had a good talk earlier on in the day introducing
talk earlier on in the day introducing
talk earlier on in the day introducing some of the concepts we time it behind
some of the concepts we time it behind
some of the concepts we time it behind learning and I’m hoping to sort of build
learning and I’m hoping to sort of build
learning and I’m hoping to sort of build on them really so this talk is going to
on them really so this talk is going to
on them really so this talk is going to be a bit more technical there’s there’s
be a bit more technical there’s there’s
be a bit more technical there’s there’s no maths which you’ll be glad to hear
no maths which you’ll be glad to hear
no maths which you’ll be glad to hear there’s there’s also no code I’ve tried
there’s there’s also no code I’ve tried
there’s there’s also no code I’ve tried to you know explain myself using
to you know explain myself using
to you know explain myself using diagrams and pictures wherever I can but
diagrams and pictures wherever I can but
diagrams and pictures wherever I can but it is a much more technical talk so
it is a much more technical talk so
it is a much more technical talk so hopefully you can get you your teeth
hopefully you can get you your teeth
hopefully you can get you your teeth into it we’ve got the usual slides at
into it we’ve got the usual slides at
into it we’ve got the usual slides at the front saying please rate and engage
the front saying please rate and engage
the front saying please rate and engage so yeah my name’s Phil I’m with try fork
so yeah my name’s Phil I’m with try fork
so yeah my name’s Phil I’m with try fork but we’re in try fot leads so we’re
but we’re in try fot leads so we’re
but we’re in try fot leads so we’re quite distinct from the Danish
quite distinct from the Danish
quite distinct from the Danish mothership yeah I actually am a software
mothership yeah I actually am a software
mothership yeah I actually am a software engineer in my in my professional life
engineer in my in my professional life
engineer in my in my professional life machine learning is a just a bit more of
machine learning is a just a bit more of
machine learning is a just a bit more of a hobby I’m currently working on there
a hobby I’m currently working on there
a hobby I’m currently working on there and Apache meat sauce framework for
and Apache meat sauce framework for
and Apache meat sauce framework for elasticsearch yeah if you’d if you’d
elasticsearch yeah if you’d if you’d
elasticsearch yeah if you’d if you’d like to talk more about any of the
like to talk more about any of the
like to talk more about any of the subjects that I’m about to discuss then
subjects that I’m about to discuss then
subjects that I’m about to discuss then please see me or I’ll see some of my
please see me or I’ll see some of my
please see me or I’ll see some of my colleagues listed at the bottom there
colleagues listed at the bottom there
colleagues listed at the bottom there I’m gonna skip the marketing slides
I’m gonna skip the marketing slides
I’m gonna skip the marketing slides because you will not try and we’re
because you will not try and we’re
because you will not try and we’re split into three or four topics the the
split into three or four topics the the
split into three or four topics the the final one the architectures that is is
final one the architectures that is is
final one the architectures that is is more about how we would do this in
more about how we would do this in
more about how we would do this in production how we would do this in real
production how we would do this in real
production how we would do this in real life it’s it’s interesting but it’s not
life it’s it’s interesting but it’s not
life it’s it’s interesting but it’s not really the core thing of my my talk so
really the core thing of my my talk so
really the core thing of my my talk so I’m going to go through the first three
I’m going to go through the first three
I’m going to go through the first three sections and if we have time we might do
sections and if we have time we might do
sections and if we have time we might do the fourth but I’ll probably end up
the fourth but I’ll probably end up
the fourth but I’ll probably end up speaking for too long and I’ll probably
speaking for too long and I’ll probably
speaking for too long and I’ll probably drop that section I’m going to introduce
drop that section I’m going to introduce
drop that section I’m going to introduce the the reasons why we want to provide
the the reasons why we want to provide
the the reasons why we want to provide some new tools and techniques to apply
some new tools and techniques to apply
some new tools and techniques to apply to fraud to try and make the case to the
to fraud to try and make the case to the
to fraud to try and make the case to the business users as why you should pick up
business users as why you should pick up
business users as why you should pick up on some of these ideas and start
on some of these ideas and start
on some of these ideas and start to run with them I’m gonna then
to run with them I’m gonna then
to run with them I’m gonna then introduce the topic of machine learning
introduce the topic of machine learning
introduce the topic of machine learning and you’ve probably had quite a bit of
and you’ve probably had quite a bit of
and you’ve probably had quite a bit of experience already but if you haven’t
experience already but if you haven’t
experience already but if you haven’t that that’ll be the section that really
that that’ll be the section that really
that that’ll be the section that really explains what’s going on and and why it
explains what’s going on and and why it
explains what’s going on and and why it happens and I’ve also got quite a lot of
happens and I’ve also got quite a lot of
happens and I’ve also got quite a lot of demos as well some of the demos are
demos as well some of the demos are
demos as well some of the demos are quite simple and very general just to
quite simple and very general just to
quite simple and very general just to explain the concepts but the rest of the
explain the concepts but the rest of the
explain the concepts but the rest of the demos are all focused towards fraud
demos are all focused towards fraud
demos are all focused towards fraud prevention focus towards finance and
prevention focus towards finance and
prevention focus towards finance and specifically mortgages okay so let’s
specifically mortgages okay so let’s
specifically mortgages okay so let’s crack on so in order to to do any of
crack on so in order to to do any of
crack on so in order to to do any of this work we need to persuade some
this work we need to persuade some
this work we need to persuade some people to give us some money and there’s
people to give us some money and there’s
people to give us some money and there’s no better reason to get people to give
no better reason to get people to give
no better reason to get people to give us some money if there’s other money at
us some money if there’s other money at
us some money if there’s other money at risk in the UK we’ve got some UK
risk in the UK we’ve got some UK
risk in the UK we’ve got some UK specific facts here in the UK financial
specific facts here in the UK financial
specific facts here in the UK financial crime is defined as I can’t even read
crime is defined as I can’t even read
crime is defined as I can’t even read that screen so I have to read from here
that screen so I have to read from here
that screen so I have to read from here sorry fraud is an act of deception
sorry fraud is an act of deception
sorry fraud is an act of deception intended for personal gain or to cause a
intended for personal gain or to cause a
intended for personal gain or to cause a loss to another party so all of these
loss to another party so all of these
loss to another party so all of these facts and figures it specific to the UK
facts and figures it specific to the UK
facts and figures it specific to the UK but they’re they’re applicable to pretty
but they’re they’re applicable to pretty
but they’re they’re applicable to pretty much every country in the world anybody
much every country in the world anybody
much every country in the world anybody that’s trying to do wrong to do harm for
that’s trying to do wrong to do harm for
that’s trying to do wrong to do harm for their own financial gain is considered
their own financial gain is considered
their own financial gain is considered fraud we’ve got a UK mortgage fraud
fraud we’ve got a UK mortgage fraud
fraud we’ve got a UK mortgage fraud listed there in 2014 a 1.2 million
listed there in 2014 a 1.2 million
listed there in 2014 a 1.2 million properties bought and sold in the UK and
properties bought and sold in the UK and
properties bought and sold in the UK and 83 in every 10,000 of those applications
83 in every 10,000 of those applications
83 in every 10,000 of those applications were fraudulent so that’s not quite 1%
were fraudulent so that’s not quite 1%
were fraudulent so that’s not quite 1% 0.8 3% and when we say when you say
0.8 3% and when we say when you say
0.8 3% and when we say when you say fraud in that that aspect it’s not
fraud in that that aspect it’s not
fraud in that that aspect it’s not necessarily people being like hugely
necessarily people being like hugely
necessarily people being like hugely devious we’re going from the small scale
devious we’re going from the small scale
devious we’re going from the small scale where somebody’s maybe telling a few
where somebody’s maybe telling a few
where somebody’s maybe telling a few fibs about their employment history or
fibs about their employment history or
fibs about their employment history or how much they earn all the way up to
how much they earn all the way up to
how much they earn all the way up to huge huge you know international fraud
huge huge you know international fraud
huge huge you know international fraud in 2013 there was a story of two guys
in 2013 there was a story of two guys
in 2013 there was a story of two guys who had invented there a whole series of
who had invented there a whole series of
who had invented there a whole series of companies that invented estate agents
companies that invented estate agents
companies that invented estate agents that invented surveyors they’ve invented
that invented surveyors they’ve invented
that invented surveyors they’ve invented property businesses and builders and
property businesses and builders and
property businesses and builders and they had supposedly bought a huge tract
they had supposedly bought a huge tract
they had supposedly bought a huge tract of land which they were going to build
of land which they were going to build
of land which they were going to build you know lots of new houses on the
you know lots of new houses on the
you know lots of new houses on the invented or stole the identities of
invented or stole the identities of
invented or stole the identities of other people to take out mortgages on
other people to take out mortgages on
other people to take out mortgages on those respective houses so it turns out
those respective houses so it turns out
those respective houses so it turns out there were tens you know tens to
there were tens you know tens to
there were tens you know tens to hundreds of mortgage applications all
hundreds of mortgage applications all
hundreds of mortgage applications all going in for houses that hadn’t been
going in for houses that hadn’t been
going in for houses that hadn’t been built yet
built yet
built yet but as it turned out they just took that
but as it turned out they just took that
but as it turned out they just took that money paid off the original land the
money paid off the original land the
money paid off the original land the original debt they worked they owned the
original debt they worked they owned the
original debt they worked they owned the land and then just liked it they just
land and then just liked it they just
land and then just liked it they just ran off they completely invented a
ran off they completely invented a
ran off they completely invented a village bought loads and mortgages based
village bought loads and mortgages based
village bought loads and mortgages based upon that and then ran off how how can
upon that and then ran off how how can
upon that and then ran off how how can that even so that the total cost finally
that even so that the total cost finally
that even so that the total cost finally came to it was about 53 million pounds
came to it was about 53 million pounds
came to it was about 53 million pounds and managed to to run away with and they
and managed to to run away with and they
and managed to to run away with and they did finally get caught but they very
did finally get caught but they very
did finally get caught but they very nearly got away with it because it was
nearly got away with it because it was
nearly got away with it because it was just so embarrassing
just so embarrassing
just so embarrassing you know the mortgage company was so
you know the mortgage company was so
you know the mortgage company was so embarrassed to say that this had
embarrassed to say that this had
embarrassed to say that this had happened it almost never even got caught
happened it almost never even got caught
happened it almost never even got caught so it does it does get to quite a large
so it does it does get to quite a large
so it does it does get to quite a large scale and this this actually equates to
scale and this this actually equates to
scale and this this actually equates to approximately 1 billion pounds worth of
approximately 1 billion pounds worth of
approximately 1 billion pounds worth of fraudulent applications so it’s a huge
fraudulent applications so it’s a huge
fraudulent applications so it’s a huge huge number but my interestingly it’s
huge number but my interestingly it’s
huge number but my interestingly it’s not actually the worst case of fraud in
not actually the worst case of fraud in
not actually the worst case of fraud in the UK the worst is actually credit
the UK the worst is actually credit
the UK the worst is actually credit current account fraud so traditionally
current account fraud so traditionally
current account fraud so traditionally what would what people would do is to
what would what people would do is to
what would what people would do is to steal somebody’s information open a
steal somebody’s information open a
steal somebody’s information open a standard bank account current account
standard bank account current account
standard bank account current account some sort from from a traditional bank
some sort from from a traditional bank
some sort from from a traditional bank which you can do quite easily in the UK
which you can do quite easily in the UK
which you can do quite easily in the UK and then use the overdraft or use some
and then use the overdraft or use some
and then use the overdraft or use some facilities to actually withdraw some
facilities to actually withdraw some
facilities to actually withdraw some money and then and then run a runoff
money and then and then run a runoff
money and then and then run a runoff so that actually constitutes the most
so that actually constitutes the most
so that actually constitutes the most fraud in the UK but we’re talking a
fraud in the UK but we’re talking a
fraud in the UK but we’re talking a little bit about mortgages today and
little bit about mortgages today and
little bit about mortgages today and finally we’ve got UK real retail fraud
finally we’ve got UK real retail fraud
finally we’ve got UK real retail fraud much of the business in the UK is
much of the business in the UK is
much of the business in the UK is actually made up of small to medium
actually made up of small to medium
actually made up of small to medium sized enterprise
sized enterprise
sized enterprise the big guys actually don’t they make a
the big guys actually don’t they make a
the big guys actually don’t they make a significant part of the market but not
significant part of the market but not
significant part of the market but not not a huge part small to medium-sized
not a huge part small to medium-sized
not a huge part small to medium-sized businesses they’re estimated losing
businesses they’re estimated losing
businesses they’re estimated losing eighteen billion pounds every year to
eighteen billion pounds every year to
eighteen billion pounds every year to fraudulent transactions so that’s when
fraudulent transactions so that’s when
fraudulent transactions so that’s when somebody goes online buy some clothes or
somebody goes online buy some clothes or
somebody goes online buy some clothes or buy some food or buy some shopping of
buy some food or buy some shopping of
buy some food or buy some shopping of some kind on a credit card and then
some kind on a credit card and then
some kind on a credit card and then maybe they cancel a credit card as soon
maybe they cancel a credit card as soon
maybe they cancel a credit card as soon as I place the order so the the guys on
as I place the order so the the guys on
as I place the order so the the guys on the retail side of having to ship all of
the retail side of having to ship all of
the retail side of having to ship all of this stuff only to find that the person
this stuff only to find that the person
this stuff only to find that the person you know doesn’t exist or
you know doesn’t exist or
you know doesn’t exist or card stolen or stuff like that and that
card stolen or stuff like that and that
card stolen or stuff like that and that amounts to a huge amount as well another
amounts to a huge amount as well another
amounts to a huge amount as well another reason why businesses might want to look
reason why businesses might want to look
reason why businesses might want to look at some of these ideas is that
at some of these ideas is that
at some of these ideas is that legislation so we’ve got one end of the
legislation so we’ve got one end of the
legislation so we’ve got one end of the spectrum where there’s people actually
spectrum where there’s people actually
spectrum where there’s people actually doing wrong to their businesses you
doing wrong to their businesses you
doing wrong to their businesses you might want to try and protect yourself
might want to try and protect yourself
might want to try and protect yourself but also this legislation legal
but also this legislation legal
but also this legislation legal requirements that need to be put in
requirements that need to be put in
requirements that need to be put in place in order to comply two more in
place in order to comply two more in
place in order to comply two more in 2017 there’s new anti money laundering
2017 there’s new anti money laundering
2017 there’s new anti money laundering legislation coming in within the EU so
legislation coming in within the EU so
legislation coming in within the EU so it applies to all EU countries it’s
it applies to all EU countries it’s
it applies to all EU countries it’s extending extending money laundering
extending extending money laundering
extending extending money laundering rules that are already in place but the
rules that are already in place but the
rules that are already in place but the main changes are that the out of scope
main changes are that the out of scope
main changes are that the out of scope limit has dropped to a thousand euros
limit has dropped to a thousand euros
limit has dropped to a thousand euros the previously it was fifteen thousand
the previously it was fifteen thousand
the previously it was fifteen thousand euros and this applies to businesses
euros and this applies to businesses
euros and this applies to businesses that are handling financial transactions
that are handling financial transactions
that are handling financial transactions so it applies to banks obviously
so it applies to banks obviously
so it applies to banks obviously financial institutions credit agencies
financial institutions credit agencies
financial institutions credit agencies stuff like that it also applies to legal
stuff like that it also applies to legal
stuff like that it also applies to legal services in the state services it also
services in the state services it also
services in the state services it also applies to to gambling services
applies to to gambling services
applies to to gambling services basically anybody that’s handling and
basically anybody that’s handling and
basically anybody that’s handling and moving money around has to comply with
moving money around has to comply with
moving money around has to comply with this legislation and what this is saying
this legislation and what this is saying
this legislation and what this is saying is that anybody that has a transaction
is that anybody that has a transaction
is that anybody that has a transaction of over a thousand euros they need to
of over a thousand euros they need to
of over a thousand euros they need to prove to the authorities that they’re
prove to the authorities that they’re
prove to the authorities that they’re doing their due diligence in to prove
doing their due diligence in to prove
doing their due diligence in to prove that that person is a not being
that that person is a not being
that that person is a not being fraudulent and be not using the money
fraudulent and be not using the money
fraudulent and be not using the money for nefarious means like terrorism or
for nefarious means like terrorism or
for nefarious means like terrorism or something like that and finally then
something like that and finally then
something like that and finally then they’re they’re required to submit their
they’re they’re required to submit their
they’re they’re required to submit their information to a central registry of
information to a central registry of
information to a central registry of information and this there’s a well
information and this there’s a well
information and this there’s a well there’s obviously previously concerns
there’s obviously previously concerns
there’s obviously previously concerns there but that’s a bit unclear and how
there but that’s a bit unclear and how
there but that’s a bit unclear and how that’s actually going to be implemented
that’s actually going to be implemented
that’s actually going to be implemented so there’s financial reasons direct
so there’s financial reasons direct
so there’s financial reasons direct financial reasons why you want might you
financial reasons why you want might you
financial reasons why you want might you might want to do it is also legal
might want to do it is also legal
might want to do it is also legal reasons so how do we do it at the moment
reasons so how do we do it at the moment
reasons so how do we do it at the moment well if a traditional company was goat
well if a traditional company was goat
well if a traditional company was goat would go to a software house and ask for
would go to a software house and ask for
would go to a software house and ask for some software to do this they would
some software to do this they would
some software to do this they would probably come up with these some
probably come up with these some
probably come up with these some combination of these four general ideas
combination of these four general ideas
combination of these four general ideas we’ve got the origination based
we’ve got the origination based
we’ve got the origination based technique so most countries have a law
technique so most countries have a law
technique so most countries have a law that requires financial services to
that requires financial services to
that requires financial services to prove the
prove the
prove the they’re talking to the real person
they’re talking to the real person
they’re talking to the real person origination is that’s it that’s what
origination is that’s it that’s what
origination is that’s it that’s what origination is I won one thing I get
origination is I won one thing I get
origination is I won one thing I get really really annoyed about is banks in
really really annoyed about is banks in
really really annoyed about is banks in the UK they’ve got this awful technique
the UK they’ve got this awful technique
the UK they’ve got this awful technique of using automated phone systems to try
of using automated phone systems to try
of using automated phone systems to try and prove you are who you say you are so
and prove you are who you say you are so
and prove you are who you say you are so you go through the whole series of you
you go through the whole series of you
you go through the whole series of you know police typing your ID number please
know police typing your ID number please
know police typing your ID number please type in your address please type in your
type in your address please type in your
type in your address please type in your password please do this please do that
password please do this please do that
password please do this please do that and that takes about three and a half
and that takes about three and a half
and that takes about three and a half minutes and then as soon as you finally
minutes and then as soon as you finally
minutes and then as soon as you finally speak to a real person which is all you
speak to a real person which is all you
speak to a real person which is all you wanted to do in the first place
wanted to do in the first place
wanted to do in the first place as soon as you’ve written speak to a
as soon as you’ve written speak to a
as soon as you’ve written speak to a real person they ask all the same
real person they ask all the same
real person they ask all the same questions again and it turns out they do
questions again and it turns out they do
questions again and it turns out they do this because these businesses aren’t
this because these businesses aren’t
this because these businesses aren’t quite sure that the automated method
quite sure that the automated method
quite sure that the automated method really is proof enough that the personal
really is proof enough that the personal
really is proof enough that the personal methods are actually going through a
methods are actually going through a
methods are actually going through a variety does my head in and some some
variety does my head in and some some
variety does my head in and some some may be less secure instances such as
may be less secure instances such as
may be less secure instances such as insurance agencies and people that are
insurance agencies and people that are
insurance agencies and people that are not necessarily as interested in
not necessarily as interested in
not necessarily as interested in protecting security they can use some
protecting security they can use some
protecting security they can use some really quite dodgy methods like I’ve had
really quite dodgy methods like I’ve had
really quite dodgy methods like I’ve had some cases where people have asked me
some cases where people have asked me
some cases where people have asked me just for my date of birth or just for my
just for my date of birth or just for my
just for my date of birth or just for my postcode or something like that and
postcode or something like that and
postcode or something like that and they’re completely not secure your date
they’re completely not secure your date
they’re completely not secure your date of birth is basically a password you
of birth is basically a password you
of birth is basically a password you were given at birth you can’t change
were given at birth you can’t change
were given at birth you can’t change it’s fixed and you have to live with it
it’s fixed and you have to live with it
it’s fixed and you have to live with it so it’s the worst password that that can
so it’s the worst password that that can
so it’s the worst password that that can ever exist the next group of
ever exist the next group of
ever exist the next group of technologies are rules based so these
technologies are rules based so these
technologies are rules based so these are static rules that are usually
are static rules that are usually
are static rules that are usually provided by analysts saying that you
provided by analysts saying that you
provided by analysts saying that you know no transaction must be bigger than
know no transaction must be bigger than
know no transaction must be bigger than X or you can’t have so many transactions
X or you can’t have so many transactions
X or you can’t have so many transactions within a certain period of time
within a certain period of time
within a certain period of time something like that and they’re and
something like that and they’re and
something like that and they’re and they’re great and they’re okay and they
they’re great and they’re okay and they
they’re great and they’re okay and they catch a reasonable amount of fraud it’s
catch a reasonable amount of fraud it’s
catch a reasonable amount of fraud it’s usually the the accidental types and the
usually the the accidental types and the
usually the the accidental types and the basically the not so intelligent
basically the not so intelligent
basically the not so intelligent fraudsters would try and do something
fraudsters would try and do something
fraudsters would try and do something silly like this but also it u also catch
silly like this but also it u also catch
silly like this but also it u also catch all the good guys as well like like when
all the good guys as well like like when
all the good guys as well like like when you’re abroad you cards always declined
you’re abroad you cards always declined
you’re abroad you cards always declined the first time because they think it’s
the first time because they think it’s
the first time because they think it’s fraudulent or you know any trying to buy
fraudulent or you know any trying to buy
fraudulent or you know any trying to buy a new car from a guy and he you know
a new car from a guy and he you know
a new car from a guy and he you know takes cash and you try and pull out 1500
takes cash and you try and pull out 1500
takes cash and you try and pull out 1500 pounds out of the cash machine you can’t
pounds out of the cash machine you can’t
pounds out of the cash machine you can’t do it because it’s you know it’s against
do it because it’s you know it’s against
do it because it’s you know it’s against their static rules credit checks
their static rules credit checks
their static rules credit checks lots of agencies will gladly accept your
lots of agencies will gladly accept your
lots of agencies will gladly accept your money to provide you with a number
money to provide you with a number
money to provide you with a number that’s it and these numbers are supposed
that’s it and these numbers are supposed
that’s it and these numbers are supposed to represent the worthiness or the the
to represent the worthiness or the the
to represent the worthiness or the the risk that that person provides to your
risk that that person provides to your
risk that that person provides to your business and there is certainly a case
business and there is certainly a case
business and there is certainly a case there’s an argument to use them how
there’s an argument to use them how
there’s an argument to use them how accurate they are is another question
accurate they are is another question
accurate they are is another question aggregation and monitoring so this is
aggregation and monitoring so this is
aggregation and monitoring so this is more of a reactive type of solution
more of a reactive type of solution
more of a reactive type of solution where analysts would be provided with
where analysts would be provided with
where analysts would be provided with the data and they you know perform some
the data and they you know perform some
the data and they you know perform some query or ask a question and try and do
query or ask a question and try and do
query or ask a question and try and do something based upon that so for example
something based upon that so for example
something based upon that so for example you can have some guys that find a
you can have some guys that find a
you can have some guys that find a pattern between you know one cash
pattern between you know one cash
pattern between you know one cash machine for example gave up a large
machine for example gave up a large
machine for example gave up a large amount of money so the analyst will when
amount of money so the analyst will when
amount of money so the analyst will when they check it out so they’re the types
they check it out so they’re the types
they check it out so they’re the types of things that exist in the wild at the
of things that exist in the wild at the
of things that exist in the wild at the moment but now I’m going to start
moment but now I’m going to start
moment but now I’m going to start talking about machine learning and how
talking about machine learning and how
talking about machine learning and how we can use machine learning to improve
we can use machine learning to improve
we can use machine learning to improve some of those technologies and try and
some of those technologies and try and
some of those technologies and try and remove some of the bias or the
remove some of the bias or the
remove some of the bias or the redundancy or the error out of those
redundancy or the error out of those
redundancy or the error out of those technologies okay so following on from
technologies okay so following on from
technologies okay so following on from our excellent presentation this morning
our excellent presentation this morning
our excellent presentation this morning I forgot the first name miss Pitt sorry
I forgot the first name miss Pitt sorry
I forgot the first name miss Pitt sorry if you hear she was talking about how we
if you hear she was talking about how we
if you hear she was talking about how we learn I I also have a couple of slides
learn I I also have a couple of slides
learn I I also have a couple of slides but it’s not it’s it’s a bit more basic
but it’s not it’s it’s a bit more basic
but it’s not it’s it’s a bit more basic I like to introduce my my daughter here
I like to introduce my my daughter here
I like to introduce my my daughter here she’s she’s 18 months old and she’s
she’s she’s 18 months old and she’s
she’s she’s 18 months old and she’s currently going through this process of
currently going through this process of
currently going through this process of learning and it’s really fascinating to
learning and it’s really fascinating to
learning and it’s really fascinating to watch how she does this because there’s
watch how she does this because there’s
watch how she does this because there’s there’s lots of parallels between this
there’s lots of parallels between this
there’s lots of parallels between this and between the state machine learning
and between the state machine learning
and between the state machine learning algorithms at the moment and if we can
algorithms at the moment and if we can
algorithms at the moment and if we can understand how how we learn it actually
understand how how we learn it actually
understand how how we learn it actually helps us to write better algorithms and
helps us to write better algorithms and
helps us to write better algorithms and it helps you to understand the
it helps you to understand the
it helps you to understand the algorithms as well so this is my
algorithms as well so this is my
algorithms as well so this is my daughter with her her mother my wife
daughter with her her mother my wife
daughter with her her mother my wife making some yummy rice crispy crispy
making some yummy rice crispy crispy
making some yummy rice crispy crispy chocolate square things and in the top
chocolate square things and in the top
chocolate square things and in the top picture there she’s doing exactly what
picture there she’s doing exactly what
picture there she’s doing exactly what mom told her please take the rice
mom told her please take the rice
mom told her please take the rice krispies and put them in some baskets
krispies and put them in some baskets
krispies and put them in some baskets and then we can eat them later on but
and then we can eat them later on but
and then we can eat them later on but somewhere along the line she decided to
somewhere along the line she decided to
somewhere along the line she decided to perform some tests
perform some tests
perform some tests she decided if I put this thing in my
she decided if I put this thing in my
she decided if I put this thing in my mouth
mouth
mouth it gonna be good or is he gonna be bad
it gonna be good or is he gonna be bad
it gonna be good or is he gonna be bad so she put it in her mouth and he was
so she put it in her mouth and he was
so she put it in her mouth and he was good
good
good so she completely ignored any
so she completely ignored any
so she completely ignored any instructions from there not because
instructions from there not because
instructions from there not because she’d learned that eating chocolate with
she’d learned that eating chocolate with
she’d learned that eating chocolate with Rice Krispies was a good thing so that’s
Rice Krispies was a good thing so that’s
Rice Krispies was a good thing so that’s a very simple example of how children
a very simple example of how children
a very simple example of how children learn and how algorithms learn in
learn and how algorithms learn in
learn and how algorithms learn in general you you provide them with some
general you you provide them with some
general you you provide them with some tests with some input and then they
tests with some input and then they
tests with some input and then they evaluate that input and decide on some
evaluate that input and decide on some
evaluate that input and decide on some outcome
it takes time however Shoei she’s 18
it takes time however Shoei she’s 18 months and she’s still pretty stupid you
months and she’s still pretty stupid you
months and she’s still pretty stupid you know she can’t work she’s struggling to
know she can’t work she’s struggling to
know she can’t work she’s struggling to put sentences together she she can when
put sentences together she she can when
put sentences together she she can when she walks she falls flat in her face she
she walks she falls flat in her face she
she walks she falls flat in her face she gets spatulas and misses a mouth and
gets spatulas and misses a mouth and
gets spatulas and misses a mouth and hits her eye and it’s too late it’s not
hits her eye and it’s too late it’s not
hits her eye and it’s too late it’s not good so it does take time for this to
good so it does take time for this to
good so it does take time for this to happen this applies to to algorithms as
happen this applies to to algorithms as
happen this applies to to algorithms as well it take time to learn we’ve got
well it take time to learn we’ve got
well it take time to learn we’ve got this great game that she loves which are
this great game that she loves which are
this great game that she loves which are index cards and this is an example of
index cards and this is an example of
index cards and this is an example of how she gets things wrong I mean she’s
how she gets things wrong I mean she’s
how she gets things wrong I mean she’s she’s very good I yeah she’s really good
she’s very good I yeah she’s really good
she’s very good I yeah she’s really good I don’t give you the impression that I’m
I don’t give you the impression that I’m
I don’t give you the impression that I’m a bad father I’m saying she’s rubbish
a bad father I’m saying she’s rubbish
a bad father I’m saying she’s rubbish and get rid of it but no she’s very good
and get rid of it but no she’s very good
and get rid of it but no she’s very good but in some cases she does get it wrong
but in some cases she does get it wrong
but in some cases she does get it wrong the first example on the left there is a
the first example on the left there is a
the first example on the left there is a door however she thinks it’s a house and
door however she thinks it’s a house and
door however she thinks it’s a house and she thinks it’s a house because it’s got
she thinks it’s a house because it’s got
she thinks it’s a house because it’s got four walls and it’s got these features
four walls and it’s got these features
four walls and it’s got these features in the middle which are like squares
in the middle which are like squares
in the middle which are like squares which kind of look like windows but what
which kind of look like windows but what
which kind of look like windows but what she hasn’t learned yet is that a house
she hasn’t learned yet is that a house
she hasn’t learned yet is that a house actually needs a triangle on the top and
actually needs a triangle on the top and
actually needs a triangle on the top and so this is a this is an example of a
so this is a this is an example of a
so this is a this is an example of a misuse of features so there are features
misuse of features so there are features
misuse of features so there are features there but she’s misusing them to come to
there but she’s misusing them to come to
there but she’s misusing them to come to the wrong conclusion the second one she
the wrong conclusion the second one she
the wrong conclusion the second one she calls this a chicken because she doesn’t
calls this a chicken because she doesn’t
calls this a chicken because she doesn’t quite understand the concept of a bird I
quite understand the concept of a bird I
quite understand the concept of a bird I think she she struggles to to to
think she she struggles to to to
think she she struggles to to to understand classes of things she’s quite
understand classes of things she’s quite
understand classes of things she’s quite happy to learn that that thing is
happy to learn that that thing is
happy to learn that that thing is definitely a bird and that thing is
definitely a bird and that thing is
definitely a bird and that thing is definitely a teddy and that thing is
definitely a teddy and that thing is
definitely a teddy and that thing is definitely mommy and that thing is his
definitely mommy and that thing is his
definitely mommy and that thing is his dad went it around but she struggles
dad went it around but she struggles
dad went it around but she struggles with things so that’s a chicken so
with things so that’s a chicken so
with things so that’s a chicken so that’s so that’s okay but that’s just an
that’s so that’s okay but that’s just an
that’s so that’s okay but that’s just an example of a Mis classification and then
example of a Mis classification and then
example of a Mis classification and then finally we’ve got the third picture and
finally we’ve got the third picture and
finally we’ve got the third picture and apparently that’s a tiger now I went out
apparently that’s a tiger now I went out
apparently that’s a tiger now I went out when I show this cat she kind of looks
when I show this cat she kind of looks
when I show this cat she kind of looks at me and goes I’m not sure what it is
at me and goes I’m not sure what it is
at me and goes I’m not sure what it is and then I look at the car go
and then I look at the car go
and then I look at the car go I’m not sure that is either idea I think
I’m not sure that is either idea I think
I’m not sure that is either idea I think sometimes she goes for a cat sometimes
sometimes she goes for a cat sometimes
sometimes she goes for a cat sometimes she goes for
she goes for
she goes for there sometimes I don’t know I don’t
there sometimes I don’t know I don’t
there sometimes I don’t know I don’t even know what it is it looks like
even know what it is it looks like
even know what it is it looks like something sort of ran over it it’s like
something sort of ran over it it’s like
something sort of ran over it it’s like a cat that’s been ran over basically and
a cat that’s been ran over basically and
a cat that’s been ran over basically and that’s a great example of just bad data
that’s a great example of just bad data
that’s a great example of just bad data so in real life you will get that data
so in real life you will get that data
so in real life you will get that data and there’s a big cleaning method that’s
and there’s a big cleaning method that’s
and there’s a big cleaning method that’s required to try and prevent you from
required to try and prevent you from
required to try and prevent you from getting this bad data because you will
getting this bad data because you will
getting this bad data because you will come to the wrong result so just to
come to the wrong result so just to
come to the wrong result so just to prove that it’s not just her age I’ve
prove that it’s not just her age I’ve
prove that it’s not just her age I’ve got an example for all of you so take a
got an example for all of you so take a
got an example for all of you so take a look at this picture and I’m just going
look at this picture and I’m just going
look at this picture and I’m just going to watch you for a second right so so
to watch you for a second right so so
to watch you for a second right so so for all the programmers out there this
for all the programmers out there this
for all the programmers out there this is like a human equivalent of like a
is like a human equivalent of like a
is like a human equivalent of like a stack overflow so what you start doing
stack overflow so what you start doing
stack overflow so what you start doing is you try and focus in on their eyes
is you try and focus in on their eyes
is you try and focus in on their eyes but then you realize that she’s got eyes
but then you realize that she’s got eyes
but then you realize that she’s got eyes in a different place so you kind of jump
in a different place so you kind of jump
in a different place so you kind of jump across and then you realize the mouth is
across and then you realize the mouth is
across and then you realize the mouth is in the wrong place so you jump again and
in the wrong place so you jump again and
in the wrong place so you jump again and you’re up and down and up and down and
you’re up and down and up and down and
you’re up and down and up and down and if you stare at it long enough you start
if you stare at it long enough you start
if you stare at it long enough you start to feel sick so to that and but but all
to feel sick so to that and but but all
to feel sick so to that and but but all this is proving is that you’ve learnt
this is proving is that you’ve learnt
this is proving is that you’ve learnt some specific things over time you have
some specific things over time you have
some specific things over time you have you know decade’s worth of experience to
you know decade’s worth of experience to
you know decade’s worth of experience to say what a face which should look like
say what a face which should look like
say what a face which should look like and when it doesn’t look like that you
and when it doesn’t look like that you
and when it doesn’t look like that you don’t quite know how to process it and
don’t quite know how to process it and
don’t quite know how to process it and we can get it wrong no humans are
we can get it wrong no humans are
we can get it wrong no humans are completely infallible fallible sorry
completely infallible fallible sorry
completely infallible fallible sorry they’re wrong choice of words they’re
they’re wrong choice of words they’re
they’re wrong choice of words they’re completely fallible ok so moving on to
completely fallible ok so moving on to
completely fallible ok so moving on to the more technical topics here machine
the more technical topics here machine
the more technical topics here machine learning comprises a four-ish sort of
learning comprises a four-ish sort of
learning comprises a four-ish sort of distinct components they’re all trying
distinct components they’re all trying
distinct components they’re all trying to do slightly separate different things
to do slightly separate different things
to do slightly separate different things the first item is dimensionality
the first item is dimensionality
the first item is dimensionality reduction so when we think of data it
reduction so when we think of data it
reduction so when we think of data it has a number of dimensions and by
has a number of dimensions and by
has a number of dimensions and by dimensions are basically mean like a
dimensions are basically mean like a
dimensions are basically mean like a single point of information so if you
single point of information so if you
single point of information so if you imagine a 10 by 10 grayscale picture
imagine a 10 by 10 grayscale picture
imagine a 10 by 10 grayscale picture that has like a hundred dimensions a
that has like a hundred dimensions a
that has like a hundred dimensions a hundred pixels in there which all
hundred pixels in there which all
hundred pixels in there which all represent a distinct piece of data the
represent a distinct piece of data the
represent a distinct piece of data the problem with that is that with images
problem with that is that with images
problem with that is that with images it’s ok but for many other types of data
it’s ok but for many other types of data
it’s ok but for many other types of data it’s really hard to try and visualize
it’s really hard to try and visualize
it’s really hard to try and visualize what’s going on so you’ve got to
what’s going on so you’ve got to
what’s going on so you’ve got to compress that space down into two or
compress that space down into two or
compress that space down into two or three dimensions in order to actually
three dimensions in order to actually
three dimensions in order to actually see what’s going on so that’s the act of
see what’s going on so that’s the act of
see what’s going on so that’s the act of dimensionality reduction we’ve got
dimensionality reduction we’ve got
dimensionality reduction we’ve got clustering where we’re trying to assign
clustering where we’re trying to assign
clustering where we’re trying to assign an output to a certain class
an output to a certain class
an output to a certain class quite often we know what class it should
quite often we know what class it should
quite often we know what class it should belong to or at least we should know how
belong to or at least we should know how
belong to or at least we should know how many classes there are at least so
many classes there are at least so
many classes there are at least so clustering is the process of trying to
clustering is the process of trying to
clustering is the process of trying to group things together into distinct
group things together into distinct
group things together into distinct classes we’ve got classification which
classes we’ve got classification which
classes we’ve got classification which is linked to clustering where that’s
is linked to clustering where that’s
is linked to clustering where that’s more asking the question exactly where
more asking the question exactly where
more asking the question exactly where do I put the line to say that’s Class A
do I put the line to say that’s Class A
do I put the line to say that’s Class A and that’s Class B and finally
and that’s Class B and finally
and that’s Class B and finally regression which is trying to predict a
regression which is trying to predict a
regression which is trying to predict a value based upon their previous inputs
value based upon their previous inputs
value based upon their previous inputs we’ve also got different types of
we’ve also got different types of
we’ve also got different types of learning as well learning is the key
learning as well learning is the key
learning as well learning is the key thing that’s this really enabled deep
thing that’s this really enabled deep
thing that’s this really enabled deep learning to to come to the forefront is
learning to to come to the forefront is
learning to to come to the forefront is that the new training techniques that
that the new training techniques that
that the new training techniques that have been developed are so much more
have been developed are so much more
have been developed are so much more powerful than they were in the past
powerful than they were in the past
powerful than they were in the past training can be split into supervised
training can be split into supervised
training can be split into supervised and unsupervised learning supervised
and unsupervised learning supervised
and unsupervised learning supervised learning is where you have an expected
learning is where you have an expected
learning is where you have an expected result so it’s a it’s labeled so you say
result so it’s a it’s labeled so you say
result so it’s a it’s labeled so you say that this raw data is supposed to belong
that this raw data is supposed to belong
that this raw data is supposed to belong to Class A this is supposed to be the
to Class A this is supposed to be the
to Class A this is supposed to be the number one or this person is fraudulent
the algorithm is then trained the
the algorithm is then trained the parameters of the algorithm and then
parameters of the algorithm and then
parameters of the algorithm and then tuned to try and produce that same
tuned to try and produce that same
tuned to try and produce that same result and the the measure of
result and the the measure of
result and the the measure of performance for that algorithm is
performance for that algorithm is
performance for that algorithm is compared to the true result versus the
compared to the true result versus the
compared to the true result versus the predicted Frizzle and then when you were
predicted Frizzle and then when you were
predicted Frizzle and then when you were to use this in in real life if you had
to use this in in real life if you had
to use this in in real life if you had new data coming in then you would use
new data coming in then you would use
new data coming in then you would use those pre learnt weights and you would
those pre learnt weights and you would
those pre learnt weights and you would predict an output based upon that for
predict an output based upon that for
predict an output based upon that for unsupervised
unsupervised
unsupervised you’ve got no results so you don’t know
you’ve got no results so you don’t know
you’ve got no results so you don’t know exactly what class it’s supposed to
exactly what class it’s supposed to
exactly what class it’s supposed to belong to algorithms are trained in you
belong to algorithms are trained in you
belong to algorithms are trained in you need to decide on on what’s going to
need to decide on on what’s going to
need to decide on on what’s going to provide you with a measure of how good
provide you with a measure of how good
provide you with a measure of how good your algorithms be trained so some some
your algorithms be trained so some some
your algorithms be trained so some some of them deciding whether data are close
of them deciding whether data are close
of them deciding whether data are close or far away so since this measure of
or far away so since this measure of
or far away so since this measure of distance between data the there’s also
distance between data the there’s also
distance between data the there’s also may be other reasons why you want to do
may be other reasons why you want to do
may be other reasons why you want to do it as well and you can provide your own
it as well and you can provide your own
it as well and you can provide your own we’re talking about
we’re talking about
we’re talking about customized or personalized customized
customized or personalized customized
customized or personalized customized functions to actually cost whether your
functions to actually cost whether your
functions to actually cost whether your output is going to be labeled as class 1
output is going to be labeled as class 1
output is going to be labeled as class 1 or class 2 if something is important but
or class 2 if something is important but
or class 2 if something is important but in the real in the real world most data
in the real in the real world most data
in the real in the real world most data is usually semi-supervised
is usually semi-supervised
is usually semi-supervised you usually start off with some label
you usually start off with some label
you usually start off with some label data and usually a lot more that is
data and usually a lot more that is
data and usually a lot more that is unlabeled so you can kind of combine
unlabeled so you can kind of combine
unlabeled so you can kind of combine these two things together to maybe you
these two things together to maybe you
these two things together to maybe you can use the labeled stuff to start to
can use the labeled stuff to start to
can use the labeled stuff to start to bring out some of the clusters and then
bring out some of the clusters and then
bring out some of the clusters and then apply the unlabeled data to you know
apply the unlabeled data to you know
apply the unlabeled data to you know really filling the pattern a bit more so
really filling the pattern a bit more so
really filling the pattern a bit more so let’s talk about some specific
let’s talk about some specific
let’s talk about some specific algorithms I’m going to talk about to
algorithms I’m going to talk about to
algorithms I’m going to talk about to every every guy’s got his own favorite
every every guy’s got his own favorite
every every guy’s got his own favorite algorithm this first one is called a
algorithm this first one is called a
algorithm this first one is called a decision tree and there’s various
decision tree and there’s various
decision tree and there’s various different types of decision tree but
different types of decision tree but
different types of decision tree but we’re going to stick to the simple one
we’re going to stick to the simple one
we’re going to stick to the simple one for now and they can be used for
for now and they can be used for
for now and they can be used for classification and regression and the
classification and regression and the
classification and regression and the idea is that they predict the target of
idea is that they predict the target of
idea is that they predict the target of the target value of a class or a value
the target value of a class or a value
the target value of a class or a value or something based upon some very simple
or something based upon some very simple
or something based upon some very simple decision rules so is it less than 10 or
decision rules so is it less than 10 or
decision rules so is it less than 10 or bigger than 10 is it is it labeled a or
bigger than 10 is it is it labeled a or
bigger than 10 is it is it labeled a or labeled B the example we’ve got there on
labeled B the example we’ve got there on
labeled B the example we’ve got there on the right is quite morbid actually this
the right is quite morbid actually this
the right is quite morbid actually this is a decision tree that’s been learned
is a decision tree that’s been learned
is a decision tree that’s been learned from the data provided from the Titanic
from the data provided from the Titanic
from the data provided from the Titanic manifests and this is predicting whether
manifests and this is predicting whether
manifests and this is predicting whether you’re going to survive if you were on
you’re going to survive if you were on
you’re going to survive if you were on the Titanic or not so the first question
the Titanic or not so the first question
the Titanic or not so the first question it asks is is the sex male so if it was
it asks is is the sex male so if it was
it asks is is the sex male so if it was yes then it goes down to one side of the
yes then it goes down to one side of the
yes then it goes down to one side of the tree on the Left if it was no it goes
tree on the Left if it was no it goes
tree on the Left if it was no it goes down the right side of the tree so if
down the right side of the tree so if
down the right side of the tree so if you were female you had a pretty good
you were female you had a pretty good
you were female you had a pretty good chance of 0.73 so 73% chance of
chance of 0.73 so 73% chance of
chance of 0.73 so 73% chance of surviving and that represents 36% of the
surviving and that represents 36% of the
surviving and that represents 36% of the entire population inside the Titanic or
entire population inside the Titanic or
entire population inside the Titanic or as if you were male and if you were
as if you were male and if you were
as if you were male and if you were above 9.5 then you’ve got a fairly big
above 9.5 then you’ve got a fairly big
above 9.5 then you’ve got a fairly big chance that you’re going to die
chance that you’re going to die
chance that you’re going to die unfortunately 61% of all males of a 9.5
unfortunately 61% of all males of a 9.5
unfortunately 61% of all males of a 9.5 died and you can see that you can go
died and you can see that you can go
died and you can see that you can go down the tree and you can make a
down the tree and you can make a
down the tree and you can make a decision based upon these rules so the
decision based upon these rules so the
decision based upon these rules so the idea of the algorithm is to train these
idea of the algorithm is to train these
idea of the algorithm is to train these parameters these rules these decision
parameters these rules these decision
parameters these rules these decision points to optimally make the right
points to optimally make the right
points to optimally make the right decision
decision
decision so it’s conceptually quite simple it can
so it’s conceptually quite simple it can
so it’s conceptually quite simple it can handle categorical data which is great
handle categorical data which is great
handle categorical data which is great because some algorithms can’t but it
because some algorithms can’t but it
because some algorithms can’t but it well decision trees specifically can
well decision trees specifically can
well decision trees specifically can ooph it quite badly but there are lots
ooph it quite badly but there are lots
ooph it quite badly but there are lots of methods
of methods
of methods to to use decision trees in a different
to to use decision trees in a different
to to use decision trees in a different way to prevent the overfitting so don’t
way to prevent the overfitting so don’t
way to prevent the overfitting so don’t worry about that too much and decision
worry about that too much and decision
worry about that too much and decision trees are usually one of the simplest
trees are usually one of the simplest
trees are usually one of the simplest and sometimes effective enough to solve
and sometimes effective enough to solve
and sometimes effective enough to solve a problem the next algorithm and what’s
a problem the next algorithm and what’s
a problem the next algorithm and what’s surrounded by lots of hype at the moment
surrounded by lots of hype at the moment
surrounded by lots of hype at the moment is deep learning so deep learning is
is deep learning so deep learning is
is deep learning so deep learning is it’s really good because you remember
it’s really good because you remember
it’s really good because you remember those classes of types of algorithms at
those classes of types of algorithms at
those classes of types of algorithms at the start there he actually does all of
the start there he actually does all of
the start there he actually does all of them he does the dimensionality
them he does the dimensionality
them he does the dimensionality reduction the classification the
reduction the classification the
reduction the classification the regression and the clustering it could
regression and the clustering it could
regression and the clustering it could do all of it it’s a holy grail of
do all of it it’s a holy grail of
do all of it it’s a holy grail of algorithms no other algorithm can
algorithms no other algorithm can
algorithms no other algorithm can actually do all the same things the idea
actually do all the same things the idea
actually do all the same things the idea is that it’s actually trying to model
is that it’s actually trying to model
is that it’s actually trying to model our learning process in our brain
our learning process in our brain
our learning process in our brain basically it seems to model the neurons
basically it seems to model the neurons
basically it seems to model the neurons and the synapses in your brain to do the
and the synapses in your brain to do the
and the synapses in your brain to do the similar sort of tasks it’s it’s
similar sort of tasks it’s it’s
similar sort of tasks it’s it’s simplified somewhat but that’s that’s
simplified somewhat but that’s that’s
simplified somewhat but that’s that’s the general idea so the hope here is
the general idea so the hope here is
the general idea so the hope here is that if we can produce a model that of
that if we can produce a model that of
that if we can produce a model that of our brain that then we can merit right
our brain that then we can merit right
our brain that then we can merit right algorithms to perform things that our
algorithms to perform things that our
algorithms to perform things that our brain can do quite easily like
brain can do quite easily like
brain can do quite easily like recognition classification things like
recognition classification things like
recognition classification things like that so the pros and cons again it’s
that so the pros and cons again it’s
that so the pros and cons again it’s very versatile can be used for lots of
very versatile can be used for lots of
very versatile can be used for lots of different tasks
different tasks
different tasks the key improvement really is that it
the key improvement really is that it
the key improvement really is that it begins to remove the requirement of
begins to remove the requirement of
begins to remove the requirement of feature engineering so with all of the
feature engineering so with all of the
feature engineering so with all of the other algorithms your algorithm will
other algorithms your algorithm will
other algorithms your algorithm will live or die based upon what features you
live or die based upon what features you
live or die based upon what features you give the input you need to work really
give the input you need to work really
give the input you need to work really hard with other algorithms to to say
hard with other algorithms to to say
hard with other algorithms to to say that this is the most important feature
that this is the most important feature
that this is the most important feature I’m going to keep that and use that but
I’m going to keep that and use that but
I’m going to keep that and use that but those are the ones are completely
those are the ones are completely
those are the ones are completely redundant I’m going to remove them and
redundant I’m going to remove them and
redundant I’m going to remove them and that takes a significant amount of time
that takes a significant amount of time
that takes a significant amount of time with deep learning it has the ability of
with deep learning it has the ability of
with deep learning it has the ability of internally during the training stage of
internally during the training stage of
internally during the training stage of either completely removing parameters or
either completely removing parameters or
either completely removing parameters or completely keeping parameters purely
completely keeping parameters purely
completely keeping parameters purely based upon how well it fits the data how
based upon how well it fits the data how
based upon how well it fits the data how well the training process goes so it
well the training process goes so it
well the training process goes so it removes the bias that comes from
removes the bias that comes from
removes the bias that comes from removing data or adding data that you’re
removing data or adding data that you’re
removing data or adding data that you’re not sure it should be there or not the
not sure it should be there or not the
not sure it should be there or not the the main con actually there’s a suppose
the main con actually there’s a suppose
the main con actually there’s a suppose there’s a couple of cons the biggest one
there’s a couple of cons the biggest one
there’s a couple of cons the biggest one is it can be hard to visualize as soon
is it can be hard to visualize as soon
is it can be hard to visualize as soon as you start getting into
as you start getting into
as you start getting into neural network sizes that are quite deep
neural network sizes that are quite deep
neural network sizes that are quite deep it can be quite hard to visualize and
it can be quite hard to visualize and
it can be quite hard to visualize and conceptualize I’m hopefully going to try
conceptualize I’m hopefully going to try
conceptualize I’m hopefully going to try and prove that wrong in a little bit but
and prove that wrong in a little bit but
and prove that wrong in a little bit but um that’s that’s the problem number one
um that’s that’s the problem number one
um that’s that’s the problem number one and problem number two can be quite
and problem number two can be quite
and problem number two can be quite computationally expensive but that’s
computationally expensive but that’s
computationally expensive but that’s that’s true for kind of lots of these
that’s true for kind of lots of these
that’s true for kind of lots of these algorithms really so how do they
algorithms really so how do they
algorithms really so how do they actually work well they all it works
actually work well they all it works
actually work well they all it works primarily by trying to conceptualize
primarily by trying to conceptualize
primarily by trying to conceptualize things so there’s this idea that that
things so there’s this idea that that
things so there’s this idea that that neural networks are acting like a
neural networks are acting like a
neural networks are acting like a hierarchy of of concepts and the the
hierarchy of of concepts and the the
hierarchy of of concepts and the the whole goal really is to take those
whole goal really is to take those
whole goal really is to take those images also take your data and produce a
images also take your data and produce a
images also take your data and produce a concept something that accurately
concept something that accurately
concept something that accurately describes what is provided at the input
describes what is provided at the input
describes what is provided at the input so we’ve got the couple of the concepts
so we’ve got the couple of the concepts
so we’ve got the couple of the concepts on the left there we’ve got a street an
on the left there we’ve got a street an
on the left there we’ve got a street an animal and a person but you can see that
animal and a person but you can see that
animal and a person but you can see that you don’t
you don’t
you don’t the to the bottom ones the person and
the to the bottom ones the person and
the to the bottom ones the person and the animal there they’re actually linked
the animal there they’re actually linked
the animal there they’re actually linked by another concept you know they’re both
by another concept you know they’re both
by another concept you know they’re both animals is just one of them’s human so
animals is just one of them’s human so
animals is just one of them’s human so the great thing about the delayering
the great thing about the delayering
the great thing about the delayering concept is that you can actually start
concept is that you can actually start
concept is that you can actually start to tag things that are similar but not
to tag things that are similar but not
to tag things that are similar but not quite the same based upon your training
quite the same based upon your training
quite the same based upon your training data so to be more specific this says is
data so to be more specific this says is
data so to be more specific this says is a an example of how you would go about
a an example of how you would go about
a an example of how you would go about conceptualizing an image so each pixel
conceptualizing an image so each pixel
conceptualizing an image so each pixel within the image that’s the dashed lines
within the image that’s the dashed lines
within the image that’s the dashed lines there that would be passed into the
there that would be passed into the
there that would be passed into the input of our deep learning and it would
input of our deep learning and it would
input of our deep learning and it would start to reduce concepts around those
start to reduce concepts around those
start to reduce concepts around those pixels so the first layer might decide
pixels so the first layer might decide
pixels so the first layer might decide that there’s a you know part of a tire
that there’s a you know part of a tire
that there’s a you know part of a tire or a pile of a rim or an end plate or
or a pile of a rim or an end plate or
or a pile of a rim or an end plate or something like that usually very small
something like that usually very small
something like that usually very small discreet kind of local things within the
discreet kind of local things within the
discreet kind of local things within the image the next layer might start to
image the next layer might start to
image the next layer might start to build in that concept and build a
build in that concept and build a
build in that concept and build a concept of a tire or a full wing or a
concept of a tire or a full wing or a
concept of a tire or a full wing or a real wing and then finally we get to the
real wing and then finally we get to the
real wing and then finally we get to the classification and in this case is an f1
classification and in this case is an f1
classification and in this case is an f1 car but you can imagine that if you then
car but you can imagine that if you then
car but you can imagine that if you then showed the algorithm a normal car it
showed the algorithm a normal car it
showed the algorithm a normal car it could reuse some of those concepts they
could reuse some of those concepts they
could reuse some of those concepts they all they still have wheels they still
all they still have wheels they still
all they still have wheels they still have you know cockpits or our bodies
have you know cockpits or our bodies
have you know cockpits or our bodies away probably don’t have wings I don’t
away probably don’t have wings I don’t
away probably don’t have wings I don’t know maybe maybe in Leeds I don’t don’t
know maybe maybe in Leeds I don’t don’t
know maybe maybe in Leeds I don’t don’t about Denmark
about Denmark
about Denmark but you can reuse some of these concepts
but you can reuse some of these concepts
but you can reuse some of these concepts and that kind of shows the applicability
and that kind of shows the applicability
and that kind of shows the applicability to not just not just problems that it’s
to not just not just problems that it’s
to not just not just problems that it’s already seen but also future problems
already seen but also future problems
already seen but also future problems that it hasn’t seen and so just to
that it hasn’t seen and so just to
that it hasn’t seen and so just to finish this section off really just
finish this section off really just
finish this section off really just machine learning in the news or deep
machine learning in the news or deep
machine learning in the news or deep learning in them in the news the the one
learning in them in the news the the one
learning in them in the news the the one I really like that’s accessible to
I really like that’s accessible to
I really like that’s accessible to anybody really is the Google the new
anybody really is the Google the new
anybody really is the Google the new Google Translate app that takes pictures
Google Translate app that takes pictures
Google Translate app that takes pictures of signs or text in a different language
of signs or text in a different language
of signs or text in a different language and it translates that text but the real
and it translates that text but the real
and it translates that text but the real the cool USP of the whole thing is that
the cool USP of the whole thing is that
the cool USP of the whole thing is that it actually takes the image and replaces
it actually takes the image and replaces
it actually takes the image and replaces the image with the correct text in your
the image with the correct text in your
the image with the correct text in your language so here we’ve got a Russian
language so here we’ve got a Russian
language so here we’ve got a Russian sign and it’s replaced it with the
sign and it’s replaced it with the
sign and it’s replaced it with the English here actually I say he says
English here actually I say he says
English here actually I say he says access the city but according to my
access the city but according to my
access the city but according to my friend who who speaks Russian it
friend who who speaks Russian it
friend who who speaks Russian it actually means exit to village so not
actually means exit to village so not
actually means exit to village so not access to City exit to village but it’s
access to City exit to village but it’s
access to City exit to village but it’s not quite as grandiose if we showed if
not quite as grandiose if we showed if
not quite as grandiose if we showed if Google showed us science and exit to
Google showed us science and exit to
Google showed us science and exit to village so it’s probably why they
village so it’s probably why they
village so it’s probably why they changed it and then we’ve got the the
changed it and then we’ve got the the
changed it and then we’ve got the the images at the bottom and this is a new
images at the bottom and this is a new
images at the bottom and this is a new chip developed by IBM it’s been a few
chip developed by IBM it’s been a few
chip developed by IBM it’s been a few years in the making actually but
years in the making actually but
years in the making actually but effectively it’s a a deep learning
effectively it’s a a deep learning
effectively it’s a a deep learning neural network type infrastructure
neural network type infrastructure
neural network type infrastructure inside a chip so obviously you’ve got
inside a chip so obviously you’ve got
inside a chip so obviously you’ve got the cause and you used to the cause
the cause and you used to the cause
the cause and you used to the cause imagine the cause parallelized massively
imagine the cause parallelized massively
imagine the cause parallelized massively so instead of having you know one call
so instead of having you know one call
so instead of having you know one call we’ve got tens of thousands in this case
we’ve got tens of thousands in this case
we’ve got tens of thousands in this case is actually a million there’s a a
is actually a million there’s a a
is actually a million there’s a a million neurons in this chip so it’s
million neurons in this chip so it’s
million neurons in this chip so it’s able to do a million parallel tasks all
able to do a million parallel tasks all
able to do a million parallel tasks all at the same time and when we go through
at the same time and when we go through
at the same time and when we go through some of the examples in in a minute
some of the examples in in a minute
some of the examples in in a minute we’re going to be talking about like
we’re going to be talking about like
we’re going to be talking about like image sizes like they’re 10 10 by 10 100
image sizes like they’re 10 10 by 10 100
image sizes like they’re 10 10 by 10 100 input pixels that go down to maybe 2 to
input pixels that go down to maybe 2 to
input pixels that go down to maybe 2 to 2 outputs on there 2 dimensions on the
2 outputs on there 2 dimensions on the
2 outputs on there 2 dimensions on the output so that’s kind of nothing in
output so that’s kind of nothing in
output so that’s kind of nothing in comparison to what this could do and
comparison to what this could do and
comparison to what this could do and this is actually in hardware as well so
this is actually in hardware as well so
this is actually in hardware as well so it’s super fast super low power and
it’s super fast super low power and
it’s super fast super low power and should produce some really interesting
should produce some really interesting
should produce some really interesting applications ok so it’s just to solidify
applications ok so it’s just to solidify
applications ok so it’s just to solidify the howdy learning works I’m going to
the howdy learning works I’m going to
the howdy learning works I’m going to take you through an example which is a
take you through an example which is a
take you through an example which is a description
description
description of some some numbers here so the the
of some some numbers here so the the
of some some numbers here so the the idea of this task is to recognize some
idea of this task is to recognize some
idea of this task is to recognize some handwritten digits and to classify them
handwritten digits and to classify them
handwritten digits and to classify them as a number from 0 to 9 so it’s a really
as a number from 0 to 9 so it’s a really
as a number from 0 to 9 so it’s a really classic here machine learning example
classic here machine learning example
classic here machine learning example but it’s really great to use in the
but it’s really great to use in the
but it’s really great to use in the example as an example because it’s very
example as an example because it’s very
example as an example because it’s very easy to understand very very easy for
easy to understand very very easy for
easy to understand very very easy for everybody to understand it’s just trying
everybody to understand it’s just trying
everybody to understand it’s just trying to recognize what that number is and the
to recognize what that number is and the
to recognize what that number is and the first thing we notice when we start
first thing we notice when we start
first thing we notice when we start looking at the data so the first step in
looking at the data so the first step in
looking at the data so the first step in any in any data analysis job is to have
any in any data analysis job is to have
any in any data analysis job is to have a look at the data and the first thing
a look at the data and the first thing
a look at the data and the first thing we notice is that if you actually if you
we notice is that if you actually if you
we notice is that if you actually if you look at that that top left number there
look at that that top left number there
look at that that top left number there so I’m not not completely sure whether
so I’m not not completely sure whether
so I’m not not completely sure whether that’s a 5 or a that’s 3 and this
that’s a 5 or a that’s 3 and this
that’s a 5 or a that’s 3 and this immediately brings problems because this
immediately brings problems because this
immediately brings problems because this data is actually labeled so every one of
data is actually labeled so every one of
data is actually labeled so every one of these examples you’ll see so each each
these examples you’ll see so each each
these examples you’ll see so each each number is an example you can see that
number is an example you can see that
number is an example you can see that it’s been inverted from maybe you’ve
it’s been inverted from maybe you’ve
it’s been inverted from maybe you’ve somebody written pen on white paper and
somebody written pen on white paper and
somebody written pen on white paper and it’s being inverted and then reduced to
it’s being inverted and then reduced to
it’s being inverted and then reduced to a fixed pixel size and then sent it as
a fixed pixel size and then sent it as
a fixed pixel size and then sent it as well and the first thing that we can see
well and the first thing that we can see
well and the first thing that we can see is we’re already not sure whether that’s
is we’re already not sure whether that’s
is we’re already not sure whether that’s a 3 or a 5 and so somebody’s gone
a 3 or a 5 and so somebody’s gone
a 3 or a 5 and so somebody’s gone through and labeled this data as being a
through and labeled this data as being a
through and labeled this data as being a 3 or a 5 but I’m not convinced that
3 or a 5 but I’m not convinced that
3 or a 5 but I’m not convinced that that’s actually correct so we’re giving
that’s actually correct so we’re giving
that’s actually correct so we’re giving our algorithm potentially dodgy data
our algorithm potentially dodgy data
our algorithm potentially dodgy data already so there are in mind whenever
already so there are in mind whenever
already so there are in mind whenever you’re trying to train data that your
you’re trying to train data that your
you’re trying to train data that your your label data might not be right in
your label data might not be right in
your label data might not be right in the first place because it’s usually
the first place because it’s usually
the first place because it’s usually it’s usually labeled by by humans so
it’s usually labeled by by humans so
it’s usually labeled by by humans so what we then do with each example is we
what we then do with each example is we
what we then do with each example is we feed it into an input layer so I’m
feed it into an input layer so I’m
feed it into an input layer so I’m trying to stay away from the term neural
trying to stay away from the term neural
trying to stay away from the term neural network although I’ve mentioned it a
network although I’ve mentioned it a
network although I’ve mentioned it a couple of times because that it’s been
couple of times because that it’s been
couple of times because that it’s been around since the 80s but it it sounds
around since the 80s but it it sounds
around since the 80s but it it sounds complicated but it’s really not all the
complicated but it’s really not all the
complicated but it’s really not all the neural network is you have a node where
neural network is you have a node where
neural network is you have a node where some data goes in and then you have have
some data goes in and then you have have
some data goes in and then you have have links to an annexe subset of nodes and
links to an annexe subset of nodes and
links to an annexe subset of nodes and those are those links all have weights
those are those links all have weights
those are those links all have weights that it’s as simple as that all we do is
that it’s as simple as that all we do is
that it’s as simple as that all we do is we alter the weights within the the
we alter the weights within the the
we alter the weights within the the network in order to perform a task so
network in order to perform a task so
network in order to perform a task so I’ll try and refrain from using that
I’ll try and refrain from using that
I’ll try and refrain from using that terminology so our input layer is
terminology so our input layer is
terminology so our input layer is usually the same size as the size of the
usually the same size as the size of the
usually the same size as the size of the data so here we’ve got made maybe 10 by
data so here we’ve got made maybe 10 by
data so here we’ve got made maybe 10 by 10 pixels so we’ve got 100 inputs
10 pixels so we’ve got 100 inputs
10 pixels so we’ve got 100 inputs have one input for each pixel we then
have one input for each pixel we then
have one input for each pixel we then pass that data through to what’s known
pass that data through to what’s known
pass that data through to what’s known as a hidden layer and we call it hidden
as a hidden layer and we call it hidden
as a hidden layer and we call it hidden layer a bit basically because it’s not
layer a bit basically because it’s not
layer a bit basically because it’s not an input or an output it’s something in
an input or an output it’s something in
an input or an output it’s something in the middle it’s not directly observable
the middle it’s not directly observable
the middle it’s not directly observable and the way in which they’re connected
and the way in which they’re connected
and the way in which they’re connected is with a weight and during the training
is with a weight and during the training
is with a weight and during the training process those weights could be you know
process those weights could be you know
process those weights could be you know completely removed by setting it to zero
completely removed by setting it to zero
completely removed by setting it to zero or you know completely kept by sitting
or you know completely kept by sitting
or you know completely kept by sitting it’s all one and that’s all the training
it’s all one and that’s all the training
it’s all one and that’s all the training process is doing so what’s really great
process is doing so what’s really great
process is doing so what’s really great at this point is that those weights
at this point is that those weights
at this point is that those weights actually they combine in the next layer
actually they combine in the next layer
actually they combine in the next layer so you might have learnt that the
so you might have learnt that the
so you might have learnt that the weights that have been learned for that
weights that have been learned for that
weights that have been learned for that one particular neuron in the hidden
one particular neuron in the hidden
one particular neuron in the hidden layer can actually be treated as like a
layer can actually be treated as like a
layer can actually be treated as like a feature this is this is the beginnings
feature this is this is the beginnings
feature this is this is the beginnings of a concept so it’s saying that given
of a concept so it’s saying that given
of a concept so it’s saying that given that one neuron that one item in the
that one neuron that one item in the
that one neuron that one item in the hidden layer there that has that has
hidden layer there that has that has
hidden layer there that has that has certain weights on each of the input
certain weights on each of the input
certain weights on each of the input pixels so if we if if we were to make
pixels so if we if if we were to make
pixels so if we if if we were to make that the output layer there we could
that the output layer there we could
that the output layer there we could imagine that if that was the the output
imagine that if that was the the output
imagine that if that was the the output layer for the number one the weights
layer for the number one the weights
layer for the number one the weights would represent a shape that looks
would represent a shape that looks
would represent a shape that looks something like the number one generally
something like the number one generally
something like the number one generally in hidden layers you have multiple
in hidden layers you have multiple
in hidden layers you have multiple hidden layers so you’re trying to get
hidden layers so you’re trying to get
hidden layers so you’re trying to get the algorithm to learn these small steps
the algorithm to learn these small steps
the algorithm to learn these small steps these small increments of of concept and
these small increments of of concept and
these small increments of of concept and what we can actually do is to say that
what we can actually do is to say that
what we can actually do is to say that for for that one hidden layer we can go
for for that one hidden layer we can go
for for that one hidden layer we can go back and say what does the input layer
back and say what does the input layer
back and say what does the input layer have to look like in order to fully
have to look like in order to fully
have to look like in order to fully activate that one neuron and only that
activate that one neuron and only that
activate that one neuron and only that one neuron so this is an example of that
one neuron so this is an example of that
one neuron so this is an example of that hidden feature layer here and it might
hidden feature layer here and it might
hidden feature layer here and it might look a bit abstract but you you can just
look a bit abstract but you you can just
look a bit abstract but you you can just about start to make out that it’s
about start to make out that it’s
about start to make out that it’s starting to learn this kind of ghostly
starting to learn this kind of ghostly
starting to learn this kind of ghostly images of numbers in there and that’s
images of numbers in there and that’s
images of numbers in there and that’s because it’s starting to learn some of
because it’s starting to learn some of
because it’s starting to learn some of these concepts if you were to use a
these concepts if you were to use a
these concepts if you were to use a number of hidden layers and say you know
number of hidden layers and say you know
number of hidden layers and say you know don’t don’t try and learn the number all
don’t don’t try and learn the number all
don’t don’t try and learn the number all in one go it might come up with features
in one go it might come up with features
in one go it might come up with features that are like edges maybe it could learn
that are like edges maybe it could learn
that are like edges maybe it could learn the edge of the stick of a7 or maybe you
the edge of the stick of a7 or maybe you
the edge of the stick of a7 or maybe you can start to learn some curves of a nine
can start to learn some curves of a nine
can start to learn some curves of a nine or something like that and these are the
or something like that and these are the
or something like that and these are the hidden features that are in the middle
hidden features that are in the middle
hidden features that are in the middle of all these these networks
of all these these networks
of all these these networks so then finally we would produce an
so then finally we would produce an
so then finally we would produce an output layer which usually amounts to
output layer which usually amounts to
output layer which usually amounts to the number of possible classifications
the number of possible classifications
the number of possible classifications that we want to make so for our output
that we want to make so for our output
that we want to make so for our output layer we would have 10 we would have 0
layer we would have 10 we would have 0
layer we would have 10 we would have 0 to 9 and each one of those nodes would
to 9 and each one of those nodes would
to 9 and each one of those nodes would represent a number and at the output
represent a number and at the output
represent a number and at the output layer if we were to actually put one of
layer if we were to actually put one of
layer if we were to actually put one of these examples in you’d never get 100%
these examples in you’d never get 100%
these examples in you’d never get 100% you always get this the we’re talking
you always get this the we’re talking
you always get this the we’re talking earlier about how they’re they’re not
earlier about how they’re they’re not
earlier about how they’re they’re not deterministic but you kind of they are
deterministic but you kind of they are
deterministic but you kind of they are deterministic in the sense that they
deterministic in the sense that they
deterministic in the sense that they have fixed weight so you can follow the
have fixed weight so you can follow the
have fixed weight so you can follow the path of those weights through the data
path of those weights through the data
path of those weights through the data however we’re never quite sure like
however we’re never quite sure like
however we’re never quite sure like going back to that previous example
going back to that previous example
going back to that previous example we’re never quite sure whether it’s a 5
we’re never quite sure whether it’s a 5
we’re never quite sure whether it’s a 5 or a 3 so we’re going to the algorithm
or a 3 so we’re going to the algorithm
or a 3 so we’re going to the algorithm will probably decide that I’m 50 percent
will probably decide that I’m 50 percent
will probably decide that I’m 50 percent sure that it’s a 5 but there’s a 40%
sure that it’s a 5 but there’s a 40%
sure that it’s a 5 but there’s a 40% chance there could be a 3 so all of the
chance there could be a 3 so all of the
chance there could be a 3 so all of the numbers that are generated basically the
numbers that are generated basically the
numbers that are generated basically the the classification is made by picking
the classification is made by picking
the classification is made by picking the highest of those numbers so in this
the highest of those numbers so in this
the highest of those numbers so in this case would say that the 5 is the
case would say that the 5 is the
case would say that the 5 is the classification for this example because
classification for this example because
classification for this example because that add the highest value at the output
but what’s really cool as well is that
but what’s really cool as well is that we can actually rather than try and tell
we can actually rather than try and tell
we can actually rather than try and tell it to classify the objects by only
it to classify the objects by only
it to classify the objects by only having 10 outputs we can actually
having 10 outputs we can actually
having 10 outputs we can actually produce the same number of outputs and
produce the same number of outputs and
produce the same number of outputs and inputs and say ask the algorithm please
inputs and say ask the algorithm please
inputs and say ask the algorithm please try and reconstruct the image based upon
try and reconstruct the image based upon
try and reconstruct the image based upon your hidden you know concepts and
your hidden you know concepts and
your hidden you know concepts and representations so what we can do here
representations so what we can do here
representations so what we can do here is given a certain output please reduce
is given a certain output please reduce
is given a certain output please reduce reproduce that input and then we could
reproduce that input and then we could
reproduce that input and then we could do some comparison to see how well it’s
do some comparison to see how well it’s
do some comparison to see how well it’s performed so this is an example of what
performed so this is an example of what
performed so this is an example of what a reconstruction actually looks like and
a reconstruction actually looks like and
a reconstruction actually looks like and if I just flick backwards or forwards
if I just flick backwards or forwards
if I just flick backwards or forwards between what was real what was the real
between what was real what was the real
between what was real what was the real input and what was the learned concepts
input and what was the learned concepts
input and what was the learned concepts about that you can kind of see that the
about that you can kind of see that the
about that you can kind of see that the learned concepts are kind of like a
learned concepts are kind of like a
learned concepts are kind of like a drunk blurred version of the real number
drunk blurred version of the real number
drunk blurred version of the real number and that’s because they’re kind of
and that’s because they’re kind of
and that’s because they’re kind of learning they did what the most likely
learning they did what the most likely
learning they did what the most likely look is for that particular number and
look is for that particular number and
look is for that particular number and and what’s really interesting is in the
and what’s really interesting is in the
and what’s really interesting is in the real data with what we won’t show
real data with what we won’t show
real data with what we won’t show whether that’s 3 or 5 but if you look at
whether that’s 3 or 5 but if you look at
whether that’s 3 or 5 but if you look at the drunk verse
the drunk verse
the drunk verse it actually looks a little bit more than
it actually looks a little bit more than
it actually looks a little bit more than a five and this is saying that the
a five and this is saying that the
a five and this is saying that the algorithm was decided um well but it’s
algorithm was decided um well but it’s
algorithm was decided um well but it’s probably been labeled as a five so that
probably been labeled as a five so that
probably been labeled as a five so that so the algorithm has has learnt that of
so the algorithm has has learnt that of
so the algorithm has has learnt that of those features as a five so when you try
those features as a five so when you try
those features as a five so when you try and reconstruct it it looks more like a
and reconstruct it it looks more like a
and reconstruct it it looks more like a five and then finally we talked about
five and then finally we talked about
five and then finally we talked about dimensionality reduction so what we can
dimensionality reduction so what we can
dimensionality reduction so what we can do is take that high dimensional output
do is take that high dimensional output
do is take that high dimensional output so in this case we have ten discrete
so in this case we have ten discrete
so in this case we have ten discrete classes from zero to nine and we can
classes from zero to nine and we can
classes from zero to nine and we can flatten them into space so we don’t have
flatten them into space so we don’t have
flatten them into space so we don’t have ten dimensions to plot all our data so
ten dimensions to plot all our data so
ten dimensions to plot all our data so we can’t we can’t plot the 50% of the
we can’t we can’t plot the 50% of the
we can’t we can’t plot the 50% of the five to thirty percent of the for the
five to thirty percent of the for the
five to thirty percent of the for the twenty percent of the three and so on
twenty percent of the three and so on
twenty percent of the three and so on and so on all on a graph because we
and so on all on a graph because we
and so on all on a graph because we don’t have that many dimensions so what
don’t have that many dimensions so what
don’t have that many dimensions so what we can do is flatten all of that into
we can do is flatten all of that into
we can do is flatten all of that into two dimensions and this is what this
two dimensions and this is what this
two dimensions and this is what this process is here and what it shows you is
process is here and what it shows you is
process is here and what it shows you is how well the data are clustering
how well the data are clustering
how well the data are clustering together so we can see if I have stand
together so we can see if I have stand
together so we can see if I have stand very close to my screen I can see that
very close to my screen I can see that
very close to my screen I can see that the number Seven’s at the bottom are
the number Seven’s at the bottom are
the number Seven’s at the bottom are quite well clustered there the number of
quite well clustered there the number of
quite well clustered there the number of eights are okay in the top left but then
eights are okay in the top left but then
eights are okay in the top left but then we’ve also got some very strange
we’ve also got some very strange
we’ve also got some very strange features like so let’s take the five and
features like so let’s take the five and
features like so let’s take the five and a three example you see the fives in the
a three example you see the fives in the
a three example you see the fives in the orange in the middle they’re pretty well
orange in the middle they’re pretty well
orange in the middle they’re pretty well mixed with the three and that’s kind of
mixed with the three and that’s kind of
mixed with the three and that’s kind of because there must be quite a lot of
because there must be quite a lot of
because there must be quite a lot of examples that look like a five or look
examples that look like a five or look
examples that look like a five or look like a three so they’re quite well mixed
like a three so they’re quite well mixed
like a three so they’re quite well mixed so that means to actually perform the
so that means to actually perform the
so that means to actually perform the classification the algorithm is gonna
classification the algorithm is gonna
classification the algorithm is gonna have to work really hard to try and you
have to work really hard to try and you
have to work really hard to try and you know pull those apart so this is what
know pull those apart so this is what
know pull those apart so this is what you would generally do on the output is
you would generally do on the output is
you would generally do on the output is you would you would try and visualize
you would you would try and visualize
you would you would try and visualize the data in such a way that we as humans
the data in such a way that we as humans
the data in such a way that we as humans can couldn’t understand it that could be
can couldn’t understand it that could be
can couldn’t understand it that could be in 2d or in 3d okay so hopefully that
in 2d or in 3d okay so hopefully that
in 2d or in 3d okay so hopefully that that section kind of introduced you to
that section kind of introduced you to
that section kind of introduced you to two deep learning and some of the ideas
two deep learning and some of the ideas
two deep learning and some of the ideas and some of the terminology so when I
and some of the terminology so when I
and some of the terminology so when I come to some of the financial demos
come to some of the financial demos
come to some of the financial demos there this should be much easier to
there this should be much easier to
there this should be much easier to understand so first example is a
understand so first example is a
understand so first example is a traditional example using a rules-based
traditional example using a rules-based
traditional example using a rules-based approach and in this case we’ve been a
approach and in this case we’ve been a
approach and in this case we’ve been a little bit fancy we use in graph
little bit fancy we use in graph
little bit fancy we use in graph database typically graphed over it
database typically graphed over it
database typically graphed over it databases aren’t used as much as we’d
databases aren’t used as much as we’d
databases aren’t used as much as we’d like but they do perform really well in
like but they do perform really well in
like but they do perform really well in a
a
a in a fraud based scenario so just
in a fraud based scenario so just
in a fraud based scenario so just quickly recap if you don’t know a graph
quickly recap if you don’t know a graph
quickly recap if you don’t know a graph database is a another new SQL database
database is a another new SQL database
database is a another new SQL database but its power really is the description
but its power really is the description
but its power really is the description of the data so the data can only ever be
of the data so the data can only ever be
of the data so the data can only ever be either a node or a relationship a node
either a node or a relationship a node
either a node or a relationship a node is like a thing or a noun whereas a
is like a thing or a noun whereas a
is like a thing or a noun whereas a relationship is is a link or a
relationship is is a link or a
relationship is is a link or a relationship or a or a verb that
relationship or a or a verb that
relationship or a or a verb that basically connects two concepts together
basically connects two concepts together
basically connects two concepts together and the key selling point really is that
and the key selling point really is that
and the key selling point really is that sometimes you’ve got data that is just
sometimes you’ve got data that is just
sometimes you’ve got data that is just better described in a graph like
better described in a graph like
better described in a graph like structure so for example when we’re
structure so for example when we’re
structure so for example when we’re talking about fraud and and finance and
talking about fraud and and finance and
talking about fraud and and finance and stuff
stuff
stuff you’ve got the concepts of people and
you’ve got the concepts of people and
you’ve got the concepts of people and accounts and those people and accounts
accounts and those people and accounts
accounts and those people and accounts are all linked to different things
are all linked to different things
are all linked to different things they’re linked to an address a link to a
they’re linked to an address a link to a
they’re linked to an address a link to a current account and so on so for example
current account and so on so for example
current account and so on so for example we’ve got the traditional the
we’ve got the traditional the
we’ve got the traditional the traditional social media use case where
traditional social media use case where
traditional social media use case where we’ve got bobs these Bobby’s friends
we’ve got bobs these Bobby’s friends
we’ve got bobs these Bobby’s friends with Jane we’ve got a chair contained
with Jane we’ve got a chair contained
with Jane we’ve got a chair contained within a room Jane bought a book and so
within a room Jane bought a book and so
within a room Jane bought a book and so on but the real power is that once
on but the real power is that once
on but the real power is that once you’ve modeled it in this way you can
you’ve modeled it in this way you can
you’ve modeled it in this way you can perform complex queries that you
perform complex queries that you
perform complex queries that you wouldn’t be able to do in a traditional
wouldn’t be able to do in a traditional
wouldn’t be able to do in a traditional relational database so when you wanted
relational database so when you wanted
relational database so when you wanted to do so to go back to the social media
to do so to go back to the social media
to do so to go back to the social media example again when you wanted to do like
example again when you wanted to do like
example again when you wanted to do like who is friends with my friend you have
who is friends with my friend you have
who is friends with my friend you have to do some crazy joined with your SQL in
to do some crazy joined with your SQL in
to do some crazy joined with your SQL in order to get that to work with a graph
order to get that to work with a graph
order to get that to work with a graph database you can just pop you can just
database you can just pop you can just
database you can just pop you can just hop through the graph it makes it really
hop through the graph it makes it really
hop through the graph it makes it really really fast so in their fraud situation
really fast so in their fraud situation
really fast so in their fraud situation we might model our data to something
we might model our data to something
we might model our data to something like this we might have an account
like this we might have an account
like this we might have an account holder in the middle and they have
holder in the middle and they have
holder in the middle and they have relationships with phone numbers or
relationships with phone numbers or
relationships with phone numbers or national insurance numbers things like
national insurance numbers things like
national insurance numbers things like that and then we can perform queries on
that and then we can perform queries on
that and then we can perform queries on that if we would like to but when you
that if we would like to but when you
that if we would like to but when you start viewing that in detail and
start viewing that in detail and
start viewing that in detail and actually viewing how these connections
actually viewing how these connections
actually viewing how these connections are connecting things together
are connecting things together
are connecting things together interesting patterns start to come out
interesting patterns start to come out
interesting patterns start to come out and especially if you’re visualizing it
and especially if you’re visualizing it
and especially if you’re visualizing it in this way as well it’s much easier to
in this way as well it’s much easier to
in this way as well it’s much easier to visualize data in this way than it is in
visualize data in this way than it is in
visualize data in this way than it is in a table for example so in this example
a table for example so in this example
a table for example so in this example we’ve got three account holders in red
we’ve got three account holders in red
we’ve got three account holders in red having the red yep they’re red and
having the red yep they’re red and
having the red yep they’re red and they’re linked in various different ways
they’re linked in various different ways
they’re linked in various different ways we’ve got all three of them are sharing
we’ve got all three of them are sharing
we’ve got all three of them are sharing the same address so who could be dodgy I
the same address so who could be dodgy I
the same address so who could be dodgy I actually had a person in another talk
actually had a person in another talk
actually had a person in another talk excuse me
excuse me
excuse me that III was suggesting that all three
that III was suggesting that all three
that III was suggesting that all three people sharing the same address that
people sharing the same address that
people sharing the same address that could be dodgy and and she was like no
could be dodgy and and she was like no
could be dodgy and and she was like no no no no when thousands of people are
no no no when thousands of people are
no no no when thousands of people are sharing the same address then it’s dodgy
sharing the same address then it’s dodgy
sharing the same address then it’s dodgy three is fine don’t worry about it so
three is fine don’t worry about it so
three is fine don’t worry about it so I’m like okay so but we could set up a
I’m like okay so but we could set up a
I’m like okay so but we could set up a rule there to say you know how many
rule there to say you know how many
rule there to say you know how many people are using the same address and
people are using the same address and
people are using the same address and you could do that in the traditional
you could do that in the traditional
you could do that in the traditional database but where the power really
database but where the power really
database but where the power really comes in is when you start linking these
comes in is when you start linking these
comes in is when you start linking these these things together and searching for
these things together and searching for
these things together and searching for these larger rings and groups within the
these larger rings and groups within the
these larger rings and groups within the data so if we imagine that directly two
data so if we imagine that directly two
data so if we imagine that directly two people aren’t sharing the same national
people aren’t sharing the same national
people aren’t sharing the same national insurance number for example which is
insurance number for example which is
insurance number for example which is illegal in the UK maybe there’s a third
illegal in the UK maybe there’s a third
illegal in the UK maybe there’s a third party which is linking these National
party which is linking these National
party which is linking these National Insurance numbers together so you
Insurance numbers together so you
Insurance numbers together so you actually start to form these rings
actually start to form these rings
actually start to form these rings within the data which are kind of not
within the data which are kind of not
within the data which are kind of not not natural this shouldn’t really be
not natural this shouldn’t really be
not natural this shouldn’t really be rings in the data and graph databases
rings in the data and graph databases
rings in the data and graph databases are really good at viewing and spotting
are really good at viewing and spotting
are really good at viewing and spotting these rings so that’s the kind of
these rings so that’s the kind of
these rings so that’s the kind of technology that would exist in the wild
technology that would exist in the wild
technology that would exist in the wild today if we were asked to to perform a
today if we were asked to to perform a
today if we were asked to to perform a job like this but where we’re really
job like this but where we’re really
job like this but where we’re really interested in is bringing some machine
interested in is bringing some machine
interested in is bringing some machine learning techniques to some of these
learning techniques to some of these
learning techniques to some of these ideas so the first idea I had was quite
ideas so the first idea I had was quite
ideas so the first idea I had was quite a typical one really and that’s why
a typical one really and that’s why
a typical one really and that’s why that’s why I did it because it was quite
that’s why I did it because it was quite
that’s why I did it because it was quite easy to do but basically if we could use
easy to do but basically if we could use
easy to do but basically if we could use vocal fingerprints for origination it
vocal fingerprints for origination it
vocal fingerprints for origination it would just solve just the the main
would just solve just the the main
would just solve just the the main reasons really it would save the user a
reasons really it would save the user a
reasons really it would save the user a significant amount of time the user
significant amount of time the user
significant amount of time the user experience would would you know be huge
experience would would you know be huge
experience would would you know be huge hugely improved not having to wait on
hugely improved not having to wait on
hugely improved not having to wait on the phone for 20 minutes just because
the phone for 20 minutes just because
the phone for 20 minutes just because some stupid automated system took you to
some stupid automated system took you to
some stupid automated system took you to the wrong place so if we can use their
the wrong place so if we can use their
the wrong place so if we can use their person’s voice as a form of
person’s voice as a form of
person’s voice as a form of authentication origination then we’ll be
authentication origination then we’ll be
authentication origination then we’ll be able to save time be able to save
able to save time be able to save
able to save time be able to save machines and be able to save their the
machines and be able to save their the
machines and be able to save their the power of people on the other end of the
power of people on the other end of the
power of people on the other end of the phone so to do this what we’d have to do
phone so to do this what we’d have to do
phone so to do this what we’d have to do is to record the customers voice
is to record the customers voice
is to record the customers voice we then pre-process the data in some way
we then pre-process the data in some way
we then pre-process the data in some way to clean it up and put it in a format
to clean it up and put it in a format
to clean it up and put it in a format that’s that’s capable of being put into
that’s that’s capable of being put into
that’s that’s capable of being put into an algorithm in this case we would trade
an algorithm in this case we would trade
an algorithm in this case we would trade a deep learning model but it could be
a deep learning model but it could be
a deep learning model but it could be any algorithm and then we’d store that
any algorithm and then we’d store that
any algorithm and then we’d store that fingerprint for future verification in
fingerprint for future verification in
fingerprint for future verification in the online scenario so once you’ve got
the online scenario so once you’ve got
the online scenario so once you’ve got set up the user would come on you’d
set up the user would come on you’d
set up the user would come on you’d rerecord his voice again maybe against
rerecord his voice again maybe against
rerecord his voice again maybe against the preset phrase maybe against new
the preset phrase maybe against new
the preset phrase maybe against new phrase and then you’d compare that
phrase and then you’d compare that
phrase and then you’d compare that result of the fingerprint and that would
result of the fingerprint and that would
result of the fingerprint and that would prove whether that person is you know
prove whether that person is you know
prove whether that person is you know really who they say they are so this is
really who they say they are so this is
really who they say they are so this is the pre-processing stage in action so
the pre-processing stage in action so
the pre-processing stage in action so this is a bit of signal processing which
this is a bit of signal processing which
this is a bit of signal processing which is converting the the time signature of
is converting the the time signature of
is converting the the time signature of the the audio file into frequency into
the the audio file into frequency into
the the audio file into frequency into the frequency domain so what you’re
the frequency domain so what you’re
the frequency domain so what you’re seeing there is a plot of the frequency
seeing there is a plot of the frequency
seeing there is a plot of the frequency components versus time so red is strong
components versus time so red is strong
components versus time so red is strong and that green blue a color is is weak
and that green blue a color is is weak
and that green blue a color is is weak so it’s saying that you know you can see
so it’s saying that you know you can see
so it’s saying that you know you can see there the gaps in between the data
there the gaps in between the data
there the gaps in between the data they’re a kind of where that paused to
they’re a kind of where that paused to
they’re a kind of where that paused to say the words and I think if we’re if it
say the words and I think if we’re if it
say the words and I think if we’re if it works yeah so this is some example data
works yeah so this is some example data
works yeah so this is some example data that I used in my learning and this is
that I used in my learning and this is
that I used in my learning and this is three examples of three people saying
three examples of three people saying
three examples of three people saying the same phrase don’t ask me what that
the same phrase don’t ask me what that
the same phrase don’t ask me what that phrase actually means I don’t know what
phrase actually means I don’t know what
phrase actually means I don’t know what anything but anyway you can tell
anything but anyway you can tell
anything but anyway you can tell yourself that those three voices sounded
yourself that those three voices sounded
yourself that those three voices sounded sometimes a little bit different but in
sometimes a little bit different but in
sometimes a little bit different but in that last example completely different
that last example completely different
that last example completely different and what we’re trying to do is to to
and what we’re trying to do is to to
and what we’re trying to do is to to make the deep learning think the same
make the deep learning think the same
make the deep learning think the same okay so once we’ve put it into our deep
okay so once we’ve put it into our deep
okay so once we’ve put it into our deep learning model we’ve done the training
learning model we’ve done the training
learning model we’ve done the training and we’ve produced an output our output
and we’ve produced an output our output
and we’ve produced an output our output in this case is between these three
in this case is between these three
in this case is between these three different people so you could have three
different people so you could have three
different people so you could have three outputs and then again we’ve compressed
outputs and then again we’ve compressed
outputs and then again we’ve compressed that we’ve squashed that under the
that we’ve squashed that under the
that we’ve squashed that under the screen into two dimensions and this is a
screen into two dimensions and this is a
screen into two dimensions and this is a plot that shows how close all of those
plot that shows how close all of those
plot that shows how close all of those voices were between so we’ve got a
voices were between so we’ve got a
voices were between so we’ve got a couple of different points in there and
couple of different points in there and
couple of different points in there and the the different colors there – Bob
the the different colors there – Bob
the the different colors there – Bob Steve and Dave they correspond to the
Steve and Dave they correspond to the
Steve and Dave they correspond to the three different examples the three
three different examples the three
three different examples the three different people giving the example
different people giving the example
different people giving the example sorry and each individual point is a
sorry and each individual point is a
sorry and each individual point is a specific phrase that they said so we had
specific phrase that they said so we had
specific phrase that they said so we had ten ten different phrases that they said
ten ten different phrases that they said
ten ten different phrases that they said and you can see that all of these
and you can see that all of these
and you can see that all of these examples are clustering together quite
examples are clustering together quite
examples are clustering together quite well so if we then took another they’re
well so if we then took another they’re
well so if we then took another they’re the same people but using a different
the same people but using a different
the same people but using a different spoken example so not the same examples
spoken example so not the same examples
spoken example so not the same examples how would that perform
how would that perform
how would that perform new data so I think we go again so the
new data so I think we go again so the
new data so I think we go again so the top line now in the results that was the
top line now in the results that was the
top line now in the results that was the the raw result the raw output of those
the raw result the raw output of those
the raw result the raw output of those three neurons throught for that file and
three neurons throught for that file and
three neurons throught for that file and it’s saying that one of the new your
it’s saying that one of the new your
it’s saying that one of the new your honors have 0.98
honors have 0.98
honors have 0.98 the 10.1 another 100.1 as well and
the 10.1 another 100.1 as well and
the 10.1 another 100.1 as well and that’s saying that you know Bob
that’s saying that you know Bob
that’s saying that you know Bob definitely pretty sure 19 percent sure
definitely pretty sure 19 percent sure
definitely pretty sure 19 percent sure that that was definitely Bob
there you go 97 percent chance that was
there you go 97 percent chance that was Steve there 96 percent it was Dave so
Steve there 96 percent it was Dave so
Steve there 96 percent it was Dave so that was that example quite a simple
that was that example quite a simple
that was that example quite a simple example in sense that it only used a
example in sense that it only used a
example in sense that it only used a very small data set but it’s you know
very small data set but it’s you know
very small data set but it’s you know it’s instructive and it kind of points
it’s instructive and it kind of points
it’s instructive and it kind of points towards things that we could do in the
towards things that we could do in the
towards things that we could do in the future given much more data I mean like
future given much more data I mean like
future given much more data I mean like every phone call we pick up these days
every phone call we pick up these days
every phone call we pick up these days there’s always a we are recording your
there’s always a we are recording your
there’s always a we are recording your voice for verification training purposes
voice for verification training purposes
voice for verification training purposes so there must be huge vast databases of
so there must be huge vast databases of
so there must be huge vast databases of people’s voices out there ok so next is
people’s voices out there ok so next is
people’s voices out there ok so next is ample decision trees so this is an
ample decision trees so this is an
ample decision trees so this is an example of decision tree that we showed
example of decision tree that we showed
example of decision tree that we showed earlier on and this is predicting
earlier on and this is predicting
earlier on and this is predicting mortgage default so amazingly two banks
mortgage default so amazingly two banks
mortgage default so amazingly two banks – – sorry two mortgage providers in the
– – sorry two mortgage providers in the
– – sorry two mortgage providers in the u.s. went bust as usual of course and
u.s. went bust as usual of course and
u.s. went bust as usual of course and were bailed out by the US taxpayer so we
were bailed out by the US taxpayer so we
were bailed out by the US taxpayer so we owned by the US government so Freddie
owned by the US government so Freddie
owned by the US government so Freddie but Freddie Mac and Fannie Mae and as
but Freddie Mac and Fannie Mae and as
but Freddie Mac and Fannie Mae and as part of their I don’t know as part of
part of their I don’t know as part of
part of their I don’t know as part of their reprisal basically a slap on the
their reprisal basically a slap on the
their reprisal basically a slap on the wrist the the government forced them to
wrist the the government forced them to
wrist the the government forced them to release lots of their data to the public
release lots of their data to the public
release lots of their data to the public and amazingly they they publicized a
and amazingly they they publicized a
and amazingly they they publicized a whole data set of mortgage applications
whole data set of mortgage applications
whole data set of mortgage applications and also historical accounts of what
and also historical accounts of what
and also historical accounts of what happened to those mortgage applications
happened to those mortgage applications
happened to those mortgage applications so you can say that they did told us
so you can say that they did told us
so you can say that they did told us whether that person then defaulted in
whether that person then defaulted in
whether that person then defaulted in the future so the task here is given
the future so the task here is given
the future so the task here is given some given some oh dear I’m running over
some given some oh dear I’m running over
some given some oh dear I’m running over time off to speed up given some data is
time off to speed up given some data is
time off to speed up given some data is it possible to predict whether that
it possible to predict whether that
it possible to predict whether that person’s going to default so the first
person’s going to default so the first
person’s going to default so the first the first problem is the whole data
the first problem is the whole data
the first problem is the whole data cleaning problem like we saw
cleaning problem like we saw
cleaning problem like we saw the previous talk it’s the vast majority
the previous talk it’s the vast majority
the previous talk it’s the vast majority of time to spend cleaning data
of time to spend cleaning data
of time to spend cleaning data I’m gonna skip over that so if we were
I’m gonna skip over that so if we were
I’m gonna skip over that so if we were to flatten all of the data that was
to flatten all of the data that was
to flatten all of the data that was recovered into a an image before we put
recovered into a an image before we put
recovered into a an image before we put it through the algorithm this is kind of
it through the algorithm this is kind of
it through the algorithm this is kind of what it looks like it’s very
what it looks like it’s very
what it looks like it’s very intermingled and mixed can’t quite
intermingled and mixed can’t quite
intermingled and mixed can’t quite understand what’s going on so a decision
understand what’s going on so a decision
understand what’s going on so a decision tree is is learning all of these rules
tree is is learning all of these rules
tree is is learning all of these rules and based upon the outcome of those
and based upon the outcome of those
and based upon the outcome of those rules is rather yes the person defaulted
rules is rather yes the person defaulted
rules is rather yes the person defaulted no they didn’t default so we had
no they didn’t default so we had
no they didn’t default so we had approximately 20,000 samples total 50-50
approximately 20,000 samples total 50-50
approximately 20,000 samples total 50-50 split a random forest classifier so it’s
split a random forest classifier so it’s
split a random forest classifier so it’s a type of decision tree algorithm but is
a type of decision tree algorithm but is
a type of decision tree algorithm but is better does not over fit as much only 11
better does not over fit as much only 11
better does not over fit as much only 11 input features so the main problem here
input features so the main problem here
input features so the main problem here is I don’t actually think we’ve got
is I don’t actually think we’ve got
is I don’t actually think we’ve got enough data to do a really good job but
enough data to do a really good job but
enough data to do a really good job but we’ll see what we can do and the one
we’ll see what we can do and the one
we’ll see what we can do and the one great thing about decision trees is that
great thing about decision trees is that
great thing about decision trees is that actually gives you a measure of
actually gives you a measure of
actually gives you a measure of importance for all of those variables so
importance for all of those variables so
importance for all of those variables so here we’ve got the variables that were
here we’ve got the variables that were
here we’ve got the variables that were inputted to the algorithm at the bottom
inputted to the algorithm at the bottom
inputted to the algorithm at the bottom and it shows their respective importance
and it shows their respective importance
and it shows their respective importance of those variables on there on the
of those variables on there on the
of those variables on there on the left-hand side so you can see actually
left-hand side so you can see actually
left-hand side so you can see actually the credit score is in second place so
the credit score is in second place so
the credit score is in second place so I’m not sure that the credit reference
I’m not sure that the credit reference
I’m not sure that the credit reference agencies would be too happy that you
agencies would be too happy that you
agencies would be too happy that you know they could only explain 0.25 of the
know they could only explain 0.25 of the
know they could only explain 0.25 of the data so 25% of the data could only be
data so 25% of the data could only be
data so 25% of the data could only be explained by the credit score alone so
explained by the credit score alone so
explained by the credit score alone so not not a great result for them and
not not a great result for them and
not not a great result for them and actually the most important measure was
actually the most important measure was
actually the most important measure was the HPI origination which was the house
the HPI origination which was the house
the HPI origination which was the house price index origination for that local
price index origination for that local
price index origination for that local area so this is saying that a person who
area so this is saying that a person who
area so this is saying that a person who took out a mortgage in a very local area
took out a mortgage in a very local area
took out a mortgage in a very local area it’s very dependent on the prices within
it’s very dependent on the prices within
it’s very dependent on the prices within that area as to whether they’re going to
that area as to whether they’re going to
that area as to whether they’re going to default or not and this is kind of a
default or not and this is kind of a
default or not and this is kind of a typical really in the US you can see
typical really in the US you can see
typical really in the US you can see like vast tracts of like places like
like vast tracts of like places like
like vast tracts of like places like Detroit that you know as soon as some of
Detroit that you know as soon as some of
Detroit that you know as soon as some of the jobs left everybody just lost their
the jobs left everybody just lost their
the jobs left everybody just lost their jobs in the whole house price area then
jobs in the whole house price area then
jobs in the whole house price area then crashed and then people couldn’t afford
crashed and then people couldn’t afford
crashed and then people couldn’t afford to sell because they couldn’t sell it so
to sell because they couldn’t sell it so
to sell because they couldn’t sell it so that’s kind of why that’s so important
that’s kind of why that’s so important
that’s kind of why that’s so important interesting result and then final
interesting result and then final
interesting result and then final example I’m having to move rather
example I’m having to move rather
example I’m having to move rather quickly here because I’ve only got two
quickly here because I’ve only got two
quickly here because I’ve only got two minutes left but is it possible to take
minutes left but is it possible to take
minutes left but is it possible to take that data
that data
that data and try and see whether there’s
and try and see whether there’s
and try and see whether there’s something strange going on without in
something strange going on without in
something strange going on without in the data so basically this is an
the data so basically this is an
the data so basically this is an unlabeled example we’re not telling it
unlabeled example we’re not telling it
unlabeled example we’re not telling it what to learn here so how do we do that
what to learn here so how do we do that
what to learn here so how do we do that well there’s a deep learning technique
well there’s a deep learning technique
well there’s a deep learning technique called an autoencoder which basically it
called an autoencoder which basically it
called an autoencoder which basically it takes the inputs and it restricts the
takes the inputs and it restricts the
takes the inputs and it restricts the number of hidden neurons to only a few
number of hidden neurons to only a few
number of hidden neurons to only a few concepts he’s saying you’ve really got a
concepts he’s saying you’ve really got a
concepts he’s saying you’ve really got a pick and choose what data you use and
pick and choose what data you use and
pick and choose what data you use and generate some concepts that are really
generate some concepts that are really
generate some concepts that are really quite strict and then we try and
quite strict and then we try and
quite strict and then we try and reproduce the output again and we’re
reproduce the output again and we’re
reproduce the output again and we’re comparing the output against the input
comparing the output against the input
comparing the output against the input as a measure of how well we’ll have done
as a measure of how well we’ll have done
as a measure of how well we’ll have done so basically those restrictions in the
so basically those restrictions in the
so basically those restrictions in the middle maybe only two neurons you know
middle maybe only two neurons you know
middle maybe only two neurons you know yes and no something like that is that
yes and no something like that is that
yes and no something like that is that possible to reconstruct the data so we
possible to reconstruct the data so we
possible to reconstruct the data so we can do that so there’s the same data as
can do that so there’s the same data as
can do that so there’s the same data as before slightly it’s a different random
before slightly it’s a different random
before slightly it’s a different random sample so it might look slightly
sample so it might look slightly
sample so it might look slightly different we’ve got an input layer a
different we’ve got an input layer a
different we’ve got an input layer a number of hidden layers that are
number of hidden layers that are
number of hidden layers that are compressing the data down into smaller
compressing the data down into smaller
compressing the data down into smaller and smaller neurons and then we’re
and smaller neurons and then we’re
and smaller neurons and then we’re reconstructing again back to the input
reconstructing again back to the input
reconstructing again back to the input layer and doing a comparison to see how
layer and doing a comparison to see how
layer and doing a comparison to see how well we did but what we can do then is
well we did but what we can do then is
well we did but what we can do then is plot in two or three D one of those
plot in two or three D one of those
plot in two or three D one of those hidden layers to actually view those
hidden layers to actually view those
hidden layers to actually view those concepts and what we’ve learnt and
concepts and what we’ve learnt and
concepts and what we’ve learnt and finally this is the result of that
finally this is the result of that
finally this is the result of that process and the left-hand side we’ve got
process and the left-hand side we’ve got
process and the left-hand side we’ve got a 2d representation and you can start to
a 2d representation and you can start to
a 2d representation and you can start to see there’s actually some structure
see there’s actually some structure
see there’s actually some structure within that data so most generally you
within that data so most generally you
within that data so most generally you can see that the people that defaulted
can see that the people that defaulted
can see that the people that defaulted the each ruse on that graph or on the on
the each ruse on that graph or on the on
the each ruse on that graph or on the on the left-hand side and the people that
the left-hand side and the people that
the left-hand side and the people that didn’t default on the right-hand side
didn’t default on the right-hand side
didn’t default on the right-hand side and within there if you look on the
and within there if you look on the
and within there if you look on the right-hand side there’s a couple of
right-hand side there’s a couple of
right-hand side there’s a couple of orange dots and that’s saying that the
orange dots and that’s saying that the
orange dots and that’s saying that the vast majority of people in there didn’t
vast majority of people in there didn’t
vast majority of people in there didn’t default but one or two people did now an
default but one or two people did now an
default but one or two people did now an analyst might start to ask why so it
analyst might start to ask why so it
analyst might start to ask why so it could be something quite innocent you
could be something quite innocent you
could be something quite innocent you know maybe the person lost his
know maybe the person lost his
know maybe the person lost his high-powered job went to prison
high-powered job went to prison
high-powered job went to prison something like that but it’s kind of
something like that but it’s kind of
something like that but it’s kind of indicative that something else is going
indicative that something else is going
indicative that something else is going on and this is where the analyst would
on and this is where the analyst would
on and this is where the analyst would come in and start investigating that
come in and start investigating that
come in and start investigating that data so these are completely unlabeled
data so these are completely unlabeled
data so these are completely unlabeled and the algorithm has absolutely no idea
and the algorithm has absolutely no idea
and the algorithm has absolutely no idea what it means
what it means
what it means and it still takes human to do some
and it still takes human to do some
and it still takes human to do some analysis and to do some investigation to
analysis and to do some investigation to
analysis and to do some investigation to figure out what has happened but these
figure out what has happened but these
figure out what has happened but these kinds of tools lead the analysts in the
kinds of tools lead the analysts in the
kinds of tools lead the analysts in the right direction as opposed to just
right direction as opposed to just
right direction as opposed to just taking a random Sam
taking a random Sam
taking a random Sam and then finally on the right hand side
and then finally on the right hand side
and then finally on the right hand side we’ve got a 3d representation of the
we’ve got a 3d representation of the
we’ve got a 3d representation of the same data and this is where it becomes
same data and this is where it becomes
same data and this is where it becomes really really powerful you can imagine
really really powerful you can imagine
really really powerful you can imagine like if you could get that graph and you
like if you could get that graph and you
like if you could get that graph and you can like look into it and and move it
can like look into it and and move it
can like look into it and and move it and turn it around and you can start to
and turn it around and you can start to
and turn it around and you can start to see clusters in 3d space and that’s when
see clusters in 3d space and that’s when
see clusters in 3d space and that’s when it starts to become immersive and given
it starts to become immersive and given
it starts to become immersive and given enough time it takes it takes a certain
enough time it takes it takes a certain
enough time it takes it takes a certain amount of time for any analyst to
amount of time for any analyst to
amount of time for any analyst to analyze data but given enough time they
analyze data but given enough time they
analyze data but given enough time they will be able to learn to see patterns
will be able to learn to see patterns
will be able to learn to see patterns within that data which will help them to
within that data which will help them to
within that data which will help them to investigate things that they haven’t
investigate things that they haven’t
investigate things that they haven’t seen before and I think I better stop
seen before and I think I better stop
seen before and I think I better stop there because I’ve completely run out of
there because I’ve completely run out of
there because I’ve completely run out of time so thank you very much for
time so thank you very much for
time so thank you very much for listening
Be First to Comment