Francois Chollet @ Tokyo University

@fchollet

title: Deep Learning: current capabilities, limitatinos, and future perspectives

software engineer at google brain

some small implementation details turned out to be critical

like "how do you encode konwledge and reasoning in a computer"
scale makes a difference in deep learning (vs linear regression)
very large parametric models trianed on many many samples
a layer is a geometric transformation that turns one vector into another
a sequence of simple transformations to turn high level vector space to lower vector space
enough data
- == a dense sampling of 'input X output' space
chess has no priors
therefore humans and models alike have to start learning from scratch
therefore models can achieve arbitrary levels of skill

what can pattern recognition solve?
- all these problems, if they can be mapped to pattern recognition, they can be solved.
humans can cover a lot of ground with very little data
- extreme abstraction of meaning from data

meta learning
- look at github, grab the library of stuff that solve some problem, put it together, solve the problem, then put it back into the library. (e.g. npm / github)
- a model that learns to map
  - inputs (problems) to solution functions (solution space == npm library / github)