-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathindex.html
211 lines (196 loc) · 11.4 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
<!DOCTYPE html>
<html lang="en">
<head>
<!-- Basic Page Needs
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<meta charset="utf-8">
<title>Ankesh Anand</title>
<meta name="description" content="PhD student in Artifical Intelligence at Mila, Montreal">
<meta name="author" content="Ankesh Anand">
<meta name="keywords" content="ankesh anand, ankesh, anand, mila">
<!-- Mobile Specific Metas
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- FONT
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link href="//fonts.googleapis.com/css?family=Raleway:400,300,600" rel="stylesheet" type="text/css">
<link href="https://fonts.googleapis.com/css?family=Source+Sans+Pro|Vollkorn" rel="stylesheet">
<link rel="stylesheet" href="https://use.fontawesome.com/releases/v5.5.0/css/all.css"
integrity="sha384-B4dIYHKNBt8Bc12p+WXckhzcICo0wtJAoU8YZTY5qE0Id1GSseTk6S+L3BlXeVIU" crossorigin="anonymous">
<link rel="stylesheet" href="https://cdn.rawgit.com/jpswalsh/academicons/master/css/academicons.min.css">
<!-- CSS
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link rel="stylesheet" href="css/normalize.css">
<link rel="stylesheet" href="css/skeleton.css">
<link rel="stylesheet" href="css/custom.css">
<link rel="stylesheet" href="https://use.typekit.net/qks0rmn.css">
<!-- Favicon
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link rel="icon" type="image/png" href="img/favicon-32x32.png" sizes="32x32"/>
<link rel="icon" type="image/png" href="img/favicon-16x16.png" sizes="16x16"/>
</head>
<body>
<!-- Primary Page Layout
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<div class="container">
<nav class="navbar">
<ul class="navbar-list">
<li class="navbar-item"><a href="/blog">Blog</a> </li>
</ul>
</nav>
<div class="row" style="margin-top: 1%">
<div class="three columns sidebar">
<img class="headshot" src="img/headshot.jpg">
<h4 class="avatar-title">Ankesh Anand</h4>
<span><a href="mailto:[email protected]">[email protected]</a></span>
<span> <a href="https://twitter.com/ankesh_anand"<i class="fab fa-twitter"></i> <a/>
/ <a href="https://github.com/ankeshanand"> <i class="fab fa-github"></i> </a>
/ <a href="https://linkedin.com/in/ankeshanand"> <i class="fab fa-linkedin"></i> </a>
/ <a href="Resume.pdf"><i class="ai ai-cv"></i></a>
/ <a href="https://scholar.google.co.in/citations?user=HEozmkMAAAAJ&hl=en"><i
class="ai ai-google-scholar"></i></a>
</span>
</div>
<div class="one columns"> </div>
<div class="eight columns intro">
<p>
Hi! I am Ankesh Anand, a Research Scientist in AI at <a href="https://deepmind.com">Google DeepMind</a>, London working on RL for reasoning and new capabilities in Gemini models.
Most recently, I played a key role in building <a href="https://deepmind.google/technologies/gemini/flash-thinking/">Gemini Flash Thinking</a> and <a href="https://deepmind.google/technologies/project-mariner/">Project Mariner</a>.
</p>
<p>I previously did a PhD at <a href="https://mila.quebec/en/">Mila</a>,
with <a href="https://aaroncourville.wordpress.com">Aaron Courville</a> on Self-Supervised Learning
and Reinforcement Learning. I have also worked as a research intern at
<a href="https://deepmind.com/">DeepMind</a>, London with <a href="http://www.jesshamrick.com/">Jessica Hamrick</a>,
and at <a href="https://www.microsoft.com/en-us/research/lab/microsoft-research-montreal/"> Microsoft
Research, Montreal </a> with <a href="https://rdevon.github.io/">Devon Hjelm</a> and
<a href="https://www.microsoft.com/en-us/research/people/phbachma/">Philip Bachman</a>.
<p>Earlier, I graduated from <a href="http://www.iitkgp.ac.in/">IIT Kharagpur</a> with a Bachelors and
Masters in Mathematics and Computing. I have also spent time at VISA, HackerEarth and Google Summer of
Code.
</p>
Check out my recent <a href="https://ankeshanand.com/blog/2022/01/08/rl-fine-tuning.html">blog post</a>
on how we should think about RL in the era of foundation models!
</p>
<h5>Publications</h5>
<hr/>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/muzero_ssl.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2111.01587">Procedural Generalization
by Planning with Self-Supervised World Models</a></span>
<div class="pub-authors"><span class="fw6">Ankesh Anand</span>, Jacob Walker, Yazhe Li,
Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Théophane Weber, Jessica B. Hamrick
</div>
<span class="pub-venue">ICLR, 2022</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/sgi.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2106.04799">Pretraining Representations
for Data-Efficient Reinforcement Learning</a></span>
<div class="pub-authors">Max Schwarzer, Nitarshan Rajkumar, Michael Noukhovitch,
<span class="fw6">Ankesh Anand</span>, Laurent Charlin, Devon Hjelm, Philip Bachman, Aaron Courville
</div>
<span class="pub-venue">NeurIPS, 2021</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/mpr.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2007.05929">Data-Efficient RL with
Self-Predictive Representations</a></span>
<div class="pub-authors">Max Schwarzer*, <span class="fw6">Ankesh Anand*</span>, Rishab Goel,
R Devon Hjelm, Aaron Courville, Philip Bachman
</div>
<span class="pub-venue">ICLR 2021, Spotlight</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/STDIM.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1906.08226">Unsupervised State Representation
Learning in Atari</a></span>
<div class="pub-authors"><span class="fw6">Ankesh Anand*</span>, Evan Racah*, Sherjil Ozair*,
Yoshua Bengio, Marc-Alexandre Côté, R Devon Hjelm
</div>
<span class="pub-venue">NeurIPS 2019</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/FastText.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1811.05013">Blindfold Baselines for Embodied QA</a></span>
<div class="pub-authors"><span class="fw6">Ankesh Anand</span>, Eugene Belilovsky, Kyle Kastner,
Hugo Larochelle, Aaron
Courville
</div>
<span class="pub-venue">ViGIL Workshop at NeurIPS 2018</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/home.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1711.11017">HoME: a Household Multimodal Environment</a></span>
<div class="pub-authors">Simon Brodeur, Ethan Perez*, <span class="fw6">Ankesh Anand*</span>,
Florian Golemo*,
Luca Celotti, Florian Strub, Jean Rouat, Hugo Larochelle, Aaron Courville
</div>
<span class="pub-venue">ICLR 2018, Workshop Track</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/mmgan.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1707.08273">MMGAN: Manifold Matching Generative Adversarial Networks</a></span>
<div class="pub-authors">Noseong Park, <span class="fw6">Ankesh Anand</span>, Joel Moniz, Kookjin
Lee, Tanmoy Chakraborty,
J Choo, H Park, Youngmin Kim
</div>
<span class="pub-venue">ICPR 2018</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/clickbaits.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title">
<a href="https://arxiv.org/abs/1612.01340">We used Neural Networks to Detect Clickbaits: You won't believe what happened Next!
</a> </span>
<div class="pub-authors"><span class="fw6">Ankesh Anand</span>, Tanmoy Chakraborty, Noseong Park
</div>
<span class="pub-venue">ECIR 2017</span>
</div>
</div>
</div>
</div>
</div>
<!-- Global site tag (gtag.js) - Google Analytics -->
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-132764710-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag() {
dataLayer.push(arguments);
}
gtag('js', new Date());
gtag('config', 'UA-132764710-1');
</script>
<!-- End Document
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
</body>
</html>