-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.html
315 lines (285 loc) · 17.1 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
<!DOCTYPE html>
<html lang="en">
<!-- Global site tag (gtag.js) - Google Analytics -->
<head>
<meta charset='utf-8'>
<script async src="https://www.googletagmanager.com/gtag/js?id=UA-109153516-1"></script>
<script>
window.dataLayer = window.dataLayer || [];
function gtag() { dataLayer.push(arguments); }
gtag('js', new Date());
gtag('config', 'UA-109153516-1');
</script>
<head>
<!-- Basic Page Needs
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<meta charset="utf-8">
<title>Rithesh Kumar</title>
<meta name="description" content="MSc student in Artifical Intelligence at Mila, Montreal">
<meta name="author" content="Rithesh Kumar">
<meta name="keywords" content="Rithesh Kumar, rithesh, kumar, mila, ai researcher">
<!-- Mobile Specific Metas
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- FONT
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link href="//fonts.googleapis.com/css?family=Raleway:400,300,600" rel="stylesheet" type="text/css">
<link href="https://fonts.googleapis.com/css?family=Source+Sans+Pro|Vollkorn" rel="stylesheet">
<link rel="stylesheet" href="https://use.fontawesome.com/releases/v5.5.0/css/all.css"
integrity="sha384-B4dIYHKNBt8Bc12p+WXckhzcICo0wtJAoU8YZTY5qE0Id1GSseTk6S+L3BlXeVIU" crossorigin="anonymous">
<link rel="stylesheet" href="https://cdn.rawgit.com/jpswalsh/academicons/master/css/academicons.min.css">
<!-- CSS
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link rel="stylesheet" href="css/normalize.css">
<link rel="stylesheet" href="css/skeleton.css">
<link rel="stylesheet" href="css/custom.css">
<link rel="stylesheet" href="https://use.typekit.net/qks0rmn.css">
<!-- Favicon
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<link rel="icon" type="image/png" href="img/favicon-32x32.png" sizes="32x32" />
<link rel="icon" type="image/png" href="img/favicon-16x16.png" sizes="16x16" />
</head>
<body>
<!-- Primary Page Layout
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
<div class="container">
<div class="row" style="margin-top: 10%">
<div class="three columns sidebar">
<img class="headshot" src="img/headshot.jpg">
<h4 class="avatar-title">Rithesh Kumar</h4>
<span><a href="mailto:[email protected]">[email protected]</a></span>
<span> <a href="https://twitter.com/ritheshkumar_"><i class="fab fa-twitter"></i> <a />
/ <a href="https://github.com/ritheshkumar95"> <i class="fab fa-github"></i> </a>
/ <a href="https://www.linkedin.com/in/rithesh-kumar-b0479488/"> <i class="fab fa-linkedin"></i>
</a>
/ <a href="public/CV_RitheshKumar.pdf"><i class="ai ai-cv"></i></a>
/ <a href="https://scholar.google.ca/citations?user=hJjeVsQAAAAJ&hl=en"><i
class="ai ai-google-scholar"></i></a>
</span>
</div>
<div class="one columns"> </div>
<div class="eight columns intro">
<p>
Hello! I am Rithesh Kumar, an AI researcher with expertise in deep learning and generative modeling.
Currently, I am a Research Scientist and member of the Audio Research Group at <a href="https://research.adobe.com/">Adobe Research</a>.
<br><br>
Previously, I was the Technical Lead for the Overdub Research team at <a
href="https://www.descript.com">Descript Inc.</a> In this time, I built and shipped 4+ text-to-speech models behind the
flagship <a href="https://www.descript.com/overdub">Overdub</a> feature capable of ultra-realistic voice cloning and
performing corrections on recordings through text. Recently, I also led the development of the
<a href="https://www.descript.com/regenerate"> Regenerate </a>feature that leverages instant voice cloning technology to make bad edits sound seamless and natural.
</p>
<p> Currrently, I live in Toronto, Ontario 🇨🇦.</p>
<h5>Education</h5>
<hr />
<p> I completed
my MSc
in Computer Science (specializing in Artificial Intelligence)
at the <a href="https://mila.quebec/en/">Mila lab</a> in Université de Montréal supervised
by <a href="http://www.iro.umontreal.ca/~bengioy/yoshua_en/">Yoshua Bengio</a>. During my MSc, I had
the excellent opportunity to intern at <a
href="https://www.descript.com/lyrebird-ai?source=lyrebird">Lyrebird</a> and <a
href="https://www.microsoft.com/en-us/research/lab/microsoft-research-montreal/">Microsoft
Research - Montréal</a>.
<p>Earlier, I graduated from <a href="http://www.ssn.edu.in/">SSN College of Engineering</a>
(affiliated to <a href="https://www.annauniv.edu/">Anna University</a>) with a Bachelors
in Computer Science and Engineering. I spent the final 2 years of my undergrad learning about
deep learning, spending a summer at the <a href="http://serre-lab.clps.brown.edu/">Serre Lab</a>
in
Brown University and collaborating with <a href="http://www.iro.umontreal.ca/~bengioy/yoshua_en/">
Prof. Yoshua Bengio</a> at the Mila
lab.
</p>
</p>
<h5>Publications</h5>
<hr />
<div class="row publication">
<div class="pub-image four columns">
<img src="img/rvqgan.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2306.06546">High-Fidelity Audio Compression with Improved RVQGAN</a></span>
<div class="pub-authors"><span class="fw6">Rithesh
Kumar*</span>, Prem Seetharaman*, Alejandro Luebs, Ishaan Kumar, Kundan Kumar
</div>
<span class="pub-venue">Poster Presentation (Spotlight) - NeurIPS 2023 </span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/vampnet.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2307.04686">VampNet: Music Generation via Masked Acoustic Token Modeling</a></span>
<div class="pub-authors">Hugo Flores Garcia, Prem Seetharaman, <span class="fw6">Rithesh
Kumar</span>, Bryan Pardio
</div>
<span class="pub-venue">Poster Presentation - ISMIR 2023 </span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/cargan.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2110.10139">Chunked Autoregressive GAN
for Conditional Waveform Synthesis</a></span>
<div class="pub-authors">Max Morrison, <span class="fw6">Rithesh
Kumar</span>, Kundan Kumar,
Prem Seetharaman, Aaron Courville, Yoshua Bengio
</div>
<span class="pub-venue">Poster Presentation - ICLR 2022</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/nugan.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/2010.11362">NU-GAN: High Resolution
Neural Upsampling With GANs</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>, Kundan Kumar, Vicki Anand,
Yoshua Bengio,
Aaron Courville
</div>
<!-- <span class="pub-venue">Hosted at Arxiv</span> -->
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/melgan.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1910.06711">MelGAN: Generative
Adversarial Networks for Conditional Waveform Synthesis </a></span>
<div class="pub-authors">Kundan Kumar*, <span class="fw6">Rithesh Kumar*</span>, Thibault de
Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brébisson, Yoshua
Bengio,
Aaron Courville
</div>
<span class="pub-venue">Poster Presentation - NeurIPS 2019</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/ebm.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1901.08508">Maximum Entropy
Generators
for Energy-based Models</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>, Sherjil Ozair, Anirudh
Goyal,
Aaron Courville, Yoshua Bengio
</div>
<span class="pub-venue">Masters Thesis</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/piano_roll.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1811.07426">Harmonic Recomposition
using
Conditional Autoregressive Modeling</a></span>
<div class="pub-authors">Kyle Kastner, <span class="fw6">Rithesh Kumar</span>,
Tim Coojimans, Aaron Courville
</div>
<span class="pub-venue">Poster Presentation - Joint Workshop on Machine Learning for Music
(ICML
2018)</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/obamanet.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://arxiv.org/abs/1801.01442">ObamaNet: Photo-realistic
lip-sync from text</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>, Jose Sotelo, Kundan Kumar,
Alexandre de Brébisson, Yoshua Bengio
</div>
<span class="pub-venue">Oral Presentation - Machine Learning for Creativity and Design
Workshop
(NeurIPS 2017)</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/samplernn.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title">
<a href="https://arxiv.org/abs/1612.07837"> SampleRNN: An Unconditional End-to-End
Neural
Audio Generation Model
</a> </span>
<div class="pub-authors">Soroush Mehri, Kundan Kumar, Ishaan Gulrajani,
<span class="fw6">Rithesh Kumar</span>, Shubham Jain, Aaron Courville, Yoshua Bengio
</div>
<span class="pub-venue">Poster Presentation - ICLR 2017</span>
</div>
</div>
<h5>Select Projects</h5>
<hr />
<div class="row publication">
<div class="pub-image four columns">
<img src="img/vqvae.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a href="https://github.com/ritheshkumar95/pytorch-vqvae">Reproducing
Neural Discrete Representation Learning</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>, Tristan Deleu, Evan Racah
</div>
<span class="pub-venue">Final project - <a
href="https://ift6135h18.wordpress.com/">Representation
Learning</a></span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/handwriting.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a
href="https://github.com/ritheshkumar95/pytorch_handwriting_synthesis">Reproducing
Handwriting Synthesis and Prediction</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>
</div>
<span class="pub-venue">Open source project</span>
</div>
</div>
<div class="row publication">
<div class="pub-image four columns">
<img src="img/im2latex.png">
</div>
<div class="pub-desc eight columns">
<span class="pub-title"><a
href="https://github.com/ritheshkumar95/im2latex-tensorflow">Reproducing
What You Get Is
What
You See: Visual Markup Decompiler</a></span>
<div class="pub-authors"><span class="fw6">Rithesh Kumar</span>, Rithesh Rohan, U.
Sivashanmugam
<!-- </div> -->
<span class="pub-venue">Undergraduate Thesis</span>
</div>
</div>
</div>
<footer class="page-footer font-small blue">
<div class="footer-copyright text-center"> Shout-out to <a href="https://ankeshanand.com">
Ankesh
Anand</a> for the
website template.
</div>
</footer>
</div>
</div>
<!-- Footer -->
<!-- Footer -->
<!-- End Document
–––––––––––––––––––––––––––––––––––––––––––––––––– -->
</body>
</html>