Performance metrics #1

uditsaxena · 2018-04-30T04:22:33Z

Hey, could you please talk about the performance metrics of this pytorch implementation?

Thanks

hengruo · 2018-04-30T04:45:54Z

Thanks for your attention! I just finished this model. Now it can get EM/F1 = 70.5/77.2 after 20 epochs. I will release more detail metrics soon.

uditsaxena · 2018-04-30T04:53:42Z

Thanks, could you also talk about training time ?

Ramondy · 2018-05-14T11:21:19Z

Thanks for sharing ! We just started playing with your code, as part of a school project. The performance we get is horrible : EM/F1 = 0.02/3.94 after 10 epochs. It seems the model is not learning. We are digging in... Any idea that might help our cause? @uditsaxena our training time is a little over an hour per epoch, using an AWS p2.x large.

hengruo · 2018-05-22T01:01:43Z

@Ramondy I got the same results after my destructive changes... I'm trying to reimplement it to save this model and make code cleaner.

rsps950551 · 2018-05-23T09:43:33Z

I have same problem with Ramondy. Any idea that might help our cause?

huaiwen · 2018-05-24T09:28:37Z

I have same problem，EM: 0.025, F1: 4.1858

BangLiu · 2018-07-04T02:31:25Z

@hengruo May I ask that what is the current best performance you can get? I found a few things different from the paper:

your learning rate is not fixed to 0.001 after 1000 steps. When I use the same learning rate change scheduler, I found the performance is growing but the performance changes quite slow after around some epochs.
seems you didn't use exponential moving average for parameter weights. The decay 0.9999 you set is for the scheduler, which is actually gamma parameter.
If I set the learning rate fixed to be 0.001 after 1000 epochs, my performance is not good and often changes dramatically during each epoch (I was mostly based on your implementation, but the multi-head attention is implemented in another manner as an isolated module in my implementation).

Looking forward to your reply!

susht3 · 2018-07-04T08:09:41Z

what is the performance now?

sherlockhommel · 2018-07-19T08:44:43Z

I played around with the configuration a little bit, but I am barely scratching F1 of 20 (even after >30.000 steps). I see that the standard configuration in config.py is different from the paper (e.g. number of heads in multihead attention = 2 instead of 8, batch_size, etc).

Could someone with better results explain the exact configurations he used? I think this would help others to get up to speed and start the real experimentation ;) Thanks!

hengruo · 2018-07-19T12:41:07Z

@susht3 @fkaupmann now two guys got about 65.0 F1 after 25,000 iterations. I've updated the readme. If you would like to know the training details, you could see the other issue about memory explosion. They discussed it there.

ZhaoyueCheng · 2018-07-19T15:45:58Z

I tried to train with the default parameter, but I only get very low F1/ EM after a long time, F1 is around 10 after training for a long time. Is there anything I need to pay attention to while training the model? Thanks!

zhijuny · 2019-06-08T08:59:53Z

I tried to train with the default parameter, but I only get very low F1/ EM after a long time, F1 is around 10 after training for a long time. Is there anything I need to pay attention to while training the model? Thanks!

Have you solved the problem?

LilAnthony123 · 2020-07-22T08:25:38Z

Hello, what is the performance now? I got a F1/Em score below 10

zhijuny · 2020-07-22T08:35:48Z

Sorry, I gave up this project a long time ago。

…

---Original--- From: "LilAnthony123"<[email protected]> Date: Wed, Jul 22, 2020 16:25 PM To: "setoidz/QANet-pytorch"<[email protected]>; Cc: "Comment"<[email protected]>;"Zhijun Yu"<[email protected]>; Subject: Re: [setoidz/QANet-pytorch] Performance metrics (#1) Hello, what is the performance now? I got a F1/Em score below 10 — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance metrics #1

Performance metrics #1

uditsaxena commented Apr 30, 2018

hengruo commented Apr 30, 2018

uditsaxena commented Apr 30, 2018

Ramondy commented May 14, 2018 •

edited

Loading

hengruo commented May 22, 2018

rsps950551 commented May 23, 2018

huaiwen commented May 24, 2018

BangLiu commented Jul 4, 2018

susht3 commented Jul 4, 2018

sherlockhommel commented Jul 19, 2018

hengruo commented Jul 19, 2018

ZhaoyueCheng commented Jul 19, 2018

zhijuny commented Jun 8, 2019

LilAnthony123 commented Jul 22, 2020

zhijuny commented Jul 22, 2020 via email

Performance metrics #1

Performance metrics #1

Comments

uditsaxena commented Apr 30, 2018

hengruo commented Apr 30, 2018

uditsaxena commented Apr 30, 2018

Ramondy commented May 14, 2018 • edited Loading

hengruo commented May 22, 2018

rsps950551 commented May 23, 2018

huaiwen commented May 24, 2018

BangLiu commented Jul 4, 2018

susht3 commented Jul 4, 2018

sherlockhommel commented Jul 19, 2018

hengruo commented Jul 19, 2018

ZhaoyueCheng commented Jul 19, 2018

zhijuny commented Jun 8, 2019

LilAnthony123 commented Jul 22, 2020

zhijuny commented Jul 22, 2020 via email

Ramondy commented May 14, 2018 •

edited

Loading