about the training implementation #5

chenypic · 2018-05-16T13:29:39Z

Great work. Thanks for your code. Do you have a plan to publish the training implementation? I really want to follow your job.

MarvinTeichmann · 2018-05-16T14:23:44Z

ConvCRFs can be trained using PyTorch. Training is straight forward and can be done like any other neural network. Iterate over the training data, apply softmax cross-entropy loss and use the pytorch autograd package to backprop.

I strongly recommend that you implement your own pipeline. Having a good understanding of your training process is quite crucial in deep learning.

I am considering to make my pipeline public, however the code is currently quite messy, undocumented and will not work out of the box. I think implementing your own pipeline by following some of the pytorch tutorials is much more rewarding and easiert then trying to make mine work.

Edit: I deleted part of my earlier response to increase my overall niceness. You can find the full response in the changelog.

chenypic · 2018-05-16T15:39:13Z

Thanks for your detailed response. I appreciate it, and I agree with you. I will implement my own pipeline according to your paper and my task.

SHMCU · 2018-08-27T03:15:12Z

Hi Marvin,
I wrote a script to train the convCRF using nll loss. I treat the air plane image as a two class segmentation problem. At the beginning the training went well, the segmentation was improving, but if I keep train it, it would not converge. It reaches the min loss value then the loss stated to increase and the segmentation become worse. Finally, the result become look like the noisy unary. Could you give me some suggestions on what problem this could be? Thank you very much!

Hai

prio1988 · 2018-08-28T20:46:40Z

Hi Hai,

may I ask you why have you used the nll loss and not the cross entropy loss in the training?

Thanks

hsu-z2 · 2018-08-28T20:54:53Z

Hi prio1988,

I think nll loss is actually multiclass cross entropy, right? It should also work when I set the model to work on only two classes, that is background and foreground. Right?

prio1988 · 2018-08-28T21:00:04Z

Nll loss assume that you have already applied a logSoftMax layer on the top of your network. The multi class cross entropy loss is the torch.nn.CrossEntropyLoss. I think that probably you should use the last one. Instead I am still wondering why to apply a logsoftmax on the unary instead that just a softmax.

SHMCU · 2018-08-28T21:07:40Z

Oh, thank you for the very good suggestion! I will dig into the problem of logsoftmax+nll or softmax+crossEntropyLoss.I read somewhere that logsoftmax is numerically more stable than softmax. On Tuesday, August 28, 2018, 5:00:05 PM EDT, prio1988 <[email protected]> wrote: Nll loss assume that you have already applied a logSoftMax layer on the top of your network. The multi class cross entropy loss is the torch.nn.CrossEntropyLoss. I think that probably you should use the last one. Instead I am still wondering why to apply a logsoftmax on the unary instead that just a softmax. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

prio1988 · 2018-08-28T21:09:37Z

If you use the crossEntropyLoss you can avoid also the softmax. It is done internally by the loss.

SHMCU · 2018-08-28T21:11:58Z

OK. Then that would be much better. Since the implementation of crossEntropyLoss already considered the numerical stability issues.Thank you! On Tuesday, August 28, 2018, 5:09:38 PM EDT, prio1988 <[email protected]> wrote: If you use the crossEntropyLoss you can avoid also the softmax. It is done internally by the loss. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

HqWei · 2019-01-16T13:04:58Z

I have trained it however I get the following error:
RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

HqWei · 2019-01-16T13:07:11Z

Is there any one having tried training?

qiqihaer · 2019-03-26T07:15:35Z

I have trained it however I get the following error:
RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

I have the same problem. Have you solved it?

pvthuy · 2020-06-02T09:00:04Z

@HqWei @qiqihaer Could you share a portion of your code for training convCRF?

SHMCU · 2020-06-02T19:01:01Z

There is a paper called PAC-CRF, you may find the convCRF implementation there. On Tuesday, June 2, 2020, 02:00:21 AM PDT, pvthuy <[email protected]> wrote: @HqWei @qiqihaer Could you share a portion of your code for training convCRF? — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

pvthuy · 2020-06-03T01:59:59Z

@SHMCU It's very helpful. Thank you very much!

GITSHOHOKU · 2021-11-22T04:23:48Z

@SHMCU It's very helpful. Thank you very much!

Hi, did you solve the in-place operation problem?
Should we set CRF iteration step to 1 to avoid this error? I tried it on PACCRF and the same problem occured.

GITSHOHOKU · 2021-11-22T04:27:34Z

ConvCRFs can be trained using PyTorch. Training is straight forward and can be done like any other neural network. Iterate over the training data, apply softmax cross-entropy loss and use the pytorch autograd package to backprop.

I strongly recommend that you implement your own pipeline. Having a good understanding of your training process is quite crucial in deep learning.

I am considering to make my pipeline public, however the code is currently quite messy, undocumented and will not work out of the box. I think implementing your own pipeline by following some of the pytorch tutorials is much more rewarding and easiert then trying to make mine work.

Edit: I deleted part of my earlier response to increase my overall niceness. You can find the full response in the changelog.

Hi, I have a question about the training step with this wonderful CRF impletement.
Should we set CRF iteration step=1 in training step ? And in inference step to set it bigger than 1?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the training implementation #5

about the training implementation #5

chenypic commented May 16, 2018

MarvinTeichmann commented May 16, 2018 •

edited

Loading

chenypic commented May 16, 2018

SHMCU commented Aug 27, 2018

prio1988 commented Aug 28, 2018

hsu-z2 commented Aug 28, 2018

prio1988 commented Aug 28, 2018

SHMCU commented Aug 28, 2018 via email

prio1988 commented Aug 28, 2018

SHMCU commented Aug 28, 2018 via email

HqWei commented Jan 16, 2019 •

edited

Loading

HqWei commented Jan 16, 2019

qiqihaer commented Mar 26, 2019

pvthuy commented Jun 2, 2020

SHMCU commented Jun 2, 2020 via email

pvthuy commented Jun 3, 2020

GITSHOHOKU commented Nov 22, 2021

GITSHOHOKU commented Nov 22, 2021

about the training implementation #5

about the training implementation #5

Comments

chenypic commented May 16, 2018

MarvinTeichmann commented May 16, 2018 • edited Loading

chenypic commented May 16, 2018

SHMCU commented Aug 27, 2018

prio1988 commented Aug 28, 2018

hsu-z2 commented Aug 28, 2018

prio1988 commented Aug 28, 2018

SHMCU commented Aug 28, 2018 via email

prio1988 commented Aug 28, 2018

SHMCU commented Aug 28, 2018 via email

HqWei commented Jan 16, 2019 • edited Loading

HqWei commented Jan 16, 2019

qiqihaer commented Mar 26, 2019

pvthuy commented Jun 2, 2020

SHMCU commented Jun 2, 2020 via email

pvthuy commented Jun 3, 2020

GITSHOHOKU commented Nov 22, 2021

GITSHOHOKU commented Nov 22, 2021

MarvinTeichmann commented May 16, 2018 •

edited

Loading

HqWei commented Jan 16, 2019 •

edited

Loading