Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

request for README of pytorch-ssd-mmd-coral #6

Open
chrisliu54 opened this issue Jul 4, 2018 · 8 comments
Open

request for README of pytorch-ssd-mmd-coral #6

chrisliu54 opened this issue Jul 4, 2018 · 8 comments

Comments

@chrisliu54
Copy link

Hi, I run the examples/ssd/train_visda.py script after figure out the directory(using visda18 dataset), and I came across several errors as follows:
image

After I run next time(without any change), I got the following error message:
image

When debugging, I found tensor operation buggy in the following code:

xmin = w - boxes[:,2]
xmax = w - boxes[:,0]

Specifically, when tensor pass throught the line above, the actual dimension is of 1d, not 2d.

I'm still puzzled about the dataset folder configuration. Hope the README file of the code coming out soon : )

@MInner
Copy link
Collaborator

MInner commented Jul 5, 2018

As for the "Connection refused" error, install and run wisdom server in parallel, or comment lines that start it. Not sure about the latter, are you sure you downloaded our gt files and provided paths to them?

@chrisliu54
Copy link
Author

chrisliu54 commented Jul 5, 2018

Yep, I just downloaded the gt files mentioned in this page and modified the corresponding path in the source code. I used the images extracted from train.tar as training data and visda18-detection-train.txt/visda18-detection-test.txt as gt boxes.

The latter problem was actually caused by dimension mismatch. Sometimes a boxes with only 1 dimension was passed to the function. I fixed this problem by modifying the code this way:

xmin = w - boxes[..., 2]
xmax = w - boxes[..., 0]

@chrisliu54
Copy link
Author

BTW, according to this page, I found the "Connection refued" error can be solved by setting num_worker=0 in when defining torch.utils.data.DataLoader.

@chrisliu54
Copy link
Author

Well, another issue appeared when processing this line:

cls_loss[cls_targets<0] = 0 # set ignored loss to 0

I got the error message below:
image

@heinzermch
Copy link

heinzermch commented Jul 5, 2018

The whole code is written in a very sloppy way. I recommend to use TorchCV directly, which is what this project is based on anyway. Plus at least the implementation of Self-Ensemble is wrong (not that it would work for this task). I did not check the implementations of the other baselines, as I don't have a good understanding of them.

@MInner
Copy link
Collaborator

MInner commented Jul 6, 2018

@heinzermch yep, I suspected that self-assembling code was probably wrong logically, but I would appreciate if you could point me more specifically towards what exactly is wrong with this implementation. As for the quality of code in the repo, it is pretty much torchcv SSD example with data readers for this specific dataset and somewhat different run logic since you have to train supervisely on source, unsupervisely on target, and eval on different splits of both. Pull requests are welcome :)

@krumo
Copy link

krumo commented Aug 11, 2018

@chrisliu54 hi, did you solve this error?

@chrisliu54
Copy link
Author

After I turned to use official repo of TorchCV, some weird errors has gone since then : )

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants