thanks your work，and a question about GPU #4

YRQ66 · 2023-02-11T14:08:31Z

Thank you for your work. I want to ask about the requirements for the size of gpu memory to train

rainyl · 2023-02-12T02:14:45Z

In my situation, I trained it using a RTX 3080Ti GPU with 12G memory and a batch size of 16, the GPU memory was almost fully occupied. I think with lower batch size, 8G may be also enough but need more time.
Hoping my answer can help you :)

YRQ66 · 2023-02-12T14:38:10Z

I found a problem. When I tried to train, the verification process took a lot of time, as shown in the figure. In addition, I found that the dataset is different from the common MathJax fonts. At present, I am trying to build a larger dataset, which is expected to exceed 100K. It is expected that I will open it here soon. In the future, I will also try to build a handwritten dataset (which may take longer). Thank you again for your work

rainyl · 2023-02-13T02:39:01Z

the verification process took a lot of time

Well, I guess it may be due to the first time of loading data from hard drive, when setting pin_memory=True for DataLoader, the rest data loading process will be fast.
Also, the current collate_fn is not clever and need to be improved.
If the validation process is really slow, you can set conf.eval_batch to a smaller value, the default is 0x3f3f3f and in most condition the whole validation dataset will be included.

the dataset is different from the common MathJax fonts

In order to enhance the dataset, I included several common formula fonts (the font will influence the OCR result greatly), I remember that the font used by MathJax is similar to TimesNewRoman or STIX or XITS (please correct it if I am wrong), so the formula rendered by MathJax should be recognizable by this program.

It is really great that you are trying to complete the dataset, I think for open-sourced LaTex formula OCR project, the most important thing is actually the high-quality dataset itself, the current dataset contains a lot of errors, many formula can't render or rendered not as expected. I am looking forward to your works.

YRQ66 · 2023-02-13T12:04:47Z

Thank you for your help, which will continue to inspire me

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thanks your work，and a question about GPU #4

thanks your work，and a question about GPU #4

YRQ66 commented Feb 11, 2023

rainyl commented Feb 12, 2023

YRQ66 commented Feb 12, 2023

rainyl commented Feb 13, 2023

YRQ66 commented Feb 13, 2023

thanks your work，and a question about GPU #4

thanks your work，and a question about GPU #4

Comments

YRQ66 commented Feb 11, 2023

rainyl commented Feb 12, 2023

YRQ66 commented Feb 12, 2023

rainyl commented Feb 13, 2023

YRQ66 commented Feb 13, 2023