A question about zero-grad settings in VL-adapter's multitask.py file. #19

y2sman · 2024-05-27T00:51:49Z

Thanks for your brilliant work.

                batch['log_train_accuracy'] = self.args.log_train_accuracy

                # self.optim.zero_grad()
                if self.args.fp16 and _use_native_amp:
                    with autocast():
                        if self.args.distributed:
                            results = self.model.module.train_step(batch)
                        else:
                            results = self.model.train_step(batch)
                else:
                    if self.args.distributed:
                        results = self.model.module.train_step(batch)
                    else:
                        results = self.model.train_step(batch)

                loss = results['loss']

Looking at the code, it appears that you are training without initializing the gradients before performing backpropagation.

Is there a reason why this works?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question about zero-grad settings in VL-adapter's multitask.py file. #19

A question about zero-grad settings in VL-adapter's multitask.py file. #19

y2sman commented May 27, 2024 •

edited

Loading

A question about zero-grad settings in VL-adapter's multitask.py file. #19

A question about zero-grad settings in VL-adapter's multitask.py file. #19

Comments

y2sman commented May 27, 2024 • edited Loading

y2sman commented May 27, 2024 •

edited

Loading