MLM task is broken in the new update #1322

pooryapzm · 2021-06-09T02:22:59Z

MLM task

To Reproduce

Tell use which version of jiant you're using: 2.2.0
Describe the environment where you're using jiant, e.g, "Macbook CPU"

Expected behavior
It should create a model and start the training.

Screenshots
It throws the following exception when it tries to create MLM head.

Additional context
The issue happens in the following line:

jiant/jiant/proj/main/modeling/heads.py

Line 70 in 310f22b

head = head_class(task, **kwargs)

It happens because it calls JiantMLMHeadFactory with some arguments while the initializer doesn't accept any argument. A workaround is to create an object by adding the following lines:

        if head_class ==JiantMLMHeadFactory:
            head_class = head_class()

After this fix, the following line throws an exception:

jiant/jiant/proj/main/modeling/heads.py

Line 208 in 310f22b

mlm_head_class = self.registry[task.TASK_TYPE]

It can be fixed by changing it to:

    def __call__(self, task, model_arch, **kwargs):
        """Summary

        Args:
            task (Task): Task used to initialize task head
            **kwargs: Additional arguments required to initialize task head
        """
        mlm_head_class = self.registry[model_arch]
        mlm_head = mlm_head_class(**kwargs)
        return mlm_head

Then, the next issue is for the following line:

jiant/jiant/proj/main/modeling/taskmodels.py

Line 283 in 310f22b

input_ids=masked_batch.input_ids,

and it can be fixed by changing the line to:
input_ids=masked_batch.masked_input_ids,

The text was updated successfully, but these errors were encountered:

Pzoom522 · 2022-04-28T12:11:54Z

Thanks for the workaround! Additionally, the original line 201 should be further changed into

def __call__(self, task, model_arch, hidden_dropout_prob, **kwargs):

to provent hidden_dropout_prob from being passed with **kwargs, which leads to an error.

Also, as of Transformers v4.5, transformers.models.bert.modeling_bert.BertLayerNorm and transformers.models.bert.modeling_bert.gelu are not longer supported. My correction is to change the former into torch.nn.LayerNorm and the latter into x = transformers.models.bert.modeling_bert.gelu(x).

Hope it can help future users!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLM task is broken in the new update #1322

MLM task is broken in the new update #1322

pooryapzm commented Jun 9, 2021 •

edited

Loading

Pzoom522 commented Apr 28, 2022 •

edited

Loading

MLM task is broken in the new update #1322

MLM task is broken in the new update #1322

Comments

pooryapzm commented Jun 9, 2021 • edited Loading

Pzoom522 commented Apr 28, 2022 • edited Loading

pooryapzm commented Jun 9, 2021 •

edited

Loading

Pzoom522 commented Apr 28, 2022 •

edited

Loading