`MLP*`: Doesn't respect input IDs #292

weefuzzy · 2024-12-24T23:20:30Z

The MLP client training code seems to assume that the input and output data / labels are identically ordered, and never checks that this is the case. Much of the time it is, but one can simply 'reorder' the entries of a Dataset – which should remain semantically equivalent because it's an unordered collection – and completely screw up one's training.

There's no particularly nice solution to this: we just have to traverse the ID list of one of the data sets and rearrange (a copy of) the other to match. Current fix in progress introduces a new flag (strict) to enable this, because it's not cheap.

The text was updated successfully, but these errors were encountered:

weefuzzy added the bug Something isn't working label Dec 24, 2024

weefuzzy self-assigned this Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`MLP*`: Doesn't respect input IDs #292

`MLP*`: Doesn't respect input IDs #292

weefuzzy commented Dec 24, 2024

MLP*: Doesn't respect input IDs #292

MLP*: Doesn't respect input IDs #292

Comments

weefuzzy commented Dec 24, 2024

`MLP*`: Doesn't respect input IDs #292

`MLP*`: Doesn't respect input IDs #292