Bug in generation code #85

Squire-tomsk · 2024-07-10T13:15:47Z

At this line, a delay pattern mask is generated and applied to the initial audio IDs. Then, at this line, a mask is also generated to revert the delay on the output tokens. However, it generates the mask based on input_ids which already includes the delay pattern.

The text was updated successfully, but these errors were encountered:

henry-tujia · 2024-08-07T07:56:52Z

Here is just regenerating the mask once again to facilitate extracting the actual output of the model from the ids that already contain the mask.

Squire-tomsk · 2024-08-12T12:41:29Z

Yes, I understand the intention. However, the current code does not regenerate the mask in all cases. If the initial input_ids only contain a vector of BOS tokens, it works perfectly. But if you try to generate a continuation of some audio, the input_ids will be modified here line, and the mask generated here line will be incorrect.

Guppy16 · 2024-08-16T16:18:52Z

#110

Does this help?

apresence mentioned this issue Aug 30, 2024

GREAT MODELS, but a number of issues ... #125

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in generation code #85

Bug in generation code #85

Squire-tomsk commented Jul 10, 2024

henry-tujia commented Aug 7, 2024

Squire-tomsk commented Aug 12, 2024

Guppy16 commented Aug 16, 2024

Bug in generation code #85

Bug in generation code #85

Comments

Squire-tomsk commented Jul 10, 2024

henry-tujia commented Aug 7, 2024

Squire-tomsk commented Aug 12, 2024

Guppy16 commented Aug 16, 2024