Open
Description
Hi, I noticed that when many small batches are rendered (for example, if I apply a lot of additive or shader effects), the performance is reduced greatly, because bufferSubData copies the whole arrays, causing performance drops.
By creating shallow subarrays with the batch size, the performance improves for scenes with lots of graphics and effects such as additive blending.
This is how I implemented it in my fork:.