[llama] Added the fused rotary embedding kernel #216
Job | Run time |
---|---|
5m 26s | |
5m 32s | |
5m 49s | |
4m 3s | |
3m 59s | |
4m 12s | |
6m 40s | |
3m 26s | |
4m 47s | |
1s | |
43m 55s |
Job | Run time |
---|---|
5m 26s | |
5m 32s | |
5m 49s | |
4m 3s | |
3m 59s | |
4m 12s | |
6m 40s | |
3m 26s | |
4m 47s | |
1s | |
43m 55s |