[llama] Added the fused rotary embedding kernel #1708
Job | Run time |
---|---|
8m 53s | |
10m 5s | |
1m 33s | |
1m 42s | |
1m 29s | |
1m 51s | |
1m 42s | |
4m 16s | |
4m 51s | |
1s | |
36m 23s |
Job | Run time |
---|---|
8m 53s | |
10m 5s | |
1m 33s | |
1m 42s | |
1m 29s | |
1m 51s | |
1m 42s | |
4m 16s | |
4m 51s | |
1s | |
36m 23s |