Route gemma-4 sliding-window layers through FlashAttention-2#860
Open
danielhanchen wants to merge 4 commits into
Open
Route gemma-4 sliding-window layers through FlashAttention-2#860danielhanchen wants to merge 4 commits into
danielhanchen wants to merge 4 commits into
background
wait
wait-all
cancel
parallel
Loading