Fixes #12673. record_stream in group offloading is not working properly
#1963
| Job | Run time |
|---|---|
| 33s | |
| 31s | |
| 12m 29s | |
| 5m 19s | |
| 49s | |
| 47m 10s | |
| 3m 37s | |
| 16m 1s | |
| 9m 16s | |
| 4m 35s | |
| 3m 37s | |
| 3m 30s | |
| 5m 4s | |
| 4m 48s | |
| 3m 4s | |
| 3m 40s | |
| 3m 31s | |
| 2h 7m 34s |