refactor: sync gpu before prover

Async operations may be running before the prover runs which may inflate the run time of the prover. Running futhark bench yield faster than ZKBoo speeds on the GPU so this may be an issue.

Merge request reports

Loading