Implement custom fast call fiber for arm32.

The basic calling works, next we measure on both x86 and arm and then decide on how we implement our fiber/'staggered stack' abstraction.
4 jobs from coninuation in 36 seconds (queued for 2 seconds)
Status Job ID Name Coverage
  Build
passed #3452
build_cmake

00:19

 
  Test
failed #3453
run_tests

00:16

 
  Sanitizer
skipped #3455
run_address_sanitizer
skipped #3454
run_thread_sanitizer
 
Test run_tests
compilation terminated.
make[2]: *** [app/invoke_parallel/CMakeFiles/invoke_parallel.dir/main.cpp.o] Error 1
app/invoke_parallel/CMakeFiles/invoke_parallel.dir/build.make:62: recipe for target 'app/invoke_parallel/CMakeFiles/invoke_parallel.dir/main.cpp.o' failed
make[1]: *** [app/invoke_parallel/CMakeFiles/invoke_parallel.dir/all] Error 2
CMakeFiles/Makefile2:392: recipe for target 'app/invoke_parallel/CMakeFiles/invoke_parallel.dir/all' failed
Makefile:140: recipe for target 'all' failed
make: *** [all] Error 2
./ci_scripts/run_tests.sh: line 9: ./bin/tests: No such file or directory
ERROR: Job failed: exit code 1