Commits · 83c6e6220f6d9c779f1f020ae433966804cb98d4 · las3_pub / predictable_parallel_patterns

23 Jan, 2020 4 commits
- Draft of new context switching tasks. · 83c6e622
  FritzFlorian committed Jan 23, 2020
  
  83c6e622 Browse Files
- Add plots to context switch benchmarks. · 5e0ce1f5
  FritzFlorian committed Jan 23, 2020
  
  5e0ce1f5 Browse Files
- Add custom context switch library. · e2092e63
```
The rationale to do an custom implementation is that the existing solutions are quite a bit slower and/or require more memory.
```
  FritzFlorian committed Jan 23, 2020
  e2092e63 Browse Files
- Add benchmark results from running on x86 and arm32. · af75e21a
  FritzFlorian committed Jan 23, 2020
  
  af75e21a Browse Files
22 Jan, 2020 1 commit

Implement custom fast call fiber for arm32. · 3f7b5ad0

The basic calling works, next we measure on both x86 and arm and then decide on how we implement our fiber/'staggered stack' abstraction.

committed Jan 22, 2020

3f7b5ad0 Browse Files

21 Jan, 2020 1 commit

Add custom 'fast fiber call' implementation to comparison. · 5b791d0e

We now cover all implementations that have a chance of being fast.
ARM implementations for our 'fast fiber call' are still missing. After we add them we decide on how to proceed.

committed Jan 21, 2020

5b791d0e Browse Files

20 Jan, 2020 1 commit
- Add alternative context switch implementation (similar to boost). · 3a6b724d
  FritzFlorian committed Jan 20, 2020
  
  3a6b724d Browse Files
13 Jan, 2020 1 commit
- Extend context switch example for arm32. · 540fb8ed
  FritzFlorian committed Jan 13, 2020
  
  540fb8ed Browse Files
10 Jan, 2020 1 commit

Add minimal example for x86_64 user level threads. · 5490e966

We implement a minimal concepts of user level threads. This shows the minimum requirements for our 'staggered' stack implementation: we need to be able to switch to a new stack and allow someone else to continue the calling function right before the switch.

committed Jan 10, 2020

5490e966 Browse Files

04 Jan, 2020 1 commit
- Add fib benchmark. · d054e1ab
  FritzFlorian committed Jan 04, 2020
  
  d054e1ab Browse Files
20 Dec, 2019 1 commit
- Add two 'standardized' benchmarks. · 79ac0243
  FritzFlorian committed Dec 20, 2019
  
  79ac0243 Browse Files
05 Dec, 2019 1 commit

Minor changes for profiling and add more alignment. · 2f539691

The idea is to exclude as many sources as possible that could lead to issues with contention and cache misses. After some experimentation, we think that hyperthreading is simply not working very well with our kind of workload. In the future we might simply test on other hardware.

committed Dec 05, 2019

2f539691 Browse Files

04 Dec, 2019 1 commit
- Working version of our trading-deque · e34ea267
  FritzFlorian committed Dec 04, 2019
  
  e34ea267 Browse Files
02 Dec, 2019 1 commit
- Sketch out idea for lock free trading deque. · 4cf3848f
  FritzFlorian committed Dec 02, 2019
  
  4cf3848f Browse Files
29 Nov, 2019 3 commits

First 'crash free' version. · 1b576824

This version runs through our initial fft and fib tests. However, it is not tested further in any way. Additionally, we added a locking deque, potentially hurting performance and moving away from our initial goal.

committed Nov 29, 2019

1b576824 Browse Files

WIP: Partly functional version. Stealing and continuation tarding works 'most' of the time. · c6dd2fc0

The main issue seems to still be the fact that we have a lock free protocol where a steal can be pending. We plan to remove this fact next by introducing a protocol that works on a single atomic update.

committed Nov 29, 2019

c6dd2fc0 Browse Files

WIP: We plan to fully remove the start property from the cont manager. · adf05e9a

The start_chain property does not make sense, as chains are purely 'virtual', i.e. they only fully exist when walking through the computation (by patching them on important events). We initially added the property as a helper for better runtime and simpler implementation, but we think without it we will not get as much inconsistency in the runtime state. Performance can be 're-added' later on.

committed Nov 29, 2019

adf05e9a Browse Files

27 Nov, 2019 2 commits
- WIP: Major flaws fixed. Edge cases at beginning missing and cleanup for conts missing. · 21733e4c
  FritzFlorian committed Nov 27, 2019
  
  21733e4c Browse Files
- WIP: Refactor memory manager to reduce redundancy. · 69fd7e0c
```
It is still not working, however we now have no more redundant code, making debugging it simpler.
```
  FritzFlorian committed Nov 27, 2019
  69fd7e0c Browse Files
25 Nov, 2019 1 commit

WIP: Add first performance tests of single threaded execution. · 8668cad2

We changed up some of the memory constraints in the lock free deque and will need to see if this is ok. If so, the single threaded performance looks very good.

committed Nov 25, 2019

8668cad2 Browse Files

19 Nov, 2019 1 commit

WIP: Fast path with 'outlined' slow path code in place. · c2d4bc25

Everything so far is untested. We only made sure tha fast path still seems to function correctly. Next up is writing tests for both the fast and slow path to then introduce the slow path. After that we can look at performance optimizations.

committed Nov 19, 2019

c2d4bc25 Browse Files

07 Nov, 2019 1 commit

WIP: First implementation of serial/fast path. · 842b518f

This showcases the expected performance when a task executes a sub-tree without inference from other threads. We target to stay about 6x slower than a normal function call.

committed Nov 07, 2019

842b518f Browse Files

06 Nov, 2019 3 commits
- WIP: Sketch fast path of task manager. · 39d2fbd8
  FritzFlorian committed Nov 06, 2019
  
  39d2fbd8 Browse Files
- WIP: Initialization of continuation chains. · d3b64a85
  FritzFlorian committed Nov 06, 2019
  
  d3b64a85 Browse Files
- WIP: Sketch continuation and taks class. · 740ae661
```
This first sketch of the classes captures what we think is needed in terms of general interface and very mich WIP.
```
  FritzFlorian committed Nov 06, 2019
  740ae661 Browse Files
05 Nov, 2019 1 commit

WIP: re-work static memory allocation for scheduler. · 693d4e9b

We changed how the memory is allocated from passing char* buffers to then store objects into to creating 'fat objects' for all scheduler state. This eases development for us, as we can make changes to data structures without too much effort (e.g. add a second array to manage tasks if required).

committed Nov 05, 2019

693d4e9b Browse Files

02 Oct, 2019 1 commit

Add deconstructor calls to tasks. · 5bc35f9e

Our stack is not calling deconstructors of its elements. This is problematic for e.g. the graph implementation where reference counted images are hold in tasks. To solve this for now we manually call the deconstructor after each tasks (we do so, because a generic, virtual deconstructor adds runtime costs to primitive tasks, requiring us to re-run all benchmarks; with this change we do not need to do this and as we re-work the scheduler anyways we postpone a clean implementation for then).

committed Oct 02, 2019

5bc35f9e Browse Files

01 Oct, 2019 1 commit
- Rework spawn ordering/waiting on pipeline tasks · eca0dd4d
  FritzFlorian committed Oct 01, 2019
  
  eca0dd4d Browse Files
30 Sep, 2019 1 commit
- Remove un-needed iteration in scan algorithm. · 722ddf41
  FritzFlorian committed Sep 30, 2019
  
  722ddf41 Browse Files
16 Sep, 2019 2 commits
- Fix: App used old threading interface. · ef19ea1b
  FritzFlorian committed Sep 16, 2019
  
  ef19ea1b Browse Files
- Rework threads to not be template based. · 4a9ca21d
```
This allows us to more easily handle them and makes their interface closer to std::thread.
```
  FritzFlorian committed Sep 16, 2019
  4a9ca21d Browse Files
02 Sep, 2019 1 commit

Yield thread if there is (probably) no more work. · 65409e0a

The scheduler yields if it failed to steal any work due to the task
lists being empty. This should improve performance on multiprogrammed
systems, as it potentially makes room for other worker threads which
still have work to perform.

committed Sep 02, 2019

65409e0a Browse Files

30 Aug, 2019 2 commits
- Add divison strategies for for-each api. · 68af3068
  FritzFlorian committed Aug 30, 2019
  
  68af3068 Browse Files
- Move scheduling related data structures in correct package. · f3e7df77
  FritzFlorian committed Aug 30, 2019
  
  f3e7df77 Browse Files
02 Aug, 2019 1 commit
- Add notes on futex linux syscall. · 2801278f
```
This might allow us to do lock free, conditional waits in our stealing loop.
```
  FritzFlorian committed Aug 02, 2019
  2801278f Browse Files
01 Aug, 2019 2 commits
- Refactor locking_deque to new interface. · 353a5b17
  FritzFlorian committed Aug 01, 2019
  
  353a5b17 Browse Files
- Change both stack and queue to same offset counters. · e403e498
```
This allows the stack and deque class to use the same offset, making it work better with each other.
```
  FritzFlorian committed Aug 01, 2019
  e403e498 Browse Files
31 Jul, 2019 3 commits
- Add first performance measurements of dataflow API. · a9361609
  FritzFlorian committed Jul 31, 2019
  
  a9361609 Browse Files
- Add simple logo. · 7874c2a2
  FritzFlorian committed Jul 31, 2019
  
  7874c2a2 Browse Files
- Merge branch 'dataflow' into 'master' · aba75f54
```
Merge: Dataflow

See merge request !12
```
  Florian Fritz committed Jul 31, 2019
  aba75f54 Browse Files