« earlier | later » Page 2 of 6
Sorting data in parallel CPU vs GPU | Solarian Programmer
Simple example of benchmarking CPU and GPU sorting.
to ag0803 gpu parallel sort teaching ... on 11 February 2013
Vaguely TBB-ish library for doing combined CPU/GPU computation, shifting data around and picking alternative task implementations as required.
UPC appears to be roughly to C as OpenMP is to Fortran, but with support for distributed computation. (That is, it's a way of doing C-like computation with pointers and shared data across multiple machines. Whether this is a good idea I'm not entirely convinced, but it's always nice to see better compiler support for parallel programming.)
to distributed gcc parallel software upc ... on 25 November 2012
The Secrets of Parallel Pathfinding on Modern Computer Hardware - IntelĀ® Software Network
Interesting (although a bit light on technical detail).
to ag0803 parallel pathfinding ... on 19 May 2012
SIMD < SIMT < SMT: parallelism in NVIDIA GPUs
How parallel constructs in CUDA et al. map to hardware.
A parallel monad from Simon M.
to concurrency haskell parallel ... on 02 September 2011
OpenMP Multi-Threaded Template Library
STL-compatible algorithms parallelised with OpenMP.
High Performance Computing: Training
Tutorial materials on various HPC stuff, including OpenMP and MPI.
Designing and Building Parallel Programs
to concurrency etext parallel teaching ... on 16 May 2011
« earlier | later » Page 2 of 6
- parallel | |
6 | ag0803 |
1 | aspects |
1 | astrophysics |
1 | bcrypt |
1 | bit-twiddling |
1 | c |
1 | ca |
1 | cdc |
1 | ce0813 |
1 | cell |
1 | cilk |
1 | cloud |
1 | code |
1 | compiler |
18 | concurrency |
2 | concurrent |
1 | coroutine |
1 | cpu |
1 | cray |
1 | cryptography |
1 | csp |
1 | design |
2 | distributed |
1 | efficiency |
2 | embedded |
1 | erlang |
1 | etext |
1 | fork |
1 | forth |
1 | functional |
1 | games |
1 | gcc |
1 | gcd |
2 | gpgpu |
2 | gpu |
1 | graphics |
1 | haskell |
1 | history |
1 | hpc |
1 | intel |
2 | java |
1 | journal |
1 | language |
3 | language-design |
1 | life |
1 | lightweight-processes |
1 | lightweight-threads |
1 | lisp |
1 | manycore |
1 | memory |
1 | microcontroller |
1 | mobile |
1 | mpi |
1 | multicore |
1 | nbody |
1 | obfuscated |
1 | oo |
4 | openmp |
1 | optimisation |
1 | os |
1 | ownership |
52 | parallel |
1 | parallel-survey |
1 | parasail |
1 | pathfinding |
2 | patterns |
1 | performance |
2 | pi-calculus |
2 | pict |
1 | power |
8 | programming |
20 | research |
2 | retrocomputing |
1 | rust |
1 | simulation |
6 | software |
1 | sort |
1 | splash |
1 | static-analysis |
9 | teaching |
1 | type-systems |
1 | upc |
1 | valgrind |
2 | vector |
1 | vm |
tasty by Adam Sampson.