« earlier | later » Page 2 of 6
Sorting data in parallel CPU vs GPU | Solarian Programmer edit / delete
Simple example of benchmarking CPU and GPU sorting.
to ag0803 gpu parallel sort teaching ... on 11 February 2013
Vaguely TBB-ish library for doing combined CPU/GPU computation, shifting data around and picking alternative task implementations as required.
UPC appears to be roughly to C as OpenMP is to Fortran, but with support for distributed computation. (That is, it's a way of doing C-like computation with pointers and shared data across multiple machines. Whether this is a good idea I'm not entirely convinced, but it's always nice to see better compiler support for parallel programming.)
to distributed gcc parallel software upc ... on 25 November 2012
The Secrets of Parallel Pathfinding on Modern Computer Hardware - IntelĀ® Software Network edit / delete
Interesting (although a bit light on technical detail).
to ag0803 parallel pathfinding ... on 19 May 2012
SIMD < SIMT < SMT: parallelism in NVIDIA GPUs edit / delete
How parallel constructs in CUDA et al. map to hardware.
simonmar/monad-par - GitHub edit / delete
A parallel monad from Simon M.
to concurrency haskell parallel ... on 02 September 2011
OpenMP Multi-Threaded Template Library edit / delete
STL-compatible algorithms parallelised with OpenMP.
High Performance Computing: Training edit / delete
Tutorial materials on various HPC stuff, including OpenMP and MPI.
Designing and Building Parallel Programs edit / delete
to concurrency etext parallel teaching ... on 16 May 2011
« earlier | later » Page 2 of 6
tasty by Adam Sampson.