gitstatus/ at master · romkatv/gitstatus edit / delete

Speeding up directory listing. This will, however, have horrible performance on NFS... so it's not really true that there's nothing much left to optimise!

to optimisation unix ... on 29 June 2020

How LLVM Optimizes a Function – Embedded in Academia edit / delete

A nice walkthrough of how LLVM's optimisation passes rewrite a simple loop. This assumes you know how LLVM's SSA representation works, but it'll still make sense even if you don't.

to cmp409 compiler llvm loop optimisation ... on 11 September 2018

N4455 No Sane Compiler Would Optimize Atomics edit / delete

"Compilers do optimize atomics, memory accesses around atomics, and utilize architecture-specific knowledge. This paper illustrates a few such optimizations, and discusses their implications."

to atomic cmp201 compiler optimisation ... on 11 September 2018

Compilers - What Every Programmer Should Know About Compiler Optimizations edit / delete

AG0803 students will find this interesting -- a pretty good overview of the basic ideas.

to ag0803 compiler optimisation ... on 22 January 2016

How to optimize Raspberry Pi code using its GPU « Pete Warden's blog edit / delete

Speeding up computer vision code using the VideoCore.

to gpu optimisation performance raspberry-pi videocore vision ... on 24 August 2014

A concrete illustration of practical running time vs big-O notation - The Old New Thing - Site Home - MSDN Blogs edit / delete

One for AG0803 students: why algorithmic complexity isn't everything in a world with complex memory architectures.

to ag0803 complexity data-structures memory optimisation performance ... on 13 August 2014

SSE – Vectorizing conditional code | Félix Abecassis edit / delete

An example of using GCC's vectorisor.

to ag0803 gcc optimisation vectorisation ... on 02 June 2014

Auto-vectorization in GCC - GNU Project - Free Software Foundation (FSF) edit / delete

Has a nice set of examples of things GCC can vectorise.

to gcc optimisation vectorisation ... on 02 June 2014

Blosc edit / delete

"Blosc is a high performance compressor optimized for binary data. It has been designed to transmit data to the processor cache faster than the traditional, non-compressed, direct memory fetch approach via a memcpy() OS call." This is very neat -- it's a compressed data format designed so that you can decompress it into cache very quickly. AG0803 students take note!

to ag0803 cache compression memory optimisation python ... on 28 April 2014

Browser bookmarks: tasty+ | tasty= Log in | Export | Atom