When using the GPU in a straightforward manner, outperforming the CPU is a task of moderate complexity once one gets the knack of it. But compared to the (measured) peak performance, GFLOPs rates are disappointing. The set of experimental results being presented clearly shows what future research should be focussed on to achieve proper fractions of the available performance reserves.