This talk outlines techniques to integrate GPUs into an existing large scale MPI application. Examples are presented for the parallel solution of PDE problems from CFD and CSM using multigrid solvers with FEAST. It is part of the full day GPGPU and CUDA tutorials, held in conjunction with ARCS 2008, Architecture of Computing Systems, Dresden, Germany, in February 2008