In this work the authors present a range-Doppler processor for SAR data focusing running on GPUs. The standard algorithm has been revisited with a fine-grain parallelism over the range gates and the range migration algorithm has been implemented using shared memory to reduce memory transfer overhead. After optimization, an overall 10x speedup has been observed with respect to a CPU reference.