Abstract—The programmable Graphics Processing Unit(GPU) has over the years become an integral part of today’s computing systems. The GPU use-cases have gradually been extended from graphics towards a wide range of applications. Since the programmable GPU is now making its way to mobile devices, it is interesting to study these new use-cases also there. To test this, we created a programming environment based on the embedded profile of the fresh Khronos OpenCL standard and run it against image processing workload in a mobile device with CPU, GPU and DSP back-ends. The early results on performance and energy consumption with CPU+GPU configuration were promising but also suggest there is room for optimization.