In this episode we cover some questions regarding function calls from kernels and the use of clFinish. Also, we'll discuss basic GPU architecture, memory layout, shared memory. Thread blocks, warps and efficient data loading will also be discussed.