Remove the extraneous '+0' immediate offset part in PTX load/stores, to improve readability of output PTX code.