Fix to correct the rounding of directToTotalRatio
This change improves the precision as well as corrects the rounding of directToTotalRatio. Optimization: This MR removes 7 instances of BASOP_Util_Divide1616_Scale used in 2 functions: diffuse_meta_merge_1x1_fx and full_stream_merge_fx