add xfails to hrtf_loading tests
for all cases when comparing between ROM and a differing hrtf loaded from file, the respective testcase is now reported as xfail previously, it was reported as fail, but warnings about delay differences were reported, which felt wrong