Skip to content

Stereo Encoder: Higher interharmonic noise-floor for speech, 32 kbps

Basic info

  • Float reference:
  • Fixed point:
    • Encoder (fixed): c89d638e
    • Decoder (fixed): n/a

Bug description

For certain speech segments, the BASOP encoder seems to exhibit a slightly higher inter-harmonic noise floor for certain speech segments. For example for the LTVs, between 0:59.88 and 1:00.80:

Float:

Bildschirmfoto 2025-03-19 um 15.14.27.png

BASOP:

Bildschirmfoto 2025-03-19 um 15.14.36.png

It's not clear to me whether this is a problem of a different ACELP mode or some other issue.

Ways to reproduce

IVAS_cod -stereo 32000 16 ltv16_STEREO.wav ltv16_STEREO_32kbps.192
IVAS_dec stereo 16 ltv16_STEREO_32kbps.192 ltv16_STEREO_32kbps.wav