Text this: Group feature calibration for sound event detection