Text this: Interval evaluation of temporal (in)stability for neural machine translation