We Need Statistical Significance Testing in Machine Translation Evaluation
A rule of thumb may yield correct results but can’t be scientifically credible. Illustration by the author. Take any research paper or blog post presenting a new method for AI,…