Skip to content

Benjamin Marie's Blog

Analysis About AI, Natural Language Processing, and Machine Translation

Benjamin Marie's Blog

Analysis About AI, Natural Language Processing, and Machine Translation

  • Blog
  • About
  • Contact
    • Home
    • Benjamin Marie
    • Page 2
Evaluation Machine translation

We Need Statistical Significance Testing in Machine Translation Evaluation

Oct 27, 2022

A rule of thumb may yield correct results but can’t be scientifically credible. Illustration by the author. Take any research paper or blog post presenting a new method for AI,…

Conference LLM Machine translation

COLING 2022 Highlights

Oct 25, 2022

More robust evaluation metrics, language models that don’t understand anything, and better evaluation for grammatical error correction COLING 2022 was held in mid-October in Gyeongju (Republic of Korea). This natural language…

Framework/Tool

Romanize Any Language Without Machine Learning

Oct 15, 2022

A heuristic-based method exploiting Unicode tables

Framework/Tool Machine translation

MBR Decoding: Get Better Results from Many Systems

Oct 15, 2022

Even your state-of-the-art system has flaws that others don’t have Illustration by the author. Made with Noto Color Emoji. New large language models (BLOOM, OPT, GPT-3, NLLB, …) are released almost…

Conference Evaluation Machine translation

A Large-Scale Automatic Evaluation of Machine Translation

Sep 29, 2022

Like every year since 2006, the Conference on Machine Translation (WMT) organized extensive machine translation shared tasks. Numerous participants from all over the world submitted their machine translation (MT) outputs…

Conference Evaluation Machine translation

AMTA 2022 Highlights

Sep 23, 2022

I highlight and sum up the AMTA 2022 papers that I found the most original and interesting. I picked papers from the users (see the proceedings) and the research (see…

Evaluation Framework/Tool Machine translation

compare-mt: Because Scoring Your Systems Is Not Enough

Aug 29, 2022

I present compare-mt. A very simple tool that gives the user a high-level and coherent view of the salient differences between systems. It exploits statistics usually computed by automatic metrics such…

Evaluation GPT LLM Machine translation

Comparing the Uncomparable to Claim the State of the Art: A Concerning Trend

Aug 16, 2022

I describe how uncomparable numbers happen to be compared in scientific papers. Then, I review some concrete examples of flawed comparisons from very well-known and recent work, namely, Open AI’s…

Posts navigation

1 2

« Previous Page

About the author:
Ph.D, research scientist in NLP/AI.
Advocate of the scientific credibility.
Building next-gen AI translation systems: https://slaitor.com

  • Conference
  • Evaluation
  • Framework/Tool
  • GPT
  • LLM
  • Machine translation
  • Scientific credibility

You Missed

Evaluation Scientific credibility

Do Bigger Evaluation Datasets Make Your Results More Significant?

Evaluation Machine translation Scientific credibility

Scientific Credibility in Machine Translation Research: Pitfalls and Promising Trends

Machine translation GPT LLM

AI Won’t Replace Translators

Evaluation Machine translation

Traditional Versus Neural Metrics for Machine Translation Evaluation

Benjamin Marie's Blog

Analysis About AI, Natural Language Processing, and Machine Translation

Copyright © All rights reserved | Blogus by Themeansar.