Auto-translated Chinese national IOI training team report papers

Revision en8, by z4120, 2020-11-30 19:07:41

This is the Chinese national training team report papers translated into English using several computer tools.

I find this to be a much way to read these PDF papers than Google Translate or Foxit Reader Translate (despite the limitations -- see below), so I think it may be useful to other people too.

Original papers download:

Auto-translated papers download:

I've only translated some topics, but I will upload more in the future.

I'll update all the files if I find some better way to translate those.

I could not find any existing post that does the same thing, despite a lot of blog posts that requests it: 1 2 3 and I find it really hard to copy and paste each line into a translator program (or select each line), and translating the whole thing with Google Translate (or similar) will remove the figures/formulas, so the side-by-side comparison was helpful.

Issues/possible improvements/contributions:

  • It's really hard to find a good site/program to translate PDF files. Does anyone know one better than this one?

    The one I'm using fails badly sometimes, stretches or shrinks the text. Sample page (low resolution version). However, it's still better than the alternatives (Foxit Reader Translate, Google document translate), which requires highlighting/copying each sentence, scrolling two windows parallelly, and/or overflows the page width so horizontal scrolling is required.

  • I suppose that the original Chinese characters are still preserved inside the PDF; however direct copy and paste results in corrupted data.

    If anyone can figure out how to extract the Chinese characters without OCR, that would improve the translation quality (because currently the OCR is not perfect, and there are some errors).

    (some metadata in a PDF shows that it was made with Microsoft Word 2013 and/or Acrobat 11.0.0)

  • The images, math formulas and pseudo code listings are not preserved.

    This is a limitation of ABBYY OCR tool. Although it can be fixed manually, I'm not going to do that.

  • You can also write (usually English; however Chinese HTML is still easier to translate than Chinese PDF) blog posts to explain the techniques.

  • Or find existing content (in English) that describes those techniques.

History

 
 
 
 
Revisions
 
 
  Rev. Lang. By When Δ Comment
en10 English z4120 2021-03-14 16:47:11 1235 Migrate to DeepL
en9 English z4120 2021-03-11 08:50:57 969 2020/1 upload
en8 English z4120 2020-11-30 19:07:41 61 First publish (published)
en7 English z4120 2020-11-30 19:03:03 193 Tiny change: '/i7A1.png) (\n [low resol' -> '/i7A1.png)\n ([low resol'
en6 English z4120 2020-11-30 18:54:56 426
en5 English z4120 2020-11-30 18:43:17 1491
en4 English z4120 2020-11-30 07:26:41 70
en3 English z4120 2020-11-30 07:20:55 53
en2 English z4120 2020-11-30 07:19:09 3222
en1 English z4120 2020-11-29 19:59:45 8 Initial revision (saved to drafts)