Aligner

Aligner screenshot

An aligner is a program that allows alignment between Welsh and English texts.

It aligns texts at the sentence level, and can turn two texts from two different languages ​​into an output of pairs of sentences.

Aligners are useful not only for developers who want to create parallel data for training machine translation but also translators who wish to create translation memories from translations created with any CAT tools or from any other texts that are available in two or more languages.

Aligner screenshot

Using the aligner

The basis of the Welsh-English Aligner is the open-source program Hunalign and a dictionary of equivalent Welsh-English words, terms and phrases to help the program align effectively between the two languages.

Our alignment resources on GitHub contain code samples on using Hunalign with Python to align large collections of texts automatically, in addition to a tutorial on how translators can use the popular LFAligner app.

To see the code, go to:

techiaith/alinio