Member-only story
Extracting Parallel-text pairs from Wikipedia
I was able to extract almost 15,000 English (en) to Odia (or) parallel text pairs from Wikipedia. Millions can be extracted for a well-resourced language.
Translation Tool of Wikipedia does not do the auto translation for Odia
Wikipedia has got its own Translation tool, by which you can translate an Article from one language to another. Mostly it uses Google Translator API to convert the sentences in near real-time. However, some of the unfortunate languages like Odia, where Google translate is not available yet, you have to translate manually.
Paragraph Alignment
After an Editor, translate an Article, the good thing about the tool is that it manages to keep alignment between the paragraphs of source and destination language. However, it may not be always accurate. Quoting text from the MediaWiki page:
Content Translation does not do any kind of automatic alignment and the provided alignment is only best effort based on how the connections were preserved while translation happens. When automatically aligning the sentences, it is good to remember that the translations do not necessarily match 1:1.