Download About The first version of Nemlar corpus was produced within the NEMLAR project. This is a set of annotated Arabic texts collected from 13 different domains and contains about 500,000 words. The Arabic Language Processing team (ALP team) of ...
Read More »Home / 2018
Yearly Archives: 2018
MulTed corpus
The corpus will be available for Download soon Abstract: The MulTed is a multilingual aligned and tagged parallel corpus. i.e., it is multilingual and Part of Speech (PoS) tagged, but the sentence-alignment is bilingual, with English as a pivot language. ...
Read More »