Home / Corpora



Download About The first version of Nemlar corpus was produced within the NEMLAR project. This is a set of annotated Arabic texts collected from 13 different domains and contains about 500,000 words. The Arabic Language Processing team (ALP team) of ...

Read More »

MulTed corpus

The corpus will be available for Download soon Abstract: The MulTed is a multilingual aligned and tagged parallel corpus. i.e.,  it is multilingual and Part of Speech (PoS) tagged, but the sentence-alignment is bilingual, with English as a pivot language. ...

Read More »

OSIAN corpus

The corpus will be available for Download soon You can download sample files Abstract: The Open Source International Arabic News (OSIAN) corpus has been collected from international Arabic news websites like CNN, DW, RT, Aljazeera, among others. With a server-friendly ...

Read More »

ăn dặm kiểu NhậtResponsive WordPress Themenhà cấp 4 nông thônthời trang trẻ emgiày cao gótshop giày nữdownload wordpress pluginsmẫu biệt thự đẹpepichouseáo sơ mi nữhouse beautiful