Matching Data Library

Off-the-Shelf Matching Data

No time to create a customized data corpus? You can choose from our Matching Data Library built and validated in cooperation with TAUS members. The library corpora were compiled with TAUS Matching Data search applied to the TAUS Data Cloud repository.

* Special promotional price, 25% discount on all titles until May 31, 2019.
The discount is only valid for Matching Data purchases in EUR. It is not applicable to purchases with Data Cloud or Partner credits.
 
Convenient
Ready to download whenever you need it
 
Clean data
Data and corpora validated by TAUS members
 
Easy overview
Volumes, segment domain origin and test bed rating
 
Volume discount
25% off on bulk purchases of minimum 5 corpora
Special Discount
-25%
eBay
E-commerce

E-commerce Corpus


Reliable product descriptions and information are a crucial asset in any e-commerce environment. In these corpora you'll find carefully filtered and cleaned data on a great variety of product types, that will make it even easier for your global customers to click on the 'Add to shopping cart' button!
French - Dutch
German - Polish
German - Italian
English - Italian
Special Discount
-25%
Universitat Autonoma de Barcelona
Medical / Pharmaceutical

Medical/Pharmaceutical Corpus


High fidelity MT training data is always important, even more so when it comes to medical subjects. This is a must-have corpus for anyone seeking for pharma-related data.
English - Spanish
Special Discount
-25%
RWS Moravia
Customer Support/Help

Customer Support Corpus


Need help in fine-tuning your customer support data into Dutch? Be it for your webshop, product documentation or website, information related to customer support is usually pretty standardized and therefore best handled with automation.
English - Dutch
Special Discount
-25%
Oracle
Colloquial Text

Colloquial Corpus


Is your chat bot not chatty enough? Or your MT engine looks at you puzzled when it has to deal with informal business communication or user generated content? This corpus will give the conversation with your local audience a friendly, casual tone.
English - Spanish (International)
English - Portuguese (Brazil)
English - Chinese (PRC)
English - Korean
English - Japanese

Couldn't find what you were looking for?

Do you have a query corpus to submit?
Request Matching Data
Contact us to get more information
Contact us
500x500