U.S. State Department Crawled Corpus
International news and global public affairs shift focus frequently, have constantly evolving language and terminology. U.S. Department of State press releases closely mirror these developments and sometimes originate them. This corpus allows to incorporate these shifts in language and topic into news, diplomatic and other current affairs translations.
The press releases consist of news articles, diplomatic statements and transcribed press conferences. The press releases are from the time period April 21st, 2017 to June 21st, 2019. The translations were automatically segmented and aligned, deduplicated, shuffled and cleaned using common sense cleaning criteria. Individual translations are in the public domain. We thank the U.S. Department of State and the Office of Language Services for making these translations available.