Terms of Use
USAGE: The Simultaneous Interpretation Corpus 2021 may be used for research purposes. Commercial use is prohibited.
REDISTRIBUTION: The interpretation transcripts in the corpus shall not be redistributed. It is permissible, however, to cite examples from the corpus for the purpose of presenting research results.
CITATION: All reports/publications using the Simultaneous Interpretation Corpus 2021 must acknowledge its use. We recommend that this is done through a citation to the paper describing the corpus and a link to the corpus page:
Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura. Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021
NAIST-SIC 2021: NAIST Simutaneous Interpretation Corpus 2021 https://dsc-nlp.naist.jp/data/NAIST-SIC/2021/
WARRANTY: The corpus is provided WITH NO WARRANTY.
Download
You can download the Simultaneous Interpretation Corpus 2021 (519,700 bytes, Xzipped tar file) upon the agreement of the Terms of Use above.
Related Paper
Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura.Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data.
Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021