Terms of Use
USAGE: The Simultaneous Interpretation Corpus 2022 may be used for research purposes. Commercial use is prohibited.
REDISTRIBUTION: The interpretation transcripts in the corpus shall not be redistributed. It is permissible, however, to cite examples from the corpus for the purpose of presenting research results.
CITATION: All reports/publications using the Simultaneous Interpretation Corpus 2022 must acknowledge its use. We recommend that this is done through a citation to the paper describing the corpus and a link to the corpus page:
Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura. Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021
NAIST-SIC 2022: NAIST Simutaneous Interpretation Corpus 2022 https://dsc-nlp.naist.jp/data/NAIST-SIC/2022/
WARRANTY: The corpus is provided WITH NO WARRANTY.
Download
You can download the Simultaneous Interpretation Corpus 2022 (9,448,288 bytes, Xzipped tar file) upon the agreement of the Terms of Use above.
Related Paper
Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura.Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data.
Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021