Terms of Use

USAGE: The Simultaneous Interpretation Corpus 2022 may be used for research purposes. Commercial use is prohibited.

REDISTRIBUTION: The interpretation transcripts in the corpus shall not be redistributed. It is permissible, however, to cite examples from the corpus for the purpose of presenting research results.

CITATION: All reports/publications using the Simultaneous Interpretation Corpus 2022 must acknowledge its use. We recommend that this is done through a citation to the paper describing the corpus and a link to the corpus page:

Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura.
Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data
Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021
NAIST-SIC 2022: NAIST Simutaneous Interpretation Corpus 2022
https://dsc-nlp.naist.jp/data/NAIST-SIC/2022/

WARRANTY: The corpus is provided WITH NO WARRANTY.


Download

You can download the Simultaneous Interpretation Corpus 2022 (9,448,288 bytes, Xzipped tar file) upon the agreement of the Terms of Use above.


Related Paper

Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura.
Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data.
Proceedings of the 18th International Conference on Spoken Language Translation, pp. 226--235, 2021

Contact

Katsuhito Sudoh, Associate Professor, Nara Institute of Science and Technology

sudoh [0x40] is.naist.jp


Go back to NAIST SIC Home