Terms of Use
USAGE: NAIST-SIC-Aligned-ST may be used for research purposes. Commercial use is prohibited.
REDISTRIBUTION: The interpretation transcripts in the corpus shall not be redistributed. It is permissible, however, to cite examples from the corpus for the purpose of presenting research results.
CITATION: All reports/publications using NAIST-SIC-Aligned-ST must acknowledge its use. We recommend that this is done through a citation to the paper describing the corpus and a link to the corpus page:
Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura. Tagged End-to-End Simultaneous Speech Translation Training Using Simultaneous Interpretation Data Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pp. 363--375, 2023
NAIST-SIC-Aligned-ST: https://dsc-nlp.naist.jp/data/NAIST-SIC/Aligned-ST/
WARRANTY: The corpus is provided WITH NO WARRANTY.
Download
You can download NAIST-SIC-Aligned-ST (4,853,143 bytes, ZIP file) upon the agreement of the Terms of Use above.
Related Paper
Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura.Tagged End-to-End Simultaneous Speech Translation Training Using Simultaneous Interpretation Data.
Proceedings of the 20th International Conference on Spoken Language Translation, pp. 363--375, 2023