Terms of Use

USAGE: The English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset 2024 may be used for research purposes. Commercial use is prohibited.

REDISTRIBUTION: The translation text in the corpus shall not be redistributed. It is permissible, however, to cite examples from the corpus for the purpose of presenting research results.

CITATION: All reports/publications using the English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset 2024 must acknowledge its use. We recommend that this is done through a citation to the paper describing the corpus and a link to the corpus page:

福田りょう,土肥康輔,須藤克仁,中村哲.
原発話に忠実な英日同時機械翻訳の実現に向けた順送り訳評価データ作成.
情報処理学会研究報告 第259回自然言語処理研究会,2024.
NAIST-SIC-Aligned-ST: https://dsc-nlp.naist.jp/data/NAIST-SIC/Aligned-ST/

WARRANTY: The corpus is provided WITH NO WARRANTY.


Download

You can download NAIST-SIC-Aligned-Chunk_Mono-EJ (43,485 bytes, ZIP file) upon the agreement of the Terms of Use above.


Related Paper

Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura.
Tagged End-to-End Simultaneous Speech Translation Training Using Simultaneous Interpretation Data.
Proceedings of the 20th International Conference on Spoken Language Translation, pp. 363--375, 2023

Contact

Katsuhito Sudoh, Associate Professor, Nara Institute of Science and Technology

sudoh [0x40] is.naist.jp


Go back to NAIST SIC Home