NAIST-SIC: NAIST Simultaneous Interpretation Corpus

This corpus is collection of an English-Japanese/Japanese-English simultaneous interpretation corpus.


There are several variations:
  • NAIST Simultaneous Interpretation Corpus (NAIST-SIC) 2014 (formerly distributed as Simultaneous Translation Corpus (STC))
  • NAIST Simultaneous Interpretation Corpus (NAIST-SIC) 2021
  • NAIST Simultaneous Interpretation Corpus (NAIST-SIC) 2022
  • NAIST-SIC-Aligned
  • NAIST-SIC-Aligned-ST
  • NAIST English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset 2024