Resource

- Corpora
  1. BiPaR : A bilingual MRC dataset on novels [Jing et al. 2019]
  2. Dataset for Shallow Discourse Annotation for Chinese TED Talks.
  3. A Test Suite for Evaluating Discourse Phenomena in Document-level Neural Machine Translation.
  4. RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Oriented Dialogue Modeling.
  5. TED-CDB: A Large-Scale Chinese Discourse Relation Dataset on TED Talks.
  6. Chinese WPLC: A Chinese Dataset for Evaluating Pretrained Language Models on Word Prediction Given Long-Range Context.

- Codes