Corpus of Sentences rated with Human Complexity Judgments

The Corpus of Sentences rated with Human Complexity Judgments contains 1,123 Italian sentences and 1,200 English sentences rated by humans with a judgment of complexity. The datasets of sentences used for the task were taken from two different manually revised treebanks: the newspaper section of the Italian Universal Dependency Treebank (IUDT) for the Italian experiment, and the automatically converted Wall Street Journal section of the Penn Treebank for the English experiment.

More info: Corpus of Sentences rated with Human Complexity Judgments