The Corpus of Sentences rated with Human Complexity Judgments contains 1,123 Italian sentences and 1,200 English sentences rated by humans with a judgment of complexity. The datasets of sentences used for the task were taken from two different manually revised treebanks: the newspaper section of the Italian Universal Dependency Treebank (IUDT) for the Italian experiment, and the automatically converted Wall Street Journal section of the Penn Treebank for the English experiment.
