11.txt: the original file distributed on Project Gutenberg
alice.txt: the raw text extracted from
alice.txt.conll: the text annocated with part-of-speech tags (in CoNLL format)
alice.txt.json: the text annotated with dependency trees (in JSON format)
build_alice.sh: the script to build these files.
11.txt for the Project Gutenberg License. The part-of-speech tags and dependency trees are annotated by Stanford CoreNLP.