A dataset containing 5 of H.C. andersens fairy tales translated to English. The UTF-8 plain text was sourced from http://www.andersenstories.com/.
fairy_tales
A character vector with 5 elements.
This is not representive of the size needed to generate good word vectors. It is just used for examples.