Limitations of this model

Massive amounts of data are required

Wide variation in document length may cause problems

Highly stylized prose and orthographical inconsistencies will confuse a computer

Examples: Blogs and Literature