about summary refs log tree commit diff stats
path: root/kgramstats.cpp
Commit message (Collapse)AuthorAgeFilesLines
* guess what! the algorithmKelly Rauchenberger2015-12-301-31/+56
| | | | | | | this time it's a literal algorithm again not canonizing away punctuation newlines are actually considered new sentences now we look for the end of a sentence and then start after that
* You guessed it,,, twerked the algoKelly Rauchenberger2015-11-231-44/+41
|
* Added malapropismsKelly Rauchenberger2015-11-221-68/+93
|
* I may have made things better. I may have made things worse.Kelly Rauchenberger2015-11-221-5/+5
|
* Added some newline recognitionKelly Rauchenberger2015-07-241-31/+55
|
* Took into account question marks and exclamation marksKelly Rauchenberger2015-07-191-2/+2
|
* Stopped using C++11 because yamlcpp didn't like itKelly Rauchenberger2015-07-191-3/+6
|
* Kerjiggered the algorithmsKelly Rauchenberger2015-07-191-21/+166
|
* Modified kgram shortening rateKelly Rauchenberger2014-04-221-1/+1
|
* Stripped empty tokens from corpusFeffernoose2013-10-061-2/+8
|
* Rewrote weighted random number generatorFeffernoose2013-10-051-33/+37
| | | | | | The previous method of picking which token was the next one was flawed in some mysterious way that ended up picking various words that occurred only once in the input corpus as the first word of the generated output (most notably, "hysterically," "Anarchy," "Yorkshire," and "impunity.").
* Changed incidence of random kgram-trimmingFeffernoose2013-10-041-4/+10
| | | | Also added better terminal output
* Weighed token casing and presence of periodsFeffernoose2013-10-011-25/+67
| | | | | | | | Tokens which differ only by casing or the presence of an ending period are now considered the same token. When tokens are generated, they are cased based on the prevalence of Upper/Title/Lower casing of the token in the input corpus, and similarly, a period is added to the end of the word based on how often the same token was ended with a period in the input corpus.
* Wrote programFeffernoose2013-10-011-0/+110