rawr-ebooks - Generates nonsense statistically similar to an input corpus

	Commit message (Collapse)	Author	Age	Files	Lines
...
\| * \|	Stripped empty tokens from corpus	Feffernoose	2013-10-06	1	-2/+8
\| \| \|
* \| \|	Split rawr-ebooks and rawr-gen	Feffernoose	2013-10-06	4	-7/+94
\| \|/ \|/\| \| \| \| \|	Also wrote README
* \|	Program no longer recalculates kgramstats repeatedly within each run	Feffernoose	2013-10-05	1	-11/+11
\|/
*	Rewrote weighted random number generator	Feffernoose	2013-10-05	2	-34/+39
\| \| \| \| \| \|	The previous method of picking which token was the next one was flawed in some mysterious way that ended up picking various words that occurred only once in the input corpus as the first word of the generated output (most notably, "hysterically," "Anarchy," "Yorkshire," and "impunity.").
*	Changed incidence of random kgram-trimming	Feffernoose	2013-10-04	1	-4/+10
\| \| \| \|	Also added better terminal output
*	Weighed token casing and presence of periods	Feffernoose	2013-10-01	2	-28/+76
\| \| \| \| \| \| \| \|	Tokens which differ only by casing or the presence of an ending period are now considered the same token. When tokens are generated, they are cased based on the prevalence of Upper/Title/Lower casing of the token in the input corpus, and similarly, a period is added to the end of the word based on how often the same token was ended with a period in the input corpus.
*	Wrote program	Feffernoose	2013-10-01	6	-1/+336
\|
*	Started automake stuff	Feffernoose	2013-09-30	4	-0/+38
\|
*	Initial commit	Kelly Rauchenberger	2013-09-30	3	-0/+356