Rewrote weighted random number generator

The previous method of picking which token was the next one was flawed in some mysterious way that ended up picking various words that occurred only once in the input corpus as the first word of the generated output (most notably, "hysterically," "Anarchy," "Yorkshire," and "impunity.").
author: Feffernoose <fefferburbia@gmail.com> 2013-10-05 19:14:53 -0400
committer: Feffernoose <fefferburbia@gmail.com> 2013-10-05 19:14:53 -0400
commit: eb076ca2c6c8932fd251419563cf0078c5ee0914 (patch)
tree: bcd96acd0613fafa27b847cc5937420755b3d748 /kgramstats.h
parent: 92a4a0e7db8336f8ccc11c053dc29847a303ad88 (diff)
download: rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.tar.gz
rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.tar.bz2
rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.zip
1 files changed, 2 insertions, 1 deletions
diff --git a/kgramstats.h b/kgramstats.h
index 248b193..b40e1ab 100644
--- a/kgramstats.h
+++ b/kgramstats.h

@@ -23,9 +23,10 @@ private:
                int titlecase;
                int uppercase;
                int period;
+                string* token;
        } token_data;
        int maxK;
-        map<kgram, map<string, token_data*>* >* stats;
+        map<kgram, map<int, token_data*>* >* stats;
 };
 void printKgram(kgram k);
author	Feffernoose <fefferburbia@gmail.com>	2013-10-05 19:14:53 -0400
committer	Feffernoose <fefferburbia@gmail.com>	2013-10-05 19:14:53 -0400
commit	eb076ca2c6c8932fd251419563cf0078c5ee0914 (patch)
tree	bcd96acd0613fafa27b847cc5937420755b3d748 /kgramstats.h
parent	92a4a0e7db8336f8ccc11c053dc29847a303ad88 (diff)
download	rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.tar.gz rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.tar.bz2 rawr-ebooks-eb076ca2c6c8932fd251419563cf0078c5ee0914.zip