about summary refs log tree commit diff stats
Commit message (Collapse)AuthorAgeFilesLines
* Fixed final closing delimiters appearing on new lineKelly Rauchenberger2016-06-051-1/+1
|
* Updated libtwitter++ (user stream disconnection bug)Kelly Rauchenberger2016-06-021-0/+0
|
* Updated libtwitter++ (error handling with bad responses)Kelly Rauchenberger2016-06-012-2/+10
|
* Added ability to require a minimum number of corpora in generated outputKelly Rauchenberger2016-05-312-47/+91
| | | | Also fixed a bug with tokenizing multiple corpora.
* Fixed config-example.ymlKelly Rauchenberger2016-05-311-2/+2
|
* Update README.mdKelly Rauchenberger2016-05-311-20/+50
|
* Merge branch 'master' of http://github.com/hatkirby/rawr-ebooksKelly Rauchenberger2016-05-291-0/+1
|\
| * Added #include <algorithm>Kelly Rauchenberger2016-05-261-0/+1
| |
* | Newlines, colons, and semicolons are now valid terminatorsKelly Rauchenberger2016-05-295-27/+72
|/
* Fixed ebooks and gen ignoring the form distributionKelly Rauchenberger2016-05-262-8/+8
|
* Updated libtwitter++ (streaming connection churn)Kelly Rauchenberger2016-05-241-0/+0
|
* Updated libtwitter++ (retweet bug)Kelly Rauchenberger2016-05-221-0/+0
|
* ebooks now prefills replies correctlyKelly Rauchenberger2016-05-222-2/+2
| | | | It also now does not reply to itself.
* Projects using libtwitter++ no longer have to include its dependenciesKelly Rauchenberger2016-05-212-2/+2
|
* Pulled the ebooks functionality out into a libraryKelly Rauchenberger2016-05-208-374/+443
|
* Now ignoring retweets as they aren't really mentionsKelly Rauchenberger2016-05-202-17/+20
|
* Fixed error with replying to tweetsKelly Rauchenberger2016-05-201-0/+1
|
* Fixed case-insensitive thingKelly Rauchenberger2016-05-201-1/+1
|
* Added ability for rawr ebooks to respond to tweetsKelly Rauchenberger2016-05-205-40/+72
| | | | Now using libtwitter++ instead of twitcurl!
* Changed "full sentence mode" to "don't stop believing" modeKelly Rauchenberger2016-03-101-14/+1
|
* Member hiding is funKelly Rauchenberger2016-03-082-3/+3
|
* Full sentences mode!Kelly Rauchenberger2016-03-084-5/+18
|
* rawr-ebooks now tells you how long it will wait before the next tweetKelly Rauchenberger2016-02-281-1/+30
|
* Removed aspell session editingKelly Rauchenberger2016-02-281-4/+0
| | | | This wasn't really necessary since it was completely automated anyway, and it caused crashes for reasons that I haven't looked into with some bad corpuses.
* Added yaml-cpp as a vendor submoduleKelly Rauchenberger2016-02-283-5/+11
|
* Reverted to an older kgram cut rateKelly Rauchenberger2016-02-201-13/+9
|
* Added percentage display to preprocessing stageKelly Rauchenberger2016-02-201-4/+52
|
* Modified kgram cut rate. It's do or die.Kelly Rauchenberger2016-02-171-10/+13
|
* Attemped to fix line-endings for WindowsKelly Rauchenberger2016-02-173-0/+20
|
* Fixed issue when names.txt was not presentKelly Rauchenberger2016-02-151-24/+13
| | | | Also removed any code mentioning $noun$ because it turns out the current version of the canonical corpus doesn't even use it anymore.
* Tweaked kgram cut rate some more (it never ends)Kelly Rauchenberger2016-02-151-1/+1
|
* Tweaked kgram cut rate AGAINKelly Rauchenberger2016-02-141-1/+2
|
* Fixed incorrect diversity of tokens containing the letters aemnouKelly Rauchenberger2016-02-141-1/+1
|
* Tweaked kgram cut rate againKelly Rauchenberger2016-02-141-2/+2
|
* Fixed problem wherein "$name$'s" was considered a form of "name's"Kelly Rauchenberger2016-02-141-8/+6
|
* Fixed issue where queries with both the wildcard token and a terminating ↵Kelly Rauchenberger2016-02-131-14/+5
| | | | token would reset the prefix
* Merge in changes to older kgram cutting strategyKelly Rauchenberger2016-02-091-9/+21
|\
| * Tweaked kgram cut rate againKelly Rauchenberger2016-02-091-4/+8
| |
* | Tweaked the kgram cutting rate againKelly Rauchenberger2016-02-071-1/+1
| |
* | Changed how kgram cutting worksKelly Rauchenberger2016-02-061-17/+9
|/ | | | Whereas cutting occurred randomly before, now a token will be cut from the search kgram whenever the previously generated token was guaranteed by its search kgram (that is, it was the only token that could follow that specific query).
* Merge branch 'master' of http://github.com/hatkirby/rawr-ebooksKelly Rauchenberger2016-02-031-0/+1
|\
| * Added #include <cstring> to kgramstatsKelly Rauchenberger2016-02-031-0/+1
| |
* | Added some more emoticonsKelly Rauchenberger2016-02-031-1/+5
| |
* | Declared old-style $name$ and $noun$ canonicalKelly Rauchenberger2016-02-031-0/+6
|/ | | | Without this, they get mixed in by the spell checker with "name" and "noun."
* Token generator now uses aspell to link different spellings of a wordKelly Rauchenberger2016-02-032-4/+58
| | | | This is the grand scheme for the multi-formed word design.
* Terminator characters in the middle of tokens are no longer strippedKelly Rauchenberger2016-02-032-13/+25
| | | | Emoticon checking is also now case sensitive, and a few more emoticons were added to the list.
* Fixed issue where closing opened delimiters wouldn't pop them off the stackKelly Rauchenberger2016-02-011-0/+2
| | | | This would cause a random quotation mark, for instance, to appear at the end of a tweet if a quote had been opened and closed naturally within the tweet.
* Switched to pkg-config for finding libcurlKelly Rauchenberger2016-02-011-5/+5
|
* Added emoji freevarKelly Rauchenberger2016-02-0113-65/+1064
| | | | Strings of emojis are tokenized separately from anything else, and added to an emoticon freevar, which is mixed in with regular emoticons like :P. This breaks old-style freevars like $name$ and $noun$ so some legacy support for compatibility is left in but eventually $name$ should be made into an actual new freevar. Emoji data is from gemoji (https://github.com/github/gemoji).
* Rewrote how tokens are handledKelly Rauchenberger2016-01-298-266/+406
| | | | | | A 'word' is now an object that contains a distribution of forms that word can take. For now, most word just contain one form, the canonical one. The only special use is currently hashtags. Malapropisms have been disabled because of compatibility issues and because an upcoming feature is planned to replace it.