Commit Graph

5 Commits

Author SHA1 Message Date
Anna 815ba1c1ad fix: no longer consider empty custom filters 2021-04-25 15:19:42 -04:00
Anna d00b3b0845 feat: better handle puncutation
Certain symbols are turned into one space so the model sees multiple
words instead of one. Previously "[RP]Hi" would turn into "RPHi" and
be its own token. Now it turns into "RP" and "Hi", counting as two
tokens. This change increased the model's accuracy.

Also make "18", "http", "https", and LGBT-related words into stop
words (meaning they're ignored). Each of these stop words made the
model more accurate and reduced unwanted bias.

Messages destined for ML are now normalised by the plugin in the same
way the model's input is for training. This should make the results
come closer to expected.
2021-02-17 20:01:34 -05:00
Anna 536431bf26 refactor: clean up slightly 2021-01-02 08:23:41 -05:00
Anna 76462ff628 feat: add machine learning mode 2020-12-22 20:23:54 -05:00
Anna 9ad9d89951 feat: add option to filter unjoinable ilvl PFs 2020-08-23 10:59:48 -04:00