Commit Graph

430 Commits

Author SHA1 Message Date
a0e4777741 chore: update csv library 2021-03-03 23:35:57 -05:00
3602f54fa1 refactor: break ui code into parts 2021-03-03 23:34:52 -05:00
d78f2f6dd7 fix(trainer): remove auto-translate textvalue artifacts 2021-03-03 20:36:53 -05:00
df604139a5 fix: process pf strings as sestrings 2021-03-03 20:29:24 -05:00
4e90f8f4f4 refactor: use property instead of field 2021-03-03 20:15:11 -05:00
4148df1237 feat(data): add more data 2021-03-03 19:55:54 -05:00
9b1ee168b9 fix(build): provide correct arguments 2021-03-03 19:52:38 -05:00
d040be3285 fix(build): only update model when it actually changes 2021-03-03 19:47:22 -05:00
891d4f5aae feat(data): add more data 2021-03-03 18:05:57 -05:00
c272dc5f85 feat: add some debug data collection buttons 2021-03-03 18:05:43 -05:00
51ccd9ea0f fix(trainer): clean up some warnings 2021-03-02 23:01:16 -05:00
ce15b07636 fix(trainer): accept more invalid input interactively 2021-03-02 22:58:22 -05:00
4d35c0bac6 feat(data): add more data 2021-03-02 21:16:34 -05:00
d9dc0af591 feat(build): update model on every push 2021-03-02 21:16:13 -05:00
81b54c35aa feat: add explainer to test results 2021-03-02 13:25:05 -05:00
1ef5c9f1b5 feat: add community ad filtering
Also add tooltip on filter hover with description.
2021-03-02 13:19:47 -05:00
f3ccead078 fix: actually make multiboxing work 2021-03-02 13:08:42 -05:00
b7da161f75 refactor: always use TextUnformatted 2021-03-02 12:54:07 -05:00
f6ba0106ed fix: make multi-boxing work 2021-03-02 12:53:56 -05:00
090c4eff3c feat(trainer): make test output more obvious 2021-03-02 12:53:19 -05:00
fd256722a1 feat(data): add more 2021-03-02 12:44:14 -05:00
348c610ca1 fix(trainer): use correct path for csv 2021-03-02 12:43:47 -05:00
268c7d5ec4 fix(build): specify latest fedora 2021-03-02 12:39:41 -05:00
2fd2e54401 feat: add automated model creation 2021-03-02 12:38:30 -05:00
6c9dd9164b feat(trainer): run on net5 and accept csv path
Hopefully will use this to automate model deployment.
2021-03-02 04:52:36 -05:00
65558fa199 feat(data): ignore "blu" and add more 2021-02-26 12:07:19 -05:00
919341230f chore(defs): cut out the global section 2021-02-25 19:26:54 -05:00
f0094eed18 chore: bump version to 1.6.0 2021-02-25 18:57:03 -05:00
b811c762e6 refactor: reduce indent level 2021-02-24 21:02:41 -05:00
04ef5e84cc feat: add update button escape hatch 2021-02-24 21:01:15 -05:00
2b07613722 chore: increase priority and remove experimental line 2021-02-24 20:59:16 -05:00
435f0f8e1e feat: add model status information 2021-02-24 20:58:47 -05:00
76717dd200 chore: remove old test suite
As definitions are no longer being updated, the test suite for them
can be removed. The ML trainer already has statistics on how accurate
it is for both training mode and model creation mode, as well as an
interactive mode to test new messages.
2021-02-24 20:26:16 -05:00
33f5421e79 chore: fix modes 2021-02-24 20:23:28 -05:00
8c8b3f1b39 feat: begin removing support for defs
Remove the global section, which filtered Free Company ads and RP
ads. Prevent reporting of messages that were filtered by
definitions. Make the ML mode default and mark definitions mode as
obsolete.
2021-02-24 20:23:28 -05:00
83e6b20333 feat(data): add more 2021-02-24 20:01:41 -05:00
c038adc4e9 fix(trainer): replace newlines automatically 2021-02-24 20:01:35 -05:00
7a0614b4bb refactor: handle events in their own classes
Also deprecate /prmt and replace with /nosol.
2021-02-24 20:01:12 -05:00
819ac1b457 feat(data): add more 2021-02-21 15:50:17 -05:00
2181649b22 feat: add "come" and "join" as stop words 2021-02-21 15:50:05 -05:00
9bf074c25c fix(defs): add restocker rmt 2021-02-21 12:36:05 -05:00
0dc0c2ef00 feat(data): add more data
Also pull out stop words into field.
2021-02-20 19:25:15 -05:00
c3df0a1f8e feat: add normalisation to pipeline
Add a step to normalise messages to the ML pipeline. This ensures
computed properties run on the raw data (which is actually partially
normalised by the compute context). This prevents properties which
rely on symbols (e.g. "B>") from being unable to work properly when
normalisation happens before they have access to the input.
2021-02-17 21:45:09 -05:00
d00b3b0845 feat: better handle puncutation
Certain symbols are turned into one space so the model sees multiple
words instead of one. Previously "[RP]Hi" would turn into "RPHi" and
be its own token. Now it turns into "RP" and "Hi", counting as two
tokens. This change increased the model's accuracy.

Also make "18", "http", "https", and LGBT-related words into stop
words (meaning they're ignored). Each of these stop words made the
model more accurate and reduced unwanted bias.

Messages destined for ML are now normalised by the plugin in the same
way the model's input is for training. This should make the results
come closer to expected.
2021-02-17 20:01:34 -05:00
e8678f3c7d chore: set plugin to a higher priority 2021-02-17 18:58:58 -05:00
fcfe1bb727 feat(data): add more data 2021-02-17 18:58:34 -05:00
450415f6cc feat: ignore private pfs unless told otherwise 2021-02-16 19:41:59 -05:00
acd8f04937 chore: bump version to 1.5.9 2021-02-16 12:18:58 -05:00
f874d8ac37 feat(data): add more 2021-02-16 12:15:06 -05:00
934fe877e2 refactor: use new new syntax 2021-02-16 12:15:00 -05:00