Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Checking that works more than three fourths of the time #76

Open
cchan opened this issue Dec 10, 2013 · 4 comments
Open

Checking that works more than three fourths of the time #76

cchan opened this issue Dec 10, 2013 · 4 comments

Comments

@cchan
Copy link

cchan commented Dec 10, 2013

For example, when given a multi-select whose answer is "4 2 5 3 1" it accepts any permutation which is pointless.
And random stuff happens too - carboxyl is very different from carbonyl, but they're only one letter apart.

[what happens to all of those "report answer as wrong"-s anyway?]

@cchan
Copy link
Author

cchan commented Apr 27, 2015

By the way, do you have a dataset of "correct" answers versus user answers? I would love to run a bit of machine learning on it if so, and see if I can get some good accuracy.
Thanks.

@alchzh
Copy link

alchzh commented Apr 27, 2015

Hmm, that might be a bit complicated, and for questions that are very often mistaken that could be a problem. It's a cool idea, but I'm not sure if it is possible.

@NielsKornerup
Copy link

What might work (I don't know if this is implemented) is a system where protested incorrect answers are compared and duplicates are placed somewhere where a human can evaluate whether or not to accept them.

@cchan
Copy link
Author

cchan commented Apr 28, 2015

I think it exists. The protests certainly seem to be submitted when you click on the little "Correct" and "Incorrect" labels, but I have no idea where those actually go. There's also those Edit buttons on the questions themselves. (@polarcuke @antimatter15 ?)

So that protest system with human evaluation would be indeed necessary, but if you add a machine learning system to it (plain supervised learning, presumably) it could gradually get better and better. Or possibly not, since the features are pretty complex. Lol, I'll try.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants