Allow participants to prioritize statements #217

colinmegill · 2019-07-05T06:31:02Z

The issue of prioritization of statements in Polis has long been considered but never implemented. This is a thread to discuss a possible implementation.

Goals:

Prioritization would feed into group informed consensus, de-prioritizing "the sky is blue" statements (all groups agree, but insignificant)
Prioritization would feed into statement prioritization when showing statements to users
Prioritization would surface how different groups value different issues

Proposal:

A checkbox on each statement which the user (participant voting) would need to check before agreeing, disagreeing or passing
Checkbox allows users to flag statements as high priority
We assume there is no need for low priority as the interface allows a pass
This functionality would be disable-able in admin config

[ ] this issue is high priority to me

Prioritization matrix

This creates a second comments * participants matrix, in addition to the votes matrix. The value of each value in the second prioritization matrix is, initially, a 1 or a 0 for 'given user did prioritize given comment'.

Upon analysis, however, we might consider: total statements total votes and total uses of prioritization as a means of scaling each user's row in the prioritization matrix. We would then variously apply penalties for overuse given ratios between the three values given some equation (which should be able to approach abs(-1) but never go below it, as the values in the votes matrix are 1 & -1 and priority should not be able to reduce their weight, only increase it - though we might consider whether we factor passes in to decrease the weight). We might consider:

Naive:

priority_ij = arbitraryScalingConstant / totalUsesOfPrioritization + 1

Perhaps better:

priority_ij = (totalParticipantVotes / totalStatements) * (arbitraryScalingConstant / totalUsesOfPrioritization) + 1

let arbitraryScalingConstant = 100
let totalUsesOfPrioritization = 2

(600 votes / 600 statements) *  (100 / 2 prioritizations) + 1 = 51
(300 votes / 600 statements) * (100 / 2 prioritizations) + 1 = 26
(10 votes / 600 statements) * (100 / 2 prioritizations) + 1 = .0166 * 50 + 1 = 1.83

let totalUsesOfPrioritization = 150

(300 votes / 600 statements) * (100 / 150 prioritizations) + 1 = .5 * .666 + 1 = 1.333

Possible improvements:

Highly penalize certain totalParticipantVotes to totalStatements ratios such as if user has voted 3 times and there are 1000 statements
Similarly don't heavily reward extremely high vote counts ie., voting on half the comments should probably be about as good as voting on all of them
The difference between 559th and 560th vote should not be nearly as important as difference between 3rd and 4th vote
If the number of totalUsesOfPrioritization is very low relative to the totalParticipantVoteCount, we might also find a way to upweight that, but perhaps we're already effectively doing that by watering down other cases

Once final values were computed, the prioritization matrix would then be used to multiply the votes matrix to surface statements various opinion clusters found significant.

Special thanks to @DZNarayanan and @misscs

The text was updated successfully, but these errors were encountered:

patcon · 2020-05-03T16:30:28Z

Also, does this dovetail with crowd moderation #120?

colinmegill · 2020-05-03T17:59:26Z

This will be handled in 'importance', which is specified in another issue. I will fetch the id...

…

On Sun, May 3, 2020, 12:30 PM Patrick Connolly ***@***.***> wrote: Also, does this dovetail with crowd moderation compdemocracy/polis#120 <https://github.com/pol-is/polis-issues/issues/26>? — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <https://github.com/pol-is/polis-issues/issues/123#issuecomment-623137023>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AANQGGN7LFBPGCTYFQCMYCLRPWL3BANCNFSM4H6HRKUA> .

colinmegill · 2020-05-09T05:57:21Z

#133 is closed.
#120 is also an idea we've abandoned to prevent groups from punishing each other's ideas, in favor of comment routing.

AleksandarPetrov · 2021-02-16T19:54:00Z

Another idea for priority calculation would be to scale depending on whether a person uses more or less prioritization than other people, e.g.

priority_ij = (totalParticipantVotes / totalStatements) * (averageUsesOfPrioritizationAmongAllUsers / totalUsesOfPrioritization) + 1
(variations are possible)

jucor · 2021-03-03T15:14:07Z

Delicious idea. I think we might be getting to a point where it might make sense to explicitely formulate a latent variable model, which could capture individual's personal use of prioritization and take it into account. Thinking alongside the lines of Blei 2014.

These latent variable models range from low-rank matrix factorization (recommender-system style, a-la Netflix/Spotify, with a single matrix), to more elaborate membership models.

The added advantage is that it lends itself nicely to add the clustering as part of the latent variables.

And because it's Bayesian, there should be a way to do some online update of the posterior without having to recompute everything each time there's a new vote.

It's a chunk of work to formalize it, but I'd be game to try. It should surface "naturally" the kind of formulas mentioned in the comments above, from the form of the posterior distributions corresponding to whichever structure we bake into the model.

ThenWho · 2021-03-03T19:41:21Z

Do you need help @jucor ? Not my area but always good to look a bit further.

I thought I mentioned it but apparently not here: in tensors, priority would be just another slice, next to the agree/disagree/pass slices. It's just another type of information really 🙂 , orthogonal to normal voting but combines nicely with everything.

As as I said, not my area, but I wouldn't be surprised if it ended being very similar. So excited you're going to look into this! 😊

jucor · 2021-03-03T21:33:47Z

Thanks Giorgios! Always happy to geek out with others :) Agreed with you, there should be deep connections between both approaches. I need to put pen on paper to formalise a bit -- this week is the conference of Fairness, Accountability, and Transparency, so I don't have 100% time available, but it keeps buzzing in my mind :) Let me get back to you once I have the basic equations written down in latex, then we can iterate.

ThenWho · 2021-03-04T12:45:55Z

No worries 🙂 If you plan to work in the open, feel free to drop some URL so that me and others can follow up. Otherwise let's sync down the road! 🙌

jucor · 2021-03-04T12:49:51Z

Happy to work in the open for a good part :) The more eyes, the shallower the problems!

I would normally use Overleaf and Latex, but not sure how good they are for open view, nor how they match what's used by the pol.is community. Any suggestion of tool? Does pol.is have a preferred one that supports equations? @colinmegill ?

colinmegill · 2021-11-21T01:56:53Z

@jucor I finally have an answer here :) DeepNote is really nice.

jucor · 2021-11-21T21:58:48Z

Oh you do?? Answer as in:

method to compute it importance, or
user-based definition of importance, or
"Does pol.is have a preferred one that supports equations? " -> yes and it's Deepnote :)

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

* incorporate changes from https://github.com/chena11356/polis/tree/implement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217 * include high_priority in vote posts to server * rename "priority_type" to "importance_enabled" * use actual quotes since HTML escapes are not being respected * Editing the importance/significance label and help text, moving it up above the vote buttons * lint fix * update migration filename --------- Co-authored-by: Hadjar Homaei <hadjar@gmail.com>

* Update server.ts * update gtag usage (#1795) * update gtag usage * ignore gtag if GA value is blank * update gtag user prop * Update README.md * v1.0 release (#1774) * fix Constants capitalization bug in participationview (#1802) * Export api endpoints (#1804) * docker local postgres * fix Constants capitalization bug in participationview * add perspective to example.env * add google apis to package.json * add perspective api key to config * config prettier * import google * prettier server.ts * fix typescript bluebird promise to enable async await * refactor post_comment route async, remove paths * perspective call * jigsaw toxicity threshold under flag. * text flag for toxic * add perspective to privacy policy * jigsaw TOS * Better type for getPca. Also removed promotion of error to the return value. Let the error propagate as an error. * Tell TypeScript to use a less ancient library target. * Trim trailing whitespace. * Add data export endpoint. This is a simple first pass which just reads the data from the database and delivers it. No caching, no fancy business. The endpoints are based on the report identifier and provide three separate .csv exports. For example: /api/v3/reportExport/r6ke7cdzte2jrsxctfyt9/summary.csv /api/v3/reportExport/r6ke7cdzte2jrsxctfyt9/comments.csv /api/v3/reportExport/r6ke7cdzte2jrsxctfyt9/votes.csv The format is made to match that of the old command line exporter as much as possible. * smaller font size * Use the correct column name here. * Set text/csv content type on responses. * export fonts from globals * data export info in report view * prettier app.js * move overview down * shorten toxic text --------- Co-authored-by: Colin Megill <colinmegill@gmail.com> * Type errors (#1809) * prettier server, ts error 25 * ts errors 21 * ts errors 18 * type pid * ts errors 11 * modernize create xid function * ignore ts error on jigsaw * type uid * expand pca cache item * 8 errors. * modernize switchToUser function * void * PcaCacheItem types * missing args, null * ignore request.get error * tsignore .get on headers * null checks, headers, type fix (#1810) * update github actions (#1811) * update github actions * typo * use docker compose in place of docker-compose * lint fix * Enable non-docker postgres (#1817) * utilize docker compose --profile postgres, and POSTGRES_DOCKER var, to enable/disable using dockerized postgres * update configuration.md * add comment in file-server/Dockerfile (#1816) * Update GitHub actions (#1819) * add comment in file-server/Dockerfile * switch back to url-health-check-action * Fixed issue with Makefile prefixing extra whitespace to envvars and failing Docker build in some environments. * ignore .DS_Store (#1823) * Import lodash on component file that uses it. (#1808) * Streaming exports (#1826) * Switch to non-native Postgres client. And add a "streaming" API for making database queries, which streams the results from the database to Node as they are generated by Postgres. This allows Node to process the rows one by one (and garbage collect in between), which is much easier on the VM when we need to do big queries that summarize data (or just format it and incrementally spit it out an HTTP response). * Mostly refactoring. This moves the handle_GET_reportExport route into its own file, which necessitated refactoring some other things (zinvite and pca) out of server.ts as well. Chipping away at the monolith. This also converts the votes.csv report to use the streaming query from Postgres, which is mostly a smoke test. It seems to work, so next I'll convert it to stream the results incrementally to the HTTP response as well. * Split each report into separate function. * Count up comment votes in single pass over votes table. There was actually a bug in the old SQL that aggregated votes from _all_ conversations instead of just the conversation in question, which is why it took 30 seconds to run. With that bug fixed, even the super slow "do a full subquery for each comment row" was actually quite fast. But this is way cheaper/faster. * Add participant-votes.csv export. * Switch to non-native Postgres client. And add a "streaming" API for making database queries, which streams the results from the database to Node as they are generated by Postgres. This allows Node to process the rows one by one (and garbage collect in between), which is much easier on the VM when we need to do big queries that summarize data (or just format it and incrementally spit it out an HTTP response). * Mostly refactoring. This moves the handle_GET_reportExport route into its own file, which necessitated refactoring some other things (zinvite and pca) out of server.ts as well. Chipping away at the monolith. This also converts the votes.csv report to use the streaming query from Postgres, which is mostly a smoke test. It seems to work, so next I'll convert it to stream the results incrementally to the HTTP response as well. * Split each report into separate function. * Count up comment votes in single pass over votes table. There was actually a bug in the old SQL that aggregated votes from _all_ conversations instead of just the conversation in question, which is why it took 30 seconds to run. With that bug fixed, even the super slow "do a full subquery for each comment row" was actually quite fast. But this is way cheaper/faster. * Add participant-votes.csv export. * Flip vote polarity. In the raw votes table, -1 means agree and 1 means disagree, so we need to count things correctly. And when exporting votes in participant votes, we flip the sign so that 1 means agree and -1 means disagree. * Properly escape comment text. * add votes matrix, show data license preprod, logging. --------- Co-authored-by: Michael Bayne <mdb@samskivert.com> * Revert "Streaming exports (#1826)" This reverts commit 61d2940. * Rebase and fix for Streaming Exports changes (#1829) * Switch to non-native Postgres client. And add a "streaming" API for making database queries, which streams the results from the database to Node as they are generated by Postgres. This allows Node to process the rows one by one (and garbage collect in between), which is much easier on the VM when we need to do big queries that summarize data (or just format it and incrementally spit it out an HTTP response). * Mostly refactoring. This moves the handle_GET_reportExport route into its own file, which necessitated refactoring some other things (zinvite and pca) out of server.ts as well. Chipping away at the monolith. This also converts the votes.csv report to use the streaming query from Postgres, which is mostly a smoke test. It seems to work, so next I'll convert it to stream the results incrementally to the HTTP response as well. * Split each report into separate function. * Count up comment votes in single pass over votes table. There was actually a bug in the old SQL that aggregated votes from _all_ conversations instead of just the conversation in question, which is why it took 30 seconds to run. With that bug fixed, even the super slow "do a full subquery for each comment row" was actually quite fast. But this is way cheaper/faster. * Add participant-votes.csv export. * Switch to non-native Postgres client. And add a "streaming" API for making database queries, which streams the results from the database to Node as they are generated by Postgres. This allows Node to process the rows one by one (and garbage collect in between), which is much easier on the VM when we need to do big queries that summarize data (or just format it and incrementally spit it out an HTTP response). * Mostly refactoring. This moves the handle_GET_reportExport route into its own file, which necessitated refactoring some other things (zinvite and pca) out of server.ts as well. Chipping away at the monolith. This also converts the votes.csv report to use the streaming query from Postgres, which is mostly a smoke test. It seems to work, so next I'll convert it to stream the results incrementally to the HTTP response as well. * Split each report into separate function. * Count up comment votes in single pass over votes table. There was actually a bug in the old SQL that aggregated votes from _all_ conversations instead of just the conversation in question, which is why it took 30 seconds to run. With that bug fixed, even the super slow "do a full subquery for each comment row" was actually quite fast. But this is way cheaper/faster. * Add participant-votes.csv export. * Flip vote polarity. In the raw votes table, -1 means agree and 1 means disagree, so we need to count things correctly. And when exporting votes in participant votes, we flip the sign so that 1 means agree and -1 means disagree. * Properly escape comment text. * add votes matrix, show data license preprod, logging. * cleaned up pg-query; re-establish ssl db connection --------- Co-authored-by: Michael Bayne <mdb@samskivert.com> Co-authored-by: Colin Megill <colinmegill@gmail.com> * no more pg-native; new config flag for DATABASE_SSL (#1831) * ensure the correct http/s protocol is used in report overview (#1832) * add port collision instructions * add special chars and seed comments and vis settings tests to conversation suite * DRAFT: Better handling and/or removal of unused db fields for geolocation (#1835) * remove maxmind and its references * remove unused geolocation_cache * remove/drop unused table and fields from db * clean up typings in config.ts * use APPLICATION_NAME as a flag for nonstandard db fields * handle non-jigsaw config gracefully (#1833) * only use jigsaw API if key is provided; replace `console` with `logger` * npm run format * add comment mod checks * fix reports not showing on refresh bug and add reports test * add basic reports test * fix lint and move reports to separate folder * enable testing votes and comment ability without connected account * add monitor check * disable non functional social auth * adjust tests * remove auth from views * begin exports test and ci test setup * fix time issue * add remaining tests * action attempt 1 * change name * lint err * docker debug * copy cmd from cypress * build not watch, run tess * swap out docker command * try changing env * move env step * set up env in server and change docker command again * run in detached mode * try nohup * [DRAFT] Automated Preprod Deploy workflow (#1845) * deploy-preprod backend workflow * python mini project for static assets deploy * don't write acl headers; we ignore them anyway * include static assets deploy in github workflow * update docker syntax; add vars to deploy-preprod workflow * remove depcheck workflow * removed unused social code * update aws region * Bump black from 24.2.0 to 24.3.0 in /deploy (#1851) Bumps [black](https://github.com/psf/black) from 24.2.0 to 24.3.0. - [Release notes](https://github.com/psf/black/releases) - [Changelog](https://github.com/psf/black/blob/main/CHANGES.md) - [Commits](psf/black@24.2.0...24.3.0) --- updated-dependencies: - dependency-name: black dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Enable importance checkbox for admin and participant (#1682) * incorporate changes from https://github.com/chena11356/polis/tree/implement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217 * include high_priority in vote posts to server * rename "priority_type" to "importance_enabled" * use actual quotes since HTML escapes are not being respected * Editing the importance/significance label and help text, moving it up above the vote buttons * lint fix * update migration filename --------- Co-authored-by: Hadjar Homaei <hadjar@gmail.com> * Colinmegill/report experimental (#1855) * begin style dep fixes * readmes, schemas, uncertainty subtask * remove "hesitation" phrase * convert beeswarm to functional component * add gic xml * rendering sji uncertainty * test data piped through server * fix dev server * fix render bug in beeswarm * change to narrative * convert boxplot, add jest testing and action * add test for boxplot * change working directory * update package lock * update babel core * modify comments and add test * Added comment-groups.csv export. Colin will be using this data (or some filtered version of it) to pass to an LLM when it wants to summarize things. The code uses the summarized data from the PCA json blob instead of computing things from the raw comments and votes tables. The latter approach results in numbers that don't match up exactly with the data that appears on the HTML version of the report (our numbers are a little higher, so the Clojure backend is filtering out some votes/voters that we are not). We want the LLM to see the exact same data that's on the HTML page because it might refer to specific numbers and we want those numbers to be exactly the same as the numbers the user sees. * a test uncertainty section * add comment groups endpoint * prompts for group informed consensus section of report_experimental * two new prompts in "NARATIVE SKILLS" section * upgrade typescript for anthropic support * make script run dynamically with report ID arg * move to server * refactor + enable on report in web * add filter func to csv gen * move filter fn * final filter function improvements * separate section for narrative, list ALL citations * pull in consensus narrative changes & refactor * remove console logs * add commentlist below consensus * remove console logs and split narrative into separate url * filter on group aware consensus * sub groups prompt * improvements to group informed consensus prompt * increase length of gic section * typescript appease gic * tldr for consensu * swap uncertianty narrative. * uncertinaty title * break out raw data into component * consensus style * add gemini * toggle gemini & claude --------- Co-authored-by: Colin Megill <colinmegill@gmail.com> Co-authored-by: Darshana Narayanan <DZNarayanan@users.noreply.github.com> Co-authored-by: Michael Bayne <mdb@samskivert.com> * update model * add narrative route * export missing lines * make report narrative stream * improve streaming and UX * include read/write permissions (#1856) * roll back express upgrade (#1862) * report race condition check * refactoring + tests * finish framework folder * refactor commentsModeratedIn + jest test * refactor commentList * consensusNarrative + test * groups & test * uncertaintynarrative and test * being participantsGraph * more test writing * update deploy-preprod workflow * update deploy-preprod workflow * app.js conversion to functional * free of bugs * test fixing * finish conversion from class to functional, underscore and jquery removed * fix setstate bug * fix tests * init * indentation fix * indentation fix again * syntax err fix * indentation fix * name change * change trigger * indentation fix * change trigger again * change trigger AGAIN * changedir * setup node path * change path * remove input count * update path * add working dir * indent fix * debug * add google auth * change url * change name * modify csv * update file location * full path of csv * debug ls * still debug * rearrange command * store in gist * add gh token * change file read strategy * move working dir * use diff action * split into multiple jobs * disable checkout and debug * faster testing haclk * remove debug statement * add l * update token * debug html file * should work * use local fork * add sign up link to sign in page * prod deployment workflow (#1870) --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Armand Fardeau <armandfardeau@users.noreply.github.com> Co-authored-by: Colin Megill <colinmegill@gmail.com> Co-authored-by: Christopher Small <metasoarous@gmail.com> Co-authored-by: Michael Bayne <mdb@samskivert.com> Co-authored-by: chalkghost <142253916+chalkghost@users.noreply.github.com> Co-authored-by: tevko <tim@devzero.io> Co-authored-by: Tim <timevko@gmail.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Hadjar Homaei <hadjar@gmail.com> Co-authored-by: Darshana Narayanan <DZNarayanan@users.noreply.github.com>

colinmegill self-assigned this Jul 5, 2019

patcon transferred this issue from pol-is/polis-issues May 10, 2020

patcon changed the title ~~Allow users to prioritize statements~~ Allow participants to prioritize statements May 23, 2020

colinmegill mentioned this issue Jun 17, 2020

Bring back 'starring' a comment as checkbox for #217 #342

Open

colinmegill mentioned this issue Jun 25, 2020

Crowdmod minimum # people to show to validate decision to remove #367

Open

patcon added 🔩 p:math 🔩 p:client-participation labels Jul 4, 2020

patcon added the 🎉 enhancement label Jul 30, 2020

compdemocracy deleted a comment from crkrenn Oct 23, 2020

ThenWho mentioned this issue Dec 12, 2020

Take non-returning (stale) participants into account #211

Open

metasoarous mentioned this issue Dec 16, 2020

Add ability to skip routing and direct link to specific statement for voting #696

Closed

colinmegill mentioned this issue Mar 20, 2021

Automatic text summarization of results #915

Open

compdemocracy deleted a comment from patcon Apr 27, 2023

ballPointPenguin mentioned this issue May 3, 2023

Enable priority checkbox for admin and participant #1562

Closed

ballPointPenguin added a commit that referenced this issue May 12, 2023

incorporate changes from https://github.com/chena11356/polis/tree/imp…

4f65ec9

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

ballPointPenguin mentioned this issue May 12, 2023

Enable importance checkbox for admin and participant #1682

Merged

ballPointPenguin added a commit that referenced this issue May 12, 2023

incorporate changes from https://github.com/chena11356/polis/tree/imp…

e1fcb8e

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

colinmegill mentioned this issue Oct 18, 2023

Polis v10 (that's binary I suppose) #1725

Open

ballPointPenguin added a commit that referenced this issue Nov 15, 2024

incorporate changes from https://github.com/chena11356/polis/tree/imp…

7e77265

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

ballPointPenguin added a commit that referenced this issue Nov 23, 2024

incorporate changes from https://github.com/chena11356/polis/tree/imp…

4a85a59

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

ballPointPenguin added a commit that referenced this issue Dec 4, 2024

incorporate changes from https://github.com/chena11356/polis/tree/imp…

5034077

…lement-comment-prioritization-checkbox credit to https://github.com/chena11356 addresses #217

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow participants to prioritize statements #217

Allow participants to prioritize statements #217

colinmegill commented Jul 5, 2019

patcon commented May 3, 2020

colinmegill commented May 3, 2020 via email

colinmegill commented May 9, 2020

AleksandarPetrov commented Feb 16, 2021

jucor commented Mar 3, 2021 •

edited

Loading

ThenWho commented Mar 3, 2021

jucor commented Mar 3, 2021 via email

ThenWho commented Mar 4, 2021

jucor commented Mar 4, 2021 •

edited

Loading

colinmegill commented Nov 21, 2021

jucor commented Nov 21, 2021

Allow participants to prioritize statements #217

Allow participants to prioritize statements #217

Comments

colinmegill commented Jul 5, 2019

Goals:

Proposal:

Prioritization matrix

Naive:

Perhaps better:

Possible improvements:

patcon commented May 3, 2020

colinmegill commented May 3, 2020 via email

colinmegill commented May 9, 2020

AleksandarPetrov commented Feb 16, 2021

jucor commented Mar 3, 2021 • edited Loading

ThenWho commented Mar 3, 2021

jucor commented Mar 3, 2021 via email

ThenWho commented Mar 4, 2021

jucor commented Mar 4, 2021 • edited Loading

colinmegill commented Nov 21, 2021

jucor commented Nov 21, 2021

jucor commented Mar 3, 2021 •

edited

Loading

jucor commented Mar 4, 2021 •

edited

Loading