After the event, we conducted another review of actual event contributions, including both scheduled and non-scheduled contributions (e.g., food and drink, production and planning), and using the full WISDOM v2 model so that we could generate relative valuations of all contributions. This second review round is the subject of the below report.
Methods & Results
Materials
- Post-Tiny OHM Survey TEMPLATE (open access)
- Post-Tiny OHM Survey Backend (restricted)
- Post-Tiny OHM Results
Participants
16 members of the cOHMunity self-nominated as reviewers in response to social media posts and personal messages requesting reviewers.
Procedure
Methods will be presented in chronological order using the main stages of WISDOM (see here for more details).
0. Embed Values. The research team nominated 4 questions (dimensions) to ask reviewers, reflecting OHM values:
- “Which contribution are you the most grateful for?”
- “Which contribution was the most unique?”
- “Which offering best supported the OHM vision and mission?”
- “Which offering best supported our principle of Diversity and Inclusion?”_
1. Record. Tiny OHM contributions were recorded in our transparent workspace, from which the research team selected a diverse sample of contributions to undergo review. 30 contributions were selected, including 27 event contributions (e.g., performances, volunteering) and 3 review contributions (completion of 1, 4 & 16 surveys, respectively).
2. Review. Reviewers voted between pairs of contributions on each dimension of interest. There were 35 surveys in total: 26 regular surveys containing only event contributions, and 9 meta-review surveys containing event and review contributions. Each survey contained 25 random pairs and was assigned to a single reviewer. Pairings were fully balanced, with every pair combination represented twice (‘A vs B’, and ‘B vs A’).
3. Recognise. Pairwise comparison votes were converted into multidimensional ratings via the following process:
- Votes tallied for each contribution & dimension
- Reviewers awarded 1 Gratitude Unit per set of pairwise comparisons (1 survey containing 25 pairs = 25 Gratitude Units)
- Meta-review votes for review contributions used to fit a linear function
- Values for all non-review contributions interpolated from function
- Dimension ratings reviewed by meta-reviewers, who then assigned weights to each dimension (note that flaws were detected in all but the gratitude dimension.
4. Reward. OHMnoms were calculated as an average across weighted dimension ratings and summed for each contributor. An average ‘cost’ per attendee was calculated and subtracted from each contributors score, serving as a proof-of-concept for an OHMnom-neutral event. Other costs were also subtracted from balances, including payments to organisers under the OHMniversal Basic Income scheme (3 recipients, 1 shown in results) and payments to headline acts.
5. Respect. Experts and reliable reviewers were identified and acknowledged with numerical indicators. Expertise was calculated as the percentage of total OHMnoms in the category of interest, serving as a proof-of-concept for contributor reputations. Reviewer reliability was calculated across several metrics:
- Intra-rater reliability = % test-retest (within-subjects)
- Inter-rater reliability = % test-retest (between-subjects)
- Self-bias = % votes by self for own contribution / votes by others Note that in the future, reviewer rewards might be moderated according to reliability, incentivising high-quality, honest reviews.
Learnings
- Inclusive. The survey was rated as moderately easy (28%) or very easy (48%); and comfortable (46%) or neutral (50%), suggesting that the new pairwise comparison format was user-friendly and inclusive.
- Time-efficient. A single pairwise comparison took less than one minute to complete on average, meaning that reviewers can add a single datapoint in a modicum of time.
- Gratitude is a stable baseline dimension. We also tested Uniqueness, Diversity and Inclusion, and alignment with the OHM Vision, but each of these produced skewed results after baselining and were instead baselined against the Gratitude dimension.
Next steps
- Algorithm upgrades (ELO, priors, filter by exposure)
- Compare pre-vs post-event (‘super predictors’)
- Present at AIMOS conference
Conclusion
This experiment marked the first proof-of-concept for the complete WISDOM v2 model. In general it was a success, demonstrating an autonomous, inclusive, community-controlled system that can generate multidimensional ratings across diverse contributions. The reviewed contributions represent only a partial sample of all contributions, however, demonstrating a need to improve recording processes and develop more efficient algorithms that can handle more contributions without a full balancing of all contribution pairings. Nevertheless, the results mark our first step toward an autonomous gathering template, which might one day be used to organise similar gatherings in a decentralised, autonomous fashion.
EDIT
Additional contributions were added and meta-reviewed following the AIMOS presentation (recording, slides), including two review contributions (2 & 8 surveys) and another financial contribution ($150). These can be seen in the results spreadsheet but are not reflected in the above text (see report here).