Research | Democratic Capabilities Gap Map

Research for democratic capabilities

There is an ocean of research required to help democracy keep pace with AI.

The gap map contains over 250 research questions linked directly to the capabilities they will improve and the goals they will help meet. If you see other gaps the map is missing, you can make suggestions for improving the coverage of the map on the contributions page.

This page includes sections for:

What needs to be done first
Underserved niches
The most impactful research
Where your effort can have an outsize payoff

What needs to be done first

Resolving a few key bottlenecks has the potential to massively speed up the overall ecosystem. This is a list of goals that, once met, would improve the maturity of their parent capability, albeit only for the capabilities that we’ve tagged as the most urgent because of either their importance in speeding up the rate of improvement (simulation and measurability) or because they fill critical gaps in getting processes to the point of “good enough” for near-term use cases (resisting manipulation and reaching participants).

Viewing Research Questions. Filtered by: Tags being "Urgent" .

Research for democratic capabilities

What needs to be done first

Can AI generate its own suggested changes and test them to search the latent space for optimal solutions?

What design variables in deliberative formats can AI systems reliably identify as leverage points for optimization through automated multi-agent simulation?

For what uses, in what contexts and with what level of faithfulness is it helpful or appropriate to use simulations, and what are the philosophical, moral, and political implications?

What simulation fidelity level (agent realism, dialogue authenticity, decision distributions) accurately predicts outcomes for specific deliberative formats under real-world constraints, and where does increased fidelity stop improving predictive value?

How can lessons from speculative execution and speculative decoding help increase the availability of deliberative processes through reduced costs?

What are the key technical blockers (agent behavior calibration, emergent group dynamics modeling, preference faithfulness) to effective and trustworthy multi-agent simulation, and which are tractable with current methods?

What kinds of systems are appropriate for simulation?

How can the impacts of interventions on complex systems be simulated quickly and accurately?

What is the Pareto frontier of speed, accuracy and easy-to-use interactability?

What consent, anonymization, and data governance protocols (comparing opt-in vs. opt-out, persistent vs. temporary storage, restricted vs. open licensing) enable practitioners to balance participant privacy and autonomy against the research value of maintaining rich deliberative records?

How do downstream effects from participation systematically vary across different deliberative process formats (comparing citizens' assemblies, deliberative polls, mini-publics, and online forums), and what process features predict effect heterogeneity?

What particular knock-on effects from participation (spanning civic engagement, political efficacy, discussion spillover, network influence, or policy awareness) are most important to measure, and what longitudinal methods best capture them without excessive participant burden?

What observable deliberative quality dimensions (such as turn-taking equity, argument depth, perspective inclusion, or respectfulness) can be reliably measured through automated content analysis or human observation in real time, and what does measurement reveal about facilitator behavior changes?

What measurement approaches (comparing explicit belief statements, semantic mapping, implicit preference tasks, or network analysis of argument adoption) best capture individual and group learning and preference shifts while remaining feasible to administer at deliberation intervals?

How do different methods for measuring preference transformation (pre/post surveys, in-process journaling, exit interviews, or network tracking) correlate with one another and with long-term behavioral change, under different deliberative process formats?

What recording modalities (comparing video, audio-only, spatial tracking, or multimodal combinations) most reliably preserve the substance of deliberation while remaining minimally intrusive and respectful of participant discomfort?

Which transcription and annotation approaches (comparing human verbatim, human semantic, hybrid human-AI, or AI-only) best handle cross-talk, non-verbal communication, and emotional valence while maintaining accuracy standards?

How can we design adaptive learning systems that provide personalized learning programs?

What are the best methods for efficiently educating people?

How can individual learning be mediated through group learning to lift all boats?

How can individual learning agents identify and pair learning partners for defined objectives (idea crosspollination, depolarization, information gaps)?

How can AI systems translate, generate and integrate learning materials into diverse formats (text, audio, visual, etc)?

Which evaluation metrics (comparing single-dimension vs. composite indices) are sensitive enough to detect quality differences within similar processes but robust enough for valid comparison across different topics, geographies, and participant populations?

What constellation of outcomes (spanning legitimacy, recommendation quality, participant satisfaction, opinion change, and downstream policy impact) must any democratic process achieve to be considered successful, and how do these vary with process purpose?

How can process outcomes (spanning legitimacy, recommendation quality, participant satisfaction, opinion change, and downstream policy impact) be operationalized as measurable indicators practitioners can feasibly collect?

How can practitioners balance (through adaptive protocols or meta-evaluation frameworks) universal standards for cross-context learning against context-specific adaptations required by local stakeholder concerns and governance structures?

What are the most efficient ways of recruiting participants?

How best to implement global sortition given limited resources or access to population data?

How can we handle the real-world failure modes of recruitment?

What are the best approaches to recruiting a participant pool that captures the complexity and intersections of society while minimising self-selection biases?

What strategies can be used to motivate participation in less-democratic contexts?

For a given budget, location, panel size, and unique quotas, how can we design a recruitment plan that will maximize response rates and the representativeness of the sample?

How to manage recruitment in geographies with incredibly poor access and digital and physical infrastructure?

How can we quantify the fairness of different approaches to sampling the population?

What kinds of recruitment methods reach which kinds of people?

How can we distinguish between legitimate persuasion and manipulative influence in deliberative settings?

What behavioral indicators reliably signal attempts to game deliberative processes?

How can we create standardized integrity assessment frameworks for evaluating completed assemblies?

How can we develop manipulation impact metrics that distinguish between minor and outcome-altering influences?

How can we design information presentation formats that minimize susceptibility to framing effects?

What are the tradeoffs between openness/transparency and manipulation resistance?

How can we develop real-time detection systems for coordinated manipulation attempts during participant recruitment and selection?

How can we quantify and test the manipulation resistance of different assembly design choices?

What conditions allow commitments to remain binding when the regulatory or political environment shifts significantly after the commitment was made?

Under what conditions is it reasonable to not stick with commitments? (e.g. does the reversal of a commitment require an explicit mandate, either through an election or a subsequent deliberative process?)

What is the amount of carrots vs. sticks necessary to protect commitments internally?

What are the internal barriers that prevent commitment from happening? (e.g. employee pressure, incentive systems, decision-making culture, organizational structure?)

What role can legal or compliance infrastructure play in embedding deliberative commitments into operations? Under what conditions can it be counter-productive?

How do we measure commitment drift, i.e. commitments that have not stuck over time?

What properties should commitments have to make them truly adaptable? (e.g. specificity vs. breadth, time boundedness, rules for how commitments evolve over time)

What practices protect commitments from reversal when leadership or staff changes in an organization or government?

Could there be templated approaches to socialising and developing internal commitments?

What are the most common barriers that prevent AI labs from binding to deliberative outcomes? Which barriers are structural versus contingent on political will?

What alternative mechanisms most effectively replicate the functional properties of a legal bind?

Regarding timelines, when does the obligation need to begin? How long a delay, after a decision has been made, is acceptable for a bind to be considered respected? What prevents indefinite deferral?

Is there a demonstrable trade-off between the degree of legal bindingness imposed on AI labs and their capacity for rapid AI innovation? If so, under what governance designs is that trade-off minimized?

How should the degree of bindingness be calibrated to the characteristics of the decision at stake?

Under what conditions should a binding deliberative outcome be legally contestable or reversible?

What are the most common barriers that prevent governments from binding to deliberative outcomes? Which barriers are structural versus contingent on political will?

What existing analogues (e.g. binding arbitration) provide legal precedents, and what do they fail to address for AI governance contexts?

How does the degree of isolation of a citizen participation office affect its resilience to political interference? What level of integration vs. independence optimizes legitimacy?

What legal mechanisms can a private company set up to make deliberative outcomes enforceable?

Underserved niches