Understanding DAOs: An Empirical Study on Governance Dynamics
Priorities Extracted from This Source
#1
Improve governance outcomes when voters face too many proposals and have incomplete preferences
#2
Use proxy voting/delegation to increase legitimacy and voter satisfaction
#3
Assess the benefits and limitations of liquid democracy in blockchain governance
#4
Define delegation representative (dRep) mechanisms and agreement thresholds for delegation
#5
Characterize theoretical approximation guarantees and impossibility bounds for proxy voting
#6
Identify conditions such as voter coherence under which delegation can improve outcomes
#7
Apply governance models to real blockchain systems like Cardano Project Catalyst
#8
Approximation guarantees for proxy selection under coherent instances
#9
Limits and impossibility results for single-delegate proxy selection
#10
Tie-breaking effects on proxy selection outcomes
#11
Robustness of proxy selection under varying coherence assumptions
#12
Computational complexity of finding optimal delegation representatives
#13
Use of multiple delegation representatives to achieve optimal outcomes
#14
Generalization from coherent to approximately coherent instances
#15
Improving proxy selection and approximation guarantees in delegated voting
#16
Understanding majority agreement and common revealed proposals in proxy voting
#17
Empirical evaluation of delegation rates and outcome quality
#18
Use of realistic data and heuristics to study incomplete preferences
#19
Addressing computational tractability in selecting delegate representatives
#20
Future governance design: delegate information models and incentive/rationality models
#21
Exploring extensions such as tie-breaking, alternative distance metrics, voting rules, multi-winner settings, and partial delegation
Document Content
Full text from all 3 processed chunks:
Chunk 0
On the Potential and Limitations of Proxy Voting:
Delegation with Incomplete Votes
Georgios Amanatidis1,5, Aris Filos-Ratsikas2, Philip Lazos3,
Evangelos Markakis3,4,5, Georgios Papasotiropoulos4,6
1University of Essex.
2University of Edinburgh.
3Input Output (IOG).
4Athens University of Economics and Business.
5Archimedes/Athena Research Center.
6University of Warsaw.
Abstract
We study elections where voters are faced with the challenge of expressing pref-
erences over an extreme number of issues under consideration. This is largely
motivated by emerging blockchain governance systems, which include voters with
different weights and a massive number of community generated proposals. In
suchscenarios,itisnaturaltoexpectthatvoterswillhaveincompletepreferences,
as they may only be able to evaluate or be confident about a very small propor-
tion of the alternatives. As a result, the election outcome may be significantly
affected, leading to suboptimal decisions. Our central inquiry revolves around
whether delegation of ballots to proxies possessing greater expertise or a more
comprehensive understanding of the voters’ preferences can lead to outcomes
with higher legitimacy and enhanced voters’ satisfaction in elections where voters
submit incomplete preferences. To explore its aspects, we introduce the following
model: potential proxies advertise their ballots over multiple issues, and each
voter either delegates to a seemingly attractive proxy or casts a ballot directly.
Weidentifynecessaryandsufficientconditionsthatcouldleadtoasociallybetter
outcomebyleveragingtheparticipationofproxies.Weaccompanyourtheoretical
findings with experiments on instances derived from real datasets. Overall, our
results enhance the understanding of the power of delegation towards improving
election outcomes.
Keywords:ComputationalSocialChoice;ProxyVoting;IncompletePreferences
4202
yaM
41
]TG.sc[
2v24650.9032:viXra
1 Introduction
Broadly speaking, an election refers to a voting system in which a set of participants
express their preferences over a set of possible issues or outcomes, and those are
aggregated into a collective decision, typically with a socially desirable objective
in mind. Besides their “traditional” applications such as parliamentary elections or
referenda,electionsoftenunderpinthelivelihoodofmodernsystemssuchasblockchain
governance (Kiayias and Lazos, 2022; Cevallos and Stewart, 2021) or participatory
budgeting (Cabannes, 2004; Aziz and Shah, 2021). Quite often, voters are called to
vote on an extremely high number of issues, rendering the accurate expression of their
preferences extremely challenging. For instance, the Cardano blockchain uses Project
Catalyst (https://projectcatalyst.io) to allocate treasury funds to community projects,
and routinely receives several thousands of proposals per funding round. Another
applicationcomesfromplatformsofcivicparticipation,wheretheusersexpresssupport
on opinions or proposals (Halpern et al, 2023b).
An unfortunate consequence of these election scenarios is that the voters will
inevitably have a confident opinion only for a small number of issues (henceforth
proposals), as investing enough time and effort to inform themselves about the sheer
volume of proposals is clearly prohibitive. In turn, the “direct voting” outcome, even
under the best intentions, will most likely be ineffective in capturing the desires of
society, which it would, had the voters been sufficiently informed.
A well-documented possible remedy to this situation is to allow for proxy voting
(Cohensiusetal,2017),asysteminwhichthevotersdelegate theirvotestoproxies.The
idea is that those proxies have the time and resources to study the different proposals
carefully, and vote on behalf of the voters they represent. This in fact captures voting
applications more broadly, where the reason for delegation might be a reluctance to
express an opinion, lack of specialized knowledge, or even limited interest. When those
proxies are part of an electorate together with other voters and proxies, the resulting
systemisknownasliquid democracy (Kahngetal,2021;CaragiannisandMicha,2019).
Liquid democracy has been scrutinized, with arguments presented in its favor (Becker
et al, 2021; Halpern et al, 2023a; Revel et al, 2022) and against it (Caragiannis and
Micha, 2019; Kahng et al, 2021), and at the same time it is being employed in real-
world situations (Paulin, 2020) including settings similar to the one studied here, like
the Project Catalyst.
A takeaway message from the ongoing debate around delegative voting is that such
processes might indeed be useful under the right circumstances. Extending this line
of thought and motivated by the scenarios presented above, we aim to identify the
potential and limitations of proxy voting with regard to achieving socially desirable
outcomes, in settings with incomplete votes. More precisely, we aim to characterize
what is theoretically possible with delegation, and what is impossible, even under
idealized conditions.
1.1 Our Setting and Contribution
We focus on elections in which the aim is to choose one proposal to be implemented
fromarangeofmultipleproposals.Weintroduceamodelofproxyvoting,wherevoters
2
have intrinsic approval preferences over all proposals, which are only partially revealed
orknowntothevotersthemselves.Asetofdelegation representatives (dReps) canthen
advertise ballots over the proposals and the voters in turn may either delegate to a
proxy, if there is sufficient agreement (i.e., over a certain agreement threshold between
the proxy’s advertised ballot and the voter’s revealed preferences), or vote directly.
The outcome of the election is the proposal with the largest approval score, assembled
by the score from the ballots of the dReps (representing voters who delegated their
vote) and the voters that vote directly. The core question we pose follows:
“Is it possible for the dReps to advertise their preferences appropriately such that the
outcome of the election has an approval score that is a good approximation of the best
possible approval score; which would only be achievable if all voters had full knowledge
of their preferences?”
“Best-Case Scenario”.
We study the aforementioned question by making the following assumptions:
1. The dReps are fully informed about the preferences of the voters, i.e., they know
exactly the vector of intrinsic preferences for each voter.
2. The dReps themselves do not have actual preferences and their only goal is to
achieve the best possible approximation of the optimal approval score. To do so
they coordinate with each other and advertise their types accordingly.
3. Whentherearemultipleproposalswiththemaximumrevealedscore,tiesarebroken
in favor of those with the highest intrinsic scores.
One should not of course expect all of these assumptions to apply in practice. We
would expect the dReps to be only partially informed about the preferences of the
voters(e.g., via some probabilistic model) andto exhibit some sortof rational behavior
(e.g., needing to be appropriately incentivized to advertise ballots that are aligned with
socially-desirable outcomes). Still, studying “best-case scenario” is already instructive
for results in all other regimes. In particular:
• Our negative results (inapproximability bounds) immediately carry over to other
settings as well, regardless of the choice of the dRep information model, the
rationality model for the dReps, or the choice of the tie-breaking rule. In other
words, we show that certain objectives are impossible, even when a set of fully-
informed dReps coordinate to achieve the best outcome, hence they are certainly
impossible for any other meaningful setting.
• Our positive results (approximation guarantees) establish the limits of the afore-
mentioned impossibilities: if something is not deemed impossible by our bounds,
it should be the starting point of investigations for an information/rationality
model chosen for the application at hand. Clearly, if our upper bounds establish
that a certain number of dReps suffices to achieve a certain approximation in the
“best-case scenario” setting, one should expect a slightly larger number of dReps
to be needed in practice.
3
Our Results.
We firstly present a strong impossibility, namely that for any agreement threshold
higher than 50%, the best achievable approximation ratio is linear in the number of
voters. On the positive side, we show that for an appropriate coherence notion of the
instance, which captures the commonalities of the set of proposals that sets of voters
are informed about, meaningful approximations are possible. For the natural case of
an agreement threshold of 50%, we show that a single dRep is capable of achieving
an approximation factor of 3, whereas only 2 dReps are sufficient to elect the optimal
proposal. Most significantly, we present general theoretical upper and lower bounds
on the achievable approximations, depending on the agreement threshold, the number
of dReps, and the coherence of the instance. We complement our theoretical results
with a set of experiments using the MovieLens dataset (Harper and Konstan, 2015), to
measure the effects of proxy voting on realistic incomplete preferences.
1.2 Related Work
We first comment on some works that are closer in spirit to ours. Meir et al (2021)
propose a model with a similar objective, but focusing on the analysis of sortition, i.e.,
the approximation of the welfare achieved by selecting a random small-size committee.
Inarelateddirection,Cohensiusetal(2017)analyzeparticulardelegationmechanisms,
under elections with samples of voters located randomly in a metric space, according
to some distribution. Our approach does not consider any randomization, neither for
the voting rule nor for the preferences. Finally, Pivato and Soh (2020) also consider
the performance of proxy voting, focusing on understanding when the proxy-elected
outcome coincides with the outcome of direct voting. Again, the model of Pivato
and Soh (2020) is randomized, where the voters delegate based on the probability of
agreement to a proxy, and not based on a deterministic distance function. Moreover,
no analysis of approximation guarantees is undertaken in the work of Pivato and Soh
(2020). Our work can be seen as one that contributes to the corpus of findings in favor
of proxy voting (Becker et al, 2021; Halpern et al, 2023a; Revel et al, 2022), albeit in a
markedly different manner.
There is significant work within the field of computational social choice on elections
with incomplete votes. One stream has focused on the identification of possible and
necessarywinnersbyexploringpotentialcompletionsofincompleteprofiles;seethework
of Lang (2020) for an overview. Recent work has concentrated on the computational
complexity of winner determination under various voting rules within the framework of
incomplete information (Imber et al, 2022; Zhou et al, 2019; Baumeister and Dennisen,
2015). Another direction has studied the complexity of centralized interventions to
reduceuncertainty(Alouf-Heffetzetal,2022)(e.g.,byeducatingaselectedsetofvoters
or computing delegations via a centralized algorithm). Furthermore, there has been
an exploration of the effect of minimizing the amount of information communicated
(Kalech et al, 2011; Ayadi et al, 2019) as well as of the interplay between voters’
limited energy and social welfare (Terzopoulou, 2023). Considerable attention has been
devoted to the exploration of efficient extensions of incomplete profiles to complete
ones that satisfy desirable properties (Terzopoulou et al, 2021; Lackner, 2014; Elkind
4
et al, 2015). A conceptually related area focuses on distortion in voting, investigating
the implications of applying rules to preferences that are less refined than the voters’
intrinsicpreferences(Anshelevichetal,2021).Beyondvotingscenarios,similarsolution
concepts have been explored in domains such as fair division (Bouveret et al, 2010),
hedonic and non-cooperative games (Kerkmann and Rothe, 2019; Brill et al, 2016).
2 Election Framework and Definitions
In the current section we formally describe the main attributes of the election setting
we study.
Proposals and Voters. Let C = {1,2,...,m} be a set of candidate proposals,
where for each proposal there are exactly two options: to be elected or not. Let also
N ={1,2,...,n} be a set of voters responsible for determining the elected proposal.
Each voter i∈N is associated with approval preferences v ∈{0,1}m over the set of
i
proposals; we refer those as true or intrinsic preferences. Here, 1 and 0 are interpreted
as “accept” (or “support”) and “reject” (or “oppose”) a proposal, respectively.
Crucial to our model is the fact that voters do not actually know their entire
intrinsic preference vector, but only a subset of it; this could be due to the fact that
they have put additional effort into researching only certain proposals to verify if they
indeed support them or not, but not necessarily all of them. Formally, we will say
that each voter i has revealed preferences v ∈{0,1,⊥}m, where ⊥ denotes that the
(cid:98)i
voter does not have an opinion on the corresponding proposal. As such, we have the
following relations between v and v :
i (cid:98)i
∀j ∈C : (v (j)=v (j))∨(v (j)=⊥).
(cid:98)i i (cid:98)i
The collection of proposals for which a voter i has developed an opinion is referred to
as their revealed set, denoted by R :={j ∈C :v (j)=v (j)}. Let m :=|R |. Each
i (cid:98)i i i i
voter i also has an integer weight w ; in our work, it is without loss of generality to
i
assume that w =1 for all i∈N, as we can simply make w copies of voter i (with the
i i
same preferences), and all of our results go through verbatim.
dReps. A delegation representative (dRep) is a “special” voter whose aim is to attract
as many voters as possible to delegate their votes to her, and then participates in
the election with the combined weight of those voters. In contrast with some of the
literature, and consistently with the “best-case scenario” motivation (see Section 1.1),
we view dReps as unweighted agents, devoid of personal preferences over the proposals,
with the responsibility of facilitating the election of a proposal that attains substantial
support from the voters.
For any proposal j ∈C, every dRep has an advertised, “intended” vote (or type),
t(j)∈{0,1}, which is visible by the voters. We assume here that dReps present votes
forallproposals.1 Wewillsometimesabusethenotationandreferbytbothtothetype
vector of a dRep as well as to the dRep itself. The distance between a voter i and a
dRep of type t is calculated using the Hamming distance function and is dependent on
1 WecouldalsoallowdRepstoabstaininsomeproposals,andthiswouldnotmakeanydifferenceinour
setting.
5
the revealed preferences of i and the advertised type of t in proposals that are revealed
to i. Formally, let t be the projection of the type t to the proposals that belong to R .
|i i
Then we define the distance between a voter i and a dRep t as d(i,t):=H(v ,t ).
(cid:98)i |i
Agreement Threshold. For a voter to delegate their vote, they have to agree with
the dRep in a certain number of proposals. This is captured by a threshold bound,
any agreement above which results in delegation. To make this formal, we will assume
that voter i delegates their vote to a dRep when their distance to the dRep’s type,
taking into account only the voter’s revealed preferences, is at most ⌊mi−ki⌋, where
2
k is a parameter quantifying the reluctance of voter i to entrust their voting power
i
to a proxy. Obviously k ≤m , for every i, and we will mainly focus on scenarios in
i i
which all voters have the same parameter, thus k =k , for every voter i. For example,
i
when k =0, a voter delegates their vote if they agree with the dRep in at least half
of the proposals in their revealed set; we will refer to this case as majority agreement
(see also Fritsch and Wattenhofer (2022); Constantinescu and Wattenhofer (2023) for
a use of a very similar threshold in a difference context). Given a dRep of type t, we
say that t attracts a set of voters A(t):={i∈N :d(i,t)≤⌊mi−ki⌋}. Additionally we
2
define A(D), for a set of dReps D as
m −k
A(D):={i∈N :∃ t∈D s.t. d(i,t)≤⌊ i i⌋}.
2
If a voter is attracted by multiple dReps, we assume they delegate to any of them
arbitrarily; this choice makes our positive results stronger, whereas, notably, our
negative results work for any choice (e.g., even for the more intuitive choice of the
closest, in terms of Hamming distance, accepted dRep).
Preference Profiles. Let V =(v
i
)
i∈N
and V(cid:98) =(v
(cid:98)i
)
i∈N
. We call intrinsic preference
profile P =(N,C,V)avotingprofilethatcontainstheintrinsicpreferencesofthevoters
inN onproposalsfromC.Similarly,wecallrevealedpreferenceprofileP(cid:98) =(N,C,V(cid:98))the
votingprofilethatcontainstheirrevealedpreferences.Finally,P(cid:98)D =(N,C,V(cid:98)∪D)refers
to the preference profile on proposals from C, that contains the revealed preferences of
the voters in N as well as the advertised types of the dReps in D.
Approval Voting Winners. The winner of the election is the proposal with the
highest (weighted) approval score. Formally, let sc(j) denote the score of a proposal
j ∈ C in the profile P, i.e., the total weight of the voters i ∈ N such that v (j) = 1.
i
A proposal j ∈ C is the winning proposal in the profile P if sc(j) ≥ sc(j′),∀j′ ∈ C.
Similarly, we define sc(j) and sc (j) to be the score of a proposal j ∈C in the profile
(cid:98) (cid:98)D
P(cid:98) and P(cid:98)D , respectively. Note that s
(cid:98)
c(j) represents the score that proposal j would
attain if all voters were to vote directly and sc (j) comprises the scores of the dReps
(cid:98)D
(whose weight is the total weight of the voters they have attracted) and the scores
of the voters that have not delegated their votes to any dRep, i.e., that are voting
directly. Let win(P) := argmax{sc(j),j ∈ C} and opt(P) := sc(win(P)). The same
notions can be extended to profiles P(cid:98) and P(cid:98)D .
Tie-breaking for the winner: We assume that argmax{·} returns a single winning
proposal rather than a winning set, according to some tie-breaking rule. Consistently
6
with our discussion on the “best-case scenario” (see Section 1.1), we assume that the
tie-breaking is always in favor of the proposal with the maximum intrinsic score.
Our goal is to select a set of λ dReps that will collectively (by participating in the
election and representing voters according to the submitted thresholds) ensure that a
proposal of high intrinsic approval score will be elected, as formally presented in the
definition of the proxy selection problem, that follows.
proxy selection(P,P(cid:98),k,λ)
Input: An intrinsic voting profile P and a revealed voting profile P(cid:98) on a set C
ofmproposalsandasetN ofnweightedvoters;thevoters’true(resp.revealed)
preferences V (resp. V(cid:98) ); a parameter k, so that a voter i is attracted by a dRep
with type t if d(i,t)≤⌊mi−k⌋; an upper bound λ on the number of dReps.
2
Output: Specify type vectors for all dReps in D, with |D| ≤ λ, such that
win(P)=win(P(cid:98)D ).
The performance of a suggested set of dReps is measured in terms of how well the
intrinsic score of the winning proposal under their presence approximates the highest
intrinsic approval score. Formally:
Definition 1. Let ρ≥1. We say that a set of dReps D achieves a ρ-approximation if
sc(win(P(cid:98)D ))≥
ρ
1sc(win(P)).
One might be inclined to believe that attracting a sufficiently large set of voters
is enough to achieve a significantly low approximation ratio guarantee. The following
propositionestablishesthatthisisnotthecase.Itdemonstratesthattheattractionpart
is only one component towards solving proxy selection, and, therefore, achieving
good approximations requires further insights.
Proposition 1. ItispossibleforasingledReptoattractn−1votersandstillachieve
only an n-approximation.
Proof. Consider an instance with n voters and four proposals. Voters’ intrinsic pref-
erences are presented in the table below, where the revealed preferences are given in
white background.
I I I I
1 2 3 4
voter1 1 1 0 0
voters2,3,...,n 1 0 1 1
It is evident that win(P)=I , and opt(P)=n. At the same time the dRep that
1
votes in favor of all proposals attracts n−1 out of the n voters, namely all voters but
voter 1. This means that sc (I )=|A(D)|=n−1 (where, recall that D is the set of
(cid:98)D 1
dReps, here consisting of the single aforementioned dRep), while sc (I )=n, since
(cid:98)D 2
both the dRep and voter 1 vote in favor of I . However, sc(I )=1= 1opt(P), and
2 2 n
consequently, the dRep that votes in favor of all proposals, attracting n−1 of the
voters, cannot yield an approximation factor better than n.
7
Beforeweproceed,wehighlightthatifthetie-breakingruleisnotinfavorofelecting
the optimal proposal, then the situation can be much worse; in fact, the approximation
can be infinite as demonstrated by the following example.
Example 1. Consider an instance with n voters and m proposals. Voters’ intrinsic
preferences are presented in the table below, where the revealed preferences are given in
white background.
I I I ··· I
1 2 3 m
voter 1 1 0 1 ··· 1
. . . . .
. . . . .
. . . . .
voter n 1 0 1 ··· 1
It holds that win(P) = I , and opt(P) = n. Suppose that we select as dRep the one
1
that advertises 1 for every proposal under consideration. Then, A(D)=N. However,
the presence of this dRep in D implies that sc (j)=n, for every proposal j. Breaking
(cid:98)D
ties in an adversarial manner, results to the election of I , albeit sc(I ) = 0, which
2 2
leads to an infinite approximation factor.
We conclude the section with the definition of an important notion for our work,
that of a coherent set of voters, i.e., sets of voters with the same revealed sets. Several
of our positive results will be parameterized by properties of those sets, such as the
size of the largest coherent set.
Definition 2. A set of voters N′ ⊆N is called coherent if R =R ,∀i,i′ ∈N′. An
i i′
instance of proxy selection is called coherent if N is coherent.
Importantly, given an instance of proxy selection, it is computationally easy
to verify if it is coherent, or to find the largest coherent set of voters, via a simple
algorithm.
Lemma1. Givenaninstanceof proxyselection,wecaninpolynomialtimeidentify
the largest coherent set and its size. Furthermore, we can in polynomial time decide if
the instance is coherent.
Proof. Thefirstpartofthestatementobviouslyimpliesthesecond,asitsimplyevolves
checking whether the size of the largest coherent set is n. Now, given the intrinsic
preferences P of an instance of proxy selection we can find the largest coherent set
as follows. For every voter i, find a set of voters that can form a coherent set together
with i, namely {i′ ∈N :R ≡R }. Among the sets obtained this way, select one of
i i′
maximal cardinality. The described procedure runs in time O(n2m).
3 Theoretical Findings
We start our investigation with the case of a single dRep (λ = 1). Our main result
here is rather negative, namely that no matter how the dRep chooses their vote, the
approximation ratio cannot be better than linear in the number of voters.
8
Theorem 2. For a single dRep and any k > 0, the approximation ratio of proxy
selection is Ω(n).
Proof. Consider an instance with an odd number of m>3 proposals and n=m−1
voters, where k >0,∀i∈[n], such that:
i
- For every voter i∈[n], their preferences with respect to proposal m are as follows:
v (m)=1 and v (m)=⊥.
i (cid:98)i
- The remaining m − 1 proposals are partitioned in m−1 pairs, say
2
{1,2},{3,4},...,{m−2,m−1} and for each one of these m−1 pairs of proposals,
2
say {j,j+1}, there are two distinct voters, namely voters j and j+1, where voter
j votes in favor of both proposals j and j+1 whereas voter j+1 votes in favor of
proposal j but against proposal j+1; for every other proposal j′, the preferences of
these voters are v (j′)=0 and v (j′)=⊥, where i∈{j,j+1}.
i (cid:98)i
Say that P and P(cid:98) are the intrinsic and revealed profiles of the created instance,
respectively.Clearly,win(P)=mandopt(P)=n.WeclaimthatasingledRep,called
t,regardlessoftheiradvertisedtype,willcontributetoelectingaproposalithatsatisfies
sc(i)≤2, leading to an inapproximability of n. Towards this, first notice that t cannot
2
attract both voters of any pair. This is easy to see, as a distance of max{0,⌊mi−ki⌋}
2
for m =2 and k ≥1 means agreement on both revealed proposals. As a result, for
i i
any such pair of voters, t will either attract zero or one voter(s), given that any two
voters do not share the same preferences with respect to both their revealed proposals.
ConsiderfirstthescenariowhereA({t})=∅.Inthiscase,sincesc (m)=0<sc (j)
(cid:98)D (cid:98)D
for any proposal j ̸=m, it holds that there exists a proposal j′ such that win(P )=j′
D
for which sc(j′)=2, or in other words, the direct voting will lead to the election of an
outcome that is being accepted by exactly two voters. On the other hand, if t attracts
at least one voter, say i where i is odd (resp. i is even), then t must have voted in
favor of at least one proposal apart from proposal m, namely for proposal i (resp. for
proposal i−1), or else i would not fully agree with t. But then, voter i+1 (resp. i−1),
who is not attracted by t, is also voting in favor of proposal i (resp. for proposal i−1).
This results to a proposal i such that sc (i) = sc (m)+1. Therefore, the winning
(cid:98)D (cid:98)D
proposal is not m but a proposal i for which sc(i)≤2.
Theorem 2 should be interpreted as a very strong impossibility result since it
holds even in the “best-case scenario” (see the discussion in Section 1.1). A natural
follow-up question is whether some meaningful domain restriction can circumvent this
impossibility.Forthis,wewillappealtothenotionofcoherentsetsofvoters,definedin
Section2andwewillshowaboundedapproximationguaranteethatdegradessmoothly
as the size of the largest coherent set grows and as the agreement threshold decreases.
Theorem 3. For a single dRep, proxy selection admits an approximation ratio of
(cid:110) (cid:111)
min n,3n(k+2) , where S is the largest coherent set of voters in the instance.
2|S|
9
Proof. We denote by dRep0 and dRep1 the delegation representatives whose advertised
types with respect to a proposal j ∈C, are as follows:
(cid:40)
1, if j =win(P),
dRep0(j)= dRep1(j)=1, ∀j ∈C.
0, otherwise.
We begin with the following statement. It is a direct consequence of the proof of
Theorem 14, which we will present later on in the text, but will also come in handy
for the present proof. To maintain the flow of our presentation and prevent redundant
repetition of arguments, we opted to use this forward reference.
Claim 1. Consider a coherent instance of proxy selection, for any set of proposals,
and any set of voters, either 1/3 of the voters agree with dRep0 on at least half the
proposals, or the winning proposal of direct voting receives a score that is at least 1/3
of the votes.
Next, we state and prove the following lemma.
Lemma 2. In a coherent instance of proxy selection, either sc(win(P(cid:98)))≥ 2n ,
3(k+2)
or |A(dRep0)|≥ 2n . Hence, an approximation factor of 3(k+2) can be achieved.
3(k+2) 2
Proof of Lemma 2. Consider a coherent instance, and let R be the set of proposals
thatarerevealedtoallvoters.Ifwin(P)∈R,thentheoptimalproposalwillbeelected
by direct voting. Therefore, we focus on the case where win(P) ∈/ R. Suppose that
there exists a proposal j′ ∈C such that sc(j′)≥ 2n . But then the proof follows by
(cid:98) 3(k+2)
the fact that sc(win(P(cid:98)))≥s
(cid:98)
c(win(P(cid:98)))≥s
(cid:98)
c(j′). So we can also assume that for every
proposal j ∈C, it holds that sc(j)< 2n . To continue with the proof of the lemma,
(cid:98) 3(k+2)
we will need the following claim.
Claim 2. Consider a coherent instance, with R being the set of proposals commonly
revealed to all voters. Suppose also that sc(j)< 2n for any j ∈R. For any r ∈[k]
(cid:98) 3(k+2)
andforanysetS ⊆R ofr proposals,letZ(S ,dRep0)bethesetofvotersthattotally
r r
agree with dRep0 in all proposals of S . Then |Z(S ,dRep0)|≥ (k+2−r)n.
r r (k+2)
Proof of Claim 2. Wewillprovethestatementbyinductiononr.Toproveitforr =1,
we define N := |{i ∈ N : v (j) = 0}|, for any proposal j ∈ C. Then, by the fact
j (cid:98)i
that sc(j)< 2n < n it holds that N ≥n− n ≥ (k+1)n, and the induction
(cid:98) 3(k+2) (k+2) j (k+2) (k+2)
base follows. Say now that the statement holds for a fixed r′ <k and call S ⊆R an
r′
arbitrarysetofr′ proposals.WebuildasetS byaddingtoS anarbitraryproposal
r′+1 r′
j ∈R\S . Observe that by the fact that sc(j)< 2n < n , it holds that at least
r′ (cid:98) 3(k+2) (k+2)
(|Z(S ,dRep0)|− n ) voters from Z(S ,dRep0) are voting against j. Therefore,
r′ (k+2) r′
(k+2−r′)n n (k+2−(r′+1))n
|Z(S ,dRep0)|≥ − = .
r′+1
(k+2) (k+2) (k+2)
Hence, Claim 2 indeed holds.
10
We now continue with proving Lemma 2. We fix r = k and a set S ⊆ R of k
k
proposals, in Claim 2. Note that by the discussion in Section 2, we know that k ≤m
i
for every voter i, and therefore |R|≥k, so that we can choose such a set S . We then
k
apply Claim 1 for the (coherent) subinstance induced by the voters in Z(S ,dRep0)
k
and the proposals in C\S . This implies that either at least |Z(Sk,dRep0)| voters agree
k 3
with dRep0 in at least ⌈|C\Sk|⌉ proposals of C\S , or there is a proposal j ∈C\S
2 k k
with sc(j)≥ |Z(Sk,dRep0)|. Using now Claim 2, we have that |Z(Sk,dRep0)| ≥ 2n , and
(cid:98) 3 3 3(k+2)
thus the second case is infeasible, since we have assumed that sc(j)< 2n , for any
(cid:98) 3(k+2)
j ∈C. Coming to the first case, we note that the voters that agree with dRep0 in at
least ⌈|C\Sk|⌉ proposals of C \S , also agree with dRep0 in all proposals of S , by
2 k k
definition. This leads to a total agreement with dRep0 of at least
(cid:24) (cid:25) (cid:24) (cid:25) (cid:24) (cid:25)
|C\S | m−k m+k
k+ k =k+ ≥ .
2 2 2
Therefore,foreveryvoteri∈Z(S ,dRep0),itholdsthatd(i,dRep0)≤m−⌈m+k⌉=
k 2
⌊m−k⌋. This leads to |A(dRep0)| ≥ |Z(S ,dRep0)| ≥ 2n which, provided that
2 k 3(k+2)
Claim 2 holds, completes the proof of the lemma.
Applying Lemma 2 to the largest coherent set of the given instance immediately
proves the statement of the theorem.
An immediate but noteworthy corollary of Theorem 3 concerns majority agreement
and coherent instances.
Corollary 1. ForasingledRep,proxyselectionforcoherentinstancesandmajority
agreement admits an approximation ratio of 3.
Theattentivereadermighthaveobservedthatformajorityagreementandcoherent
instances, the general impossibility result of Theorem 2 does not apply. In that case,
one might wonder what the best achievable approximation ratio is. To partially answer
this question we offer the following negative result.
Theorem 4. Let ε > 0. For a single dRep, proxy selection does not admit a
(1.6−ε) approximation, even for coherent instances and majority agreement.
Proof. Consider an instance in which N = {v ,v ,...,v } and C = {c ,c ,c ,c }.
1 2 8 1 2 3 4
Suppose that v (c ) = 1 and that v (c ) = ⊥, for every voter i ∈ N. Furthermore,
i 1 (cid:98)i 1
there is exactly one voter whose preferences with respect to proposals c ,c ,c belongs
2 3 4
in {110,101,011,100,010,001} and two voters that are voting for {111}. Note that
in the current proof, for simplicity, we abuse the notation and use strings instead of
ordered tuples to indicate voters’ preferences. In this instance, opt(P)=sc(c )=8
1
andsc(j)=5,∀j ∈C\{c }.Therefore,directvotingcannotresultinanapproximation
(cid:98) 1
factor that is better than 8 = 1.6. We will prove that for any possible choice of
5
advertised ballot of a dRep, and if D ={t}, then sc(win(P(cid:98)D ))≤5, which again results
to the claimed approximation factor. Figure 1 will be helpful as an illustration of the
arguments that are going to be used.
11
Chunk 1
We now continue with proving Lemma 2. We fix r = k and a set S ⊆ R of k
k
proposals, in Claim 2. Note that by the discussion in Section 2, we know that k ≤m
i
for every voter i, and therefore |R|≥k, so that we can choose such a set S . We then
k
apply Claim 1 for the (coherent) subinstance induced by the voters in Z(S ,dRep0)
k
and the proposals in C\S . This implies that either at least |Z(Sk,dRep0)| voters agree
k 3
with dRep0 in at least ⌈|C\Sk|⌉ proposals of C\S , or there is a proposal j ∈C\S
2 k k
with sc(j)≥ |Z(Sk,dRep0)|. Using now Claim 2, we have that |Z(Sk,dRep0)| ≥ 2n , and
(cid:98) 3 3 3(k+2)
thus the second case is infeasible, since we have assumed that sc(j)< 2n , for any
(cid:98) 3(k+2)
j ∈C. Coming to the first case, we note that the voters that agree with dRep0 in at
least ⌈|C\Sk|⌉ proposals of C \S , also agree with dRep0 in all proposals of S , by
2 k k
definition. This leads to a total agreement with dRep0 of at least
(cid:24) (cid:25) (cid:24) (cid:25) (cid:24) (cid:25)
|C\S | m−k m+k
k+ k =k+ ≥ .
2 2 2
Therefore,foreveryvoteri∈Z(S ,dRep0),itholdsthatd(i,dRep0)≤m−⌈m+k⌉=
k 2
⌊m−k⌋. This leads to |A(dRep0)| ≥ |Z(S ,dRep0)| ≥ 2n which, provided that
2 k 3(k+2)
Claim 2 holds, completes the proof of the lemma.
Applying Lemma 2 to the largest coherent set of the given instance immediately
proves the statement of the theorem.
An immediate but noteworthy corollary of Theorem 3 concerns majority agreement
and coherent instances.
Corollary 1. ForasingledRep,proxyselectionforcoherentinstancesandmajority
agreement admits an approximation ratio of 3.
Theattentivereadermighthaveobservedthatformajorityagreementandcoherent
instances, the general impossibility result of Theorem 2 does not apply. In that case,
one might wonder what the best achievable approximation ratio is. To partially answer
this question we offer the following negative result.
Theorem 4. Let ε > 0. For a single dRep, proxy selection does not admit a
(1.6−ε) approximation, even for coherent instances and majority agreement.
Proof. Consider an instance in which N = {v ,v ,...,v } and C = {c ,c ,c ,c }.
1 2 8 1 2 3 4
Suppose that v (c ) = 1 and that v (c ) = ⊥, for every voter i ∈ N. Furthermore,
i 1 (cid:98)i 1
there is exactly one voter whose preferences with respect to proposals c ,c ,c belongs
2 3 4
in {110,101,011,100,010,001} and two voters that are voting for {111}. Note that
in the current proof, for simplicity, we abuse the notation and use strings instead of
ordered tuples to indicate voters’ preferences. In this instance, opt(P)=sc(c )=8
1
andsc(j)=5,∀j ∈C\{c }.Therefore,directvotingcannotresultinanapproximation
(cid:98) 1
factor that is better than 8 = 1.6. We will prove that for any possible choice of
5
advertised ballot of a dRep, and if D ={t}, then sc(win(P(cid:98)D ))≤5, which again results
to the claimed approximation factor. Figure 1 will be helpful as an illustration of the
arguments that are going to be used.
11
Fig. 1: The graph illustrates the voters’ preferences on the set of revealed proposals
in the instance created for the proof of Theorem 4. Each boxed vertex represents the
existence of a voter with the corresponding ballot, with respect to proposals c ,c ,c .
2 3 4
If a dRep advertises the ballot of a vertex v in this graph, they will attract all voters
whose preferences are within a distance of 1 from v.
• If t = 111, then A(D) equals the set of voters whose preferences belong in
{111,110,101,011}. Therefore |A(D)|=5 and hence sc (c )=5. However c is
(cid:98)D 1 2
both approved by the dRep and by a voter that doesn’t belong to A(D), consider,
e.g., the voter who is voting for 100, which leads to sc (c ) = 6 and hence to
(cid:98)D 2
a winning proposal win(P(cid:98)D ) such that s
(cid:98)
c
D
(I′) ≥ 6. Hence, win(P(cid:98)D ) ̸= c
1
, and,
consequently, sc(win(P(cid:98)D ))=5.
• If t = 110, then A(D) equals the set of voters whose preferences are {111,
110,100,010}. Therefore |A(D)| = 5 and hence sc (c ) = 5. Using the same
(cid:98)D 1
rationale to before, one can observe that, again, sc(win(P(cid:98)D ))=5. The proof is
identical for t=101 and t=011.
• Ift=100,thenA(D)equalsthesetofvoterswhosepreferencesare{100,110,101}.
Therefore |A(D)| = 3 and hence sc (c ) = 3. However c is both approved by
(cid:98)D 1 2
the dRep and by the voter whose preference vector is 111, who does not belong
to A(D), which leads to sc(win(P(cid:98)D ))=5. The proof is identical for t=010 and
t=001.
• If t = 000, then |A(D)| = 3, but sc (c ) = 4, which again leads to a winning
(cid:98)D 2
proposal of sc(win(P(cid:98)D ))=5.
We now note that if slightly relax our best-case scenario setting (see Section 1.1)
and consider arbitrary tie-breaking rules, then we can strengthen Theorem 4 to the
following impossibility result.
Proposition 5. Let ε>0. For a single dRep, proxy selection does not admit a
(2−ε)-approximationiftiesarebrokeninfavoroftheproposalwithmaximumrevealed
score.
Proof. The claim is straightforward if we consider an instance of two candidate pro-
posals, namely c and c , and two voters whose preferences are depicted in the table
1 2
below. Note that only voters’ preferences with respect to c are revealed to them.
2
12
c c
1 2
voter 1 1 1
voter 2 1 0
Obviously, opt(P)=2 and opt(P(cid:98))=1. Furthermore any dRep can attract at most
one voter and hence, for any D such that |D|=1, it holds that sc (c )≤sc (c ). In
(cid:98)D 1 (cid:98)D 2
case of ties, these are once again broken in favor of the proposal of maximum approval
score in the revealed profile, i.e. in favor of c , since sc(c )>0=sc(c ). Hence, in any
2 (cid:98) 2 (cid:98) 1
case, win(P(cid:98)D )=c
2
, and thus sc(win(P(cid:98)D ))=1, which equals 1
2
opt(P).
Proposition 6. The instance used in Proposition 5 can be generalized towards proving
that for any acceptance threshold k > 0, there exists an instance in which proxy
selection does not admit a (2−ε)-approximation, for any ε > 0, if λ ≤ 2k−1, for
arbitrary tie-breaking rules.
Proof. Say that λ = 2k−1 and consider a coherent instance, where R = R , for any
i
voter i, with n = 2λ voters and m = R+1. Given the value of k, we define m such
that k =R−1. The preferences of the voters are as follows:
• Allvotersapproveproposal1,whichistheonlyproposalthatdoesnotbelongtoR.
• Only the first λ voters approve proposal 2.
• Only the 1st and 3rd group of λ voters approve proposal 3.
2
• Only the 1st, 3rd, 5th, 7th group of λ voters approve proposal 4.
4
.
.
.
• Only voters {1,3,5,...,n−1} approve proposal m−1.
• None of the voters approve proposal m.
Clearly, the optimal proposal has an approval score of 2λ. Furthermore, ⌈R+k⌉=
2
⌈2R−1⌉ = R and hence, any dRep, can attract at most one voter, and this can be
2
done only by advertising that voter’s ballot. As a consequence, introducing any set
of dReps D, of size no more than λ, will result in sc (1)≤λ, but at the same time
(cid:98)D
sc (2) = sc(2) = λ. Breaking-ties in an adversarial manner, a proposal of intrinsic
(cid:98)D (cid:98)
approval score λ is being elected.
To understand the power of coherence towards achieving good approximations, it
is instructive to explore the limitations of the best possible dReps also on coherent
instances. To this end, we provide a couple of results: the first generalizes Theorem 2
to be parameterized by the size of the largest coherent set, and the second shows
robustness to coherent instances, as long as the agreement thresholds are sufficiently
high. The take-away message of those results is that coherent sets are not a panacea,
andcanresultinmeaningfulapproximationsonlyunderfurtherappropriateconditions.
Theorem 7. Let ε>0. For a single dRep and any k >0, proxy selection does not
admit an (cid:0) n −ε (cid:1)-approximation, where S is the largest coherent set of voters in the
|S|
instance.
Proof. Recall that in the proof of Theorem 2, we designed an instance of proxy
selection, in which every coherent set is formed by two voters, which implied
13
inapproximabilityof n.Atwhatfollows,wewillgeneralizethatconstructiontoinstances
2
with coherent sets of size up to any r >2. We create r/2 copies of each voter (more
precisely, ⌊r/2⌋ copies of each voter with an odd index and ⌈r/2⌉ copies of each voter
with an even index). Clearly, the new number of voters is now n′ =n· r. The analysis
2
from Theorem 2 directly carries over but now a proposal of score at most r gets elected
instead of proposal m which has score n′. The maximum coherent set is of size r, and
hence, the n′ inapproximability.
r
Theorem 8. Let ε>0. For a single dRep and k =m−2c(m), with c(m)∈o(m), the
approximation ratio of proxy selection is Ω(n), even for coherent instances.
Proof. Wearegoingtoconstructaninstancewhereallthevotersformasinglecoherent
set of size n. Moreover, m =m−1 for all i∈[n]. Hence, we just write c rather than
i
c(m), i.e., k =m−1−2c. We assume that m=(n−1)(c+1)+1 and n≥4 (although
any large enough m could be used). Further, in the created instance we have that
• for every voter i∈[n], her preferences with respect to proposal m are as follows:
v (m)=1 and v (m)=⊥,
i (cid:98)i
• voteri∈[n−1]approvesproposals(i−1)(c+1)+1,(i−1)(c+1)+2,...,i(c+1)and
disapproves everything else (except for m, of course, but this does not belong to
R , and the preference of voter n with respect to any proposal j satisfies v (j)=1.
i (cid:98)i
Clearly, the optimal solution satisfies n voters. Similarly to the proof of Theorem 7,
we claim that a single dRep, t, regardless of their advertised type, will contribute to
electing a proposal approved by at most two voters. Towards this, we begin by showing
that t cannot attract more than one voter.
First note that, for i,j ∈[n−1], we have H(v ,v )=2c+2. Also, for i∈[n−1],
(cid:98)i (cid:98)j
we have H(v ,v )=m−1−(c+1)≥3(c+1)+1−1−c−1=2c+2, where we used
(cid:98)i (cid:98)n
both m=(n−1)(c+1)+1 and n≥4 for the inequality. Now, suppose that t attracts
at least two distinct voters, say i,j ∈[n]. Then, by the triangle inequality, we have
(cid:22) (cid:23)
(m−1)−(m−1−2c)
2c+2≤H(v ,v )≤d(i,t)+d(j,t)≤2 =2c,
(cid:98)i (cid:98)j 2
which is a contradiction. We conclude that t may attract at most one voter.
If t does not attract any voters, then all voters vote directly and this leads to the
election of a proposal approved by exactly two voters. On the other hand, if t attracts
one voter, say i, we claim that t must have voted in favor of at least one proposal other
than m. Indeed, if t(j)=0 for every proposal j (except maybe for proposal m), then
(cid:106) (cid:107)
d(i,t) would be at least c+1> (m−1)−(m−1−2c) and they would not have attracted
2
voter i. As a result, a proposal of total approval 2 in the intrinsic profile is elected,
instead of proposal m, and the n/2 inapproximability follows.
We conclude our discussion for the case of one dRep with a complementary result
of a computational nature: a theorem that establishes that finding a dRep to attract
voters in a way that ultimately elects the optimal proposal is computationally hard.
Consequently, proxy selection turns out to be challenging both from the standpoint
of information theory and computational complexity.
14
m proposals c c c
m+1 m+2 m+3
n voters [···] 1 0 0
2 special dummy voters 0 1 1 1
n−1 dummy voters 0 0 1 1
Fig. 2: The instance created in the proof of Theorem 9. Light gray cells correspond
to preferences that are not revealed to the voters and dark gray cells correspond to
preferences that are derived from instance I of mav.
Theorem 9. The decision variant of proxy selection is NP-hard, even for majority
agreement and a single dRep.
Proof. We will establish NP-hardness for the decision version of proxy selection,
where for some parameter r, we want to answer if there exists an advertised type for a
dRep, so that the intrinsic score of the elected outcome is at least r. We will reduce
from the problem minimax approval voting (mav) problem, which is a known
NP-hard problem in voting theory. In mav, we are given an instance I of m binary
proposals and n ballots where v ∈{0,1}m,i∈[n] and we are asked for a vector v for
i
which it holds that max H(v ,v)≤θ, where H is the Hamming distance between
i∈[n] i
two vectors of the same size. For the hardness of mav we refer to LeGrand et al (2007);
Frances and Litman (1997). We note that the NP-hardness has been established for
instances with m being even, and θ = m/2. Given such an instance, we create an
instance I′ of proxy selection as follows:
• We have m′ =m+3 binary candidate proposals: {c ,...,c ,c ,c ,c },
1 m m+1 m+2 m+3
i.e., three additional proposals from I.
• We have n voters corresponding to the voters of I, and an additional number of
n+1 dummy voters, for a total of 2n+1 voters.
• Foreveryvoteri∈[n],belongingtothegroupofthefirstnvoters,theirpreferences
for the first m proposals in I′ are just as they are in I, and they are all revealed,
so that m =m. The remaining three proposals are not visible for these voters
i
and their intrinsic preferences are that v (c )=1, v (c )=v (c )=0.
i m+1 i m+2 i m+3
• For the dummy voters, none of them approve the first m proposals, which are
also not revealed to them. As for the last three proposals, there are exactly two
dummy voters, who will be referred to as the special dummy voters, who approve
all three proposals, and all three are revealed to them. All the remaining n−1
dummy voters approve only the proposals c and c , which are revealed to
m+2 m+3
them, whereas c is disapproved, and also not revealed to them.
m+1
• We set r =n+2 and λ=1, i.e. we have only one dRep available. Hence we are
looking for an advertised type of the dRep, so that the instrinsic score of the
elected outcome is at least n+2.
An illustrative exposition of voters’ ballots can be found in Fig. 2. Before we
proceed, note that the only proposal that has an intrinsic score of n+2 is the proposal
c ,whilealltheothershavelowerscores.Butc cannotbeelectedviaonlydirect
m+1 m+1
15
voting, since it is not revealed to the first n voters. Hence, the question is whether
there exists an advertised type for the dRep that can make c elected.
m+1
For the forward direction, suppose that I is a YES-instance of mav. Then there
exists a vector v ∈{0,1}m for which it holds that max H(v ,v)≤m/2. Consider
i∈[n] i
now that the dRep advertises the type t=(v,1,0,0). The dRep will attract the first n
voters, since they agree with them in at least m/2 of their revealed proposals. She will
not attract any of the dummy voters, all of which disagree with them on two proposals.
Therefore, the dRep will have a weight of n, and since the dummy voters vote directly,
the proposal c will collect a score of n+2 and will be the winner of the election.
m+1
For the reverse direction, suppose that I is a NO-instance of mav. Then for any
possible vector v ∈{0,1}m, it holds that there exists at least one voter i∗ ∈[n], such
that H(v ,v)>m/2. Fix now such an arbitrary vector v ∈{0,1}m. We will consider
i∗
all possible cases for the advertised type of the dRep, t, and show that c cannot
m+1
get elected, i.e, I′ is a NO-instance. We use a natural tie-breaking rule that resolves
any tie in favor of the proposal of maximum approval score in the revealed profile.
Therefore, in all the cases below, any tie that involves proposals c ,c ,c is
m+1 m+2 m+3
broken against c due to the fact that sc(c )<sc(c )=sc(c ).
m+1 (cid:98) m+1 (cid:98) m+2 (cid:98) m+3
• Suppose that t=(v,0,x,y), for any x,y ∈{0,1}. It is easy to see that since the
dRepdoesnotapprovec ,itisnotpossiblethatthisproposalwinstheelection.
m+1
• Suppose that t = (v,1,0,0). In this case, the dRep does not attract any of the
dummy voters. Hence, the proposals c and c receive a score of n+1. As
m+2 m+3
for proposal c , it is crucial to note that since I is a NO-instance of mav, t
m+1
can attract at most n−1 of the first n voters. Therefore together with the two
special dummy voters, the proposal c will have a score of at most n+1. By
m+1
the tie-breaking rule that we have assumed, this means that the winner of the
election will be either c or c .
m+2 m+3
• Suppose that t = (v,1,1,0), or t = (v,1,1,1), or t = (v,1,0,1). WLOG, we
analyze the former. In this case, the dRep attracts all dummy voters. Hence, the
dRep has a weight of at least n+1 and possibly more by some of the first n
voters who delegate to her. Hence, the winner will either be some proposal that is
approved by v and also has the highest approval rate among the voters who did
not delegate to dRep, or there is a tie among all the proposals approved by t. By
the tie-breaking rule, it is not possible that c is elected.
m+1
We have established that no matter what the dRep advertises, it is impossible that
c is elected, and hence I′ is a NO-instance, which concludes the proof.
m+1
Importantly, we note that all of the approximation guarantees in our paper can be
obtained in polynomial time.
3.1 Multiple dReps
WenowturnourattentiontothecaseofmultipledReps(λ≥2),asthisisnotcaptured
by the stark impossibility of Theorem 2. Is it perhaps possible to achieve much better
approximations by using sufficiently many dReps? A reinforcing observation is that for
majority agreement, 2 dReps actually suffice to elect the optimal proposal.
16
Theorem10. Whenλ=2,proxyselectionformajorityagreementcanbeoptimally
solved.
Proof. We denote by dRep0 and dRep1 the delegation representatives whose advertised
typeswithrespecttoaproposalj ∈C areasfollows:dRep0(j)=0,dRep1(j)=1, ∀j ∈
C. Recall that dRep0 differs from dRep0 used e.g. in the proof of Theorem 3. We will
make use of the following lemma, the proof of which is immediate and based on the
fact that any voter’s ballot either contains at least as many zeros as ones or more ones
than zeros.
Lemma 3. In an instance of proxy selection, with k =0, each voter belongs either
to A(dRep0) or to A(dRep1) (or both).
A direct consequence of Lemma 3 is that if D ={dRep0,dRep1} then A(D)=N.
Hence, if λ=2, we can retrieve the optimal solution. This is because each proposal
will garner precisely as much support as win(P) since no delegate representative will
vote in favor of a proposal without concurrently voting in favor of win(P). Therefore,
win(P) will be elected, according to the tie-breaking rule.
But what about instances in which voters are more discerning, indicated by larger
values of k? Whether good approximations with multiple delegation representatives
are achievable in general is still to be determined. We begin with coherent instances
and we provide the following generalization of Theorem 10 specifically for this case.
Theorem 11. When λ = 2k+1, proxy selection for coherent instances and any
k ≥0 can be optimally solved.
Proof. Let R be the set of commonly revealed proposals to the voters of the given
instance. It is without loss of generality here to assume that win(P)∈/ R, or otherwise,
the direct voting would result in the election of win(P). To create a set of dReps D,
we fix an arbitrary set S ⊆ R, of k proposals, and for every possible binary vector
k
on S
k
, i.e., for every σ ∈2Sk , we add to D exactly two dReps, namely t
σ,0
and t
σ,1
,
advertising the following, with respect to a proposal j:
σ(j), if j ∈S k , (cid:26) σ(j), if j ∈S ,
t σ,0 (j)= 1, if j =opt(P), t σ,1 (j)= k
1, otherwise.
0, otherwise.
Toprovethestatement,itissufficient,toshowthatA(D)=N,i.e.,thatD canattract
all voters from N. We fix an arbitrary voter i∈N. Definitely, there is a vector, say σ′,
that defines a pair of dReps in D, say t and t (henceforth denoted by t and t ),
σ′,0 σ′,1 1 2
such that i totally agrees in all proposals of S both with t and t . Formally, if for a
k 1 2
given vector x and a set of proposals Y, we denote by x the projection of x to the
|Y
proposals in Y, the following holds:
max{H(v ,t ),H(v ,t }=0 (1)
(cid:98)i|Sk 1|Sk (cid:98)i|Sk 2|Sk
Let R′ :=R \S =R\S and for z ∈{0,1} we define R′(z):=|{j ∈R′ :v =z}|.
i i k k i i (cid:98)i
17
Then, either R′(0)≥R′(1), or R′(1)>R′(0). Therefore,
i i i i
(cid:22) |R′| (cid:23)
min{H(v ,t ),H(v ,t }≤ i . (2)
(cid:98)i|R i ′ 1|R i ′ (cid:98)i|R i ′ 2|R i ′ 2
Combining Equations (1) and (2), we have that voter i agrees either with t or with
1
t , in at least
2
(cid:24) |R′| (cid:25) (cid:24) m −k (cid:25) (cid:24) m +k (cid:25)
k+ i =k+ i ≥ i
2 2 2
proposals. Consequently, every voter will delegate to a dRep from D and the optimal
proposal will be elected.
Theorem 11 provides a bound on the sufficient number of dReps required to make
surethattheoptimalproposaliselectedandraisesaquestionregardingpositiveresults
(both optimal and approximate) for not-necessarily-coherent instances. In Theorem 13
below we provide a generalized and more refined version of this result, that relates the
achievable approximation with the required number of dReps and the parameters of
the instance, but does not need to assume that the instances are coherent.
3.2 Beyond Coherent Instances
Coherence has been proven to be very useful towards achieving meaningful approxima-
tion guarantees. At what follows, we define a more refined notion, namely a quantified
version of it, which provides further insights into the structure of instances and how
these affect the achievable approximations. In particular, we use the notion of (x,δ)-
coherent sets for sets of voters that have a common set of proposals of size x in their
revealed sets, as well as at most δ additional proposals.
Definition 3. A set of voters N′ ⊆ N is called (x,δ)-coherent if there exists a set
X ⊆C suchthatforeveryi∈N thefollowinghold:X ⊆R ,|X|≥x,and|R \X|≤δ.
i i
Using Definition 3, we generalize the result of Theorem 3, with an additional loss in
the factor that is dependent on the type of (x,δ)-coherent sets that an instance admits.
Theorem 12. For a single dRep and for any δ>0, proxy selection admits an
(cid:110) (cid:111)
approximation ratio of min n,3n(k+δ+2) , where S is the largest (k+δ,δ)-coherent
2|S|
set in the instance.
Proof. By electing any proposal, one can straightforwardly obtain an approximation
factor of n. For the remaining of the proof, say that 2n(k+δ+2) < n. Additionally,
2|S|
for now and for ease of exposition, suppose that for every voter i∈N, it holds that
v (win(P)) = 1, v (win(P)) = ⊥. At the end of the proof we will show that this
i (cid:98)i
assumption can be dropped. For the studied case, we will prove an approximation ratio
of 3n(k+δ+2).
2|S|
The proof follows the same rationale as the proof of Theorem 3, which was based
on applying Lemma 2 to the largest coherent set of the instance. Similarly, to prove
18
Theorem 12, it suffices to prove an analogous lemma, that we state below. Recall that
(cid:40)
1, if j =win(P),
dRep0(j)= dRep1(j)=1, ∀j ∈C.
0, otherwise.
Lemma 4. In an instance of proxy selection, with N being (k+δ,δ)-coherent, it
either holds that sc(win(P(cid:98)))≥ 2n or |A(dRep0)|≥ 2n .
3(k+δ+2) 3(k+δ+2)
Proof. Suppose that there exists a proposal j′ ∈C such that sc(j′)≥ 2n . Then
(cid:98) 3(k+δ+2)
theprooffollowsbythefactthatsc(win(P(cid:98)))≥s
(cid:98)
c(win(P(cid:98)))≥s
(cid:98)
c(j′).Sowecanassume
that for every proposal j ∈C, it holds that sc(j)< 2n . Next, we state a claim
(cid:98) 3(k+δ+2)
whichisageneralizationofClaim2.ItsproofisalmostidenticaltotheproofofClaim2.
Claim 3. Consider an instance where the set of voters N forms a (k+δ,δ)-coherent
set,forsomeδ ≥0.Supposealsothatsc(j)< 2n ,foranyj ∈C.LetX betheset
(cid:98) 3(k+δ+2)
of commonly revealed proposals to all voters, which by definition satisfies |X|≥k+δ.
For any set S ⊆X of r proposals, with r ∈[k+δ], let Z(S ,dRep0) be as defined in
r r
Claim 2. Then |Z(S ,dRep0)|≥ (k+2+δ−r)n.
r (k+δ+2)
We proceed with proving Lemma 4, using Claim 3. Fix r =k+δ in Claim 3, and
let S ⊆X be a set of k+δ proposals. Let also C′ =X\S . We examine the number
r r
of voters who will eventually delegate to dRep0.
We observe that every proposal of C′ belongs to the revealed set of any voter
of Z(S ,dRep0). Therefore, we can apply Corollary 1 for Z(S ,dRep0) and C′. This
r r
implies that either there exists a proposal in C′ approved by Z(Sr,dRep0) voters or at
3
least Z(Sr,dRep0) voters agree with dRep0 on at least ⌈|C′|⌉ proposals. By Claim 3, we
3 2
know that |Z(S ,dRep0)|≥ 2n . Therefore, by the assumption we have made, it is
r k+2+δ
notpossibletohaveaproposalwithascoreof |Z(Sr,dRep0)|,hence,atleast 2·|Z(Sr,dRep0)|
3 3(k+δ+2)
voters of Z(S ,dRep0) agree with dRep0 in at least ⌈|C′|⌉ proposals from C′. But, by
r 2
the definition of Z(S ,dRep0), they also agree on the k+δ proposals of S . In total,
r r
their agreement with dRep0 is at least:
(cid:24) |C′| (cid:25) (cid:24) m −k−δ−δ (cid:25) (cid:24) m +k (cid:25)
k+δ+ ≥k+δ+ i ≥ i ,
2 2 2
wherethefirstinequalityholdsduetothefactthat|C′|=|X|−(k+δ),andm ≤|X|+δ.
i
As a consequence all these voters, i.e., at least 2n in number, will delegate to
3(k+δ+2)
dRep0.
Hence, Lemma 4 holds and Theorem 12 follows, under the assumption that all
voters secretly approve the optimal proposal. Suppose that in a given instance the
assumption doesn’t hold. Focusing on the set of voters, called N′, that satisfy the
assumption and applying the described procedure, one can prove that either there is a
proposal of score at least 2|N′| or dRep0 attracts at least 2|N′| voters. Given
3(k+δ+2) 3(k+δ+2)
that sc(win(P))=|N′|, an approximation ratio of 3(k+δ+2) is implied.
2
19
To conclude the proof, we note that the desired approximation ratio holds true by
focusing on the set S, just like in Theorem 3.
ThemainresultofthesubsectionisarelaxationofTheorem11.UnlikeTheorem11,
thetheoremdoesnotrequireanystructuralassumptions,andrelatestheapproximation
with the number of dReps, the threshold bound and the structure of the instance in
terms of approximate coherence.
Theorem13. Whenλ=min (cid:8) n,ζ2k+1(cid:9),proxyselectionadmitsanapproximation
ratio of γ , where γ is the minimum number of (k,m−k)-coherent sets that can form
3ζ
a partition of N, and ζ ≤γ with ζ ∈N.
Proof. For ease of exposition we assume that for every voter i ∈ N, it holds that
v (win(P))=1, v (win(P))=⊥. We note that this assumption is without any loss in
i (cid:98)i
the approximation factor, likewise in the proof of Theorem 12 in which we proved that
focusing only on the set of voters that are secretly in favor of the optimal proposal
does not affect the approximation ratio.
Trivially, if we set D to consist of one dRep t for every voter v such that
i i
(cid:40)
1,if j =win(P)
t (j)=
i v (j),otherwise
i
thenA(D)=N andhencewin(P(cid:98)D )=win(P).Therefore,withndRepswecanretrieve
the optimal solution and, as a consequence, the claimed approximation factor holds.
We will now proceed with proving that whenever ζ2k+1 <n, we can use a set of dReps
D, where |D|≤ζ2k+1, to elect a proposal j such that sc(j)≥ 3ζopt(P).
γ
We proceed with the following lemma, the proof of which is analogous to the proof
of Theorem 11.
Lemma 5. In an instance of proxy selection, in which N can be partitioned in
at most γ (k,m−k)-coherent sets, proxy selection can be optimally solved if
λ=γ2k+1.
Using 2k+1 dReps for any of the sets described in the statement of the theorem,
as indicated by Lemma 5, we can create a set of dReps D such that |D|≤γ2k+1 and
A(D) = N. This proves the statement of the theorem for ζ = γ. Fix now a number
ζ <γ. Among the γ sets produced by Lemma 5, let us focus on the ζ sets of largest
size. Let also N′ ⊆N be the voters in these sets. Then we can create a set of dReps
D′ ⊆D, such that |D′|≤ζ2k+1 and A(D′)⊇N′. But then, |A(D′)|≥|N′|.
Although all dReps of D′ vote in favor of win(P) and this can indeed be the
winning proposal of profile P , it may also be the case that win(P )̸=win(P). In
D′ D′
fact, if some dReps from D′ that are voting in favor of win(P ), attract voters that
D′
vote against win(P ), the intrinsic score of win(P ) may significantly differ from
D′ D′
sc(win(P )). To avoid this behaviour we suggest to create a new set of dReps, say
D′
D′′, by deleting from D′ all dReps of the form t , i.e., all dReps that are voting in
σ,1
favor of proposals that do not belong in S , as defined in the proof of Theorem 11.
k
Interestingly |D′′|= |D′|. We will now compute |A(D′′)|.
2
20
Suppose first that the number of voters attracted by dReps in D′\D′′ is at most
2|N′|. Then, |A(D′′)|≥|N′|− 2|N′| = |N′|.
3 3 3
Before continuing we define α:=|∪
i∈N(cid:101)
R
i
| and β :=min{|R
i
|,i∈N(cid:101)}, where N(cid:101) is
anyoftheγ (k,m-k)-coherentsetsofthevoters’partition,asdescribedinthestatement
of the Theorem.
We call N the number of voters, attracted by dReps in D′ \D′′ and say that
1
N ≥ 2|N′|. But then, for every voter i∈N it holds that R (1)≥ |Ri| ≥ β. But then,
1 3 1 i 2 2
the total number of approvals in P(cid:98) for proposals in C \S
k
, equals (cid:80)
i∈N1
R
i
(1) ≥
|N |β ≥ 2|N′|β = |N′|β.Alltheseapprovalsarespreadbetweenαproposals.Therefore,
1 2 3 2 3
there exists a proposal j ∈R for which sc(j)≥ |N′|β.
(cid:98) 3α
Consequently, either there is a proposal approved by |N′|β voters or there exists a
3α
set of dReps D′′ that can attract |N′|β voters without voting in favor of a proposal
3α
disapproved by a voter in A(D′′). Using the fact that |N′|≥ ζn≥ ζopt(P), we have
γ γ
an approximation ratio of γ · β . Obviously, if every of the considered ζ, out of the γ
ζ 3α
(k,m-k)-coherent sets, is coherent, then an approximation ratio of γ is implied.
3ζ
An interesting corollary of Theorem 13 (which could also be deduced from the
proof of Theorem 11) is the following: When aiming for an optimal solution with 2k+1
dReps, it’s not a necessity for instances to be coherent; rather, the key factor is the
existence of a set of k proposals commonly revealed to all voters.
We conclude the discussion on generalizations with the following theorem, for the
case of majority agreement (k = 0), which generalizes Corollary 1 by relating the
achievable approximation to the structure of the revealed sets, once again without the
requirement of coherence.
Theorem 14. ForasingledRep,proxy selectionformajorityagreementadmitsan
approximation ratio of min (cid:8) n,3α(cid:9), where α:=|∪ R | and β :=min{|R |,i∈N}.
β i∈N i i
Proof. It is apparent that any instance of proxy selection, is approximable within a
factor of n, since it holds that electing arbitrarily a proposal j ∈C results in a winning
proposal that satisfies sc(j)≥1≥ 1opt(P). Therefore suppose that min{n,3α}= 3α.
n β β
We proceed by showing that there exists a ballot type, the advertisement of which,
results to an attraction of at least half of the voters that vote in favor of the optimal
proposal. Recall that
(cid:40)
1, if j =win(P),
dRep0(j)= dRep1(j)=1, ∀j ∈C.
0, otherwise.
Note that dRep0 and dRep1 are not the “all-1-s” and the “all-0-s” dReps, but rather
they both vote in favor for the optimal proposal. Therefore, the proof of the lemma,
although not involved, is not immediate, in contrast to the proof of Lemma 3.
Lemma 6. In an instance of proxy selection, with k = 0, it holds that
max{|A(dRep0)|,|A(dRep1)|}≥ |N′|, where N′ ={i∈N :v (win(P))=1}.
2 i
21
Chunk 2
Suppose first that the number of voters attracted by dReps in D′\D′′ is at most
2|N′|. Then, |A(D′′)|≥|N′|− 2|N′| = |N′|.
3 3 3
Before continuing we define α:=|∪
i∈N(cid:101)
R
i
| and β :=min{|R
i
|,i∈N(cid:101)}, where N(cid:101) is
anyoftheγ (k,m-k)-coherentsetsofthevoters’partition,asdescribedinthestatement
of the Theorem.
We call N the number of voters, attracted by dReps in D′ \D′′ and say that
1
N ≥ 2|N′|. But then, for every voter i∈N it holds that R (1)≥ |Ri| ≥ β. But then,
1 3 1 i 2 2
the total number of approvals in P(cid:98) for proposals in C \S
k
, equals (cid:80)
i∈N1
R
i
(1) ≥
|N |β ≥ 2|N′|β = |N′|β.Alltheseapprovalsarespreadbetweenαproposals.Therefore,
1 2 3 2 3
there exists a proposal j ∈R for which sc(j)≥ |N′|β.
(cid:98) 3α
Consequently, either there is a proposal approved by |N′|β voters or there exists a
3α
set of dReps D′′ that can attract |N′|β voters without voting in favor of a proposal
3α
disapproved by a voter in A(D′′). Using the fact that |N′|≥ ζn≥ ζopt(P), we have
γ γ
an approximation ratio of γ · β . Obviously, if every of the considered ζ, out of the γ
ζ 3α
(k,m-k)-coherent sets, is coherent, then an approximation ratio of γ is implied.
3ζ
An interesting corollary of Theorem 13 (which could also be deduced from the
proof of Theorem 11) is the following: When aiming for an optimal solution with 2k+1
dReps, it’s not a necessity for instances to be coherent; rather, the key factor is the
existence of a set of k proposals commonly revealed to all voters.
We conclude the discussion on generalizations with the following theorem, for the
case of majority agreement (k = 0), which generalizes Corollary 1 by relating the
achievable approximation to the structure of the revealed sets, once again without the
requirement of coherence.
Theorem 14. ForasingledRep,proxy selectionformajorityagreementadmitsan
approximation ratio of min (cid:8) n,3α(cid:9), where α:=|∪ R | and β :=min{|R |,i∈N}.
β i∈N i i
Proof. It is apparent that any instance of proxy selection, is approximable within a
factor of n, since it holds that electing arbitrarily a proposal j ∈C results in a winning
proposal that satisfies sc(j)≥1≥ 1opt(P). Therefore suppose that min{n,3α}= 3α.
n β β
We proceed by showing that there exists a ballot type, the advertisement of which,
results to an attraction of at least half of the voters that vote in favor of the optimal
proposal. Recall that
(cid:40)
1, if j =win(P),
dRep0(j)= dRep1(j)=1, ∀j ∈C.
0, otherwise.
Note that dRep0 and dRep1 are not the “all-1-s” and the “all-0-s” dReps, but rather
they both vote in favor for the optimal proposal. Therefore, the proof of the lemma,
although not involved, is not immediate, in contrast to the proof of Lemma 3.
Lemma 6. In an instance of proxy selection, with k = 0, it holds that
max{|A(dRep0)|,|A(dRep1)|}≥ |N′|, where N′ ={i∈N :v (win(P))=1}.
2 i
21
Proof of Lemma 6. We will prove that any arbitrary voter i∈N′ belongs to at least
one of A(dRep0) and A(dRep1). We focus on the proposals R′ =R \{win(P)}. Let
i i
R′(0) = |{j ∈ R′ : v (j) = 0}|, and similarly, R′(1) = |{j ∈ R′ : v (j) = 1}|. Then,
i i (cid:98)i i i (cid:98)i
trivially, it holds that either R′(0) ≥ R′(1) or R′(1) > R′(0). If win(P) ∈ R , voter
i i i i i
i agrees with both dRep0 and dRep1 on win(P), whereas if win(P) ̸∈ R , by the
i
definition of the distance function, win(P) does not affect the distance of voter i from
any dRep. Hence in both cases, proposal win(P) does not contribute to the distance of
i from the two dReps, and therefore,
(cid:22) |R′| (cid:23) (cid:22) |R | (cid:23)
min{d(i,dRep0),d(i,dRep1)}≤0+ i ≤ i ,
2 2
wheretheminimumisachievedbydRep0ifR′(0)≥R′(1),andbydRep1otherwise.
i i
From Lemma 6 it holds that either dRep0 or dRep1 can attract at least half of the
voters in N′. If this holds for D ={dRep0}, we are done, because either the optimal
proposal wins the election, via dRep0, or another proposal wins through the voters
who vote directly, in which case the intrinsic score of the winning proposal j is at least
sc (j)≥A(D)≥ |N′| ≥ |N′| ≥ |N′|β = β opt(P).
(cid:98)D 2 3 3α 3α
Otherwise, if dRep1 attracts at least half of the voters in N′, say that all voters
from a set N ⊆ N are being attracted by dRep1. Then, for every voter i ∈ N , let
1 1
R (0) = |{j ∈ R : v = 0}| and R (1) = |{j ∈ R : v = 1}|. It should hold that
i i (cid:98)i i i (cid:98)i
R (1)>R (0), due to the fact that i∈A(dRep1). We distinguish two cases:
i i
• Suppose that |N |≥ 2|N′|, For every voter i∈N it holds that R (1)≥ |Ri| ≥ β.
1 3 1 i 2 2
But then, the total number of approvals in P(cid:98) equals (cid:80)
i∈N1
R
i
(1) ≥ |N
1
|β
2
≥
2|N′|β = |N′|β. All these approvals are spread between α proposals. Therefore,
3 2 3
there exists a proposal j ∈R for which sc(j)≥ |N′|β.
(cid:98) 3α
• Suppose now that |N | < 2|N′|. But in this case, the set of voters in N′ \N
1 3 1
should be attracted by dRep0, again by Lemma 6. In total, these are at least
|N′|− 2|N′| = |N′|. Therefore, if D ={dRep0}, the delegation representative will
3 3
vote with a weight of at least |N′|, only in favor of win(P), which would result
3
in sc (win(P)) ≥ |N′|. Then, either win(P) will win the election achieving an
(cid:98)D 3
optimal solution, or a proposal j ∈ C \{win(P)} will win. In the later case, it
should hold that sc (j) ≥ sc (win(P)) ≥ |N′| ≥ |N′|β. However, dRep0(j) = 0
(cid:98)D (cid:98)D 3 3α
and hence sc (j)=sc(j).
(cid:98)D (cid:98)
We proved that whenever win(P(cid:98)D )̸=win(P), there exists a proposal j ∈C such that
sc(j)≥ |N′|β ≥ β ·opt(P). The fact that sc(j)≥sc(j), concludes the proof.
(cid:98) 3α 3α (cid:98)
4 Experiments
We complement our theoretical results with experiments on the performance of voting
with proxies on realistic data sets where voters exhibit incomplete preferences. We
highlight the effect that the number of dReps, revealed set sizes and thresholds have
22
on the total number of voters who delegate instead of voting directly, as well as the
approximation compared to the optimal outcome.
The first hurdle to overcome is that, by definition, it is not possible to find datasets
containing both revealed and intrinsic preferences, since voters only submit the first of
the two. We use the MovieLens dataset (Harper and Konstan, 2015) to circumvent
this issue, as there is enough contextual information to calculate plausible intrinsic
preferences given the revealed ones. This set contains the reviews (with scores ranging
from 0.5 to 5.0) given by 162.541 users to 62.423 movies. Of course, not every user has
reviewed every movie. Each movie is characterized by a set of genres and also has a
relevancescorefor1.084differenttags,providedbytheusers.Usingthisinformationwe
calculate a plausible intrinsic user-specific “rating” Rat for any non-reviewed movie i:
i
(cid:80)
Rat ·sim(i,j)
Rat = j∈R j ,
i (cid:80) sim(i,j)
j∈R
where sim is a similarity metric (taking into account tag and genre relevance) and R
is the set of movies reviewed by the user, hence Rat is the rating that user gave to
j
moviej.Notethatthesemovieratingsarebetween0.5and5.0,andarethenconverted
to approval preferences by comparing them with the average rating given by that user.
To calculate sim(·,·) we use two vectors per movie i:
• The tag vector: tag ∈[0,1]1084.
i
• The genre vector: genre ∈{0,1}20.
i
The actual similarity is given by
(cid:32) (cid:33)
genre ·genre
sim(i,j)=1.2∥tag i −tag j ∥· 0.5+ i j .
∥genre ∥·∥genre ∥
i j
In addition, we take a random sample of users and movies such that each user has
reviewed at least 5% of the movies and no movie has been reviewed by more than 10%
of the users. Specifically, we begin by sampling a large subset of 13000 users and 150
movies, and filter out the users that have reviewed fewer than 5% of those movies, as
well as the movies that have been reviewed by more than 10% of the users. This makes
the effect of delegation (and using multiple dReps in particular) more pronounced, as
there are few completely uninformed users that would readily delegate and no clear
pick for a best movie. To enable meaningful comparisons, we add a movie that has not
been reviewed by any user, but is approved by all in their intrinsic ratings.
Even though in theory finding the optimal set of dReps is computationally
intractable (see Theorem 9), we use a greedy heuristic, which works well in practice.
Specifically, we can build the type of a dRep t incrementally, proposal by proposal,
settingt(j)=1ort(j)=0dependingonwhichattractsthemostvoters,assumingthat
the voters’ revealed sets only include proposals up to proposal j. We can repeat this
procedure to create multiple dReps, removing the users that have already delegated at
the end of every iteration.
We first measure the fraction of users that opted to delegate, as a function of the
numberofdRepsandeithertheirk (forfixedm )(Figure3,left)ortheirm (Figure3,
i i i
23
1.00
0.75
0.50
0.25
0 10 20 30
NumberofdReps
etaRnoitageleD
1.0
0.8
k =0.0·m m =1.0·m
i i i
k i =0.2·m i 0.6 m i =0.75·m
k =0.4·m m =0.5·m
i i i
k =0.6·m 0.4 m =0.25·m
i i i
k =0.8·m m =0.1·m
i i i
0.2
0 10 20 30
NumberofdReps
Fig. 3: The fraction of voters that chose to delegate as a function of the total number
of dReps.
1.4
1.2
1.0
0 10 20 30
NumberofdReps
oitaRnoitamixorppA
1.6
k =0.5·m m =1.0·m
i i i
k =0.6·m m =0.75·m
i i i 1.4
k =0.7·m m =0.5·m
i i i
k =0.8·m m =0.25·m
i i i
k =0.9·m 1.2 m =0.1·m
i i i
1.0
0 10 20 30
NumberofdReps
Fig. 4: The quality of the approximation as a function of the total number of dReps.
right). In the first case each k ranges from 0 up to 0.4m , indicating users increasing
i i
levels of user agreement required before delegating. In the second, each m ranges from
i
0 to 0.8m, capturing increasingly detailed revealed user preferences (while keeping
k = 0.2m ). To obtain these m s we start from the calculated intrinsic preferences
i i i
(of size m) and we then “hide” some coordinates, uniformly at random, yielding the
sparserrevealedpreferences.Ourresultsshowthatitiseasiertoattractdelegationfrom
voterswithsmallerrevealedsets(leadingtosmallercoherentsets)orwithlowerk ’s.It
i
turns out, that this also does indeed translate to better approximations of the optimal
approval score (see Figure 4; created similarly to Figure 3). These results suggest that,
interestingly, the case of coherent instances studied in many of our theoretical results
seems to be the most challenging in practice.
Notice that while the graphs in Figures 3 and 4 are qualitatively similar, there are
certain important differences. Specifically, the clearly diminishing returns structure
observed when the only objective is to attract delegation in Figure 3 is not present in
Figure4.Thisisbecausewhileitgetsprogressivelymoredifficultto“cluster” voters,the
24
quality of the outcomes increases in ever bigger jumps: at the very top, the difference
between the best and second-best proposals (movies) will be greater than the second-
best and third-best and so on, until their quality plateaus. This is also why the “jumps”
in the approximation are steeper in some regions: a certain number of dReps does not
have any effect (because even though they attract delegation, they cannot change the
winner), but then suddenly this changes.
Each run is repeated 20 times for a more accurate empirical mean and standard
deviation.Allexperimentsranona2021M1ProMacBookwith16gigabytesofmemory
and 10 high-performance cores. The code (available at https://www.plazos.me/code/
DelegatedVoting/ is parallelized and requires a few hours to complete, including some
initial preprocessing of the data.
5 Discussion and Future Directions
In this paper, we proposed and studied a model for proxy voting where the (less-
informed) voters delegate their votes to the (fully-informed) proxies (dReps), once a
certain agreement between their ballots is reached, or they vote directly otherwise. Our
findings encompass essential insights into comprehending what is possible (potential)
and what is not (limitations) in this setting. By identifying structural properties
and other restrictions, we managed to escape the strong impossibilities that we had
established.
The upper and lower bounds presented in our work are not always tight, and
future work could focus on sharpening these bounds. Perhaps more interesting is the
migration from the “best-case scenario” setting that we study in this work. This would
most probably entail the following two components:
- An information model for the dReps. It would be reasonable to assume that each
delegate representative is correctly-informed about each voter i’s approval preference
ofeachproposaljwithsomeprobabilityp ,orthattheyare(perfectlyorimperfectly)
ij
informed about a randomly-chosen set of proposals for each voter.
- A rationality model for the dReps.Delegaterepresentativesmightnothaveanyincen-
tivestocoordinatetowardsthesocially-desirableoutcome,andtheywouldneedtobe
properly incentivized to do that, e.g., via the form of payments. Additionally, dReps
could even have their own preferences regarding the proposals under consideration.
One could think of many other examples or refinements of the above, and the
appropriate choice of information/rationality model for the dReps would depend on
the application at hand. Tie-breaking rules that do not necessarily favor the optimal
proposal also worth studying. Regardless of these choices however, the results of the
“best-casescenario” shouldbethestartingpointofanyinvestigationintothosesettings.
Besidesthoseextensions,otherdirectionsthatweseeaspromisingroutesforfurther
researchonthetopicincludedifferentdistancemetrics,differentvotingrules,themulti-
winner setting, elections on interdependent proposals, as well as a partial delegation
setting where voters can opt to delegate only on proposals for which they have no
opinion.
25
References
Alouf-Heffetz S, Bulteau L, Elkind E, et al (2022) Better collective decisions via
uncertainty reduction. In: Proceedings of the 31st International Joint Conference on
Artificial Intelligence (IJCAI), pp 24–30
Anshelevich E, Filos-Ratsikas A, Shah N, et al (2021) Distortion in social choice
problems: The first 15 years and beyond. In: Proceedings of the 30th International
Joint Conference on Artificial Intelligence (IJCAI) Survey Track., pp 4294–4301
AyadiM,AmorNB,LangJ,etal(2019)Singletransferablevote:Incompleteknowledge
and communication issues. In: Proceedings of the 18th International Conference on
Autonomous Agents and MultiAgent Systems (AAMAS), pp 1288–1296
Aziz H, Shah N (2021) Participatory budgeting: Models and approaches. In: Pathways
Between Social Science and Computational Social Science: Theories, Methods, and
Interpretations. Springer, p 215–236
Baumeister D, Dennisen S (2015) Voter dissatisfaction in committee elections. In: Pro-
ceedingsofthe14thInternationalConferenceonAutonomousAgentsandMultiAgent
Systems (AAMAS), pp 1707–1708
Becker R, D’angelo G, Delfaraz E, et al (2021) Unveiling the truth in liquid democracy
with misinformed voters. In: Proceedings of the 7th International Conference on
Algorithmic Decision Theory (ADT), pp 132–146
BouveretS,EndrissU,LangJ(2010)Fairdivisionunderordinalpreferences:Computing
envy-free allocations of indivisible goods. In: Proceedings of the 19th European
Conference on Artificial Intelligence (ECAI), pp 387–392
Brill M, Freeman R, Conitzer V (2016) Computing possible and necessary equilibrium
actions (and bipartisan set winners). In: Proceedings of the 30th AAAI Conference
on Artificial Intelligence (AAAI), pp 418–424
Cabannes Y (2004) Participatory budgeting: a significant contribution to participatory
democracy. Environment and urbanization 16(1):27–46
Caragiannis I, Micha E (2019) A contribution to the critique of liquid democracy. In:
Proceedings of the 28th International Joint Conference on Artificial Intelligence
(IJCAI), pp 116–122
Cevallos A, Stewart A (2021) A verifiably secure and proportional committee elec-
tion rule. In: Proceedings of the 3rd ACM Conference on Advances in Financial
Technologies, pp 29–42
Cohensius G, Mannor S, Meir R, et al (2017) Proxy voting for better outcomes. In:
Proceedingsofthe16thConferenceonAutonomousAgentsandMultiAgentSystems
26
(AAMAS), pp 858–866
Constantinescu A, Wattenhofer R (2023) Computing the best policy that survives a
vote. In: Proceedings of the 22nd International Conference on Autonomous Agents
and Multiagent Systems (AAMAS), pp 2058–2066
Elkind E, Faliszewski P, Lackner M, et al (2015) The complexity of recognizing
incomplete single-crossing preferences. In: Proceedings of the 29th AAAI Conference
on Artificial Intelligence (AAAI), pp 865–871
Frances M, Litman A (1997) On covering problems of codes. Theory Computing
Systems 30(2):113–119
Fritsch R, Wattenhofer R (2022) The price of majority support. In: Proceedings of
the 21st International Conference on Autonomous Agents and MultiAgent Systems
(AAMAS), pp 436–444
Halpern D, Halpern JY, Jadbabaie A, et al (2023a) In defense of liquid democracy. In:
Proceedings of the 24th ACM Conference on Economics and Computation (EC), p
852
Halpern D, Kehne G, Procaccia AD, et al (2023b) Representation with incomplete
votes.In:Proceedingsofthe37thAAAIConferenceonArtificialIntelligence(AAAI),
pp 5657–5664
Harper FM, Konstan JA (2015) The movielens datasets: History and context. ACM
Transactions on Interactive Intelligent Systems 5(4):1–19
Imber A, Israel J, Brill M, et al (2022) Approval-based committee voting under
incomplete information. In: Proceedings of the 36th AAAI Conference on Artificial
Intelligence (AAAI), pp 5076–5083
Kahng A, Mackenzie S, Procaccia A (2021) Liquid democracy: An algorithmic
perspective. Journal of Artificial Intelligence Research 70:1223–1252
Kalech M, Kraus S, Kaminka GA, et al (2011) Practical voting rules with partial
information. In: Proceedings of the 10th International Conference on Autonomous
Agents and MultiAgent Systems (AAMAS), pp 151–182
Kerkmann AM, Rothe J (2019) Stability in fen-hedonic games for single-player devia-
tions. In: Proceedings of the 18th International Conference on Autonomous Agents
and MultiAgent Systems (AAMAS), pp 891–899
Kiayias A, Lazos P (2022) Sok: Blockchain governance. In: Proceedings of the 4th
ACM Conference on Advances in Financial Technologies (AFT), pp 61–73
Lackner M (2014) Incomplete preferences in single-peaked electorates. In: Proceedings
of the 28th AAAI Conference on Artificial Intelligence (AAAI), pp 742–748
27
Lang J (2020) Collective decision making under incomplete knowledge: possible and
necessary solutions. In: Proceedings of the 29th International Joint Conference on
Artificial Intelligence (IJCAI), pp 4885–4891
LeGrand R, Markakis E, Mehta A (2007) Some results on approximating the minimax
solution in approval voting. In: Proceedings of the 6th International Conference on
Autonomous Agents and MultiAgent Systems (AAMAS), pp 1185–1187
Meir R, Sandomirskiy F, Tennenholtz M (2021) Representative committees of peers.
Journal of Artificial Intelligence Research 71:401–429
Paulin A (2020) An overview of ten years of liquid democracy research. In: The 21st
Annual International Conference on Digital Government Research, pp 116–121
Pivato M, Soh A (2020) Weighted representative democracy. Journal of Mathematical
Economics 88:52–63
Revel M, Halpern D, Berinsky A, et al (2022) Liquid democracy in practice: An
empirical analysis of its epistemic performance. In: ACM Conference on Equity and
Access in Algorithms, Mechanisms, and Optimization
Terzopoulou Z (2023) Voting with limited energy: A study of plurality and Borda.
In: Proceedings of the 22nd International Conference on Autonomous Agents and
MultiAgent Systems (AAMAS), pp 2085–2093
Terzopoulou Z, Karpov A, Obraztsova S (2021) Restricted domains of dichotomous
preferences with possibly incomplete information. In: Proceedings of the 35th AAAI
Conference on Artificial Intelligence (AAAI), pp 5726–5733
Zhou A, Yang Y, Guo J (2019) Parameterized complexity of committee elections
with dichotomous and trichotomous votes. In: Proceedings of the 18th International
Conference on Autonomous Agents and MultiAgent Systems (AAMAS), pp 503–510
28