Abstract 1 Introduction 2 Problem set up 3 Bounding the gap 4 Lower bound 5 Conclusion References Appendix A Alternative multiplicative choice function Appendix B Noncounting lower bound Appendix C Follow-the-regularized-leader interpretation

Multidimensional Blockchain Fees Are (Essentially) Optimal

Guillermo Angeris ORCID Bain Capital Crypto, Boston, MA, USA Theo Diamandis ORCID Bain Capital Crypto, Boston, MA, USA Ciamac Moallemi ORCID Graduate School of Business, Columbia University, New York, NY, USA
Abstract

In this paper we show that, using only mild assumptions, dynamic multidimensional blockchain fee markets have strong performance guarantees, even against worst-case adversaries. In particular, we show that the average welfare gap between the following two scenarios is at most O(1/T), where T is the length of the time horizon considered. In the first scenario, the designer knows all future actions by users and is allowed to fix the optimal prices of resources ahead of time, based on the designer’s oracular knowledge of those actions. In the second, the prices are updated by a very simple algorithm that does not have this oracular knowledge, special cases of which are EIP-4844 and EIP-1559, both fee mechanisms used by the Ethereum blockchain. Roughly speaking, this means that, on average, over a reasonable timescale, there is no difference in welfare between “correctly” fixing the prices, with oracular knowledge of the future, when compared to the proposed algorithm. We show a matching lower bound of Ω(1/T) for any implementable algorithm and also separately consider the case where the adversary is known to be stochastic.

Keywords and phrases:
Blockchains, transaction fees, online optimization, convex optimization
Copyright and License:
[Uncaptioned image] © Guillermo Angeris, Theo Diamandis, and Ciamac Moallemi; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Online algorithms
; Theory of computation Convex optimization ; Information systems Digital cash ; Theory of computation Algorithmic mechanism design
Related Version:
Full Version: https://arxiv.org/abs/2402.08661
Editors:
Zeta Avarikioti and Nicolas Christin

1 Introduction

Public blockchains allow any user to submit a transaction that modifies the shared state of the network. These transactions are independently verified and executed by a decentralized network of full nodes, who must each re-execute the transaction to verify the correct updated state of the blockchain. Each transaction requires some amount of resources, such as compute or storage, to be executed. Because full nodes have finite resources, blockchains must limit the total computational resources that can be consumed per unit of time, usually measured per-block. As user demand may fluctuate, most blockchains implement a transaction fee mechanism in order to allocate finite resources among competing transactions.

Transaction fees.

Transaction fee mechanisms in practice often use a combination of a base fee, which is the minimum cost of entry for a particular block, and a priority fee, which is an additional fee that a user may pay so that a particular contentious transaction is included. In this paper, we concentrate on the base fee for transaction inclusion. These fees are computed algorithmically as part of the blockchain’s consensus mechanism; any transaction not paying the computed base fee is invalid. Thus, the algorithm for computing these base fees is a crucial part of the blockchain’s design, as it ultimately determines which transactions can be included in the blockchain and the blockchain’s robustness to adversarial behavior, such as spam or denial-of-service attacks.

Dynamic base fees.

Because user demand fluctuates, many blockchains have implemented or proposed a dynamic base fee mechanism. These mechanisms update the base fee based on current demand for resources, measured by the resource consumption of the transactions in previous blocks. Most famously, EIP-1559 [14] for the Ethereum blockchain introduced a base fee that adjusts to track a target block usage. Many of these mechanisms can be interpreted as applying gradient descent on the dual of a particular optimization problem [19].

Multidimensional fees.

Calculating transaction fees through a single, joint unit of account fixes the relative prices of all resources. These fixed relative prices inhibit granular price discovery and, ultimately, reduce the throughput of the blockchain. To mitigate this behavior, some blockchains have proposed multidimensional fees, where different resources are priced separately. EIP-4844 [15], to be implemented in the upcoming Dencun update, allows users to submit special pieces of data called blobs [1, 2], which have an independent fee mechanism. In essence, EIP-4844 creates a two-dimensional fee market. SIMD-110 [38] in the Solana ecosystem proposes a local fee controller for each piece of state on the Solana virtual machine. During periods of high user demand, highly contested pieces of state would become more expensive without affecting the price of other state. Similar multidimensional mechanisms have been proposed for the Ethereum ecosystem [3, 13] and other blockchains, such as Avalanche [29] and Penumbra [30], have also implemented multidimensional blockchain fees. Like dynamic base fees, these multidimensional mechanisms can be designed and interpreted by considering the optimization problem that they attempt to solve.

Optimality.

While the design and incentives of particular existing transaction fee mechanisms have been extensively studied, the overall design space has not been well-characterized. In particular, it is unclear what properties we can hope for from a “good” fee mechanism, the tradeoffs present within the space of “good” mechanisms, and what “good” even means in this context, where adversarial behavior may preclude the existence of “true” market-clearing prices for resources. These open questions are important for the many blockchains currently working on updated fee mechanisms, especially because many traditional results do not apply in the adversarial blockchain environment.

This paper.

In this paper, we seek to characterize the optimality of the algorithm for setting multidimensional blockchain fees proposed in [19] when extended to the online setting. To judge the optimality of this algorithm, we rely on the notion of regret from the online convex optimization literature [35, 23]. This quantity measures the difference between two scenarios: in the first, we dynamically update the resource prices as in [19, §3.4]; in the second, we assume that the algorithm knows all future transactions and chooses the best fixed vector of prices given this knowledge. (The second scenario is similar to the current implementation of gas in Ethereum, but, of course, the fixed gas prices of individual resources are not known to be optimal.) We show that the average welfare gap between the two scenarios is at most O(1/T), where T is the length of the time horizon considered, and we construct a corresponding lower bound. Taken together our results show that the proposed algorithm is essentially optimal according to this metric, even under arbitrary adversarial behavior that eludes traditional game theoretic analysis. We also show that, if the adversary is stochastic, the prices do converge to a market clearing price and give an explicit rate of convergence. We point to several additional directions for future work, where we suspect additional tools from online optimization may be used to improve upon the results presented here.

1.1 Related work

There are a number of papers proposing new mechanisms for pricing resources or analyzing mechanisms currently implemented in practice, which we describe next. The most similar in spirit to ours is [6], which cites a preprint of this article for its set up. They show that, when users submitting transactions are patient (i.e., their utility does not depend on when their transaction ultimately gets included), a one-dimensional exponential update rule, akin to EIP-1559, is essentially optimal compared to any baseline. This mechanism is a special case of the mechanism studied in this paper (see (7) and derivation in 3.1). Our mechanism allows for both alternative update rules and more than one resource. We review some of the other relevant literature below.

Blockchain resource pricing.

In the specific case of blockchain resource pricing, these papers can be roughly broken down into three buckets, which we describe next. The first set analyzes past or current fee mechanisms or variations thereof. These papers include some of the original economic analyses of EIP-1559 [33], empirical studies [12, 27], simulation studies [31], and clean theoretical models [25, 26]. All of these papers focus on the single-dimensional resource case, usually under some simplifying assumptions such as, e.g., certain stochastic limits in the case of [26]. The second set of papers proposes simple extensions or characterizations of mechanisms for pricing single resources in a variety of settings [20, 8, 9], including some impossibility results [16]. The final set proposes multidimensional fee mechanisms for a variety of settings [13, 19, 18]. The multidimensional fees work closest in spirit to ours is [18], which proposes a stochastic demand model of resources with stochastic observations. They apply linear-quadratic control theory to set prices that minimize a composite loss function that seeks to balance matching the target block size, while smoothly changing prices.

This work, on the other hand, shows that, even in an adversarial setting, a very simple update rule suffices and has low regret relative to optimally setting the prices. We also show that, in the “easier” stochastic setting, we recover a market-clearing price and provide a rate of convergence. One important case where we differ from many previous analyses is that we do not consider only incentive compatibility, but, indeed, consider worst-case adversarial behavior against the pricing mechanism and provide optimality bounds even in this particularly difficult setting.

Tatonnement processes.

The design of transaction fee mechanisms via gradient descent on an appropriately-chosen dual problem is also related to the literature on tatonnement processes [36, 17]. Many works (e.g., [11] and [32]) use gradient descent-like algorithms to develop dynamic pricing strategies for various types of markets. These algorithms use a dynamic step size to ensure convergence to optimal prices. More similar to our work, [5] uses a fixed step size, and their algorithm, as a result, does not necessarily converge to any single price.

2 Problem set up

In this section, we briefly introduce the blockchain resource allocation problem of [19]. We review how the dual problem leads to a notion of resource prices, which, when set correctly, ensure that block builders include the “optimal” transactions, i.e., those that maximize the utility of the transaction producers, minus the loss incurred by the blockchain for including those transactions.

2.1 The block building problem

We will fix some notation that we use for the remainder of the paper in the paragraphs that follow and describe the problem we are looking to solve.

Transactions and resources.

First, let t=1,,T denote the current blockheight. At blockheight t1, we assume that there are nt transactions from which we may pick a subset to be included in block t. (This list of transactions that we are allowed to choose from is often called the mempool.) Each of these j=1,,nt transactions has some associated utility (qt)jR+ which is gained if it is included in the block. Additionally, transaction j, if included in the block, will consume some amount of the m available resources, which we denote as an m-vector ajRm. (The ith entry of this vector, (aj)i, denotes how much of resource i is consumed by transaction j.) Finally, the vector xt{0,1}nt will denote the transactions that were included in block t: the jth entry (xt)j=1 if transaction j is included in block t and is 0, otherwise.

Total consumption.

This set up lets us write the total resource consumption of all transactions included in block t in a convenient way. Define the m-vector

yt=j=1naj(xt)j=Atxt,

where the matrix AtR+m×nt has jth column aj denoting the resources consumed by transaction j in the mempool. We then have that (yt)i denotes the total amount of resource i is consumed by the transactions included in block t.

Allowable transactions.

We denote the set of allowable transactions in a block by St{0,1}nt, which may include resource limits (e.g., if xSt, then Axb), and interactions among transactions. Examples may include complementarity constraints such as, if xSt, then xj=1 implies that xj=0 for some jj, among many other, potentially very complicated, constraints.

Loss function.

Finally, we denote the network’s “unhappiness” with the resource usage at block t as the loss function t:R+mR{}. This function maps the resource usage at block t, which is always a nonnegative vector, to some number t(yt). Higher values denote less desirable resource usage. We assume this function, which is chosen by the blockchain designer, is convex and lower semicontinuous for the remainder of the paper.

Block building problem.

We now define the block building problem as the problem of optimizing the utility of the included transactions, minus the loss incurred by the network, subject to the transactions being in the allowable transaction set:

maximize qtTxtt(yt) (1)
subject to yt=Atxt
xt𝐜𝐨𝐧𝐯(St).

Here, the variables are the included transactions xtRnt and the total resource consumption of the included transactions, ytRm. Additionally, 𝐜𝐨𝐧𝐯(St) denotes the convex hull of the set St. Replacing St with its convex hull allows “fractional” transactions, such as xj=1/3. These values of xj(0,1) can be interpreted as “spreading out” transaction j over the next 1/xj blocks. We denote the optimal value of problem (1) by s.

Bounds.

In general, it is a good idea to have maximum per-block resource consumption bounds, which we write as bR+m, so we will assume that

St{x{0,1}ntAtxb},

so any allowable choice of transactions xtSt will satisfy

0Atxtb,

as both At and xt are nonnegative. In other words, we assume that the maximum amount of resource i that any set of acceptable transactions may consume is no more than bi. Note that this also implies

0ytb,

for any yt feasible for (1) since yt=Atxt. (For why such limits are necessary, see the discussion in [19, §1].) Since 0ytb for any feasible points, we assume, without loss of generality, that t(y)= whenever yb or y0 for convenience.

Discussion.

We may view this block building problem as the following “idealized” scenario: if we, as the network, knew the welfare generated by each transaction, along with all possible constraints for all transactions then solving problem (1) would give us a way to include transactions as to maximize welfare minus the loss incurred by the network. Of course, in general, almost all of these things are not known (or, potentially even knowable) to the network, so we will show how to approximately solve this problem by, instead, correctly setting the price of the consumed resources for each block.

Resources.

We also note that the number of resources n here is left to the blockchain designer. A valid question might be: what is a reasonable number of resources and just how granular should these resources be in order to ensure that the system behaves as intended? While it may be tempting to have as many different resources as possible in order to have the most granular possible price discovery, we will see later that this may result in a very weak bound, which we show can be matched by an adversary. On the other hand, if many resources are correlated in their usage, it may be good to run some dimensionality reduction technique (such as PCA) on the historical resource utilization and only explicitly price a few “meta” resources.

2.2 The dual problem

In this subsection, we construct a particular dual problem, whose variables are the prices of the m possible resources; correctly setting these prices would solve our original problem (1). We then show that its gradients can be efficiently evaluated, without knowledge of things like the allowable transactions St nor the welfare qt for each block t. This suggests a heuristic in which we use the gradient at each step to update the prices of the resources, which we prove in a later section is (essentially) optimal.

The dual function.

One possible relaxation of (1), for each block t=1,,T, can be constructed by relaxing the equality constraint in (1) to a penalty, parametrized by a vector ptRm, which we will call the prices:

maximize qtTxt(yt)+ptT(ytAtxt)
subject to xt𝐜𝐨𝐧𝐯(St),

where the variables are the same as those of problem (1). If we denote the optimal value of this problem, which is parametrized by the vector pt, as ft(pt), then, since the objective is separable over xt and yt, we can write the optimal value of this problem as

ft(pt)=supy(ptTyt(y))+supx𝐜𝐨𝐧𝐯(St)(qtAtTpt)Tx, (2)

We note that this function ft, which we will call the dual function, is convex since it is the pointwise supremum of a family of functions linear in pt.

Fenchel conjugate.

The first term of (2) can be recognized as the Fenchel conjugate of [10, §3.3] evaluated at pt, which we write as t(pt). This function can be interpreted as choosing the resource allocation y that maximizes the network’s profit minus the loss as a result of resource consumption y. We note that the Fenchel conjugate has a closed-form expression for many commonly used loss functions.

Block building problem.

The second term in (2) is the optimal value of the following problem:

maximize (qtAtTpt)Tx
subject to x𝐜𝐨𝐧𝐯(St),

with variable xRn. We can interpret this problem as the transaction producers’ problem of creating and choosing transactions to be included in a block in order to maximize the utility of the included transactions (qtTx), after netting the fees paid to the network ((AtTpt)Tx). (This problem is sometimes called the block building problem or packing problem that is solved by block producers. These “block producers” include both users and validators.) Since the objective of this problem is linear, it has the same optimal value as the nonconvex problem

maximize (qtAtTpt)Tx (3)
subject to xSt,

with variable xRn. Note that the packing problem problem depends on the problem data associated with block t: the utilities qt, the resources required by the submitted transactions, At, and the associated constraints St for a particular block at time t. We will denote the optimal value of the packing problem as ht(p), where the index t indicates that the problem data is associated with block t.

Gradient.

With this notation, the dual function can be written as

ft(p)=t(p)+ht(p).

We note, for future use, that, when ft is differentiable at p,

ft(p)=yt(p)Atxt(p), (4)

where yt(p) denotes the maximizer for the Fenchel conjugate of t, at price p, while xt(p) denotes the optimal transactions to be included for the block building problem (3) at price p. If ft is not differentiable at p, then any solution to each subproblem suffices. Note that both of these quantities are easy to compute if price p is set at blockheight t1 and block t is submitted with these resource prices p: the former term, y(p), is the solution to a basic problem that often has a closed form, while the latter term, Ax(p), is exactly the total resource consumption of the transactions included in block t.

Bounds on the gradient.

From the discussion in §2.1, we have that the allowable transactions in St can only consume up to some resource limit b, which means that:

0Atxt(p)b,

from problem (3). Additionally, since t(y)= unless 0yb, as noted in §2.1, then, it is not hard to show that

0yt(p)b,

so we receive the simple bound on the gradient of ft at p:

byt(p)Atxt(p)b.

This implies the gradient bound

ft(p)sbs

for any s-norm. Note that, in the case where s<, the norm of b is nondecreasing in the number of elements in b; in other words, the more resources with nonzero limits that we wish to price, the worse this resulting bound is.

2.3 The aggregated dual problem

If the dual function ft is differentiable, then, since it is convex, finding a minimizer of the dual function corresponds exactly to finding a point p satisfying

ft(p)=0.

Using (4), this is the same as finding a set of prices p such that

Atxt(p)=yt(p).

We can interpret the left term as the demand for resources at price p (from users and nodes) and the right term as the supply of resources at price p (from the network). Setting the prices correctly balances the demand and supply of the available resources. One can also show that, when ft is differentiable and this condition is satisfied, the transactions xt(p) and yt(p) exactly solve the original problem (1).

The (aggregated) dual problem.

As blockchain designers, we would like to set prices that, at least on average, render the supply and demand of resources (approximately) equal. That is, we would like to find a set of prices that solves the problem:

minimize t=1Tft(p), (5)

over pRm. Finding a price p that minimizes this objective means that, using (1), over the T blocks, we have

t=1TAtxt(p)=t=1Tyt(p),

i.e., that the demand and supply for resources over the T blocks are both equal. Or, in other words, that the market, over all T blocks, clears on aggregate when the prices p are correctly set. (The case where the objective is not differentiable over p, but is subdifferentiable, means that there is at least one solution to each subproblem, at optimal prices p, for which supply and demand are both equal.)

A basic heuristic.

Since the prices p for the blockchain must be set before the blocks are observed, it would be hard to solve (5) without oracular knowledge of the functions ft. A simple heuristic is then the following: start with some prices pt, set at blockheight t1, observe what happens at block t with these prices, and update the prices slightly (by, say η>0) according to the gradient information:

pt+1=ptηft(pt). (6)

(If ft is not differentiable at pt, then replace the gradient with a subgradient.) From before, we know that the gradient at block t is observable at the prices set in the previous block, pt. This heuristic has roughly the “right behavior”: if our prices are too low for a certain resource, say i, then the demand at these prices exceeds the expected supply, so

(yt(pt))i>(Atxt(pt))i.

From the update step, this would increase the price (pt)i of resource i proportional to this amount. (The opposite is true if the prices at time t1 were set too high.) See [19] for additional discussion.

A multiplicative update.

Of course, we can also imagine any number of other update rules. For example, we may use a multiplicative update:

pt+1=ptexp(ηft(pt)), (7)

where denotes the elementwise (Hadamard) product. This update ensures that the prices are always nonnegative; though the previous update (6) can be modified slightly to accommodate this as well. We note that this update rule, when there is one resource, corresponds exactly to the EIP-4844 update found in Ethereum today.

Discussion.

Of course, the next natural question is: just how well do these heuristics perform? We will show in the remainder of the paper that the average gap between the objective values provided by this heuristic and those given by setting the optimal prices p (which clear the market on average; i.e., those that solve problem (5)) tends to zero as the number of blocks in question, T, grows:

1T(t=1Tft(pt)t=1Tft(p))CT, (8)

for any (reasonable) choice of p. Perhaps the most surprising part is that we will show that this is true even when the allowable transactions St, resource utilizations At, and the welfare qt are controlled by an adversary. We will also show that these algorithms are essentially optimal: no better rate, in terms of T, is possible for any algorithm with such an adversary. The fact that this is possible, along with the original bounds for such problems, come from a long line of work in online convex optimization [35], which we present (and adapt to our specific scenario) in the next section.

3 Bounding the gap

From §2, the main quantity we wish to bound is the difference between the algorithm’s performance (i.e., the aggregate performance of the pt) against the performance of the aggregate market clearing price (i.e., the price p that minimizes (5)):

R=i=1Tft(pt)i=1Tft(p). (9)

This quantity R is called the regret of the algorithm. Bounding this regret for a variety of algorithms for choosing pt is the main goal of the field of online algorithms, for which there are many great resources, cf., [35, 23]. A bound on this quantity R would then also give us a bound of the form of (8) given by R/T, which is called the average regret. We will give a very short and self-contained exposition of online convex optimization, tailored to our special case, in the what remains of this section and in the next section.

Discussion.

Note that this comparison is interesting for a few reasons. First, the price p can only be computed if all of the dual functions ft are known in advance, which would require knowledge that the network does not have. (On the other hand, updating the prices pt via a rule such as (6) is very easy to implement in practice.) The second observation is that it need not be true that pt approaches p, except in very special cases, even when the regret is low. (One case where this is true is, for example, when the ft=f, for some fixed function f, which is a common simplification in some analyses of dynamic fees, or when the ft are i.i.d. drawn from some fixed distribution and strongly convex.)

Linearization.

Since the functions ft are convex, then, by definition [10, §3.1.3] we have that

ft(p)ft(pt)+ft(pt)T(ppt).

Rearranging, this gives an upper bound on the difference in objective value at time t,

ft(pt)ft(p)ft(pt)T(ptp).

For the remainder of the paper, we will write gt=ft(pt) for notational convenience. (If ft is not differentiable at pt, then any subgradient will suffice.) Summing both sides over t then gives

Ri=1TgtT(ptp). (10)

In other words, if we can find a reasonable upper bound for the linear term on the right hand side, we can give a bound on the regret R, or the average regret R/T.

Results.

The main point of this section will be to use bound (10), along with a simple framework to show that the linear update rule (6), with appropriately chosen step size η>0 has average regret bounded from above by

RT=1T(i=1Tft(pt)i=1Tft(p))BMT,

where b2B is an upper bound on the norm of the resource limits, while p2M is a bound on the optimal price. In the case that the adversary is stochastic (i.e., if the ft are random functions, uniformly and identically drawn under certain conditions we state below) there is a much stronger result. Here, the time-averaged price p¯=(1/T)t=1Tpt converges to the optimal value in expectation in that

𝐄[f(p¯)f(p)]B2MT,

using again rule (6), where f(p)=𝐄[f1(p)], for any fixed p. (We can, of course, replace f1 with any other ft.) This also implies that, if the ft are almost surely μ-strongly convex, for some μ>0, then p¯ converges to the optimal price p in expectation. Finally, if the update used is the multiplicative rule of (7), then the average regret can be bounded from above by

RTBMmlog(M/ε)2T.

Here, we define pM, while B=b2, as before, while ε1 is a lower bound on the price used in the first iteration of the rule, p0ε𝟏, where 𝟏 is the all-ones vector and the inequality is elementwise.

3.1 Choice functions and updates

We will focus on the two heuristics presented in §2.3 to update the prices and show that, for each, the regret is low. To do this, we rewrite the algorithms as a special case of a general update rule, sometimes known as FTRL or Follow The Regularized Leader [35, 23], which we will then use to find a bound on the regret. The presentation here is a slight variation of the lovely theorem and proof of [24, §2].

Choice function.

Given some choice function F:RmR{}, we choose the new prices pt+1 given the previous gradients g1,,gt by setting

pt+1=F(η(g1++gt)), (11)

where η>0 will be some specific step size to be chosen later. (The gradient can be replaced with any subgradient whenever the function is not differentiable, but is convex.) The one requirement we will have of this function F is that it is “smooth”; in particular that the Hessian of F (if twice-differentiable) is bounded from above by σI, where σ>0, in some suitable domain DRn.

Gradient descent.

One choice of functions we will use is the norm-squared,

F(z)=12p0+z22.

Since F(z)=p0+z, then

pt+1=p0η(g1++gt)=ptηgt,

which corresponds exactly to the gradient descent rule proposed in (6). This particular choice function is smooth with σ=1 for any domain D.

Multiplicative update.

The second choice function is

F(z)=i=1m(p0)iexp(zi),

which leads to the multiplicative update rule found in (7),

pt+1=p0exp(η(g1++gt))=ptexp(ηgt).

Here exp() is taken elementwise, while denotes the elementwise (Hadamard) product. For the domain D={zzlog(M)}, this choice function is smooth with σ=p0M. (We use log(M) since a bound on the prices of order M would imply a bound on the domain size of order around log(M) – this is a somewhat technical condition that can be avoided by a slightly more careful choice of F; cf., appendix A.) We provide an alternate interpretation of this rule as both follow-the-regularized-leader and online mirror descent in appendix C. More specifically, these multiplicative update rules emerge from regularization by the nonnegative entropy function. This fact might explain some empirical observations of EIP-1559’s behavior.

3.2 Constructing general regret bounds

We will combine two simple observations to give a bound on the linear term of (10). The first is that, if we define the Fenchel conjugate of F over the domain D by

F(p~)=supzD(p~TzF(z)),

then we have the definitional inequality

zTp~F(p~)F(z), (12)

for any choice of p~Rm and zD. The second is that, since F is smooth with constant σ, we have, by a simple exercise in integration:

F(z+z)F(z)+F(z)Tz+σ2z22.

So, given a sequence of g1,,gT, by applying this inequality repeatedly and using the definition of the prices pt given in (11), we have

F(η(g1++gT))F(0)ηt=1TgtTpt+ση22t=1Tgt22.

Finally, setting z=η(g1++gT) in (12) and using the above inequality gives

t=1TgtT(ptp~)F(0)+F(p~)η+ση2t=1Tgt22.

Since the gradients gt are bounded from above by gt2b2=B, this leads to the following bound on the linearized quantity:

t=1TgtT(ptp~)BσT(F(0)+F(p~)), (13)

by choosing the step size

η=F(0)+F(p~)σTB2/2,

for any choice of p~. Since this is an online algorithm which may run indefinitely, we may choose T to trade off between responsiveness of the system and regret. In particular, as T becomes large, our regret bound gets tighter (roughly going as T) but the system’s responsiveness to events becomes slower and, depending on the system’s implementation, the “optimal fixed baseline” of p becomes less realistic.

Regret bound.

Using (13), we can now give a bound on the regret R using the inequality of (10). More specifically, if we know that the optimal aggregate clearing price p lies in some set P, then, using (13), we have

RBσT(F(0)+F(p))BσT(F(0)+suppPF(p)),

or, alternatively, that the average regret over that time horizon, R/T, is

RTBσ(F(0)+suppPF(p))T. (14)

3.3 Regret bounds for updates

Given the set up above, we are now ready to give bounds on the regret for both the gradient-descent-like update rule of (6) and the multiplicative update of (7).

Norm-squared choice function.

For the gradient-descent-like rule (6), we have the choice function F(z)=12p0+z22. Applying the bound (13) gives, since F(0)=p022 and σ=1 for this choice function (from §3.1):

1Tt=1TgtT(ptp~)Bp022+F(p~)2T.

It is not hard to show that F(p~)=12p~22, and, assuming that we have a bound p2M (and similarly for p0), then the average regret is bounded by (14),

RTBMT.

Exponential choice function.

In the multiplicative update rule (7), the exponential choice function is given by F(z)=i=1m(p0)iexp(zi) has F(0)=p01 and σ=M such that, again, using bound (13),

1Tt=1TgtT(ptp~)BM(p01+F(p~))2T. (15)

It is also not hard to show that the conjugate of F is

F(p~)=i=1mp~ilogp~i(p0)ip~i,

and, if we have a bound that 0p~M𝟏 and ε𝟏p0M𝟏 for ε1, then

supp~MF(p~)mmax{Mlog(M/ε)M,0},

which follows from the fact that F is separable and convex so is maximized at the boundary of the domain. For simplicity, we will assume that Me, in which case the first term in the max is nonnegative. Putting it all together, we know that

p01mMandF(p)m(Mlog(M/ε)M).

Plugging these values into (15) gives

RTBMmlog(M/ε)2T,

where, from before, M is the maximum price allowed for any one resource, m is the number of resources, and ε is the minimum starting price. The latter condition cannot be dropped since, if p0=0, no update would ever be able to increase the price, while on the order of log(1/ε) updates are needed to scale the price p0 to any constant. A sufficient condition to ensure that p0ε𝟏 is that there is always some minimal demand for blockspace at a small enough price, as in, e.g., [19, §3.2].

3.4 A stochastic model

In this subsection, we consider a stochastic setting instead of the adversarial one above, which will give us stronger results on the convergence of the prices pt. In particular, given a probability space with sample space Ω and distribution P, we assume that a sample ωtΩ determines the parameters of the block building problem at time t, so that,

qt=q~(ωt),St=S~(ωt),t()=~(,ωt),At=A~(ωt),

where the welfare q~, feasible set S~, losses (,ω) (viewed as a function over ω), and consumption matrices A~ are random variables over Ω. Given a sample ωΩ and a price vector p, the utility in scenario ω for price p is given by the concave function

f~(p,ω)=supy(pTy~(y,ω))+supx𝐜𝐨𝐧𝐯(S~(ω))(q~(ω)A~(ω)Tp)Tx.

(In other words, we replace the problem data in the definition of the dual function (2) with the corresponding sampled versions.) Here, we will show that, in a certain sense, the time-averaged price of the algorithm minimizes the expected utility,

f(p)=𝐄[f~(p,ω)]. (16)

For later, we will define p~argminf. Unlike in the previous set up for adversarial functions, in this problem, the observed functions are random variables ft(p)=f~(p,ωt), where the ωt are drawn i.i.d. according to P. So, in this setting, the gradient update rule of (6),

pt+1=ptηft(pt),

corresponds to an instance of stochastic gradient descent with stepsize given by η>0.

Result.

A standard result in stochastic gradient descent from, say [21, Thm. 9.5], is that,

𝐄[f(p¯)]f(p~)2M2ηT+ηB42.

Here, B=b2 and p~2M is a bound on the optimal price p~ that minimizes the expected utility f given in (16). (We also assume that the initial price p02M.) Finally, the time-averaged price p¯ is defined

p¯=1Tt=1Tpt.

Setting η=2M/B2T then gives the bound:

𝐄[f(p¯)]f(p~)B2MT.

Extensions.

There is an immediate extension to this result. The first is to note that, by assumption, this result relies on i.i.d. draws of the sample ωt, but this can be generalized considerably. If we define ϵt to be the noise in the gradient estimate at block t, i.e.,

ϵt=f(pt)pf~(pt,ωt),

then the above result requires that {ϵt} be zero mean and i.i.d. Under suitable conditions, this can be relaxed to the {ϵt} being a martingale difference sequence [22, §5.2].

3.5 Discussion

With only the basic assumptions that (a) there is some bound on the maximum price p that an adversary may choose and (b) some hard constraint on the block size, which is enforced by the validators, we achieve average regret that scales as 1/T for either update rule, even when the transactions and welfare, which are included in the functions ft, are chosen adversarially. An interesting point is that this shows why a hard constraint on the resources consumed per-block is a good idea: it bounds any adversary’s ability to construct arbitrarily-sized blocks which could make the regret of most reasonable rules quite large. The restriction given in (a), that the adversary has bounded prices, seems strong, but is implied by a number of simple statements: for example, it suffices that the adversary has finite budget and must pay ptTAtxt(pt) for each transaction, which is, in fact, a much stronger requirement. Two interesting questions for future research are: first, is there a more natural model for an adversary in the blockchain setting than the essentially all-powerful one provided here? And, second, given such an adversary, can we achieve better results?

Time-independent step sizes.

In this model, we assumed that the step size η is fixed and can depend on the total length of observed time period, T. Indeed, the step size trades off the ability of the prices p to “react” to observed gradients gt with the average regret. (In particular, as T becomes large, the average regret becomes small, but η must be similarly small and must remain so for the time period in observation.) It is not hard to show that the regret bound in both the adversarial (§3.3) and stochastic case (§3.4) above for the gradient descent rule (6) (i.e., the one derived from the norm-square choice function) also carries through in the case where the step sizes η are decreasing with respect to time, to get an update rule of the form

xt+1=xtηtgt. (17)

Here, ηt=C/t and C>0 is some appropriately chosen constant that, importantly, does not depend on the observation window T, but only on the known bounds for the price p2M and the maximum resource utilizations b2B. The regret for this algorithm, under the same assumptions as §3.3 for the norm-squared choice function, is no more than 2BMT in the adversarial case; see [23, §3.1] for a proof. (The bound for the stochastic case is more refined, but see, e.g., [21, Thm. 5.3] for the statement.) More generally, any sequence {αt} which asymptotically satisfies

t=1Tαt=O(T)

and 1/αT=O(T) has a similar guarantee, up to a potentially different constant.

Strong convexity.

Indeed, if the step sizes of the algorithm are allowed to vary with respect to time, we can get much stronger regret guarantees in the case that the conjugate of the loss t is strongly convex, using the same update rule (17). More specifically, if t is μ-strongly convex, then we can achieve logarithmic regret that, additionally, does not depend on there being any bound on the optimal price p. (One case where t is strongly convex is, e.g., when t(y)=(1/2)(yb)+22 for some target utilizations bR+m and y is bounded by the maximum resource utilizations yb.) In this particular case, we have that for the update rule (17), where the step sizes are ηt=μ/t, the regret is no larger than B2log(T)/μ, making the average regret quite small: no more than B2log(T)/(μT). (For a proof of this, see [23, §3.3.1].)

4 Lower bound

In this section, we will show that, up to a scalar constant, the bounds provided in §3.3 are essentially optimal for any algorithm. In particular, we will show that any rule to choose the prices pt+1, based on the previous t observed gradients g1,,gt, achieves, at best, regret on the order of 1/T, for an adversary who can control the transactions appearing in a block, subject to the constraints of the set S containing some maximum resource usage limit bR+m, for a commonly-used loss. As in the previous section, we assume that the optimal price pRm for the aggregated program is bounded, pM for some value M>0.

4.1 Construction

The construction we provide here is very simple: the adversary is purely stochastic and picks the resource utilizations, up to the bound, from a simple distribution, independently of all of the previous gradients, such that any algorithm achieves 0 objective value, while the best possible price p achieves an objective of CT for some constant C>0.

Loss function and set.

We construct a lower bound for any algorithm which attempts to minimize the dual problem corresponding to the commonly-used loss

(y)={00yb+otherwise, (18)

where bR+m is the resource utilization target. Using this definition, it is not hard to show that the Fenchel conjugate of is

(p)=(b)T(p)+,

where ((p)+)i=max{pi,0} denotes the “positive part” of p, applied elementwise. The set St{0,1}nt we assume satisfies

St{xtR+ntAtxtb},

for some matrix AtRm×nt and max resource utilization bR+m. (We note that this set up reduces exactly to the resource metering of EIP-1559 when the number of resources m=1, where nt is the number of transactions in the mempool of the block builder and b is the block limit.)

Adversary.

Consider a (stochastic) adversary who is able to control the amount of resources used by the network. The adversary picks the welfare to be qt=0 and picks transactions At such that the welfare maximization problem (3) satisfies, at price pt,

Atxt=bεtb~,

where εt{±1}m is uniformly randomly chosen (independently, for each t=1,,T) and

b~i=min{bi,bibi},

for i=1,,m. (The adversary can achieve this by constructing, say, exactly one transaction that is always included with zero welfare and utilizations equal to exactly bεtb~; we assume this in what follows.) This gives

gt=ytAtxt=b(bεtb~)=εtb~. (19)

Proof.

It suffices to show that, in expectation, the regret is Ω(T), which means that there is at least one choice of resource consumption trajectories (gradients {gt}) that leads to large regret, for any possible choices of prices pt by an algorithm. First note that the linear bound (13),

t=1T(ft(pt)ft(p))t=1TgtT(ptp),

is met at exact equality for any nonnegative pt and p using our construction. To see this, note that the transactions were chosen to have zero welfare, qt=0. Using the definition of (18) means that, for any p,pt0:

ft(p)=t(p)+ht(p)=(b)Tp+(qtAtTxt(pt))Tp=(bAtxt(pt))Tp=gtTp,

for every t=1,,T. It will then suffice to show that the linear sum

𝐄[t=1TgtT(ptp)] (20)

is “large” in expectation, for p chosen with knowledge of gt.

The first term of the loss (20) is zero since

𝐄[gtTpt]=𝐄[gt]Tpt=0,

as pt can depend only on the observed gradients, g1,,gt1. The expected regret then reduces to

𝐄[t=1TgtTptgtTp¯]=𝐄[p¯Tt=1Tgt].

Since p is chosen after observing the gradients gt, and we are free to choose any p with pM and p0, we then this term is maximized by choosing pi=M if

t=1T(gt)i0,

and 0 otherwise, for i=1,,m. This means that

𝐄[p¯Tt=1Tgt]=M𝐄[𝟏T(t=1Tgt)+],

where ((z)+)i=max{zi,0} is the “positive part” of zi. Using the definition of gt above (19), we can bound the latter term

M𝐄[𝟏T(t=1Tgt)+]=M(𝟏Tb~)𝐄[(t=1Tεt)+]M(𝟏Tb~)CT,

where εt{±1} for t=1,,T is uniformly and independently drawn and C1/24. The inequality follows from a mechanical (if involved) counting argument, or any number of lower bounds on the expected value of the positive side of a random walk. We reproduce a simple, noncounting proof of this in appendix B. We note that, compared to either of the bounds of §3.3, this is tight up to a twiddle factor of M and a square root factor of m.

5 Conclusion

In this paper, we have shown that two simple update rules: a gradient update rule and a multiplicative update rule, lead to low average regret for the discovery and assignment of prices for blockchain resources, even under adversarial conditions. This implies that, on average, over a long enough timescale, the difference between correctly setting resource prices with oracular knowledge of the future (to roughly match supply and demand over the period of time in question) and these two simple update rules, is very small. We have also shown that, under the assumptions above, this is essentially the best we can hope to do by also showing a matching lower bound. Since Ethereum’s EIP-1559 is the single-dimensional special case of the multiplicative rule presented above, the analysis also applies directly, along with the proposed pricing rule for EIP-4844, which is the two-dimensional special case.

Future work.

Future potential avenues for research include weaker, but potentially more realistic adversarial models (as the adversary has a lot of power in this particular formulation) which may yield tighter bounds to the expected performance. These include models like that of patient bidders [6], where the changes to the set of transactions at each time step are limited. Additionally, a framework to set the number of resources m in a principled way (e.g., one which delineates the tradeoff between more resources versus lower worst-case regret) may be of practical interest. We note that these types of adversarial resource allocation problems also show up in zero knowledge proof markets [37, 34] and in general compute markets [7]. We suspect our results can be easily extended to and are useful for these settings.

References

  • [1] John Adler. Eip-2242: Transaction postdata, 2019. URL: https://eips.ethereum.org/EIPS/eip-2242.
  • [2] John Adler. Multi-threaded data availability on eth 1. Ethresearch, 2019. URL: https://ethresear.ch/t/multi-threaded-data-availability-on-eth-1/5899.
  • [3] John Adler. Always has been (or, wait, it’s all resource pricing? part 2). EthCC, 2022. URL: https://www.youtube.com/watch?v=Zq8uwpX39oI.
  • [4] Guillermo Angeris. A non-counting lower bound for the expected distance of a simple random walk. https://angeris.github.io/blog/content/simple-proof-rademacher/, 2023.
  • [5] Itai Ashlagi, Jacob Leshno, Pengyu Qian, and Amin Saberi. Price discovery in waiting lists: A connection to stochastic gradient descent. Available at SSRN 4192003, 2022.
  • [6] Moshe Babaioff and Noam Nisan. On the optimality of eip-1559 for patient bidders (draft–comments welcome), 2024.
  • [7] Maryam Bahrani and Naveen Durvasula. Resonance: Transaction fees for heterogeneous computation. arXiv preprint arXiv:2411.11789, 2024. doi:10.48550/arXiv.2411.11789.
  • [8] Maryam Bahrani, Pranav Garimidi, and Tim Roughgarden. Transaction fee mechanism design with active block producers. arXiv preprint arXiv:2307.01686, 2023. doi:10.48550/arXiv.2307.01686.
  • [9] Soumya Basu, David Easley, Maureen O’Hara, and Emin Gün Sirer. Stablefees: A predictable fee market for cryptocurrencies. Management Science, 2023.
  • [10] Stephen P. Boyd and Lieven Vandenberghe. Convex Optimization. Cambridge University Press, Cambridge, UK; New York, 2004.
  • [11] Sébastien Bubeck, Nikhil R Devanur, Zhiyi Huang, and Rad Niazadeh. Multi-scale online learning: Theory and applications to online auctions and pricing. Journal of Machine Learning Research, 20(62):1–37, 2019. URL: https://jmlr.org/papers/v20/17-498.html.
  • [12] Vitalik Buterin. Blockchain resource pricing, 2018.
  • [13] Vitalik Buterin. Multidimensional eip 1559. ethresear.ch, 2022.
  • [14] Vitalik Buterin, Eric Conner, Rick Dudley, Matthew Slipper, Ian Norden, and Abdelhamid Bakhta. Eip-1559: Fee market change for eth 1.0 chain, 2019. URL: https://eips.ethereum.org/EIPS/eip-1559.
  • [15] Vitalik Buterin and Ansgar Dietrichs. Eip-4488: Transaction calldata gas cost reduction with total calldata limit, 2021. URL: https://eips.ethereum.org/EIPS/eip-4488.
  • [16] Hao Chung and Elaine Shi. Foundations of Transaction Fee Mechanism Design, pages 3856–3899. SIAM, 2023. doi:10.1137/1.9781611977554.ch150.
  • [17] Richard Cole and Lisa Fleischer. Fast-converging tatonnement algorithms for one-time and ongoing market problems. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 315–324, 2008. doi:10.1145/1374376.1374422.
  • [18] Davide Crapis, Ciamac C Moallemi, and Shouqiao Wang. Optimal dynamic fees for blockchain resources. arXiv preprint arXiv:2309.12735, 2023. doi:10.48550/arXiv.2309.12735.
  • [19] Theo Diamandis, Alex Evans, Tarun Chitra, and Guillermo Angeris. Designing Multidimensional Blockchain Fee Markets. In Joseph Bonneau and S. Matthew Weinberg, editors, 5th Conference on Advances in Financial Technologies (AFT 2023), volume 282 of Leibniz International Proceedings in Informatics (LIPIcs), pages 4:1–4:23, Dagstuhl, Germany, 2023. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.AFT.2023.4.
  • [20] Matheus V. X. Ferreira, Daniel J. Moroz, David C. Parkes, and Mitchell Stern. Dynamic posted-price mechanisms for the blockchain transaction-fee market. In Proceedings of the 3rd ACM Conference on Advances in Financial Technologies, pages 86–99, 2021. doi:10.1145/3479722.3480991.
  • [21] Guillaume Garrigos and Robert M Gower. Handbook of convergence theorems for (stochastic) gradient methods. arXiv preprint, 2023. arXiv:2301.11235.
  • [22] J Harold, G Kushner, and George Yin. Stochastic approximation and recursive algorithm and applications. Application of Mathematics, 35(10), 1997.
  • [23] Elad Hazan et al. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3-4):157–325, 2016. doi:10.1561/2400000013.
  • [24] Sham M. Kakade, Shai Shalev-Shwartz, and Ambuj Tewari. Regularization techniques for learning with matrices. Journal of Machine Learning Research, 13:1865–1890, 2009. URL: https://api.semanticscholar.org/CorpusID:8477893.
  • [25] Stefanos Leonardos, Barnabé Monnot, Daniël Reijsbergen, Efstratios Skoulakis, and Georgios Piliouras. Dynamical analysis of the eip-1559 ethereum fee market. In Proceedings of the 3rd ACM Conference on Advances in Financial Technologies, pages 114–126, 2021. doi:10.1145/3479722.3480993.
  • [26] Stefanos Leonardos, Daniël Reijsbergen, Barnabé Monnot, and Georgios Piliouras. Optimality despite chaos in fee markets. In Financial Cryptography and Data Security: 27th International Conference, FC 2023, Bol, Brač, Croatia, May 1–5, 2023, Revised Selected Papers, Part II, pages 346–362, Berlin, Heidelberg, 2023. Springer-Verlag. doi:10.1007/978-3-031-47751-5_20.
  • [27] Yulin Liu, Yuxuan Lu, Kartik Nayak, Fan Zhang, Luyao Zhang, and Yinhong Zhao. Empirical analysis of eip-1559: Transaction fees, waiting times, and consensus security. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, CCS ’22, pages 2099–2113, New York, NY, USA, 2022. Association for Computing Machinery. doi:10.1145/3548606.3559341.
  • [28] Brendan McMahan. Follow-the-regularized-leader and mirror descent: Equivalence theorems and l1 regularization. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pages 525–533. JMLR Workshop and Conference Proceedings, 2011. URL: http://proceedings.mlr.press/v15/mcmahan11b/mcmahan11b.pdf.
  • [29] Patrick O’Grady. Multidimensional fees + fifo mempool. https://github.com/ava-labs/hypersdk/pull/352, 2023.
  • [30] Penumbra Team. Groundwork for multidimensional gas fees. https://github.com/penumbra-zone/penumbra/issues/2970, 2023.
  • [31] Daniël Reijsbergen, Shyam Sridhar, Barnabé Monnot, Stefanos Leonardos, Stratis Skoulakis, and Georgios Piliouras. Transaction fees on a honeymoon: Ethereum’s eip-1559 one month later. In 2021 IEEE International Conference on Blockchain (Blockchain), pages 196–204. IEEE, 2021. doi:10.1109/BLOCKCHAIN53845.2021.00034.
  • [32] Aaron Roth, Aleksandrs Slivkins, Jonathan Ullman, and Zhiwei Steven Wu. Multidimensional dynamic pricing for welfare maximization. ACM Transactions on Economics and Computation (TEAC), 8(1):1–35, 2020. doi:10.1145/3381527.
  • [33] Tim Roughgarden. Transaction fee mechanism design. SIGecom Exch., 19(1):52–55, July 2021. doi:10.1145/3476436.3476445.
  • [34] Uma Roy, John Guibas, Mallesh Pai, Kshitij Kulkarni, and Dan Robinson. Provewith.us. https://www.provewith.us/, 2024.
  • [35] Shai Shalev-Shwartz et al. Online learning and online convex optimization. Foundations and Trends® in Machine Learning, 4(2):107–194, 2012. doi:10.1561/2200000018.
  • [36] Hirofumi Uzawa. Walras’ tatonnement in the theory of exchange. The Review of Economic Studies, 27(3):182–194, 1960.
  • [37] Wenhao Wang, Lulu Zhou, Aviv Yaish, Fan Zhang, Ben Fisch, and Benjamin Livshits. Mechanism design for zk-rollup prover markets. arXiv preprint arXiv:2404.06495, 2024. doi:10.48550/arXiv.2404.06495.
  • [38] Anatoly Yakovenko and Tao Zhu. Simd-110: Exponential fee for write lock accounts, 2024. URL: https://github.com/solana-foundation/solana-improvement-documents/pull/110.

Appendix A Alternative multiplicative choice function

Instead of setting a bound on the gradients such that the multiplicative update choice function of §3.1 has arguments zRm with zlog(M), we can instead choose the function F in the following way: if we know that pM (or if we wish to have a rule that satisfies this condition) then we can set the function F to be, instead,

F(z)=i=1mmin{(p0)iexp(zi),M(zilog(M/(p0)i)+1)}.

We then have that

(F(z))i={(p0)iexp(zi)zilog(M/(p0)i)Motherwise.

It is not hard to show that F is indeed convex by noting that it is both separable and the derivative of any one term is nondecreasing by construction. Additionally, F is smooth with constant mM, which is easy to see by noting that, again, F is separable and the second derivative of each term is bounded from above by M. By construction, we will have that any

p~F(z),

satisfies p~M, as required, while

F(p)=i=1m(pilog(pi(p0)i)pi),

whenever 0pM𝟏 (where we define 0log0=0 for convenience).

Appendix B Noncounting lower bound

In this appendix, we give a simple lower bound for the expected value of a random walk. More specifically, given εt{±1} chosen uniformly and independently for t=1,,T, we will show that there exists a constant C1/12 such that

𝐄[|t=1Tεt|]CT.

Since εt is symmetric about 0 (i.e., εt and εt have the same distribution) then the positive part would satisfy

𝐄[(t=1Tεt)+]C2T124T,

given the above. This material is a shorter version of [4].

Proof.

For succinctness, we will write the sum as

X=t=1Tεt.

The odd moments of X vanish, while the second and fourth moments are

𝐄[X2]=T,𝐄[X4]=3T22T3T2.

which is easy to see from expanding the sum and noting that any term containing an odd power of εt has expectation equal to zero. The bound will come from the simple observation that, for any a0,

𝐄[|X|]aPr(|X|a). (21)

We bound the second term in the inequality from below by expanding the second moment,

𝐄[X2]a2+𝐄[X2𝟏|X|a]a2+𝐄[X4]𝐄[𝟏|X|a]=a2+𝐄[X4]Pr(|X|a), (22)

for any fixed constant a0, where 𝟏|X|a is the zero-one indicator function for the event |X|a. Here, the first inequality follows from the pointwise inequality

X2a2𝟏|X|<a+X2𝟏|X|a,

while the second inequality follows from an application of Cauchy—Schwarz to the latter term and the fact that 𝟏|X|a2=𝟏|X|a. Rearranging the second moment bound (22) above, we find

Pr(|X|a)(𝐄[X2]a2)2𝐄[X4](Ta2)23T2,

where we have implicitly assumed that a2T. Plugging this in to the basic bound (21) on the expectation gives, for any 0aT,

𝐄[|X|]a(Ta2)23T2.

The right hand quantity is maximized at a=T/5 (which is easy to see by using the first-order optimality conditions applied to a) so we get

𝐄[|X|]16755T112T,

which means that C1/12, as required. We note that this constant is loose, as a simple counting argument shows that, in fact

C2/π,

for T large enough, but this bound is enough to show the lower bound holds up to a constant.

Appendix C Follow-the-regularized-leader interpretation

We may interpret the price update rule (11) as follow-the-regularized-leader (FTRL), with the regularizer defined by the choice function. The FTRL update rule is given by

pt+1=argminpDs=1tgsTp+(1/η)R(p),

where R is some strongly convex regularizer and D is the effective domain of the {ft}. From the optimality conditions, we can write

pt+1=(R)1(η(g1++gt)),

assuming the range of (R)1 is contained in the domain D. (Note that R:DRn is an invertible function since R is strongly convex.) Comparing this equation with the choice function-based price update,

pt+1=F(η(g1++gt)),

we immediately see the relationship between the choice function F and the regularizer R, namely that R=(F)1 and vice versa. (Equivalently, this is saying that R and F are Fenchel conjugates of one another.) We may alternatively interpret the update rule as online mirror descent with Bregman divergence defined by the regularizer. See [28] for details.

EIP 1559-style price updates.

This connection allows us to interpret the multiplicative update rule (7) as FTRL. Consider the loss function t(y)=I(y=b) for some target gas usage bR+. The matrix At is simply a row vector of the gas costs for each transaction. The gradient at block t, which has gas price pt, is then

gt=ft(pt)=bAtxt(pt).

From section 3.1, we write the EIP-1559-style multiplicative update rule with p0=1 as

pt+1=ptexp(ηgt)=F(η(g1++gt)).

As discussed above, we may also interpret this update rule as FTRL with the regularizer

R(p)=plog(p)p.

This regularizer achieves its minimum at p=p0=1. Note that it is not symmetric around its minimum; price decreases are more quickly penalized than price increases.

EIP 1559 as online mirror descent.

Alternatively, we may interpret the EIP-1559-style multiplicative price update rule as online mirror descent with Bregman divergence defined by the nonnegative entropy plog(p). This update rule is given by

pt+1=argminp𝐝𝐨𝐦ff(pt)+gtT(ppt)+1ηD(p,pt),

where the first two terms are a linear approximation to the convex function f and the last term is the Bregman divergence regularizer

D(p,pt)=plog(p)ptlog(pt)(log(pt)+1)(ppt).

Dropping constant terms, we can rewrite the update rule as

pt+1 =argminp𝐝𝐨𝐦fgtTp+1η(plog(p/pt)p)
=ptexp(ηgt).

Note that the regularizer D(p,pt) achieves its minimum at p=pt and penalizes price decreases more than price increases. (See figure 1.) This asymmetric penalty may explain some past observations, e.g., that EIP 1559 has a positive overshoot [26].

Figure 1: Bregman divergence corresponding to the nonnegative entropy function, which is used in the EIP 1559-style update rule.