Stable Matching with Interviews
Abstract
In several two-sided markets, including labor and dating, agents typically have limited information about their preferences prior to mutual interactions. This issue can result in matching frictions, as arising in the labor market for medical residencies, where high application rates are followed by a large number of interviews. Yet, the extensive literature on two-sided matching primarily focuses on models where agents know their preferences, leaving the interactions necessary for preference discovery largely overlooked. This paper studies this problem using an algorithmic approach, extending Gale-Shapley’s deferred acceptance to this context.
Two algorithms are proposed. The first is an adaptive algorithm that expands upon Gale-Shapley’s deferred acceptance by incorporating interviews between applicants and positions. Similar to deferred acceptance, one side sequentially proposes to the other. However, the order of proposals is carefully chosen to ensure an interim stable matching is found. Furthermore, with high probability, the number of interviews conducted by each applicant or position is limited to .
In many seasonal markets, interactions occur more simultaneously, consisting of an initial interview phase followed by a clearing stage. We present a non-adaptive algorithm for generating a single stage set of in tiered random markets. The algorithm finds an interim stable matching in such markets while assigning no more than interviews to each applicant or position.
Keywords and phrases:
Stable Matching, Gale–Shapley Algorithm, Algorithmic Game TheoryCopyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Algorithmic game theoryFunding:
The authors acknowledge the support of NSF awards 2209520, 2312156, and a gift from CISCO.Editors:
Raghu MekaSeries and Publisher:

1 Introduction
In many two-sided matching markets used for dating or allocating labor, agents often interact prior to forming matches in order to gain information about their potential partners. These interactions, meant for learning preferences, can generate congestion, resulting in market inefficiencies. For example, in the market for fellowships in sports medicine, on average more than 23 applicants interview for each slot [33]. Similarly, in dating markets, such as the popular platform Tinder, only a small fraction of online conversations culminate in an in-person meeting [20, 15].
This paper considers the problem of how to alleviate interview congestion and improve welfare in two-sided matching markets. We take an algorithmic approach to form interviews and find stable matches while minimizing congestion.
We study this question in the classic two-sided marriage problem by [14]. In the model, there are applicants and positions. Each applicant and position has a publicly known value of and , respectively. The subjective interest of for matching with , denoted by is unknown unless the parties meet each other. The observed utility of in is if and have met, and equal to otherwise. The distributions of s are known, they are all independent, and their expected value is zero. The subjective interest and the observed utility of positions in the applicants are defined similarly.
A matching is interim-stable if (i) all the matched pairs have interviewed each other and (ii) there are no blocking pairs with respect to the observed utilities. One simple approach for finding an interim stable matching is to ask all pairs to interview each other and then find a stable matching with respect to the observed utilities. That involves interviews per agent. On the other hand, even in the special case in which all the s and s are equal to each other, finding an interim-stable matching requires at least interviews per agent.
The main result of the paper is to show that the number of interviews needed to find an interim-stable matching is much closer to the above lower-bound than the upper bound. In fact, it is possible to find an interim stable matching by assigning only a polylogarithmic number of interviews to each applicant or position. This is shown for both adaptive and non-adaptive models.
1.1 Approach
Adaptive algorithm.
We present an adaptive algorithm that finds an interim stable matching while requiring that each applicant and position engage in at most interviews. The algorithm is applicable to a general setting, where the publicly known values assigned to applicants and positions can vary arbitrarily, and the subjective interests are independent and identically distributed.
Theorem (See Theorem 1).
There exists an adaptive algorithm that finds an interim stable matching between positions and applicants such that with high probability every applicant and position participates in at most interviews.
The proposed algorithm expands upon deferred acceptance [14] by dividing the process into distinct interview and proposal stages. Specifically, in the applicant-optimal version, each applicant submits a proposal to the position offering the highest observed utility. The position will accept the “highest” proposal or assign an interview if the proposing applicant has the potential to create a blocking pair, taking into account potential changes in the observed value following the interview.
The main technical ingredient of the analysis is to show the probability that an applicant gets rejected by more than consecutive positions is very small. More precisely, with probability at least , each applicant is matched to position for .
Non-adaptive algorithm.
In the adaptive model, the algorithm is allowed to determine the sequence of interviews based on the results of previous ones. However, in practice, many markets favor a non-adaptive approach in which the interview lists are predetermined beforehand to allow the parties to streamline scheduling and the rest of the logistics.
We present a non-adaptive algorithm for interim stable matching with a poly-logarithmic number of interviews per agent. The algorithm works in tiered markets in which applicants and positions are partitioned into multiple tiers. Two applicants and in the same tier have the same publicly known value, i.e., . On the other hand, if an applicant is in a higher tier than , then it is always preferred independent of the interview outcomes, i.e., for all position , .
Theorem (See Theorem 18).
There exists a non-adaptive algorithm that, with high probability, finds an interim stable matching for tiered markets with no more than interviews for each applicant or position.
The design of the non-adaptive algorithm incorporates two key technical components. First, we observe that in markets where all applicants and positions are in a single tier, requiring each applicant to interview a set of positions chosen uniformly at random combined with an applicant-optimal Gale-Shapley algorithm results in an interim stable matching.
On the other hand, when all positions are in the same tier while applicants are in individual tiers, random interviews prove to be ineffective. This is primarily due to positions exhibiting a preference for applicants in the corresponding higher tiers. Consequently, these higher-tier applicants are given higher priority in the matching process. The challenge arises for the lowest-tier applicant who, despite undergoing a poly-logarithmic number of random interviews, struggles to find the sole available position among the interviewed positions. To address this, a different approach is taken. By fixing the order of positions, interviews are assigned to each applicant with consecutive positions whose indices closely match the tier number of the applicant. This strategy ensures that applicants with similar tier numbers have a substantial overlap of interviews. Consequently, positions that are interviewed by higher-tier applicants will be matched to them and will not be available options for lower-tier applicants.
The non-adaptive algorithm combines the above two technical ingredients with a few other ideas. Specifically, it creates a sequence of bipartite graphs , where each may be a random graph like the first special case we considered, a sequential graph like the second special case, or a small complete bipartite graph when we have two small tiers. We refer the reader to Section 3.3 for the details of the algorithm and its analysis.
Further, in Section 3.4, we demonstrate that if applicants and positions have individual preferences within the tiers prior to the interviews, the exact same ideas and proofs presented in Section 3.3 carry over, with some mild assumptions about the distribution of individual preferences before the interviews and subjective interest arising from the interview.
The extensive literature on two-sided matching has led to a rich theory and successful designs in practice. However, most of the existing work focuses on models in which agents know their own preferences. Little is known about the interaction period in which agents learn about their preferences by interacting with each other. This paper attempts to address this gap by looking at this problem with an algorithmic lens, focusing on extending Gale-Shapley’s deferred acceptance to this setting.
1.2 Related literature
Congestion and frictions in the early stages of two-sided markets stem from the lack of complete information and competition. Several approaches have been proposed to improve the screening process, including signaling, limiting the number of applications, and even coordinating interviews.
Signaling allows applicants to send programs “interest” signals [19, 10, 16], which intend to improve efficiency by helping in screening applications and interviewing applicants that are typically beyond reach. The literature has investigated how a limited number of signals indicating special interest can enhance efficiency. [16] demonstrate the impact of different signal quantities on the matching outcome. Signaling is used in the Economics job market [9] and experimented in residency and fellowship markets [28, 30]. Instead of signaling mechanisms, this paper proposes interview mechanisms that address interview congestion by eliminating interviews that are unlikely to form a match.
Another approach considered is to limit the number of applications applicants can send. [1, 6, 32] demonstrate the benefit of limiting the number of applications when agents have complete information about their own preferences. We note that [6] and [32] further assume that agents are ex-ante symmetric.111Limiting applications has also been proposed in for medical markets [31, 8, 26].
Several papers analyzed games induced by inviting agents for interviews [11, 17], demonstrating various frictions. Other CS papers study how to make interview decisions in the worst-case towards reaching a stable matching [12, 29].
A related approach for how to coordinate interviews is also studied. Specifically, [23] and [2] find that the match rate is high when agents have a similar number of interviews. [18] propose the idea of incorporating overlaps between positions. Our preference model allows for a more heterogeneous tiered market structure, which results in the non-adaptive algorithm assigning agents possibly an unequal number of interviews.
Our paper is inspired by [25, 24] who propose to conduct an interview match that will limit the number of interviews. Similar to our paper, [2] study non-adaptive interview mechanisms that generate many-to-many matching in large random markets with interim stability in the limit. In their model different applicants (positions) have heterogeneous pre-interview utilities of being matched to some position (applicant). Their heterogeneous utilities are a summation of common scores (generated from a continuous distribution) and idiosyncratic scores (and in particular the market is not tiered). Instead, we consider arbitrary homogeneous pre-interview utilities (i.e., just common scores) and propose an adaptive algorithm generating an interim stable matching for any finite market size. [2] further points to the challenge of handling tiered markets.222They offer a heuristic for two-tiered markets; this heuristic adds to higher tier agents, safety interviews with lower tier agents. Instead our non-adaptive algorithm generates an interim-stable matching in every homogeneous tiered market.
This paper relies on the existing literature on random two-sided matching markets. Our non-adaptive algorithm incorporates the findings of [5] and [7] as a fundamental component. These prior studies specifically examine the average ranking of the matched positions for individual applicants in the applicant-proposing Gale-Shapley algorithm when preference lists are randomized. We further extend the tiered market model in [4] to allow for noisy preferences. [4] focuses on minimizing communication to reach an exact stable matching with high probability in a random market when agents know their own preferences. Instead, agents in our model know the public values and have perfect information only over public values and we seek to minimize the number of interviews required to learn about post-interview preferences in order to reach interim stability.
The concurrent work [3] extends our framework in two notable ways. First, it examines markets with heterogeneous pre-interview utilities, addressing scenarios where agents have differing pre-interview preferences. Second, it explores simple decentralized signaling mechanisms to determine which pairs should interview each other.
The authors demonstrate that in a single-tiered random market with sparse signals (), almost interim stability can be achieved. Specifically, the matching becomes perfectly interim stable after removing a vanishingly small fraction of agents. Furthermore, in the case of dense signals (), perfect interim stability can be attained in imbalanced markets through short-side signaling. Additionally, they extend their results to multi-tiered markets and identify conditions under which signaling mechanisms are incentive compatible.
Finally, there is a growing literature in Economics that studies the structure stable matchings when agents have partial knowledge of their own preferences Liu et al. [22], Liu [21]. This literature does not consider the matching dynamics that arise prior to being matched. An exception is [13], which contributes to our understanding of the stages that occur prior to a match.
1.3 Our Model
Let and denote the set of applicants and positions respectively. The utility of applicant for position , can be written as . The quantity reflects the characteristics of the position, like working hours, salary, or prestige, and is public knowledge. Applicant ’s subjective interest in position is captured by , which will only be revealed to the applicant after the interview. We assume ’s are independently sampled from a known symmetric distribution with mean zero. Define the utility of position in applicant , similarly. Note that ’s and ’s are also independent from each other.
We define interim stable matchings. Let the observed utility of in , be if and have interviewed, and equal to otherwise. Define ’s or the observed utility of positions in the applicants similarly. A matching is interim stable if all the matched pairs have interviewed with each other and there are no blocking pairs with respect to observed utilities. In other words, and imply either or . Throughout the paper, we assume the applicants and positions prefer being matched to someone to remaining unmatched.
For ease of notation, we use and to refer to the position or the applicant they are matched to, respectively. Define if is not matched. We also use and to denote the preferences of applicants and positions, e.g., if and only if .
We will study adaptive and non-adaptive algorithms for finding interim stable matching. Adaptive algorithms use the outcome of previous interviews to propose the next one. In contrast, non-adaptive algorithms determine the interviews that should be conducted between all the applicants and agents in one shot.
2 An Adaptive Algorithm
In this section, we present an adaptive algorithm for finding an interim stable matching. Our algorithm extends Gale-Shapley’s deferred acceptance to incorporate interviews between the applicants and positions. Just like in deferred acceptance, one side makes proposals to the other sequentially, but the order of proposals is chosen carefully so that with high probability, the number of interviews needed to obtain interim stability is no more than .
Theorem 1.
Algorithm 1 finds an interim stable matching between positions and applicants. Moreover, with high probability, the number of interviews done by each applicant or position is at most .
Let us start with an informal overview of the algorithm. Consider the applicant-proposing deferred acceptance mechanism. In the beginning, has the largest expected utility and is, therefore, the first choice for all applicants. Similarly, since is the applicant with the largest expected utility, it is ’s first choice. The algorithm asks and to interview each other. The interview may change and ’s preferences. If they are still at the top of each other’s preference list, we can match them and reject all other applicants for ; otherwise, we update the preference list of and and continue the process.
For an unmatched applicant , let be the most preferred position from which the applicant is not yet rejected. At any time, is the position to which applicant is going to propose next. At each step, we consider the position with the smallest such that there exists an applicant where .
Suppose that among all proposals that is receiving, is the one that has the largest observed utility, i.e. . If and have already interviewed each other, then the process is similar to deferred acceptance: if , position rejects applicant ; otherwise, rejects and matches to .
Now consider the case where has not interviewed . If and , rejects without an interview. Otherwise, and interview each other and update their observed utility and preference list. The algorithm ends when all agents are matched. See a formal description below.
2.1 Analysis of Algorithm 1
We will prove Theorem 1 in the rest of this section. We will show that in the course of the algorithm, all positions interview candidates that have a fairly similar global ranking. More specifically, with high probability, every position only interviews applicants where .
We start with a few simple observations about the algorithm. First, note that similar to the Gale-Shapley algorithm, when a position gets matched to an applicant , it remains matched until the end of the algorithm. Further, may only reject to match with a more preferable applicant.
Claim 2.
Suppose that unmatched position and applicant interview each other at some point during the algorithm and both observe non-negative and . Then, the algorithm (tentatively) matches them to each other. Subsequently, may only interview applicants for which .
Proof.
If and , the interview does not change and applicant remains the most preferable applicant proposing to . Hence, the algorithm matches applicant to position in the next iteration.
Now, observe that, based on Algorithm 1 of Algorithm 1, is going to interview an applicant if either or if , which implies .
Observation 3.
If position interviews applicant and , then is interviewed by all where .
Proof.
Before proposing to , proposes to all positions with . Also, based on Algorithm 1 of the algorithm, when , does not reject without an interview.
The next lemma establishes that positions do not interview any applicant with a significantly higher index. Moreover, we show that with the possible exception of positions with the highest index (and lowest expected utility for the applicants), the positions get matched sequentially in the increasing order of their index.
Lemma 4.
With probability of at least , if and interview each other, then . Also, at the time an applicant proposes to ,
-
If , then for ,
-
If , then for .
Proof.
The first part of the lemma holds trivially when . So suppose and let be the set of agents with . At most agents from can be matched to position for at any time. Further, applicants propose to positions in the increasing order of their index. Hence, in order for to get matched to an applicant with , it should reject at least applicants from set . We will show that such an event is extremely unlikely.
Let be the set of the indices of the first applicants interviewed by . By the above argument, . By 2, if interviews some applicant and observes that both and are non-negative, then and get matched. Since and are drawn independently at random from a symmetric distribution with mean zero, . Therefore,
A union bound over the above events implies the probability that there exists an such that and interview each other and is at most .
The second part of the lemma follows by observing that as long as each position with gets matched to one of its first proposals, does not receive any proposals before is matched.
For the rest of the section, we condition on the high probability events stated in Lemma 4. For , let and . Also, let and .
Claim 5.
For any ,
Proof.
Since we condition on Lemma 4, Therefore,
Claim 6.
Suppose that the algorithm is considering the proposal between applicant and position . Then, for any ,
Proof.
We need to prove the statement for . By Lemma 4, when the algorithm is considering pair , we have for . By 5,
which implies,
(1) |
On the other hand,
Lemma 7.
With a probability of at least , if position interviews applicant then .
Proof.
Let and . The statement holds for since . Suppose for some , applicant and position interview each other at some point during the algorithm. At that time, by Lemma 4, all positions with index smaller than are matched to an applicant. Also, by 3, all positions in have already interviewed , but none of them is matched to . We will show that the probability of such an event is at most .
Consider all positions for and define
By 6, there are at most positions in the set of which are matched to an applicant with an index smaller than . The rest are matched to an applicant in . Therefore, .
Define to be indicating whether ’s utility for matching to is higher than being matched to all elements in and to be , , and . By 2, implies that can not be matched to an applicant with an index more than . Further, by 6 at most positions in may be matched to an applicant with index less than , so it is sufficient to bound the probability that .
Observe that for all . Also, note that because of the independence of the subjective component of utilities, ’s are independent. Further, . Using Chernoff bound,
The first inequality is due to assuming is sufficiently large. Applying union bound over all applicants completes the proof.
Proof of Theorem 1.
The statements of Lemma 4 and Lemma 7 hold with high probability, for all applicants . Therefore, every is interviewed only by positions where . Similarly, for every position in Algorithm 1, the position only interviews ’s for which . Therefore, the number of interviews done by each applicant or position is at most .
The proof of stability is fairly similar to the analysis of deferred acceptance. Suppose there exists a blocking pair . This implies that and . Therefore, must have been rejected by during one of the iterations of Algorithm 1. This implies that position is matched with an applicant who has a higher observed utility, and the current match of should not be worse than , i.e., . That is a contradiction, as we initially assumed that is a blocking pair.
3 A Non-Adaptive Algorithm for Tiered Markets
The algorithm in the previous section forms the sequence of interviews adaptively, suggesting each interview based on the outcome of the earlier ones. However, many markets operate in a more simultaneous manner, where there is a stage of interviews followed by a clearing phase. Therefore, it may be more efficient if the interview lists are decided either in advance or independently for each position.
In this section, we present a non-adaptive algorithm for the problem in markets in which applicants and positions are partitioned into tiers, extending the model in [4] to allow for post interaction noise. Applicants have the same ex-ante utility for two positions in the same tier but prefer a position in a higher tier to a lower-tier position with probability 1. The same is true for positions.
More formally, let denote the tiers of positions. Remember the utility of an agent for position , . We will assume that if and only if and are in the same tier, i.e., for some , and if is in a higher tier than , i.e. for some . Further, ’s are all independent and identically distributed, and their distributions are bounded and symmetric around . Therefore, if and belong to the same tier and equal to 1 otherwise. We use relative index of a position to show the index of the position within the tier it belongs to. Formally, for position where , the relative index is equal to . Define the tiers for applicants similarly. We assume that the number of applicants and positions are equal in the general tiered markets, i.e. , and every applicant and position will be matched. However, to develop the algorithm for the general tiered market, we use two subroutines that might need to solve a subproblem with case (see Section 3.1 and Section 3.2).
Our non-adaptive algorithm works in two phases. In the first phase, the algorithm proposes a set of interviews between the two sides. We represent this by a bipartite graph , where each edge denotes an interview between its endpoints. In the second phase, after the interviews are conducted and the corresponding for are revealed, the algorithm finds an interim stable matching between two sides. The main result of this section is that the proposed interviews in the first phase are such that (i) no agent or position has more than interviews, and (ii) after the interviews are done, the algorithm can find an interim stable matching supported by with high probability.
In order to explain the main ideas of the algorithm, it will be instructive to look at two special cases in the following subsections.
3.1 Example 1: Single Tier Structure
In the first example, the applicants and positions are each in their own single tier, i.e. . The number of applicants and positions can be different333Note that in the setting of the general tiered market, we assume , however, the single tier structure serves as a subroutine of the algorithm for the general tiered market and might need to solve a subproblem with . but assume that and are sufficiently large, and without loss of generality, . In this case, first form by connecting each applicant to positions chosen uniformly at random from the .
Each applicant interviews with positions and the distribution of ’s are symmetrically distributed around 0. So, for every , of the ’s are going to be non-negative in expectation. In fact, a simple Chernoff bound shows that with high probability, for every applicant , at least of positions interviewing the applicant have a non-negative .
Claim 8.
Among positions that applicant is interviewing, at least of them have with probability .
Let be the preference list of applicant in the decreasing value of (break ties randomly). It is not hard to see that each is a random permutation chosen over the ordering of positions. Following the same argument, we can form to be the random permutations representing the preference list of position according to the decreasing values of . The above observation allows us to take advantage of the following lemma.
Lemma 9 ([27], Theorem 6.1).
Suppose that we run the applicant-proposing Gale-Shapley algorithm for the above random permutations. With probability , each applicant gets matched to a position with a rank less than in permutation , where .
The above lemma combined with 8 implies that every applicant is matched with a position where .
Lemma 10.
For any and such that , with probability , each applicant gets matched to a position that has interviewed and , where .
3.2 Example 2: One Large Tier vs Multiple Singleton Tiers
In this example, all positions belong to the same tier, but each applicant is in a tier of its own. In other words, the applicants are ex-ante indifferent among positions, but all positions prefer applicant to when .
Before presenting our solution, it may be worthwhile to understand why forming the interview graph randomly in the same fashion as our previous example is not suitable for this variation of the problem. First, note that since agent is the first choice for all the positions, should be matched to its most preferred position in every interim stable matching. Removing applicant and that position, the same argument applies to applicant , and so on.
Now suppose that each applicant interviews random positions similar to the algorithm in Section 3.1. Since we choose random positions for applicant to interview and then applicant chooses the most preferred position after the interviews, all positions have the same probability of being the most preferred position for applicant in this process. Therefore, the process is equivalent to the case that we choose a random position for applicant and remove both, then we choose a random position for and remove both, and so on. Suppose the algorithm successfully matches the first applicants to their positions. When only the last applicant remains, it is crucial to show that the only remaining position should be one of the positions that interviewed. Otherwise, the resulting matching is not interim stable. Since we are removing positions randomly, the probability that the last remaining position is among the position that interviewed is . Therefore, we cannot find an interim stable matching supported within if we want .
Instead, for this example, our algorithm forms by connecting every agent to positions such that , where and . Note that the degree of applicants with an index close to 1 or may be smaller than the rest of the applicants.
In this construction, applicants whose tiers are close to each other interview for sets of positions that are similar. Therefore, there is a high correlation between the positions to which they get matched. As a result, with high probability, for every , the set of positions that have interviewed and are not matched to one of the higher-tier applicants is non-empty. Further, can find a stable match in this set. We will prove this statement in more generality in the next section when analyzing the algorithm for the general case.
3.3 An Algorithm for General Tiered Markets
We are now ready to present our algorithm and its analysis for tiered markets in their full generality. As we said before, the algorithm works in two phases. In phase 1, Algorithm 2 constructs a bipartite interview graph between applicants and positions. This is done possibly in several iterations. In each iteration, , the algorithm adds an edge set between two subsets and to . Further, it identifies whether the applicants or positions are going to be the proposing side in this part of the graph. Note that an applicant or a position can be in multiple s or s.
In the second phase, Algorithm 3 conducts the interviews between all the pairs in and updates the preferences of the two sides. Then, in each iteration , it removes vertices that are matched in earlier iterations and implements either a position or applicant-proposing deferred acceptance algorithm between and . We discuss the details of this algorithm later in the section.
In practice, one should first implement Algorithm 2 to construct the interview graph and then Algorithm 3 to implement the interviews and find the stable matching. But it will be useful for the analysis to couple the two algorithms and consider them together after each iteration .
Algorithm 2 considers applicants and positions starting from the top. Let and be the set of top-tier applicants and positions. Label the sets and in such a way that . We will consider three different cases: (1) If both and are relatively small, i.e. and for , we can afford to set up an interview for each pair between and . (2) If and , we choose a set of size comprised of vertices of with the lowest index and set up an interview between each pair in and . (3) If both and are large than , we use the same approach as Section 3.1. Specifically, for each vertex in , we choose random vertices from the first positions in to interview.
When both and are larger than , we can remove both and the first vertices of from and and move to the next iteration of the algorithm. The situation is more subtle when for a short tier , we choose a set which has a size larger than . In this case, in the same iteration of Algorithm 3, some of the vertices of will remain unmatched. A priori, and without knowing the outcome of the interviews between these two sets, we do not know which vertices of are going to remain unmatched. Therefore, we will not remove any vertex from immediately. Because of this subtlety, we will have to keep track of the “effective cardinality” and . They are updated to be equal to the number of unmatched vertices in and , respectively, at the same iteration in Algorithm 3.
The algorithm continues in the same fashion. At every step, the top non-empty tiers and from each side are selected and named so that . We call the short side and the long side.
In the rest of this section, we prove the correctness of the algorithm and give an upper bound on the number of interviews done by every applicant or position. First, we show that at any time during the algorithm is close to .
Invariant 10.
For a tier , after every iteration during the course of Algorithm 2, .
Proof.
We prove this by induction on the number of iterations. Initially, . Now assume that before the th iteration. If is the short side in iteration , then all vertices of will be removed after this iteration, and thus . Also, if is the long side and the algorithm is in case 3, then by Algorithm 2 of Algorithm 2.
In all other cases, decreases after the iteration. Further, by Algorithm 2, if , we remove vertices of for which .
Also, it is not hard to see that all applications and positions will be removed by the end of Algorithm 2.
Observation 11.
After the last iteration of Algorithm 2, all applicants and positions are removed.
Proof.
Let be all tiers of positions and be all tiers of applicants. Note that at all times during the execution of Algorithm 2 since in all three cases, the effective cardinality of the short side tier is subtracted from both and . Since the algorithm does not terminate until either or both or , then all vertices are removed when the algorithm terminates.
We let be the set of applicants that are endpoints of . We define similarly. Moreover, let and be the set of applicants and positions that are endpoints of , and are unmatched when we run Gale-Shapley in Algorithm 3 between and . Therefore, if and belongs to tiers and , then and at the time of iteration . As we mentioned before, a vertex may appear in multiple s or s. We will show that such sets are consecutive. To do so, we first give a helpful property of the algorithm.
Claim 12.
For a tier that has interview with tiers where for all , define be tier before interviewing with in Algorithm 2 for . Suppose for , then all vertices in with relative indices at most will be selected for interview with , and , where is the initial size of tier before the execution of Algorithm 2.
Proof.
The algorithm always chooses vertices with the lowest index from the long side to interview. Consider the interview with , those are the vertices with a relative index of to . And by definition of Algorithm 2, .
Claim 13.
For each vertex , there exist and such that is in or iff .
Proof.
Let be the tier containing and be the initial size of tier at the start of Algorithm 2. Also, let be the first iteration considering and be the effective size of tier at that point. It is not hard to see that if is the short side in this iteration, or if the short side has size , all the vertices of that have an interview in this iteration will be removed, and we are done. When is the long side and , every vertex in will be interviewed.
The only remaining case to consider is where is on the long side, , and the tiers considered alongside are , where for all . From 12 and the fact that decreases monotonically, as long as is not removed from the graph, it will be selected for interview, which means it will be the endpoint of at least one edge in . That implies is in or by definition.
Lemma 14.
Suppose , , and . Then, with probability , gets matched in iteration of Algorithm 3 to some position such that .
Proof.
Let and be two tiers that vertices of and belong to, respectively. Also, by the definition of , we have at the time of th iteration. If , since , we choose a subset of with size and interview all edges. This set contains vertices that are not matched at th iteration. By Lemma 10, if we choose , then and with probability
will match to a position such that .
Similarly, if , we choose a subset of that contains unmatched vertices. For each vertex of we interview random positions in . Hence, if we choose in Lemma 10, then , and with probability
will match to a position such that .
Lemma 15.
With probability , if a vertex is removed in iteration of Algorithm 2, it will get matched via an edge in .
Proof.
Without loss of generality, assume that the removed vertex in iteration is applicant in tier . Let be the tier of the other side that the algorithm considers in iteration . First, assume that is the short side, , and at iteration . According to Case 1 of Algorithm 2, all vertices of interview all vertices of , hence, the statement hold with probability 1. Now suppose that is the short side and at iteration . Hence, by definition, we have and . Therefore, by Lemma 14, is matched to a position such that with probability , which implies that . Furthermore, if , , and is the long side, . Since for this case we run a position-proposing Gale-Shapley in Algorithm 3, by Lemma 14, all positions in get matched to applicants in with edges . Thus, all vertices of are matched to positions in using edges since we have .
The only case that remains to be investigated is when is the long side in several iterations until is removed at iteration , i.e. , for and . Also, let be the tiers of the other side in each iteration. Let . First, we prove that .
Let be tier before the th iteration of Algorithm 2 for . Also, let be the relative position of in tier before any iteration. During iteration , since , 12 shows that (1) every vertex in with index at most is in ; (2) . For , the first case is that and when there is no iteration on before or when , and the second case is that when . In both cases, we have . Since is removed in iteration , we have , thus
Now consider iteration and suppose that is not matched yet. Then, with a probability of at least , applicant will be matched in iteration because all applicants of are in the same tier, and subjective interests are i.i.d. Moreover, by definition. Therefore, the probability that remains unmatched after iteration is at most
which completes the proof because of our choice of and .
Using a similar argument as Lemma 15, the next lemma gives a bound on the degree of every node in .
Lemma 16.
The number of interviews assigned to every position or applicant is at most
Proof.
We will show the above for every applicant . The proof for positions is the same. Let be the tier that belongs to. If is the tier of the short side at iteration , then has at most incident edges in by definition of Algorithm 2 and Section 3.3. Also, if both and , by Case 3 of the Algorithm 2 and Section 3.3, each vertex in or has at most interviews in iteration . Note that each vertex is in one of the above scenarios at most in one iteration since after that it gets removed. Therefore, similar to the proof of the previous lemma, it only remains to show that the number of interviews is bounded when is the long side in several iterations until is removed at iteration , i.e. , for , and . Let be the tiers of the other side in each iteration.
Define , , and for as before. With a similar argument, we have , and
Thus, . Note that the number of interviews for is equal to . For all tiers except , we have at the time of the iteration . Also, by Section 3.3, we have . Therefore,
which implies the total number of interviews for vertex is at most before it gets removed which concludes the proof.
Lemma 17.
The matching produced by the algorithm is an interim stable matching with probability .
Proof.
By 11, all applicants and positions are removed from the graph after Algorithm 3 finishes. Using union bound for all vertices in Lemma 15, with probability , all vertices are matched using an edge in . Consider tiers and from applicants and tiers and from positions and suppose that is a higher tier than and is higher than . Let and . We claim that at most one of the following holds: or . Without loss of generality, suppose that . This implies that at some iteration of the Algorithm 2, the algorithm considers tiers and in Algorithm 2. Hence, all vertices of should be already removed, which implies .
Now, consider applicant and position that are not matched to each other. We prove that is not a blocking pair. Let and be the matches of and , respectively. Also, let and be the tiers that include and , respectively. We use to show tier is preferable to tier . If such that , then this pair is not a blocking pair since . A similar argument works for . By the above argument, and does not belong to tiers and such that and . Hence, either or both and . Thus, and are once considered at the same time in one iteration in Algorithm 2 of Algorithm 2.
Without loss of generality, suppose that at iteration , the algorithm considers and in Algorithm 2 and is the short side. By definition, we have . If , then , which implies that . Thus, is not a blocking pair. If , since the algorithm finds a stable matching between and , then is not a blocking pair, which finishes the proof.
Theorem 18.
There exists a non-adaptive algorithm that finds an interim stable matching for tiered ranking agents with high probability such that the number of interviews done by each applicant or position is at most .
Proof.
3.4 Extension
In the aforementioned tiered market scenario, we presume the absence of ex-ante preferences within a tier. However, agents may have distinct individual preferences within tiers. For example, in the medical context preferences can be shaped by various factors, including geographical location, the availability of extracurricular activities, specific research laboratories they aspire to join, etc. To capture this characteristic of the market, we introduce the following model. Applicant ’s utility for position is now defined to be , where if and only if and are in the same tier, and if is in a higher tier; is a small random perturbation of the ex-ante preference which is known before the interview; For now, we assume that s and s are all drawn from the uniform distribution over [-1,1] independently. The same applies to positions.
We first briefly argue that our algorithm works for the extended tiered markets. Consider the first subproblem, Single Tier Structure.
-
Using the same Chernoff bound arguments, we can prove that for each applicant, there is a constant such that of the interviewed position has .
-
In the extended model, the preference lists are still random permutations over the randomness of both and , thus each applicant will be matched to a position with a high rank.
Combine the above facts, we can give a similar argument as Lemma 10 that each applicant gets matched to an interviewed position with . In this way, when we use it as a subroutine, it still satisfies the property that the applicant would never want to switch to positions that are not interviewed in the same tier since .
Consider the second subproblem, One Large Tier v.s. Multiple Singleton Tiers. Using a similar Chernoff argument as above, it is guaranteed that a higher-tier applicant is matched to the most preferred available position that is interviewed with . Thus they have no incentive to switch.
The proof for the extended general case is a combination of the two extended subproblems, and can be modified from the current one by changing the arguments in Lemma 14 from to .
Remark.
Although the boundedness of uniform distribution plays an important role in the analysis, more general distributions work, as long as with constant probability is larger than the maximum of i.i.d. s.
4 Conclusion
The extensive body of work on two-sided matching has given rise to a comprehensive theory and efficient implementations in real-world scenarios. The majority of these studies assume that agents are fully aware of their preferences. This leaves a critical aspect – the interaction period where agents learn about their preferences via mutual interaction – largely under-explored. The current paper ventures into this area, using an algorithmic lens to extend Gale-Shapley’s deferred acceptance to this setting.
We present an algorithmic approach that extends Gale-Shapley’s deferred acceptance to include situations where agents form preferences during the interaction period. We propose two algorithms. The first, an adaptive algorithm, integrates interviews between applicants and positions into Gale-Shapley’s deferred acceptance. Like deferred acceptance, one side sequentially proposes to the other. However, the order of proposals is arranged to increase the chances of achieving an interim stable match. Further, our analysis shows that the number of interviews conducted by each applicant or position is, with high probability, limited to .
We also propose a non-adaptive algorithm for markets where the dynamics consist of an initial interview phase followed by a clearing stage. In these situations, having prearranged interview lists for each position may be necessary. Our non-adaptive algorithm creates these lists simultaneously. It aims to find a stable match in markets where applicants and positions are divided into tiers, and it limits the number of interviews for each applicant or position to no more than .
We point out that the algorithms in this work do not account for the incentives of applicants or positions, i.e., mechanisms induced by the algorithm don’t carry the dominant strategy incentive compatible (DSIC) property as the Gale-Shapley’s algorithm.
References
- [1] Ishan Agarwal and Richard Cole. Stable matching: Choosing which proposals to make. arXiv preprint, 2022. doi:10.48550/arXiv.2204.04162.
- [2] Maxwell Allman and Itai Ashlagi. Interviewing matching in random markets, 2023. arXiv:2305.11350.
- [3] Maxwell Allman, Itai Ashlagi, Amin Saberi, and Sophie H. Yu. Stable matching with interviews, 2024.
- [4] Itai Ashlagi, Mark Braverman, Yash Kanoria, and Peng Shi. Clearing matching markets efficiently: informative signals and match recommendations. Management Science, 66(5):2163–2193, 2020. doi:10.1287/MNSC.2018.3265.
- [5] Itai Ashlagi, Yash Kanoria, and Jacob D Leshno. Unbalanced random matching markets: The stark effect of competition. Journal of Political Economy, 125(1):69–98, 2017.
- [6] Hedyeh Beyhaghi and Éva Tardos. Randomness and fairness in two-sided matching with limited interviews. In 12th Innovations in Theoretical Computer Science Conference (ITCS 2021). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021.
- [7] Linda Cai and Clayton Thomas. The Short-Side Advantage in Random Matching Markets, pages 257–267. SIAM, 2022. doi:10.1137/1.9781611977066.19.
- [8] J Bryan Carmody, Ilana S Rosman, and John C Carlson. Application fever: reviewing the causes, costs, and cures for residency application inflation. Cureus, 13(3), 2021.
- [9] Peter Coles, John Cawley, Phillip B Levine, Muriel Niederle, Alvin E Roth, and John J Siegfried. The job market for new economists: A market design perspective. Journal of Economic Perspectives, 24(4):187–206, 2010.
- [10] Peter Coles, Alexey Kushnir, and Muriel Niederle. Preference signaling in matching markets. American Economic Journal: Microeconomics, 5(2):99–134, 2013.
- [11] Joanna Drummond and Craig Boutilier. Elicitation and approximately stable matching with partial preferences. In IJCAI, pages 97–105. Citeseer, 2013. URL: http://www.aaai.org/ocs/index.php/IJCAI/IJCAI13/paper/view/6840.
- [12] Joanna Drummond and Craig Boutilier. Preference elicitation and interview minimization in stable matchings. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 28(1), 2014.
- [13] Federico Echenique, Ruy Gonzalez, Alistair J Wilson, and Leeat Yariv. Top of the batch: Interviews and the match. American Economic Review: Insights, 4(2):223–238, 2022.
- [14] David Gale and Lloyd S Shapley. College admissions and the stability of marriage. The American Mathematical Monthly, 69(1):9–15, 1962.
- [15] Trond Viggo Grøntvedt, Mons Bendixen, Ernst O Botnen, and Leif Edward Ottesen Kennair. Hook, line and sinker: Do tinder matches and meet ups lead to one-night stands? Evolutionary Psychological Science, 6(2):109–118, 2020.
- [16] Meena Jagadeesan and Alexander Wei. Varying the number of signals in matching markets. In Web and Internet Economics: 14th International Conference, WINE 2018, Oxford, UK, December 15–17, 2018, Proceedings 14, pages 232–245. Springer, 2018. doi:10.1007/978-3-030-04612-5_16.
- [17] Sangram V Kadam. Interviewing in matching markets with virtual interviews, 2021.
- [18] Robin S Lee and Michael Schwarz. Interviewing in two-sided matching markets. Working Paper 14922, National Bureau of Economic Research, April 2009. doi:10.3386/w14922.
- [19] Soohyung Lee and Muriel Niederle. Propose with a rose? signaling in internet dating markets. Experimental Economics, 18:731–755, 2015.
- [20] Leah E LeFebvre. Swiping me off my feet: Explicating relationship initiation on tinder. Journal of Social and Personal Relationships, 35(9):1205–1229, 2018.
- [21] Qingmin Liu. Stability and bayesian consistency in two-sided markets. American Economic Review, 110(8):2625–2666, 2020.
- [22] Qingmin Liu, George J Mailath, Andrew Postlewaite, and Larry Samuelson. Stable matching with incomplete information. Econometrica, 82(2):541–587, 2014.
- [23] Vikram Manjunath and Thayer Morrill. Interview hoarding, 2021. arXiv:2102.06440.
- [24] Marc L Melcher, Itai Ashlagi, and Irene Wapnir. Reducing the burden of fellowship interviews—reply. JAMA, 321(11):1107–1107, 2019.
- [25] Marc L Melcher, Irene Wapnir, and Itai Ashlagi. May the interview be with you: signal your preferences. Journal of Graduate Medical Education, 11(1):39–40, 2019.
- [26] Helen Kang Morgan, Abigail F Winkel, Taylor Standiford, Rodrigo Muñoz, Eric A Strand, David A Marzano, Tony Ogburn, Carol A Major, Susan Cox, and Maya M Hammoud. The case for capping residency interviews. Journal of surgical education, 78(3):755–762, 2021.
- [27] Boris Pittel. On likely solutions of a stable marriage problem. The Annals of Applied Probability, 2(2):358–401, 1992.
- [28] Steven D Pletcher, CW David Chang, Marc C Thorne, and Sonya Malekzadeh. The otolaryngology residency program preference signaling experience. Academic Medicine, 97(5):664, 2022.
- [29] Baharak Rastegari, Anne Condon, Nicole Immorlica, and Kevin Leyton-Brown. Two-sided matching with partial information. In Proceedings of the fourteenth ACM conference on Electronic Commerce, pages 733–750, 2013. doi:10.1145/2492002.2482607.
- [30] Parsa Salehi, Babak Azizzadeh, and Yan Lee. Preference signaling for competitive residency programs in the nrmp, December 2019. doi:10.4300/JGME-D-19-00695.1.
- [31] Aaron M Secrest, Garrett C Coman, J Michael Swink, and Keith L Duffy. Limiting residency applications to dermatology benefits nearly everyone. The Journal of Clinical and Aesthetic Dermatology, 14(7):30, 2021.
- [32] Erling Skancke. Welfare and strategic externalities in matching markets with interviews. Available at SSRN 3960558, 2021.
- [33] Ryan Sutton, William L. Wang, Walaa Abdelfadeel, Matthew Sherman, Lisa K. Cannada, and Chad A. Krueger. Are orthopedic fellowship programs giving out too many interviews? A retrospective analysis suggests they are. HSS Journal®, 19(2):210–216, 2023. doi:10.1177/15563316221103585.
Appendix A Deferred Proofs
See 8
Proof.
Denote as the indicator random variable of whether , set as the set of positions that applicant is interviewing, and as the total number of interviewed positions with . By definition, we know that ’s are i.i.d. Bernoulli random variables with mean . By Chernoff bound, we have
which finishes the proof.