Matthew Kwan

Publications

No-(k+1)-in-line problem for k ⩾ 3 (with Anubhab Ghosal, Ritesh Goenka, Alex Grebennikov, Peter Keevash and Huy Pham). Submitted.
How many points can be placed in an n×n grid so that every (affine) line contains at most k points? For k = 2 this is the infamous “no-3-in-line” problem of Dudeney. In this paper, we resolve this problem for all other k (and sufficiently large n). Namely, for k ⩾ 3 and sufficiently large n, we show that this maximum is exactly kn. To prove this, our key observation is that in the regime k ⩾ 3, the problem is dominated (in a certain statistical sense) by the influence of a small number of “heavy” lines with many grid points. We also use similar methods to obtain new bounds on the related “no-four-on-a-circle” problem of Erdős and Purdy.

The paper refers to some experimental data on the no-four-in-a-circle problem (obtained using AlphaEvolve), which can be downloaded via this PDF file or this JSON file.
Permanents of random matrices over finite fields (with Zach Hunter and Lisa Sauermann). Submitted.
Fix a finite field 𝔽_q and let A be a uniformly random n×n matrix over 𝔽_q. The asymptotic distribution of the determinant det(A) is well-understood, but the asymptotic distribution of the permanent per(A) is still something of a mystery. In this paper we make a first step in this direction, proving that per(A) is significantly more uniform than det(A).
Edge-statistics beyond 1/e (with Alex Grebennikov). Submitted.
For integers k, l with 0 ⩽ l ⩽ (k choose 2), let ind(k,l) be the maximum proportion of k-vertex subsets of a large graph that induce exactly l edges. The edge-statistics theorem asserts that ind(k,l) ⩽ 1/e + o(1) for k → ∞ and 0 < l < (k choose 2). We investigate the “stability” of this theorem in a few different ways. In particular, the edge-statistics theorem is tight for four specific values of l; one of our main theorems is that the constant “1/e” can be improved for all other l.

One part of the paper is computer assisted; here is an accompanying C++ program.
No-(k+1)-in-line problem for large constant k (with Alex Grebennikov). Submitted.
How many points can be placed in an n×n grid so that every (affine) line contains at most k points? We prove that for n > k > 10³⁷ the maximum number of points is exactly kn.
Exponential anticoncentration for the permanent (with Zach Hunter and Lisa Sauermann). Submitted.
Let A be a random n×n matrix with independent entries, and suppose that the entries are “uniformly anticoncentrated” in the sense that there is a constant ε ⩾ 0 such that each entry a_ij satisfies sup_zPr[a_ij = z] ⩽ 1 − ε (for example, A could be a uniformly random n×n matrix with ±1 entries). We prove that the permanent of A is exponentially anticoncentrated, significantly improving previous bounds of Tao and Vu. Our proof also works for the determinant, giving an alternative proof of a classical theorem of Kahn, Komlós and Szemerédi.
Parities in random Latin squares (with Kalina Petrova and Mehtaab Sawhney). Submitted.
In a Latin square, every row can be interpreted as a permutation, and therefore has a parity (even or odd). We prove that in a uniformly random n×n Latin square, the n row parities are very well approximated by a sequence of n independent unbiased coin flips: for example, the total variation error of this approximation tends to zero as n → ∞. This resolves a conjecture of Cameron.

Along the way, we introduce several general techniques for the study of random Latin squares, including a new re-randomisation technique via “stable intercalate switchings”, and a new approximation theorem comparing random Latin squares with a certain independent model.
Geometric Littlewood–Offord problems via lattice point counting (with Alex Grebennikov). Submitted.
Consider nonzero vectors a₁,...,a_n ∈ ℝ^k, a random ±1 sequence (ξ₁,...,ξ_n), and a set S ⊆ ℝ^k. What upper bounds can we prove on the probability that the random sum a₁ξ₁+...+a_nξ_n lies in S? We develop a general framework which allows us to reduce problems of this type to counting lattice points in S. We apply this framework with known results from diophantine geometry, to prove various bounds when S is a set of points in convex position, an algebraic variety, or a semialgebraic set. In particular, this resolves some conjectures on polynomial anticoncentration in the special case where the so-called Chow rank is bounded.
Algebraic aspects of the polynomial Littlewood–Offord problem (with Zhihan Jin, Lisa Sauermann and Yiting Wang). Submitted.
Consider a degree-d polynomial f of independent Bernoulli random variables ξ₁,...,ξ_n. To what extent can f(ξ₁,...,ξ_n) concentrate on a single value? This is the so-called polynomial Littlewood–Offord problem. A nearly optimal bound (up to sub-polynomial factors) was proved by Meka, Nguyen and Vu: the point probabilities are always at most about n^−1/2, unless f is “close to the zero polynomial”. In this paper we prove several results supporting the general philosophy that the Meka–Nguyen–Vu bound can be significantly improved unless f is “close to a polynomial with special algebraic structure”, drawing some comparisons to phenomena in analytic number theory. In particular, one of our results is a corrected version of a conjecture of Costello on multilinear forms (in an appendix with Ashwin Sah and Mehtaab Sawhney, we disprove Costello's original conjecture).
Colouring random Hasse diagrams and box-Delaunay graphs (with Zhihan Jin and Lyuben Lichev). Submitted.
Let G be the Hasse diagram of a random d-dimensional partial order on n elements. We show that the chromatic number of G is typically (log n)^d−1+o(1), and the independence number of G is typically n / (log n)^d−1+o(1). We also obtain sharper results for d = 2, and analogous results when G is a random Delaunay graph with respect to axis-parallel boxes. These results extend and sharpen previous work by Chen, Pach, Szegedy and Tardos, and they provide new bounds on the largest possible chromatic number (and lowest possible independence number) of a d-dimensional box-Delaunay graph or Hasse diagram, in particular resolving a conjecture of Tomon.
Smoothed Analysis for Graph Isomorphism (with Michael Anastos and Benjamin Moore). Submitted. A conference version appeared at STOC 2025.
There is no known polynomial-time algorithm for graph isomorphism testing, but elementary combinatorial “refinement” algorithms seem to be very efficient in practice. Some philosophical justification for this situation is provided by a classical theorem of Babai, Erdős and Selkow, who showed that the simplest possible refinement-based algorithm is very effective on random graphs: for a uniformly random graph G on n vertices, naïve refinement provides a canonical labelling scheme which can be used to distinguish G from all other graphs (with probability tending to 1 as n tends to infinity).

We extend the Babai–Erdős–Selkow theorem to sparse random graphs (of any density), improving previous results of Bollobás, Czajka–Pandurangan and Linial–Moshieff. We also extend it to the smoothed analysis framework of Spielman and Teng, improving previous results of Gaudio–Rácz–Sridhar. In particular, one of our results is that for any n-vertex graph G, if we perturb G by adding and removing about n random edges, then refinement-based canonical labelling schemes typically become very efficient.
Entangled states are typically incomparable (with Vishesh Jain and Marcus Michelen). Communications in Mathematical Physics, to appear.
Consider a bipartite quantum system, where Alice and Bob jointly possess a pure state |ψ⟩. Using local quantum operations on their respective subsystems, and classical communication, Alice and Bob may be able to transform |ψ⟩ into another state |φ⟩. In this paper we prove a conjecture of Nielsen, that in the limit of large dimensionality, for almost all pairs of states |ψ⟩, |φ⟩ (according to the natural unitary invariant measure) such a transformation is not possible. That is to say, typical pairs of quantum states are entangled in fundamentally different ways, that cannot be locally converted to each other. This turns out to be equivalent to a statement about majorisation of spectra of certain random matrices.
On random matrices with large corank (with Zach Hunter, Lisa Sauermann and Mehtaab Sawhney). International Mathematics Research Notices, to appear.
Let M be a uniformly random n×n matrix with ±1 entries. We prove a large deviation inequality for the rank of M: there is an absolute constant c > 0 such that Pr[rank(M) ⩽ n − k] ⩽ exp(−cnk) for all k ⩽ n. This is optimal up to the value of c, and improves on previous work of Rudelson (who proved the same for k ⩽ n^1/2).
The exact rank of sparse random graphs (with Margalit Glasgow, Ashwin Sah and Mehtaab Sawhney). Journal of the European Mathematics Society, to appear.
Very sparse random graphs (and random bipartite graphs) are known to typically be singular (i.e., have singular adjacency matrix), due to the presence of “low-degree dependencies” such as isolated vertices and pairs of degree-1 vertices with the same neighbourhood. In this paper we give a combinatorial description of the rank of a sparse random graph 𝔾(n, c/n) or 𝔾(n, n, c/n), in terms of such local dependencies, for all constants c ≠ e (and we present some evidence that the situation is very different for c = e). This gives an essentially complete answer to a question of Vu, and has a number of applications including a central limit theorem for the rank of a sparse random graph.
Counting Perfect Matchings in Dirac Hypergraphs (with Roodabeh Safavi and Yiting Wang). Combinatorica 46.1 (2026).
For 1 ⩽ d < k and n divisible by k, let m_d(k, n) be the minimum d-degree ensuring the existence of a perfect matching in an n-vertex k-uniform hypergraph. Generalising a result of Cuckler and Kahn to the hypergraph setting, we prove that if an n-vertex k-uniform hypergraph G has minimum d-degree at least (1+ε) m_d(k, n) (for any constant ε ⩾ 0), then the number of perfect matchings in G is controlled by an entropy-like parameter of G.
The edge-statistics conjecture for hypergraphs (with Vishesh Jain, Dhruv Mubayi and Tuan Tran). International Mathematics Research Notices 2025.18 (2025).
Consider integers r, k, l such that 0 < l < (k choose r), and given a large r-uniform hypergraph G, consider the fraction of k-vertex subsets of G which span exactly l edges. In this paper we prove an essentially optimal bound on how large this fraction can be, answering a question of Alon, Hefetz, Krivelevich and Tyomkyn (namely, they suggested this as a hypergraph generalisation of their edge-statistics conjecture). We also prove a much stronger bound when l is far from zero and (k choose r).
Resolution of the quadratic Littlewood–Offord problem (with Lisa Sauermann). Compositio Mathematica 161.12 (2026), 3089–3139.
Consider a quadratic polynomial Q(ξ₁,...,ξ_n) of independent Rademacher random variables. The quadratic Littlewood–Offord problem asks: to what extent can Q(ξ₁,...,ξ_n) concentrate on a single value? In this paper, we obtain an essentially optimal bound for this problem, as conjectured by Nguyen and Vu. Specifically, if there is no way to pin down the value of Q(ξ₁,...,ξ_n) by fixing values for fewer than m of the variables ξ_i, then we have Pr[Q(ξ₁,...,ξ_n) = 0] ⩽ O(m^−1/2). A key ingredient in our proof is a new inductive decoupling scheme that reduces quadratic anticoncentration problems to high-dimensional linear anticoncentration problems.
A central limit theorem for the matching number of a sparse random graph (with Margalit Glasgow, Ashwin Sah and Mehtaab Sawhney). Journal of the London Mathematical Society 111.4:e70101 (2025).
In this paper we prove a central limit theorem for the matching number of a sparse random graph, as conjectured by Aronson, Frieze and Pittel. This had recently been proved in the so-called supercritical regime (according to an algorithmic phase transition first observed by Karp and Sipser), using a stochastic generalisation of the differential equations method; we build on this method and introduce new ideas to handle certain degeneracies present in the subcritical and critical regimes.

Here is a Mathematica notebook (PDF printout) for some of the more tedious calculations in the paper.
Partitioning problems via random processes (with Michael Anastos, Oliver Cooley and Mihyun Kang). Journal of the London Mathematical Society 110.6:e70010 (2024).
There are a number of well-known problems and conjectures about partitioning graphs to satisfy local constraints. For example, the majority colouring conjecture of Kreutzer, Oum, Seymour, van der Zypen and Wood states that every directed graph has a 3-colouring such that for every vertex v, at most half of the out-neighbours of v have the same colour as v. As another example, the internal partition conjecture, due to DeVos and to Ban and Linial, states that for every d, all but finitely many d-regular graphs have a partition into two nonempty parts such that for every vertex v, at least half of the neighbours of v lie in the same part as v. We prove several results in this spirit: in particular, two of our results are that the majority colouring conjecture holds for Erdős–Rényi random directed graphs (of any density), and that the internal partition conjecture holds if we permit a tiny number of “exceptional vertices”. Our proofs involve a variety of techniques, including several different methods to analyse random recolouring processes.
Books, hallways and social butterflies: a note on sliding block puzzles (with Florestan Brunck). The Mathematical Intelligencer 47.1 (2025), 52–65.
Recall the classical “15 puzzle”, consisting of 15 sliding blocks in a 4×4 grid. The configuration space of this puzzle consists of two connected components, corresponding to the odd and even permutations of the symmetric group S₁₅. Generalising results of Wilson, we study the connectedness of the configuration space of a puzzle consisting of an arbitrary number of sliding blocks on an arbitrary graph.
Extremal, enumerative and probabilistic results on ordered hypergraph matchings (with Michael Anastos, Zhihan Jin and Benny Sudakov). Forum of Mathematics, Sigma 13:E55 (2025).
An ordered r-matching is an r-uniform hypergraph matching equipped with an ordering on its vertices. The theory of ordered 2-matchings is well-developed, but in the case r ⩾ 3 much less is known, largely due to a lack of powerful bijective tools. Recently, Dudek, Grytczuk and Ruciński made some first steps towards a general theory of ordered r-matchings; in this paper we substantially improve several of their results and introduce some new directions of study.
The Cost of Maintaining Keys in Dynamic Groups with Applications to Multicast Encryption and Group Messaging (with Michael Anastos, Benedikt Auerbach, Mirza Ahad Baig, Miguel Cueto Noval, Guillermo Pascual-Perez and Krzysztof Pietrzak). Proceedings of the Theory of Cryptography Conference (TCC) 2024 (Lecture Notes in Computer Science vol. 15364), 413–443.
This paper is a collaboration with a team of cryptographers. We were able to use techniques from extremal combinatorics to study the efficiency of key-sharing schemes (which are used for secure communication between members of a group). I was only really involved in the combinatorics part!
High-girth Steiner triple systems (with Ashwin Sah, Mehtaab Sawhney and Michael Simkin). Annals of Mathematics 200.3 (2024), 1059–1156.
For any constant g, we show that for sufficiently large n congruent to 1 or 3 (mod 6) there is an order-n Steiner triple system with girth at least g. This resolves an old conjecture of Erdős.
The inertia bound is far from tight (with Yuval Wigderson). Bulletin of the London Mathematical Society 56.10 (2024), 3196–3208.
The inertia bound is a powerful inequality in spectral graph theory, stating that for any graph G, and any weighted adjacency matrix A of G, the independence number of G is at most the number of non-negative eigenvalues of A. In general, it is a highly non-trivial matter to identify the best weighted adjacency matrix with which to apply this bound, and the limits of the inertia bound are therefore rather unclear. In fact, only recently Sinkovic found the first example of a graph for which the inertia bound is not tight. In this paper we obtain some new results showing that the inertia bound can be extremely far from tight.
Exponentially many graphs are determined by their spectrum (with Illya Koval). Quarterly Journal of Mathematics 75.3 (2024), 869–899.
As a discrete analogue of Kac's celebrated question on “hearing the shape of a drum”, and towards a practical graph isomorphism test, it is of interest to understand which graphs are determined up to isomorphism by their spectrum (of their adjacency matrix). A striking conjecture in this area, due to van Dam and Haemers, is that “almost all graphs are determined by their spectrum”, meaning that the fraction of unlabelled n-vertex graphs which are determined by their spectrum converges to 1 as n → ∞. In this paper we make a step towards this conjecture, showing that there are exponentially many n-vertex graphs which are determined by their spectrum.
Substructures in Latin squares (with Ashwin Sah, Mehtaab Sawhney and Michael Simkin). Israel Journal of Mathematics 256.2 (2023), 363–416.
We prove several theorems about substructures in Latin squares. First, we prove the first nontrivial lower bound on the number of the number of n×n Latin squares which have no intercalates (2×2 Latin subsquares). We also prove an essentially best-possible bound on the upper tail for intercalates in random Latin squares. Then, we prove a conjecture of Linial on the existence of Latin squares with high girth. Finally, we study the number of cuboctahedra in a random Latin square (this is a measure of “how associative” the quasigroup associated with the Latin square is).
Anticoncentration in Ramsey graphs and a proof of the Erdős–McKay conjecture (with Ashwin Sah, Lisa Sauermann and Mehtaab Sawhney). Forum of Mathematics, Pi 11:E21 (2023).
An n-vertex graph is called C-Ramsey if it has no clique or independent set of size C log n (i.e., if it has near-optimal Ramsey behavior). In this paper we study edge-statistics in Ramsey graphs, obtaining very precise control of the distribution of the number of edges in a random vertex subset of a C-Ramsey graph. One of the consequences of our result is the resolution of an old conjecture of Erdős and McKay. The proof proceeds via an “additive structure” dichotomy, and involves a wide range of different tools from Fourier analysis, random matrix theory, the theory of Boolean functions, probabilistic combinatorics, and low-rank approximation.
Singularity of the k-core of a random graph (with Asaf Ferber, Ashwin Sah and Mehtaab Sawhney). Duke Mathematical Journal 172.7 (2023), 1293–1332.
Very sparse random graphs are known to typically be singular (i.e., have singular adjacency matrix), due to the presence of “low-degree dependencies” such as isolated vertices and pairs of degree-1 vertices with the same neighbour. We prove that these kinds of dependencies are in some sense the only causes of singularity: for any fixed k ⩾ 3 and c > 0, a random graph with n vertices and edge probability c/n typically has the property that its k-core (its maximal subgraph with minimum degree at least k) is nonsingular. This resolves a conjecture of Vu, and adds to a very short list of nonsingularity theorems for “extremely sparse” random matrices with density O(1/n). Note: a previous version of this paper included a remark that our proof also yields a hitting-time result. We since discovered that modifying our proof to this setting is not as trivial as we thought, and this should still be viewed as an open problem.
Geometric and o-minimal Littlewood–Offord problems (with Jacob Fox and Hunter Spink). Annals of Probability 51.1 (2023), 101–126.
The Littlewood–Offord problem is concerned with upper bounds on probabilities of events of the form a₁ξ₁+...+a_nξ_n = x, where (ξ₁,...,ξ_n) is a random ±1 sequence and a₁,...,a_n,x ∈ ℝ^d are vectors. In this paper we consider some geometric variants of the Littlewood–Offord problem. In particular, we are able to obtain near-optimal bounds on probabilities of events of the form a₁ξ₁+...+a_nξ_n ∈ S, when S ⊆ ℝ^d is definable with respect to an o-minimal structure.
Enumerating Matroids and Linear Spaces (with Ashwin Sah and Mehtaab Sawhney). Comptes Rendus Mathématique 361 (2023), 565–575.
We show that the number of linear spaces on a set of n points and the number of rank-3 matroids on a ground set of size n are both of the form (cn + o(n))^n²/6, for an explicit constant c ≈ 0.162. This is the final piece of the puzzle for enumerating fixed-rank matroids at this level of accuracy: the numbers of rank-1 and rank-2 matroids on a ground set of size n have exact representations in terms of well-known combinatorial functions, and it was recently proved by van der Hofstad, Pendavingh and van der Pol that for constant r ⩾ 4 there are (e^1−rn + o(n))^{n^r−1/r!} rank-r matroids on a ground set of size n.
Friendly bisections of random graphs (with Asaf Ferber, Bhargav Narayanan, Ashwin Sah and Mehtaab Sawhney). Communications of the American Mathematical Society 2 (2022), 380–416.
Resolving an old conjecture of Füredi, we prove that, for n even, almost every n-vertex graph admits a partition of its vertex set into two parts of equal size in which almost all vertices have more neighbours on their own side than across. The proof relies on new methods to study stochastic processes driven by degree information in random graphs; in particular, we combine enumeration techniques with an abstract second moment argument.
Dirac-type theorems in random hypergraphs (with Asaf Ferber). Journal of Combinatorial Theory, Series B 155 (2022), 318–357.
For 1 ⩽ d < k and n divisible by k, let m_d(k, n) be the minimum d-degree ensuring the existence of a perfect matching in an n-vertex k-uniform hypergraph. In general, our understanding of the values of m_d(k, n) is very limited, and it is an active topic of research to determine or approximate these values. In this paper we prove a “transference” theorem for random hypergraphs. Specifically, for any 1 ⩽ d < k, any ε > 0 and any “not too small” p, we prove that a random k-uniform hypergraph G with n vertices and edge probability p typically has the property that every spanning subgraph of G with minimum degree at least (1+ε) m_d(k, n) p has a perfect matching. One interesting aspect of our proof is a “non-constructive” application of the absorbing method, which allows us to prove a bound in terms of m_d(k, n) without actually knowing its value.
Large deviations in random Latin squares (with Ashwin Sah and Mehtaab Sawhney). Bulletin of the London Mathematical Society 54.4 (2022), 1420–1438.
In this note, we study large deviations of the number of intercalates (2×2 Latin subsquares) in a random n×n Latin square. We obtain near-optimal tail bounds, and as a consequence prove that a typical n×n Latin square has (1 + o(1)) n²/4 intercalates, resolving an old conjecture of McKay and Wanless.
Extension complexity of low-dimensional polytopes (with Lisa Sauermann and Yufei Zhao). Transactions of the American Mathematical Society 375.6 (2022), 4209–4250.
Sometimes, it is possible to represent a complicated polytope as a “shadow” of a much simpler polytope. To quantify this phenomenon, the extension complexity of a polytope P is defined to be the minimum number of facets in a (possibly higher-dimensional) polytope from which P can be obtained as a linear projection. It is an important question to understand the extent to which the extension complexity of a polytope is controlled by its dimension, and in this paper we prove three different results along these lines. First, we show that there exists an n^o(1)-dimensional polytope with at most n facets and extension complexity n^1−o(1). Second, we obtain optimal bounds for the extension complexity of random d-polytopes, and third, we obtain an optimal upper bound for the extension complexity of cyclic polygons (all of whose vertices lie on a common circle).
List-decodability with large radius for Reed–Solomon codes (with Asaf Ferber and Lisa Sauermann). IEEE Transactions on Information Theory 68.6 (2022), 3823–3828. A conference version appeared at FOCS 2021.
We prove that there exist Reed–Solomon codes which are list-decodable with radius 1 − ε and have rate Ω(ε) (which is the best possible for any code, by the list decoding capacity theorem). This improves a result of Guo, Li, Shangguan, Tamo and Wootters, and resolves the motivating question of their work.
On the permanent of a random symmetric matrix (with Lisa Sauermann). Selecta Mathematica 8.15 (2022).
Resolving a conjecture of Vu, we prove that the permanent of a uniformly random symmetric n×n matrix with ±1 entries typically has magnitude n^n/2+o(n). Our result can be extended to very general models of symmetric random matrices.
Singularity of sparse random matrices: simple proofs (with Asaf Ferber and Lisa Sauermann). Combinatorics, Probability and Computing 31.1 (2022), 21–28.
Consider a random n×n zero-one matrix with “density” p, sampled according to one of the following two models: either every entry is independently set to one with probability p (the “Bernoulli” model), or each row is independently uniformly sampled from the set of all length-n zero-one vectors with exactly pn ones (the “combinatorial” model). We give simple proofs of the (essentially best-possible) fact that in both cases, if min(p,1 − p) ⩾ (1+ε) log n / n for any constant ε > 0, then our random matrix is nonsingular with probability 1 − o(1). In the Bernoulli case this fact was already well-known, but in the combinatorial case this resolves a conjecture of Aigner-Horev and Person.
Combinatorial anti-concentration inequalities, with applications (with Jacob Fox and Lisa Sauermann). Mathematical Proceedings of the Cambridge Philosophical Society 171.2 (2021), 227–248.
We prove several different anti-concentration inequalities for functions of independent Bernoulli-distributed random variables. First, we prove some “Poisson-type” anti-concentration theorems that give bounds of the form 1/e + o(1) for the point probabilities of certain polynomials. Second, we prove an anti-concentration inequality for polynomials with nonnegative coefficients which extends the classical Erdős–Littlewood–Offord theorem and improves a theorem of Meka, Nguyen and Vu for polynomials of this type. As an application, we prove some new anti-concentration bounds for subgraph counts in random graphs.
Lower bounds for superpatterns and universal sequences (with Zachary Chroman and Mihir Singhal). Journal of Combinatorial Theory, Series A 156:105467 (2021).
A permutation is said to be k-universal or a k-superpattern if it contains every permutation of length k. A simple counting argument shows that every k-superpattern has length at least (1/e² + o(1)) k², and Arratia conjectured that this lower bound is best-possible. Disproving Arratia's conjecture, we improve this trivial bound by a small constant factor. We accomplish this by designing an efficient encoding scheme for the patterns that appear in a permutation. This approach is quite flexible and is applicable to other universality-type problems; for example, we also improve a bound by Engen and Vatter on a problem concerning (k+1)-ary sequences which contain all k-permutations.
Anticoncentration for subgraph counts in random graphs (with Jacob Fox and Lisa Sauermann). Annals of Probability 49.3 (2021), 1515–1553.
Fix a graph H and some 0 < p < 1, and let X be the number of copies of H in a random graph G(n, p). In this paper we study the anticoncentration behaviour of X: how likely can it be that X falls in some small interval or is equal to some particular value? We prove the almost-optimal result that if H is connected then for any integer x we have Pr(X = x) ⩽ n^{1−v(H)+o(1)}. Our proof proceeds by iteratively breaking X into different components which fluctuate at “different scales”, and relies on a new anticoncentration inequality for random vectors that behave “almost linearly”.
Acyclic subgraphs of tournaments with high chromatic number (with Jacob Fox and Benny Sudakov). Bulletin of the London Mathematical Society 53.2 (2021), 619–630.
It is a simple fact that if an oriented graph G has chromatic number k², then it has an acylic subgraph with chromatic number at least k, and it was recently observed that improvements to this bound would imply new bounds for an old conjecture of Burr. In this paper we consider the special case where G is a tournament, and improve some previous bounds by Nassar and Yuster, resolving one of their conjectures. Along the way, we prove a lemma showing that tournaments with many transitive subtournaments have a large almost-transitive subtournament; this may be of independent interest.
Almost all Steiner triple systems are almost resolvable (with Asaf Ferber). Forum of Mathematics, Sigma 8:E39 (2020).
We show that for any n congruent to 3 (mod 6), almost all order-n Steiner triple systems admit a decomposition of almost all their triples into disjoint perfect matchings. That is, almost all Steiner triple systems are almost resolvable.
Halfway to Rota's basis conjecture (with Matija Bucic, Alexey Pokrovskiy and Benny Sudakov). International Mathematics Research Notices 2020.21 (2020), 8007–8026.
Given n bases B₁,...,B_n in an n-dimensional vector space V, a transversal basis is a basis of V containing exactly one element from each B_i. Rota's basis conjecture posits that it is always possible to find n disjoint transversal bases. In this paper we prove the partial result that one can always find (1/2 − o(1)) n disjoint transversal bases, improving on the previous record of Ω(n/log n). Our result generalises to the setting of matroids. See also our companion note adapting our methods to the setting of a related conjecture due to Kahn.
Universality of random permutations (with Xiaoyu He). Bulletin of the London Mathematical Society 52.3 (2020), 515–529.
It is a classical fact that for any ε > 0, a random permutation of length n = (1/4+ε) k² typically contains an increasing subsequence of length k. As a far-reaching generalisation, Alon conjectured that a random permutation of this same length n is typically k-universal, meaning that it simultaneously contains every pattern of length k. He also made the simple observation that some n = O(k² log k) suffices. In this paper we make the first asymptotic improvement to this bound: if n = 2000 k² log log k, then a random permutation of order n is typically k-universal.
Nearly-linear monotone paths in edge-ordered graphs (with Matija Bucic, Alexey Pokrovskiy, Benny Sudakov, Tuan Tran and Adam Zsolt Wagner). Israel Journal of Mathematics 238.2 (2020), 663–685.
How long a monotone path can one always find in any edge-ordering of the n-vertex complete graph? This question was first asked by Chvátal and Komlós. The prevailing conjecture is that one can always find a monotone path of length Ω(n), but until now the best known lower bound was n^2/3−o(1). In this paper we prove that in any edge-ordering of the complete graph, there is a monotone path of length n^1−o(1). Our proof involves a “regularisation” lemma which may be of independent interest.
An algebraic inverse theorem for the quadratic Littlewood–Offord problem, and an application to Ramsey graphs (with Lisa Sauermann). Discrete Analysis 2020:12 (2020).
Consider a quadratic polynomial f(ξ₁,...,ξ_n) of independent Bernoulli random variables. What can be said about the concentration of f on any single value? It is known that the point probabilities of f can be as large as about n^−1/2, but still poorly understood is the “inverse” question of understanding how algebraic and arithmetic features of f affect its point probabilities. In this paper we prove some results of an algebraic flavour, showing that if f has point probabilities much larger than 1/n then it must be close to a quadratic form with low rank. We also give an application to Ramsey graphs.
Almost all Steiner triple systems have perfect matchings. Proceedings of the London Mathematical Society 121.6 (2020), 1468–1495.
We show that for any n divisible by 3, almost all order-n Steiner triple systems have a perfect matching (also known as a parallel class). In fact, we prove a general upper bound on the number of perfect matchings in a Steiner triple system and show that almost all Steiner triple systems essentially attain this maximum. We accomplish this via a general theorem comparing a uniformly random Steiner triple system to the outcome of the triangle removal process, which we hope will be useful for other problems. See also the companion note with Ashwin Sah and Mehtaab Sawhney, where we record and prove some analogues of the lemmas in this paper for random Latin squares.
Dense induced bipartite subgraphs in triangle-free graphs (with Shoham Letzter, Benny Sudakov and Tuan Tran). Combinatorica 40.2 (2020), 283–305.
We show that for any fixed t, every K_t-free graph with minimum degree d contains an induced bipartite subgraph with minimum degree Ω(log d / log log d). This resolves some conjectures of Esperet, Kang, and Thomassé. We also obtain some further results concerning large induced bipartite subgraphs in triangle-free graphs, one of which answers a question of Erdős, Janson, Łuczak and Spencer.
Ramsey graphs induce subgraphs of quadratically many sizes (with Benny Sudakov). International Mathematics Research Notices 2020.6 (2020), 1621–1638.
An n-vertex graph is called C-Ramsey if it has no clique or independent set of size C log n. All known constructions of Ramsey graphs involve randomness in an essential way, and there is a line of research towards showing that in fact all Ramsey graphs must obey certain “richness” properties characteristic of random graphs. In this paper we prove such a result: for any fixed C, every n-vertex C-Ramsey graph induces subgraphs of Θ(n²) different sizes. This resolves a conjecture of Narayanan, Sahasrabudhe and Tomon, motivated by an old problem of Erdős and McKay. Note: a minor oversight in the definition of “(δ,ε)-richness” has been corrected since publication of this paper. The corrected version can be found here or on the arXiv.
Hypergraph cuts above the average (with David Conlon, Jacob Fox and Benny Sudakov). Israel Journal of Mathematics 233.1 (2019), 67–111.
An r-cut of a k-uniform hypergraph (k-graph) H is a partition of the vertex set into r parts, and the size of such a cut is the number of edges which have a vertex from every part. The max-r-cut of H is the maximum size of an r-cut of H. We prove some new bounds on the max-r-cut of a k-graph, for fixed r ⩽ k, above the trivial “average” bound obtainable from a uniformly random cut. In particular, in contrast to the situation for max-cut in graphs and max-2-cut in 3-graphs, we show that if k ⩾ 4 or r ⩾ 3 then the worst-case behaviour is not governed by the standard deviation of a uniformly random cut.
Proof of a conjecture on induced subgraphs of Ramsey graphs (with Benny Sudakov). Transactions of the American Mathematical Society 372.8 (2019), 5571–5594.
An n-vertex graph is called C-Ramsey if it has no clique or independent set of size C log n. All known constructions of Ramsey graphs involve randomness in an essential way, and there is a line of research towards showing that in fact all Ramsey graphs must obey certain “richness” properties characteristic of random graphs. In this paper we prove an old conjecture of Erdős, Faudree and Sós that in any n-vertex C-Ramsey graph, there are Ω(n^5/2) induced subgraphs, no pair of which have the same numbers of edges and vertices. Note: a minor oversight in the definition of “(δ,ε)-richness” has been corrected since publication of this paper. The corrected version can be found here or on the arXiv.
Anticoncentration for subgraph statistics (with Benny Sudakov and Tuan Tran). Journal of the London Mathematical Society 99.3 (2019), 757–777.
Consider integers k, l such that 0 ⩽ l ⩽ (k choose 2). Given a large graph G, what is the fraction of k-vertex subsets of G which span exactly l edges? When l is zero or (k choose 2), this fraction can be exactly 1. On the other hand, with Ramsey's theorem in mind, if l is far from these extreme values we might expect that this fraction must always be substantially smaller than 1. We prove an almost-best-possible theorem to this effect, improving on results of Alon, Hefetz, Krivelevich and Tyomkyn. We also make some first steps towards some analogous questions for hypergraphs. Our proofs take a probabilistic point of view, and involve polynomial anticoncentration inequalities, hypercontractivity, and a coupling trick for random variables defined on a “slice” of the Boolean hypercube.
The random k-matching-free process (with Michael Krivelevich, Po-Shen Loh and Benny Sudakov). Random Structures and Algorithms 53.4 (2018), 692–716.
We study the k-matching-free process, where one starts with the empty n-vertex graph and adds edges one-by-one, each chosen uniformly at random subject to the constraint that no k-matching is created (for some k potentially depending on n). This appears to be the first analysis of an H-free process for non-fixed H. In our main theorems, we identify the range of k for which the process results in an extremal k-matching-free graph, as characterised by Erdős and Gallai. One of the proofs involves some interesting coupling arguments for tracking the formation of augmenting paths.
Counting Hamilton cycles in sparse random directed graphs (with Asaf Ferber and Benny Sudakov). Random Structures and Algorithms 53.4 (2018), 592–603.
Frieze showed that a random directed graph with n vertices and m = n log n + ω(n) edges typically has a directed Hamilton cycle (this is best possible). Using Frieze's machinery, permanent estimates, and some elementary facts about random permutations, we give a short proof of the fact that such random digraphs in fact typically have n! (m/n² (1+o(1)))ⁿ Hamilton cycles, improving previous results that held only for denser random digraphs. We also prove a hitting time version of our theorem.
Non-trivially intersecting multi-part families (with Benny Sudakov and Pedro Vieira). Journal of Combinatorial Theory, Series A 156 (2018), 44–60.
The classical Erdős–Ko–Rado theorem gives the maximum size of a k-uniform intersecting family, and the Hilton–Milner theorem gives the maximum size of such a family that is not trivially intersecting (this means that there is no element x which appears in each set of the family). Frankl introduced and solved a certain natural “multi-part” generalization of the Erdős–Ko–Rado problem; in this paper we study the corresponding question for non-trivially intersecting families. We solve this problem asymptotically, disproving a conjecture of Alon and Katona.
Intercalates and discrepancy in random Latin squares (with Benny Sudakov). Random Structures and Algorithms 52.2 (2018), 181–196.
An intercalate in a Latin square is a 2×2 Latin subsquare. We show that a random n×n Latin square typically has about n² intercalates, significantly improving the previous best lower and upper bounds. In addition, we show that in a certain natural sense a random Latin square has relatively low discrepancy. The primary tools in our proofs are the so-called “switching” method and permanent estimates.
Resilience for the Littlewood–Offord Problem (with Afonso Bandeira and Asaf Ferber). Advances in Mathematics 319 (2017), 292–312.
Fix a sequence of nonzero real numbers a = (a₁,...,a_n), consider a random ±1 sequence ξ = (ξ₁,...,ξ_n), and let X = a₁ξ₁+...+a_nξ_n. The Erdős–Littlewood–Offord theorem shows that, regardless of a, for any x the event X = x is unlikely (that is, X is anti-concentrated). In this paper, motivated by some questions about random matrices, we study the “resilience” of this anti-concentration. For a given x, how many coordinates of ξ can we allow an adversary to change before they can force X = x? The answer is quite surprising, and its proof involves an interesting connection to combinatorial number theory.
The average number of spanning trees in sparse graphs with given degrees (with Catherine Greenhill, Mikhail Isaev and Brendan McKay). European Journal of Combinatorics 63 (2017), 6–25.
We find the asymptotic expected number of spanning trees in a random graph conditioned on a fixed "sparse" degree sequence. In particular this gives the expected number of spanning trees in a random d-regular graph on n vertices, where d can grow modestly with n. An interesting part of the proof is a concentration result proved using a martingale based on the Prüfer code algorithm.
Bounded-degree spanning trees in randomly perturbed graphs (with Michael Krivelevich and Benny Sudakov). SIAM Journal on Discrete Mathematics 31.1 (2017), 155–171.
We show that randomly changing linearly many edges in a dense graph is typically enough to ensure the existence of a copy of any given bounded-degree spanning tree.
Cycles and matchings in randomly perturbed digraphs and hypergraphs (with Michael Krivelevich and Benny Sudakov). Combinatorics, Probability and Computing 25.6 (2016), 909–927.
In many situations, “typical” structures have certain properties, but there are worst-case extremal examples which do not. In these situations one can often show that the extremal examples are “fragile” in that after a modest random perturbation our desired property will typically appear. We prove several results of this flavour, concerning perfect matchings and Hamilton cycles in digraphs and hypergraphs. The proof of one of our results involves an unusual application of Szemerédi's regularity lemma to “beat the union bound”, which may be of independent interest.
On the number of spanning trees in random regular graphs (with Catherine Greenhill and David Wind). Electronic Journal of Combinatorics 21:P1.45 (2014).
We study the number of spanning trees τ(G) in a uniformly random d-regular graph G on n vertices (for fixed d and large n). We find the asymptotic expected value of τ(G), and we find the limiting distribution of τ(G) for d = 3. The proof uses the method of small subgraph conditioning: we estimate Y via its expectation conditioned on the short cycle counts. The estimates are rather more difficult than usual, and involve complex-analytic methods.