Designed and built with care, filled with creative elements

Top
Image Alt

Publications

  /    /  Publications

Nos 50 dernières publications

  • 11 November 2025 hal-05359228

    Transformers are deep architectures that define "in-context mappings" which enable predicting new tokens based on a given set of tokens (such as a prompt in NLP applications or a set of patches for a vision transformer). In this work, we study in particular the ability of these architectures to handle an arbitrarily large number of context tokens. To mathematically, uniformly address their expressivity, we consider the case that the mappings are conditioned on a context represented by a probability distribution of tokens which becomes discrete for a finite number of these. The relevant notion of smoothness then corresponds to continuity in terms of the Wasserstein distance between these contexts. We demonstrate that deep transformers are universal and can approximate continuous in-context mappings to arbitrary precision, uniformly over compact token domains. A key aspect of our results, compared to existing findings, is that for a fixed precision, a single transformer can operate on an arbitrary (even infinite) number of tokens. Additionally, it operates with a fixed embedding dimension of tokens (this dimension does not increase with precision) and a fixed number of heads (proportional to the dimension). The use of MLPs between multi-head attention layers is also explicitly controlled. We consider both unmasked attentions (as used for the vision transformer) and masked causal attentions (as used for NLP and time series applications). We tackle the causal setting leveraging a space-time lifting to analyze causal attention as a mapping over probability distributions of tokens.

    Takashi Furuya, Maarten V. de Hoop, Gabriel Peyré

  • 11 November 2025 hal-05359222

    Causal Transformers are trained to predict the next token for a given context. While it is widely accepted that self-attention is crucial for encoding the causal structure of sequences, the precise underlying mechanism behind this in-context autoregressive learning ability remains unclear. In this paper, we take a step towards understanding this phenomenon by studying the approximation ability of Transformers for next-token prediction. Specifically, we explore the capacity of causal Transformers to predict the next token xt+1 given an autoregressive sequence (x1,…,xt) as a prompt, where xt+1=f(xt), and f is a context-dependent function that varies with each sequence. On the theoretical side, we focus on specific instances, namely when f is linear or when (xt)t≥1 is periodic. We explicitly construct a Transformer (with linear, exponential, or softmax attention) that learns the mapping f in-context through a causal kernel descent method. The causal kernel descent method we propose provably estimates xt+1 based solely on past and current observations (x1,…,xt), with connections to the Kaczmarz algorithm in Hilbert spaces. We present experimental results that validate our theoretical findings and suggest their applicability to more general mappings f.

    Michael E. Sander, Gabriel Peyré

  • 11 November 2025 hal-05359214

    Super-resolution of pointwise sources is of utmost importance in various areas of imaging sciences. Specific instances of this problem arise in single molecule fluorescence, spike sorting in neuroscience, astrophysical imaging, radar imaging, and nuclear resonance imaging. In all these applications, the Lasso method (also known as Basis Pursuit or \( \ell^1 \)-regularization) is the de facto baseline method for recovering sparse vectors from low-resolution measurements. This approach requires discretization of the domain, which leads to quantization artifacts and consequently, an overestimation of the number of sources. While grid-less methods, such as Prony-type methods or non-convex optimization over the source position, can mitigate this, the Lasso remains a strong baseline due to its versatility and simplicity. In this work, we introduce a simple extension of the Lasso, termed ``super-resolved Lasso" (SR-Lasso). Inspired by the Continuous Basis Pursuit (C-BP) method, our approach introduces an extra parameter to account for the shift of the sources between grid locations. Our method is more comprehensive than C-BP, accommodating both arbitrary real-valued or complex-valued sources. Furthermore, it can be solved similarly to the Lasso as it boils down to solving a group-Lasso problem. A notable advantage of SR-Lasso is its theoretical properties, akin to grid-less methods. Given a separation condition on the sources and a restriction on the shift magnitude outside the grid, SR-Lasso precisely estimates the correct number of sources.

    Clarice Poon, Gabriel Peyré

  • 11 November 2025 hal-05358734

    46 pages

    Gaëtan Chenevier, Wee Teck Gan

  • 5 November 2025 hal-05349214

    We have reached a point where many bio foundation models exist across 4 different scales, from molecules to molecular chains, cells, and tissues. However, while related in many ways, these models do not yet bridge these scales. We present a framework and architecture called Xpressor that enables cross-scale learning by (1) using a novel cross-attention mechanism to compress high-dimensional gene representations into lower-dimensional cell-state vectors, and (2) implementing a multi-scale fine-tuning approach that allows cell models to leverage and adapt protein-level representations. Using a cell Foundation Model as an example, we demonstrate that our architecture improves model performance across multiple tasks, including cell-type prediction (+12%) and embedding quality (+8%). Together, these advances represent first steps toward models that can understad and bridge different scales of biological organization.

    Jeremie Kalfon, Laura Cantini, Gabriel Peyre

  • 2 November 2025 hal-05342519

    Uniform attachment with freezing is an extension of the classical model of random recursive trees, in which trees are recursively built by attaching new vertices to old ones. In the model of uniform attachment with freezing, vertices are allowed to freeze, in the sense that new vertices cannot be attached to already frozen ones. We study the impact of removing attachment and/or freezing steps on the height of the trees. We show in particular that removing an attachment step can increase the expected height, and that freezing cannot substantially decrease the height of random recursive trees. Our methods are based on coupling arguments.

    Anna Brandenberger, Simon Briend, Hannah Cairns, Robin Khanfir, Igor Kortchemski

  • 2 November 2025 hal-05342518

    We are interested in the geometry of the ``infection tree'' in a stochastic SIR (Susceptible-Infectious-Recovered) model, starting with a single infectious individual. This tree is constructed by drawing an edge between two individuals when one infects the other. We focus on the regime where the infectious period before recovery follows an exponential distribution with rate 1, and infections occur at a rate λn∼λ/n where n is the initial number of healthy individuals with λ>1. We show that provided that the infection does not quickly die out, the height of the infection tree is asymptotically κ(λ)logn as n→∞, where κ(λ) is a continuous function in λ that undergoes a second-order phase transition at λc≃1.8038. Our main tools include a connection with the model of uniform attachment trees with freezing and the application of martingale techniques to control profiles of random trees.

    Igor Kortchemski, Emmanuel Kammerer, Delphin Sénizergues

  • 2 November 2025 hal-05342515

    We are interested in the geometry of the ``infection tree'' in a stochastic SIR (Susceptible-Infectious-Recovered) model, starting with a single infectious individual. This tree is constructed by drawing an edge between two individuals when one infects the other. We focus on the regime where the infectious period before recovery follows an exponential distribution with rate 1, and infections occur at a rate λn∼λ/n where n is the initial number of healthy individuals with λ>1. We show that provided that the infection does not quickly die out, the height of the infection tree is asymptotically κ(λ)logn as n→∞, where κ(λ) is a continuous function in λ that undergoes a second-order phase transition at λc≃1.8038. Our main tools include a connection with the model of uniform attachment trees with freezing and the application of martingale techniques to control profiles of random trees.

    Igor Kortchemski, Leonard Vetter

  • 23 October 2025 hal-05328223

    Girko matrices have independent and identically distributed entries of mean zero and unit variance. In this note, we consider the random matrix model formed by the ratio of two independent Girko matrices, its entries are dependent and heavy-tailed. Our main message is that divided by the square root of the dimension, the spectral radius of the ratio converges in distribution, when the dimension tends to infinity, to a universal heavy-tailed distribution. We provide a mathematical proof of this high-dimensional phenomenon, under a fourth moment matching with a Gaussian case known as the complex Ginibre ensemble. In this Gaussian case, the model is known as the spherical ensemble, and its spectrum is a determinantal planar Coulomb gas. Its image by the inverse stereographic projection is a rotationally invariant gas on the two-sphere. A crucial observation is the invariance in law of the model under inversion, related to its spherical symmetry, and that makes, in a sense, edge and bulk equivalent. Our approach involves Girko Hermitization, local law estimates for Wigner matrices, lower bound estimates on the smallest singular value, and convergence of kernels of determinantal point processes. The universality of the high-dimensional fluctuation of the spectral radius of the ratio of Girko matrices turns out to be remarkably more accessible mathematically than for a single Girko matrix!

    Djalil Chafai, David García-Zelada, Yuan Yuan Xu

  • 23 October 2025 hal-05327710

    We prove that, for any Morse function on a compact manifold and any adapted gradient satisfying the Morse-Smale condition, there is a homotopically unique complex-valued symplectic Lefschetz fibration on the cotangent bundle whose restriction to the zero-section is the given function, whose imaginary part is the evaluation of covectors on the gradient, and which is equivariant under the actions of the fiberwise antipodal involution and the complex conjugation. Then we study the topology and symplectic geometry of the regular fibers of this fibration, which are well-defined Weinstein manifolds.

    Emmanuel Giroux

  • 17 October 2025 hal-05320169

    Given a compact Riemannian surface $M$, with Laplace-Beltrami operator $\Delta$, for $\lambda > 0$, let $P_{\lambda,\lambda^{-\frac{1}{3}}}$ be the spectral projector on the bandwidth $[\lambda-\lambda^{-\frac{1}{3}}, \lambda + \lambda^{\frac{1}{3}}]$ associated to $\sqrt{-\Delta}$. We prove a polynomial improvement on the $L^2 \to L^{\infty}$ norm of $P_{\lambda,\lambda^{-\frac{1}{3}}}$ for generic simple spheres of revolution (away from the poles and the equator) and for the Euclidean disk away from its center but up to the boundary. We use the Quantum Integrability of those surfaces to express the norm in terms of a joint basis of eigenfunctions for $\left(\sqrt{-\Delta}, \frac{1}{i}\frac{\partial}{\partial \theta}\right)$. Then, we use that those eigenfunctions are asymptotically Lagrangian oscillatory functions, each supported on a Lagrangian torus with fold-type caustic. Thus, studying the distribution of the caustics, and using BKW decay away from the caustics, we are able to reduce the problem to counting estimates.

    Ambre Chabert, Yves Colin de Verdìère

  • 14 October 2025 hal-05314896

    We consider a model for a gas of N confined particles subject to a two-body repulsive interaction, namely the one-dimensional log or Riesz gas. We are interested in the so-called high temperature regime, ie where the inverse temperature β_N scales as Nβ_N → 2P>0. We establish, in the log case, a large deviation (LD) principle and moderate deviations estimates for the largest particle x_max when appropriately rescaled . Our result is an extension of [BADG01, Pak20] where such estimates were shown for the largest particle of the β-ensemble respectively at fixed β_N=β>0 and β_N≫N -1 . We show that the corresponding rate function is the same as in the case of iid particles. We also provide LD estimates in the Riesz case. Additionally, we consider related models of symmetric tridiagonal random matrices with independent entries having Gaussian tails; for which we establish the LD principle for the top eigenvalue. In a certain specialization of the entries, we recover the result for the largest particle of the log-gas. We show that LD are created by a few entries taking abnormally large values.

    Charlie Dworaczek Guera, Ronan Memin

  • 30 September 2025 hal-05288954

    We prove the existence and uniqueness of strong solutions to the equation $u u_x - u_{yy} = f$ in the vicinity of the linear shear flow, subject to perturbations of the source term and lateral boundary conditions. Since the solutions we consider have opposite signs in the lower and upper half of the domain, this is a quasilinear forward-backward parabolic problem, which changes type across a critical curved line within the domain. In particular, lateral boundary conditions can be imposed only where the characteristics are inwards. There are several difficulties associated with this problem. First, the forward-backward geometry depends on the solution itself. This requires to be quite careful with the approximation procedure used to construct solutions. Second, and more importantly, the linearized equations solved at each step of the iterative scheme admit a finite number of singular solutions, of which we provide an explicit construction. This is similar to well-known phenomena in elliptic problems in nonsmooth domains. Hence, the solutions to the equation are regular if and only if the source terms satisfy a finite number of orthogonality conditions. A key difficulty of this work is to cope with these orthogonality conditions during the nonlinear fixed-point scheme. In particular, we are led to prove their stability with respect to the underlying base flow. To tackle this deceivingly simple problem, we develop a methodology which we believe to be both quite natural and adaptable to other situations in which one wishes to prove the existence of regular solutions to a nonlinear problem for suitable data despite the existence of singular solutions at the linear level. This paper is a shorter version of [3].

    Anne-Laure Dalibard, Frédéric Marbach, Jean Rax

  • 23 September 2025 hal-03364744

    We introduce twisted triple crossing diagram maps, collections of points in projective space associated to bipartite graphs on the cylinder, and use them to provide geometric realizations of the cluster integrable systems of Goncharov and Kenyon constructed from toric dimer models. Using this notion, we provide geometric proofs that the pentagram map and the cross-ratio dynamics integrable systems are cluster integrable systems. We show that in appropriate coordinates, cross-ratio dynamics is described by geometric R-matrices, which solves the open question of finding a cluster algebra structure describing cross-ratio dynamics.

    Niklas Affolter, Terrence George, Sanjay Ramassamy

  • 23 September 2025 hal-05272270

    We study the convergence to equilibrium in high dimensions, focusing on explicit bounds on mixing times and the emergence of the cutoff phenomenon for Dyson-Laguerre processes. These are interacting particle systems with non-constant diffusion coefficients, arising naturally in the context of sample covariance matrices. The infinitesimal generator of the process admits generalized Laguerre orthogonal polynomials as eigenfunctions.

    Our analysis relies on several distances and divergences, including an intrinsic Wasserstein distance adapted to the non-Euclidean geometry of the process. Within this framework, we employ tools from Riemannian geometry and functional inequalities. In particular, we establish exponential decay and derive a regularization inequality for the intrinsic Wasserstein distance via comparison with relative entropy.

    Samuel Chan-Ashing

  • 22 September 2025 tel-05273265

    Here are seemingly unrelated problems: computing rational homotopy groups of spheres in rational homotopy theory, purity in algebraic geometry, Koszul duality for the category of a reductive group in representation theory, splitting Drinfeld space's de Rham complex in the p-adic Langlands program, deformation quantization of Poisson manifolds in mathematical physics. And yet, all of them boil down to the same question: formality. A differential graded algebraic structure A (e.g. an associative algebra, a Lie algebra, a Pre-Calabi-Yau algebra, etc.) is formal if it is related to its homology H(A) by a zig-zag of quasi-isomorphisms preserving the algebraic structure. This thesis develops obstruction classes allowing to prove formality results. On the one hand, it incorporates aforementioned results into a single theory. On the other hand, it provides tools to study these questions in cases little studied hitherto: over any coefficient ring and for algebraic structures with several outputs: algebras encoded by properads.

    Coline Emprin

  • 22 September 2025 hal-05273226

    We develop a general obstruction theory to the formality of algebraic structures over any commutative ground ring. It relies on the construction of Kaledin obstruction classes that faithfully detect the formality of differential graded algebras over operads or properads, possibly colored in groupoids. The present treatment generalizes the previous obstruction classes in two directions: beyond characteristic zero and including a wider range of algebraic structures. This enables us to establish novel formality criteria, including formality descent with torsion coefficients, formality in families, intrinsic formality, and criteria in terms of chain-level lifts of homology automorphism.

    Coline Emprin

  • 18 September 2025 hal-01788066

    We develop a general obstruction theory to the formality of algebraic structures over any commutative ground ring. It relies on the construction of Kaledin obstruction classes that faithfully detect the formality of differential graded algebras over operads or properads, possibly colored in groupoids. The present treatment generalizes the previous obstruction classes in two directions: beyond characteristic zero and including a wider range of algebraic structures. This enables us to establish novel formality criteria, including formality descent with torsion coefficients, formality in families, intrinsic formality, and criteria in terms of chain-level lifts of homology automorphism.

    Lionel Velly, Vincent Perlbarg, Thomas Boulier, Nicolas Adam, Sebastien Delphine, Charles-Edouard Luyt, Valentine Battisti, Gregory Torkomian, Charlotte Arbelot, Russell Chabanne, Betty Jean, Carol Di Perri, Steven Laureys, Giuseppe Citerio, Alessia Vargiolu, Benjamin Rohaut, Nicolas Bruder, Nadine Girard, Stein Silva, Vincent Cottenceau, Thomas Tourdias, Olivier Coulon, Bruno Riou, Lionel Naccache, Rajiv Gupta, Habib Benali, Damien Galanaud, Louis Puybasset, Jean Constantin, Jean Chastre, Julien Amour, Corine Vezinet, Jean-Jacques Rouby, Mathieu Raux, Olivier Langeron, Vincent Degos, Francis Bolgert, Nicolas Weiss, Thomas Similowski, Alexandre Demoule, Alexandre Duguet, Eléonore Tollard, Benoit Veber, Jean-Albert Lotterie, Paola Sanchez-Pena, Michèle Genestal, Mirko Patassini, Delphine Meng, Galanaud Md, Torkomian Meng, N Adam

  • 9 September 2025 hal-05245897

    We study the small-time local controllability (STLC) of a bilinear Schrödinger equation with Neumann boundary conditions near its ground state. We focus on the degenerate case where the linearized system is not controllable, necessitating a second-order analysis. We prove two complementary results. The negative result provides a new PDE instance of Sussmann's classical quadratic obstruction, corresponding to a non-vanishing Lie bracket. The positive result appears to be the first to establish STLC at the quadratic order for a physical PDE with a single scalar control. Both proofs rely on a Fourier-based approach, which is crucial because the integral kernel of the second-order term lacks the regularity required by standard integration-by-parts arguments. Along the way, we develop tools valid in a more general setting to analyze such quadratic forms. In particular, we prove results that allow for the multiplication of a kernel by a modulation function

    Karine Beauchard, Frédéric Marbach, Thomas Perrin

  • 3 September 2025 hal-05234663

    We study a multi--particle model including a kinetic energy and a non linear local self-interaction, both in the bosonic and fermionic cases. In both cases, we prove that the model is well-posed if the number of particles is large enough. In particular, we show that there is a nonlinearity for which the model with $N=2$ particles is well-posed, while the model with $N=1$ is not.

    David Gontier, Salma Lahbabi, Simona Rota Nodari

  • 2 September 2025 tel-05236078

    En continuation des travaux de Hrushovski sur les corps pseudo-finis avec un caractère additif, nous étudions la théorie des modèles des corps aux différences (en caractéristique 0) avec un caractère additif sur le corps fixe ajouté comme prédicat au sens de la logique continue. Leur théorie possède un modèle-compagnon, la théorie ACFA+. Elle admet l’élimination des quantificateurs aux quantificateurs algébriquement bornés prés. Nous montrons que ACFA+ est la théorie asymptotique en caractéristique 0 de la clôture algébrique des corps finis munis du Frobenius et d’un caractère additif sur le corps fixe. ACFA+ est simple, mais en général on n’a pas la 3-amalgamation sur des ensembles aclσ-clos. En suivant une direction de recherche suggérée par Hrushovski, nous donnons une caractérisation complète de ce phénomène. Cela nous permet de déterminer la composante connexe du groupe de Kim-Pillay en tant que groupe topologique. En particulier, nous pouvons en déduire, comme attendu par Hrushovski, que le groupe est abélien. De plus, nous obtenons une caractérisation des imaginaires (en logique continue) dans ACFA+. Nous étudions ensuite l’amalgamation en dimension supérieure. Contrairement aux théories ACFA et PF+, nous pouvons construire un contre-exemple à la 4-amalgamation sur un ensemble pour lequel on a la 3-amalgamation. Néanmoins, nous montrons qu’on a la n-amalgamation sur tous les modèles pour tout n ∈ N. Dans le dernier chapitre, nous généralisons les résultats de Hrushovski dans une direction différente. Motivés par des exemples naturels provenant de la théorie des nombres, nous introduisons la théorie PF+,× des corps pseudo-finis avec un caractère additif et un caractère multiplicatif (d’ordre infini). Nous montrons l’élimination des quantificateurs dans une extension naturelle du langage. Ensuite nous obtenons que PF+,× est la théorie asymptotique en caractéristique 0 des corps finis avec un caractère additif non-trivial et un caractère multiplicatif suffisamment générique. Ultérieurement nous montrons que l’intégration des prédicats définissables par rapport à la mesure de comptage de Chatzidakis-Macintyre-van den Dries est uniformément définissable en fonction des paramètres.

    Stefan Marian Ludwig

  • 2 September 2025 hal-05234239

    This note reviews a recent contribution about the Fisher information for the space-homogeneous Boltzmann equation by L. Silvestre, C. Villani and the author (arXiv, 2024 ). This classical functional from information theory is shown to be nonincreasing along the flow of the non-linear PDE for all physically relevant particle interactions. The proof consists in establishing a new functional inequality on the sphere of Log-Sobolev type. This new a priori estimate on solutions yields global-in-time well posedness of the equation, in particular in the case of very singular interactions, a left open question up to this work.

    Cyril Imbert

  • 1 September 2025 hal-04642697

    We study a class of parabolic quasilinear systems, in which the diffusion matrix is not uniformly elliptic, but satisfies a Petrovskii condition of positivity of the real part of the eigenvalues. Local wellposedness is known since the work of Amann in the 90s, by a semi-group method. We first revisit these results in the context of Sobolev spaces modelled on L^2 and then explore the endpoint Besov case B_{p,1}^{d/p}. We also exemplify our method on the SKT system, showing the existence of local, non-negative, strong solutions.

    Isabelle Gallagher, Ayman Moussa

  • 18 August 2025 hal-05213680

    We study the geodesic convexity of various energy and entropy functionals restricted to (non-geodesically convex) submanifolds of Wasserstein spaces with their induced geometry. We prove a variety of convexity results by means of a simple general principle, which holds in the metric space setting, and which crucially requires no knowledge of the structure of geodesics in the submanifold: If the EVI gradient flow of a functional exists and leaves the submanifold invariant, then the restriction of the functional to the submanifold is geodesically convex. This leads to short new proofs of several known results, such as one of Carlen and Gangbo on strong convexity of entropy on sphere-like submanifolds, and several new results, such as the λ-convexity of entropy on the space of couplings of λ-log-concave marginals. Along the way, we develop sufficient conditions for existence of geodesics in Wasserstein submanifolds. Submanifold convexity results lead systematically to improvements of Talagrand and HWI inequalities which we speculate to be closely related to concentration of measure estimates for conditioned empirical measures, and we prove one rigorous result in this direction in the Carlen-Gangbo setting.

    Louis-Pierre Chaintron, Daniel Lacker

  • 14 August 2025 hal-04075154

    This article studies large and local large deviations for sums of i.i.d. real-valued random variables in the domain of attraction of an $\alpha$-stable law, $\alpha\in (0,2]$, with emphasis on the case $\alpha=2$. There are two different scenarios: either the deviation is realised via a collective behaviour with all summands contributing to the deviation (a Gaussian scenario), or a single summand is atypically large and contributes to the deviation (a one-big-jump scenario). Such results are known when $\alpha \in (0,2)$ (large deviations always follow a one big-jump scenario) or when the random variables admit a moment of order $2+\delta$ for some $\delta>0$. We extend these results, including in particular the case where the right tail is regularly varying with index $-2$ (treating cases with infinite variance in the domain of attraction of the normal law). We identify the threshold for the transition between the Gaussian and the one-big-jump regimes; it is slightly larger when considering local large deviations compared to integral large deviations. Additionally, we complement our results by describing the behaviour of the sum and of the largest summand conditionally on a (local) large deviation, for any $\alpha\in (0,2]$, both in the Gaussian and in the one-big-jump regimes. As an application, we show how our results can be used in the study of condensation phenomenon in the zero-range process at the critical density, extending the range of parameters previously considered in the literature.

    Quentin Berger, Matthias Birkner, Linglong Yuan

  • 10 July 2025 hal-00004949

    We introduce a new method of symmetrization of mappings on the $n$-sphere ($n\geq 2$). They are applied to estimate solutions of quasilinear elliptic partial differential equations of $p$-Laplacian type, with combinations of Dirac measures on the right-hand side. The case $p=n$ is reduced to a problem on the sphere, using a conformal transformation. The cases when $1n$ are considered more briefly, full details being available in other papers of the author.

    Satyanad Kichenassamy

  • 9 July 2025 hal-05152340

    We establish minimax convergence rates for score-based generative models (SGMs) under the $1$-Wasserstein distance. Assuming the target density $p^\star$ lies in a nonparametric $\beta$-smooth H\"older class with either compact support or subGaussian tails on $\mathbb{R}^d$, we prove that neural network-based score estimators trained via denoising score matching yield generative models achieving rate $n^{-(\beta+1)/(2\beta+d)}$ up to polylogarithmic factors. Our unified analysis handles arbitrary smoothness $\beta > 0$, supports both deterministic and stochastic samplers, and leverages shape constraints on $p^\star$ to induce regularity of the score. The resulting proofs are more concise, and grounded in generic stability of diffusions and standard approximation theory.

    Arthur Stéphanovitch, Eddie Aamari, Clément Levrard

  • 4 July 2025 hal-00004948

    Le perimètre d'une partie mesurable de $\mathbb R^N$ est la variation totale de sa fonction caractéristique. On en donne une généralisation au cas d'une partie $E$ d'une variété riemannienne compacte orientée. On montre que ce périmètre est la limite des variations totales des régularisées de $\chi_E$ par le noyau de la chaleur. On en déduit une inégalité isopérimétrique, et une formule de type Fleming-Rishel. On applique ensuite ces résultats à un problème quasilinéaire elliptique dans $\mathbb R^N$, pour lequel les méthodes usuelles de symétrisation dans $\mathbb R^N$ achoppent, mais que l'on pourra traiter en introduisant une méthode de symétrisation sur $S^N$.

    Satyanad Kichenassamy

  • 4 July 2025 hal-05144000

    A classical approach to the Calderón problem is to estimate the unknown conductivity by solving a nonlinear least-squares problem. It is generally believed that it leads to a nonconvex optimization problem which is riddled with bad local minimums. This has motivated the development of reconstruction methods based on convex optimization, one recent contribution being the nonlinear convex semidefinite programming approach of Harrach (2023). In this work, we investigate the computational viability of this convex approach in a simple setting where the conductivities are piecewise constant and radial. We implement this convex reconstruction method and compare it extensively to the least squares approach. Our experiments suggest that this convex programming approach only allows to accurately estimate the unknown for problems with a very small size. Moreover, surprisingly, it is consistently outperformed by Newton-type least squares solvers, which are also faster and require less measurements. We revisit the issue of nonconvexity in this piecewise constant radial setting and prove that, contrary to previous claims, there are no local minimums in the case of two scalar unknowns with no measurement noise. We also provide a partial proof of this result in the general setting which holds under a numerically verifiable assumption.

    Giovanni S Alberti, Romain Petit, Clarice Poon

  • 2 July 2025 hal-00002667

    We give an expression for the minimum energy of a map between a three-dimensional manifold and a surface, with prescribed vortex-like singularities. We find that the topological type of the target needs to be restricted for the problem to be meaningful. We extend results of Brezis-Coron-Lieb for maps from 3-space to the standard 2-sphere. This is made possible by the use of Fermi-Walker transport.

    Haïm Brezis, Satyanad Kichenassamy

  • 2 July 2025 hal-05139923

    The Calderón problem consists in recovering an unknown coefficient of a partial differential equation from boundary measurements of its solution. These measurements give rise to a highly nonlinear forward operator. As a consequence, the development of reconstruction methods for this inverse problem is challenging, as they usually suffer from the problem of local convergence. To circumvent this issue, we propose an alternative approach based on lifting and convex relaxation techniques, that have been successfully developed for solving finite-dimensional quadratic inverse problems. This leads to a convex optimization problem whose solution coincides with the sought-after coefficient, provided that a nondegenerate source condition holds. We demonstrate the validity of our approach on a toy model where the solution of the partial differential equation is known everywhere in the domain. In this simplified setting, we verify that the non-degenerate source condition holds under certain assumptions on the unknown coefficient. We leave the investigation of its validity in the Calderón setting for future works.

    Giovanni S Alberti, Romain Petit, Simone Sanna

  • 1 July 2025 hal-05136299

    In the interest of reproducible research, this is exactly the version of the code used to generate the figures in the paper "Opening the Black Box: Reverse-Engineering of Sparse Neural Networks" by the same authors.

    Valérie Castin, Rémi Gribonval

  • 30 June 2025 hal-05136482

    Dans le contexte de la compression de réseaux de neurones, la distillation consiste à entraîner un petit réseau élève (si possible parcimonieux) à partir d'échantillons générés par un réseau enseignant. Malgré le succès empirique de ces approches on connaît mal leurs conditions de succès. En guise de preuve de concept nous introduisons une nouvelle famille de réseaux parcimonieux structurés dits perceptrons multiarbre, et un algorithme d'estimation des paramètres de tels réseaux à partir d'échantillons de la fonction correspondante et de sa Jacobienne. L'algorithme en question n'est pas basé sur la descente de gradient.

    Valérie Castin, Rémi Gribonval

  • 18 June 2025 hal-05119142

    We consider a drift-diffusion process of N stochastic particles and show that its empirical measure converges, as N → ∞, to the solution of the Landau equation. We work in the regime of very soft potentials, which had never been covered before, using a tightness/uniqueness method. To claim uniqueness, we need high integrability estimates that we obtain by crucially exploiting the dissipation of the Fisher information at the level of the particle system. To be able to exploit these estimates as N → ∞, we prove the affinity in infinite dimension of the entropy production and Fisher information dissipation (and general higher-order versions of the Fisher information through an abstract theorem), results which were up to now only known for the entropy and the Fisher information.

    Côme Tabary

  • 17 June 2025 hal-05005367

    We study the discretization, convergence, and numerical implementation of recent reformulations of the quadratic porous medium equation (multidimensional and anisotropic) and Burgers' equation (one-dimensional, with optional viscosity), as forward in time variants of the Benamou-Brenier formulation of optimal transport. This approach turns those evolution problems into global optimization problems in time and space, of which we introduce a discretization, one of whose originalities lies in the harmonic interpolation of the densities involved. We prove that the resulting schemes are unconditionally stable w.r.t. the space and time steps, and we establish a quadratic convergence rate for the dual PDE solution, under suitable assumptions. We also show that the schemes can be efficiently solved numerically using a proximal splitting method and a global space-time fast Fourier transform, and we illustrate our results with numerical experiments.

    Jean-Marie Mirebeau, Erwan Stampfli

  • 16 June 2025 hal-03812909

    The purpose of this paper is to investigate the well-posedness of several linear and nonlinear equations with a parabolic forward-backward structure, and to highlight the similarities and differences between them. The epitomal linear example will be the stationary Kolmogorov equation $y\partial_x u -\partial_{yy} u=f$ in a rectangle. We first prove that this equation admits a finite number of singular solutions, of which we provide an explicit construction. Hence, the solutions to the Kolmogorov equation associated with a smooth source term are regular if and only if $f$ satisfies a finite number of orthogonality conditions. This is similar to well-known phenomena in elliptic problems in polygonal domains. We then extend this theory to a Vlasov--Poisson--Fokker--Planck system, and to two quasilinear equations: the Burgers type equation $u \partial_x u - \partial_{yy} u = f$ in the vicinity of the linear shear flow, and the Prandtl system in the vicinity of a recirculating solution, close to the line where the horizontal velocity changes sign. We therefore revisit part of a recent work by Iyer and Masmoudi. For the two latter quasilinear equations, we introduce a geometric change of variables which simplifies the analysis. In these new variables, the linear differential operator is very close to the Kolmogorov operator $y\partial_x -\partial_{yy}$. Stepping on the linear theory, we prove existence and uniqueness of regular solutions for data within a manifold of finite codimension, corresponding to some nonlinear orthogonality conditions.

    Anne-Laure Dalibard, Frédéric Marbach, Jean Rax

  • 11 June 2025 hal-05108418

    While conservation laws in gradient flow training dynamics are well understood for (mostly shallow) ReLU and linear networks, their study remains largely unexplored for more practical architectures. This paper bridges this gap by deriving and analyzing conservation laws for modern architectures, with a focus on convolutional ResNets and Transformer networks. For this, we first show that basic building blocks such as ReLU (or linear) shallow networks, with or without convolution, have easily expressed conservation laws, and no more than the known ones. In the case of a single attention layer, we also completely describe all conservation laws, and we show that residual blocks have the same conservation laws as the same block without skip connection. We then introduce the notion of conservation laws that depend only on a subset of parameters (corresponding e.g. to a pair of consecutive layers, to a residual block, or to an attention layer). We demonstrate that the characterization of such laws can be reduced to the analysis of the corresponding building block in isolation. Finally, we examine how these newly discovered conservation principles, initially established in the continuous gradient flow regime, persist under discrete optimization dynamics, particularly in the context of Stochastic Gradient Descent (SGD).

    Sibylle Marcotte, Rémi Gribonval, Gabriel Peyré

  • 27 May 2025 hal-05086703

    Guidance is a cornerstone of modern diffusion models, playing a pivotal role in conditional generation and enhancing the quality of unconditional samples. However, current approaches to guidance scheduling--determining the appropriate guidance weight--are largely heuristic and lack a solid theoretical foundation. This work addresses these limitations on two fronts. First, we provide a theoretical formalization that precisely characterizes the relationship between guidance strength and classifier confidence. Second, building on this insight, we introduce a stochastic optimal control framework that casts guidance scheduling as an adaptive optimization problem. In this formulation, guidance strength is not fixed but dynamically selected based on time, the current sample, and the conditioning class, either independently or in combination. By solving the resulting control problem, we establish a principled foundation for more effective guidance in diffusion models.

    Iskander Azangulov, Peter Potaptchik, Qinyu Li, Eddie Aamari, George Deligiannidis, Judith Rousseau

  • 23 May 2025 hal-05077442

    We derive a weak-strong uniqueness and stability principle for the Landau equation in the soft potentials case (including Coulomb interactions). The distance between two solutions is measured by their relative entropy, which to our knowledge was never used before in stability estimates. The logarithm of the strong solution is required to have polynomial growth while the weak solution can be any H-solution with sufficiently many moments at initial time. Since we require a substantial amount of regularity on the strong solution, we also provide an example of sufficient conditions on the initial data that ensure this regularity in the Coulomb (and very soft potentials) case

    Côme Tabary

  • 11 May 2025 hal-04143518

    We consider the complex Elliptic Ginibre Ensemble, a family of random matrix models introduced by Girko that interpolates between the Ginibre Ensemble and the Gaussian Unitary Ensemble and such that its empirical spectral measure converges to the uniform measure on an ellipse. We show the convergence in law of its normalised characteristic polynomial outside of this ellipse. Our proof contains two main steps. We first show the tightness of the normalised characteristic polynomial using the link between the Elliptic Ginibre Ensemble and Hermite polynomials. This part relies on the uniform control of the Hermite kernel which is derived from the recent work of Akemann, Duits and Molag. In the second step, we identify the limiting object as the exponential of a Gaussian analytic function. The limit expression is derived from the convergence of traces of Chebyshev polynomials of random matrices by the method of moments. These traces of Chebyshev polynomials appear naturally as a kind of centered version, or normal ordering, of the traces of the monomials. This work answers the interpolation problem raised in the work of Bordenave, Chafaï and the second author of this paper for the integrable case of the Elliptic Ginibre Ensemble and is therefore a fist step towards the conjectured universality of this result.

    Quentin François, David García-Zelada

  • 24 April 2025 hal-04166694

    A Boolean network is a discrete dynamical system operating on vectors of Boolean variables. The action of a Boolean network can be conveniently expressed as a system of Boolean update functions, computing the new values for each component of the Boolean vector as a function of the other components. Boolean networks are widely used in modeling biological systems that can be seen as consisting of entities which can be activated or deactivated, expressed or inhibited, on or off. P systems on the other hand are classically introduced as a model of hierarchical multiset rewriting. However, over the years the community has proposed a wide range of P system variants including diverse ingredients suited for various needs. In this work, we propose a new variant—Boolean P systems—specifically designed for reasoning about sequential controllability of Boolean networks, and use it to first establish a crisp formalization of the problem, and then to prove that the problem of sequential controllability is PSPACE-complete. We further claim that Boolean P systems are a demonstration of how P systems can be used to construct ad hoc formalisms, custom-tailored for reasoning about specific problems, and providing new advantageous points of view.

    Artiom Alhazov, Vincent Ferrari-Dominguez, Rudolf Freund, Nicolas Glade, Sergiu Ivanov

  • 16 April 2025 tel-05036943

    This thesis intends to make a contribution to the theories of algebraic cycles and moduli spaces over the real numbers. In the study of the subvarieties of a projective algebraic variety, smooth over the field of real numbers, the cycle class map between the Chow ring and the equivariant cohomology ring plays an important role. The image of the cycle class map remains difficult to describe in general; we study this group in detail in the case of real abelian varieties. To do so, we construct integral Fourier transforms on Chow rings of abelian varieties over any field. They allow us to prove the integral Hodge conjecture for one-cycles on complex Jacobian varieties, and the real integral Hodge conjecture modulo torsion for real abelian threefolds. For the theory of real algebraic cycles, and for several other purposes in real algebraic geometry, it is useful to have moduli spaces of real varieties to our disposal. Insight in the topology of a real moduli space provides insight in the geometry of a real variety that defines a point in it, and the other way around. In the moduli space of real abelian varieties, as well as in the Torelli locus contained in it, we prove density of the set of moduli points attached to abelian varieties containing an abelian subvariety of fixed dimension. Moreover, we provide the moduli space of stable real binary quintics with a hyperbolic orbifold structure, compatible with the period map on the locus of smooth quintics. This structure identifies the moduli space of stable real binary quintics with a non-arithmetic ball quotient.

    Olivier De Gaay Fortman

  • 3 April 2025 tel-05018660

    Many tasks in Machine Learning, or other fields such as physics or economics, amount to the comparison between a data distribution and a model, called density fitting. This problem is formulated as the minimization of a distance between these two measures, and optimal transport distances (OT) appear as a powerful tool for this task, since they leverage an underlying geometric information in the data. However, their practical application is challenging, because they are computationally intensive to solve, sensitive to noisy data and can only compare probabilities defined on the same space. To cope with each one of these issues, variants of OT called entropically regularized, unbalanced and Gromov-Wasserstein have been proposed in the literature. The scope of this thesis is to develop combinations of those variants so as to add priors of robustness to noise and/or an ability to handle change of spaces when comparing measures, while remaining computationally tractable. Thus the contributions of this thesis focus on providing algorithms and implementations with guarantees that the computational burden of OT variants is alleviated, and proving metric properties to ensure the comparison of distributions with respect to those variants remains insightful.

    Thibault Séjourné

  • 1 April 2025 hal-05015621

    We show the convergence of the characteristic polynomial for random permutation matrices sampled from the generalized Ewens distribution. Under this distribution, the measure of a given permutation depends only on its cycle structure, according to certain weights assigned to each cycle length. The proof is based on uniform control of the characteristic polynomial using results from the singularity analysis of generating functions, together with the convergence of traces to explicit random variables expressed via a Poisson family. The limit function is the exponential of a Poisson series which has already appeared in the case of uniform permutation matrices. It is the Poisson analog of the Gaussian Holomorphic Chaos, related to the limit of characteristic polynomials for other matrix models such as Circular Ensembles, i.i.d. matrices, and Gaussian elliptic matrices.

    Quentin François

  • 25 March 2025 hal-05004709

    We study the second Huber Theorem in dimensions 2 and 4. In dimension 2, we prove the optimal regularity for the conformal factor using Coulomb frames. In dimension 4, we introduce another Coulomb-type condition which is similar to the case of Yang--Mills connections. We obtain a generalization of the two-dimensional case that can be applied to study the singularities of Bach-flat metrics.

    Paul Laurain, Dorian Martino

  • 20 March 2025 hal-04999502

    We study the martingale property and moment explosions of a signature volatility model, where the volatility process of the log-price is given by a linear form of the signature of a time-extended Brownian motion. Excluding trivial cases, we demonstrate that the price process is a true martingale if and only if the order of the linear form is odd and a correlation parameter is negative. The proof involves a fine analysis of the explosion time of a signature stochastic differential equation. This result is of key practical relevance, as it highlights that, when used for approximation purposes, the linear combination of signature elements must be taken of odd order to preserve the martingale property. Once martingality is established, we also characterize the existence of higher moments of the price process in terms of a condition on a correlation parameter.

    Eduardo Abi Jaber, Paul Gassiat, Dimitri Sotnikov

  • 19 March 2025 tel-03905418

    This thesis studies the persistent homology of real-valued continuous functions f on compact topological spaces X. The introduction of homological indices and homological dimensions allows us to link persistence theory to metric quantities of the compact space X, such as its upper-box dimension. These quantities give a precise framework to the Wasserstein p-stability results known in the literature, but also extend them to Hölder functions on more general spaces (including all compact Riemannian manifolds) with explicit constants and whose regime for p is optimal. In degree zero of homology, a more in-depth study can be made using trees associated to f, which generalize the merge trees which are definable when f is Morse. It is possible to link the dimension of these trees to the persistence index of f and to its barcode. We apply these deterministic results to the stochastic setting to draw consequences about the barcodes of random functions of prescribed regularity. These consequences also allow us to develop distributional discrimination tests for the processes, of which we present a particular example. Finally, we define the zeta-functions associated with a stochastic process and compute these functions and other related quantities for several processes in dimension one, including the Brownian motion and the alpha-stable Lévy processes.

    Daniel Perez

  • 18 March 2025 hal-04996372

    This thesis studies the persistent homology of real-valued continuous functions f on compact topological spaces X. The introduction of homological indices and homological dimensions allows us to link persistence theory to metric quantities of the compact space X, such as its upper-box dimension. These quantities give a precise framework to the Wasserstein p-stability results known in the literature, but also extend them to Hölder functions on more general spaces (including all compact Riemannian manifolds) with explicit constants and whose regime for p is optimal. In degree zero of homology, a more in-depth study can be made using trees associated to f, which generalize the merge trees which are definable when f is Morse. It is possible to link the dimension of these trees to the persistence index of f and to its barcode. We apply these deterministic results to the stochastic setting to draw consequences about the barcodes of random functions of prescribed regularity. These consequences also allow us to develop distributional discrimination tests for the processes, of which we present a particular example. Finally, we define the zeta-functions associated with a stochastic process and compute these functions and other related quantities for several processes in dimension one, including the Brownian motion and the alpha-stable Lévy processes.

    Tony Jin, João Ferreira, Michel Bauer, Michele Filippone, Thierry Giamarchi

  • 18 March 2025 hal-04994930

    The cutoff phenomenon, conceptualized at the origin for finite Markov chains, states that for a parametric family of evolution equations, started from a point, the distance towards a long time equilibrium may become more and more abrupt for certain choices of initial conditions, when the parameter tends to infinity. This threshold phenomenon can be seen as a critical competition between trend to equilibrium and worst initial condition. In this note, we investigate this phenomenon beyond stochastic processes, in the context of the analysis of nonlinear partial differential equations, by proving cutoff for the fast diffusion and porous medium Fokker-Planck equations on the Euclidean space, when the dimension tends to infinity. We formulate the phenomenon using quadratic Wasserstein distance, as well as using specific relative entropy and Fisher information. Our high dimensional asymptotic analysis uses the exact solvability of the model involving Barenblatt profiles. It includes the Ornstein-Uhlenbeck dynamics as a special linear case.

    Djalil Chafaï, Max Fathi, Nikita Simonov

  • 12 March 2025 hal-04974699

    Ce recueil regroupe les traductions en français des textes officiels de présentation des sept problèmes mathématiques du millénaire. Il donne donc un petit aperçu des mathématiques contemporaines.

    Andrew Wiles, Pierre Deligne, Charles Fefferman, John Milnor, Stephen Cook, Enrico Bombieri, Arthur Jaffe, Edward Witten, Nicolas Bacaër, Michel Balazard, Gérard Besson, Jean-Louis Colliot-Thélène, Isabelle Gallagher, Catherine Goldstein, Thibaut Lemoine, Sylvain Perifel, Claire Voisin