344
Banff International Research Station Proceedings 2017

Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Banff InternationalResearch StationProceedings 2017

Page 2: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial
Page 3: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Contents

Five-day Workshop Reports 1

1 Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again(17w5048) 3

2 Combinatorial Reconfiguration (17w5066) 14

3 Brain Dynamics and Statistics: Simulation versus Data (17w5036) 24

4 Optimization and Inference for Physical Flows on Networks (17w5165) 38

5 New trends in arithmetic and geometry of algebraic surfaces (17w5146) 50

6 Generated Jacobian equations: from Geometric Optics to Economics (17w5078) 60

7 Quantum Field Framework for Structured Light Interactions (17w5079) 73

8 Phase Transitions Models (17w5110) 82

9 Algebraic Combinatorixx 2 (17w5012) 98

10 Recent Advances in Discrete and Analytic Aspects of Convexity (17w5074) 109

11 Arithmetic Aspects of Explicit Moduli Problems (17w5065) 120

12 Mathematical Analysis of Biological Interaction Networks (17w5099) 132

13 Nilpotent Fundamental Groups (17w5112) 145

14 Diophantine Approximation and Algebraic Curves (17w5045) 156

15 Women in Control: New Trends in Infinite Dimensions (17w5123) 167

16 Latest Advances in the Theory and Applications of Design and Analysis of Experiments (17w5007) 178

17 Women in Numbers 4 (17w5083) 188

18 Future targets in the classification program for amenable C∗-algebras (17w5127) 204

19 Lattice walks at the Interface of Algebra, Analysis and Combinatorics (17w5090) 215

20 Symmetries of Surfaces, Maps and Dessins (17w5162) 229

21 p-adic cohomology and arithmetic applications (17w5118) 239

i

Page 4: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

ii

22 New Perspectives in Representation Theory of Finite Groups (17w5003) 248

23 Automorphic Forms, Mock Modular Forms and String Theory (17w5097) 259

24 Forest and Wildland Fire Management: a Risk Management Perspective (17w5145) 268

25 Approximation algorithms and the hardness of approximation (17w5133) 277

26 Nonlinear and Stochastic Problems in Atmospheric and Oceanic Prediction (17w5061) 287

Two-day Workshop Reports 293

27 Special Western Canada Linear Algebra Meeting (17w2668) 295

28 Modeling and Simulation: Practical Engineering Applications (17w2689) 299

29 Transdisciplinary Approaches to Integrating Policy and Science for Sustainability (17w2688) 305

30 Retreat for Young Researchers in Stochastics (17w2695) 309

Focused Research Group Reports 313

31 Extensions of index theory inspired by quantum scattering theory (17frg668) 315

32 Stochastic lattice differential equations and applications (17frg671) 321

Research in Teams Reports 327

33 Derivative Free and Black Box Optimization (17rit681) 329

34 The Structure of Finite Algebras and the Constraint Satisfaction Problem (17rit685) 332

35 Nikol’skii inequalities and their applications (17rit679) 336

Page 5: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Five-dayWorkshop

Reports

Page 6: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

2 Five-day Workshop Reports

Page 7: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 1

Transport in Unsteady Flows: fromDeterministic Structures to StochasticModels and Back Again (17w5048)

January 15 - 20, 2017Organizer(s): Sanjeeva Balasuriya (University of Adelaide), Daan Crommelin (CWI Amster-dam), Gary Froyland (University of New South Wales), Adam Monahan (University of Victo-ria), Nicholas Ouellette (Stanford University), Laure Zanna (University of Oxford)

Background and Rationale for WorkshopTransport in unsteady turbulent flows is an issue of central importance in many geophysical and engineering

problems. The problem is challenging from a mathematical perspective because of the broad range of scales ofmotions present in such flows. Additionally, these flows tend to self-organize into coherent structures, such assmall scale filaments and large scale jets. The multiscale and nonlinear interactions across these spatial scales hasa profound influence on the transport of energy, enstrophy, momentum, heat, plankton, or pollutants and chemicalsin the oceans and atmosphere. In recent years, there has been considerable interest and progress in identifyinglarger scale time-varying structures (such as mesoscale oceanic eddies, the circumpolar Antarctic vortex, oceanicjets and fluid interfaces) from observational and experimental data using methods based on rigorous mathematicsrather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23,38]), identify crucial dynamical barriers in the flow and characterize transport and mixing. At the same time, aparallel line of research has been pursued investigating stochastic dynamical representation of transport by small,unresolved scales of flow [16, 36]. This question is complicated by the two-way nature of the interaction: thelarger, resolved scales organize the smaller, unresolved ones; but these then feed back on the resolved scalesthrough nonlinear interactions. Processes that have been approached from this stochastic dynamical perspectiveinclude transport by mesoscale ocean eddies in ocean models [10, 50, 51, 27], stochastic backscatter of energyin numerical weather prediction models [43], and the mixing of scalar fields or chemical species in engineeringdevices [40, 14].

The dominant diagnostic methods for determining LCSs typically use velocity data to impute where fluidparticles go using deterministic advection [23, 6, 17, 18, 42, 38, 32, 21]. The velocity field at each position isassumed to be known precisely for all time, permitting the numerical advection of fluid particles from one timeinstance to another. For real unsteady flows, however, the full velocity field at future times and across all relevantspatial scales is typically not known. Stochastic modelling approaches have been developed to accommodate thelack of information about flow structures on some scales. To date, however, these approaches have made limiteduse of the insights regarding transport afforded by LCS for either the development of stochastic parameterizationsor the study of how these small-scale transports are modulated by larger, resolved coherent structures.

3

Page 8: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

4 Five-day Workshop Reports

The presence of unresolved small-scale variability limits the genuinely predictive capabilities of deterministiccoherent structure identification. However, coherent structures provide a low-dimensional yet meaningful descrip-tion of the flow dynamics and so developing ways to use them for flow prediction would be tremendously valuablefor both geophysical and engineering applications. The natural question that then emerges is how the stochas-tic dynamics and LCS perspectives can be brought together for a fluid dynamical analysis of the (probabilistic)predictability of transport in unsteady flows. One approach is through finite-time coherent sets for advection-diffusion equations and other stochastic processes [17]. There are also promising tools to approach this problemhave recently emerged from interesting new theoretical approaches in random dynamical systems [5, 13], whichfor example indicate that classical deterministic bifurcation behavior of invariant entities is qualitatively differentunder stochastic perturbations.

In the past year or so, there has been an emerging tentative dialogue between researchers from the stochasticanalysis and the fluid dynamics LCS communities. This workshop was designed to strengthen these nascentinteractions and promote their development towards an active, fruitful research dialogue by bringing togetherresearchers with complementary approaches to a topical problem. The hope was to push deterministic flow modelstowards being more realistic, and stochastic approaches towards being more applicable in fluid dynamics settings.This workshop therefore focussed on the interplay of stochastic effects and coherent flow structures on transportin unsteady flows, with a longer-term goal of addressing predictability, for which a probabilistic interpretationseems the most appropriate approach. By bringing together participants with expertise in theory, modeling andexperiment, the workshop planned to coax theory from dynamical systems, fluid dynamics and stochastic analysisto work together. The presence of experimental and observational researchers, whose insight helped guide modelsand theory in directions relevant for applications, was an important aspect of the workshop. Several early careerresearchers in modeling, theory and experiments were also among the invitees, with the goal of ensuring that theimpact of this workshop lasts into the future.

The workshop’s stated goals were:

1. Assess from experiment and observation the predictive utility of LCS and transport estimates arising fromdeterministic models of real world systems;

2. Investigate how LCS-based perspectives on transport can inform the development of stochastic models ofsubgrid-scale transport;

3. Harmonize the deterministic and stochastic perspectives on transport, and discover which objects are analo-gous to LCS for stochastic models.

Workshop StructureGiven the diverse nature of the research areas of the participants, we designed this workshop in a very specific

way to elicit interactions between the participants, in order to work towards the goals of the workshop. Thus, westructured the workshop around several distinct components:

1. Perspectives: Eleven participants, selected because of their broad knowledge in their subject area, theirdissemination skills, and their perceived potential to reach beyond their area, were invited to present ‘Per-spective’ talks on the first and part of the second days of the workshop. These talks were limited to 15minutes, with 15 minutes discussion thereafter. The Perspectives were intended to introduce certain areasof research to participants from other areas, and the presenters were asked specifically to pose questionsand issues that would spill over into related research areas. This highly challenging task, limited as it wasto 15 minutes, was taken up with avidity by all our presenters. Illuminating and thought-provoking Per-spectives were presented by Stephen Griffies (ocean mesoscales), Michael Ghil (variability in the oceanand atmosphere), Adam Monahan (atmospheric transport), Judith Berner (prediction), Jonathan Lilly (oceanobservations and turbulence), Cecile Penland (limitations of models), Guido Boffetta (bio-organisms underturbulence), Nicholas Ouellette (extracting coherence from experimental data), Gary Froyland (set-orientedtechniques for transport and coherence), Oliver Junge (computational methods for coherence using transferoperators), and Jason Frank (numerical discretization impacts on preserving coherence). Several presentersidentified well-defined issues or potential links to other areas, which led to rich discussions, often involvingmany of the participants.

Page 9: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again 5

2. Issue identification: The Perspectives and subsequent discussions brought into focus many issues for futurediscussion, and potential multidisciplinary collaborations between the participants. A listing of these issueswas maintained, and on day two of the workshop, participants were asked to vote on which of these issuesthey would like to focus. By a process of elimination, several subject areas were identified. After muchdiscussion, issues were amalgamated into three specific areas:

1. Lagrangian Coherent Structures: What Good Are They?

2. Stochastic Parameterization.

3. Prediction.

Thus, the issues to be discussed in detail emerged organically from the structure of the workshop.

3. Problem-solving groups: Significant time was devoted to addressing specific issues under the three groupsthat were identified above. The time allocated was roughly 1.5 hours on the second day, 5 hours on thefourth day, and 1 hour on the fifth day. Outlines of the discussions from each of these groups appears inlater sections of this report. On the whole, the discussions in these groups was lively and constructive, withthe participants taking considerable effort to reach across disciplinary barriers. The organizers wish to thankShane Keating and Jonathan Lilly in particular for help in coordinating the discussions, and setting up onlineresources to help document them. The discussions in general helped in bridging the terminology divide, inhelping understand why certain issues were considered important or difficult in various research areas, inseeking overlapping interests, and in formulating issues that could lead to future collaborations.

4. Standard talks: The workshop also had ongoing talks that were of the more standard variety. In keepingwith the goals of the workshop, efforts were made to give as much exposure as possible to early-careerresearchers to present their research. One way in which this was achieved was by limiting standard talksalso to 15 minutes. This resulted in well-thought out and interesting talks from both junior and senior par-ticipants, which also elicited much discussion. Talks were presented by Cecilia Gonzalez-Tokman, HusseinAluie, Alexis Tantet, Thomas Peacock, Valerio Lucarini, Jeroen Wouters, Kathrin Padberg-Gehle, JacquesVanneste, Daniel Karrasch, Philippe Miron, Nikki Vercauteren, Ryan Abernathey, Irina Rypina, GeorgGottwald, Peter Koltai and Shane Keating. The talks cut across many of the research areas representedin the workshop, and included aspects of ergodic theory, fluid experiments, turbulence approaches, transferoperators in the presence of noise, coral reef connectivity, multistability in climate, subgrid-scale parame-terization for finite time-scale separation, coherence obtained from trajectories, dispersion in large deviationregimes, coherence as thought of in terms of a heat equation, Gulf of Mexico geometry analysis from floatdata, atmospheric boundary layer flow structures, Eulerian eddy fluxes and Lagrangian eddies, encounternumber as a new diagnostic of mixing, statistical consistency of numerical integrators, space-time charac-terization of coherence, and stochastic turbulent modeling based on satellite observations of the ocean.

5. Poster session: A poster session was held in the evening of the second day. In contrast possibly with manyother workshops or conferences, the majority of the poster presenters were senior participants. Posterswere presented by Michael Allshouse, Sanjeeva Balasuriya, Daan Crommelin, Amber Holdsworth, DouglasKelley, Bjorn Schmalfuß and Marek Stastna.

Group on “Lagrangian Coherent Structures: What GoodAre They?”

As the central topic of the workshop, this group was the largest of the three and included many participantsfrom the oceanographic community and the mathematics/dynamics community. Several participants from theatmospheric science community also attended some of these sessions. Because of the open question asked ofthe group (“What good are Lagrangian coherent structures”), the first two sessions were mainly wide-rangingdiscussions about which questions are of scientific significance.

These sessions were also crucially an open and highly informative learning and knowledge-transfer envi-ronment, where the mathematicians/dynamicists were able to communicate directly with mathematically savvyoceanographers. In one direction, this served to explain the features and limitations of current mathematical and

Page 10: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

6 Five-day Workshop Reports

computational approaches for analysing dynamics with coherent structures, for example, sampling issues [28]. Inthe other direction, among several aspects, this provided an explanation of key aspects of ocean models, how thephysics of fluid flow might better connect with the relevant mathematics, and an appreciation of the important,unknown aspects of the lifecycle of coherent structures. Several participants remarked that these open discussionsessions provided a truly unique opportunity to engage with different communities in a very collegial atmosphere,and were in many ways the highlight and most valuable part of the workshop.

The question “What good are coherent structures?” was accompanied by the related questions “What is therole of coherent structures in dynamics?” and “What sort of transport is useful in ocean and atmospheric science?(at which scales, of what quantities, etc...).” Physical manifestations of coherent structures include hurricanes,western boundary currents, and jets. Eddies/vortices at all scales are another type of coherent structure occurringfrequently in both the atmosphere [45] and the oceans [48]. Many aspects related to the mixing and transportassociated with all of these coherent entities were highlighted in the discussions [28, 44, 45, 2, 48]. In addition tothe usual impacts on transport in the ocean, these features also play a role in coral reef connectivity and search andrescue efforts. Submesoscale coherent vortices are very important to fisheries. There is a hypothesis within thefisheries community, for example, that they act as “nurseries” for larval fish for periods of time that are relevantfor larval development; that is, the time it takes for a fish larva to grow large enough to swim. Thus, it wouldbe interesting to know how much exchange occurs across barriers. Coherent structures may also be useful fordimension reduction, as touched on by the “Predictability” group, and identifying regions across which it wouldmake sense to compute fluxes more generally. The dimension reduction in this case might, for example, be inextracting “modes” corresponding to each coherent structure.

There was a substantial discussion concerning coherent structures and the physical dynamics that must beobeyed in the oceanographic and atmospheric contexts. One could consider coherent structures as emergent be-havior, and it was noted that they arise in “maximal entropy” solutions. There is for example a strong sense inthe atmospheric community that coherent structures emerge due to baroclinic instabilities, and their geometry isassociated with wavenumbers that are preferentially chosen by the dynamics. Most current work with coherentstructures does not take into account such dynamics, but instead works directly with a given velocity field. Is itpossible to incorporate the governing dynamics into standard coherent structure analysis? This broad theme wasidentified as requiring additional investigation, and an area in which there could be further interaction between the(mathematical) coherent structures analysts, and the (physical law abiding) geophysical scientists.

Another aspect that arose in the discussions was that coherent structure identification could also be used as acriterion of “fitness” of global circulation models. The quality of global circulation models could be assessed bythe coherent structures they give rise to, and comparing these to observations.

This was strong interest from the oceanographic community in identifying coherent structures directly fromobservations, and it was noted that recent methods have been developed to do this, particularly in the case wherethe data is sparse, scattered, and possibly incomplete [3, 20, 22, 8, 19, 41]. Over the past few years, the coherentstructure community has gradually been developing tools that are targeted for explicitly this situation, and it ishoped that an ongoing development—consistent with the limitations of the observational data—will continue tooccur. The dialogue established at this workshop is expected to help in this endeavor, as the needs of the practi-tioners are becoming more apparent to the theorists, and the theoretical limitations of conclusions more obvious tothe geophysicists.

In addition to coherent structures acting as efficient transporters of heat, salt, nutrients, etc., there is also amodeling interest in better understanding their formation, interaction with surrounding fluid and each other, anddeath. A brief discussion on this issue did not lead to any obvious conclusions, but could potentially be an aspectfor future investigation.

The group was conscious of having to formulate some concrete problems for work in the immediate future. Thefollowing were identified as having the potential for such work, possibly among the participants of the workshop:

• Downscale GCM (Global Climate Model) model/drifter data and quantify sensitivity and uncertainty indiagnosis of coherent structures.

• Look at the dynamical role or physical significance of coherent structures in simple (but realistic) turbulencemodels, e.g., the recent data set [1] generated from the quasigeostrophic model, and others [44, 2, 39].

Page 11: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again 7

• Consider the lifecycle of coherent structures, the role of geometry in interacting / merging structures [4, 49],and the role of “entrained periphery” of the coherent core.

• Work on theoretical and applied approaches to the issue of incorporating stochasticity into deterministiccoherent structure viewpoints. (There is current progress in this direction [7] as a result of discussions at theworkshop, with several follow-up analyses currently in preparation.)

Group on “Stochastic Parameterization”The discussion sessions on stochastic parameterization were attended by participants from the atmospheric

science community, mathematicians involved in stochastic modeling, and oceanographers.Initially, much of the discussion focused on past experiences and results from stochastic parameterizations for

atmospheric processes. This was perhaps not surprising, as stochastic methods for parameterizing atmosphericprocesses (for example convection) have been under development for over 15 years [37, 11], whereas explorationof these methods for ocean modeling is a more recent development [39, 29, 51, 27, 50]. However, later on topicsmore specific to oceanography were also addressed in the discussions.

A marked difference in what is expected from stochastic parameterization in ocean and atmosphere modelingis that in current atmospheric GCMs, most of the spectrum of eddies are explicitly resolved, whereas in oceanGCMs, they are not. This is directly related to the difference in Rossby radius between atmosphere and ocean. Inthe atmosphere, it is on the order of 1000 km, a scale that is well resolved in current atmospheric GCMs. As aconsequence, atmospheric eddies exist primarily on synoptic scales and these are well resolved. By contrast, theoceanic Rossby radius becomes as small asO(10 km) at high latitudes, so that oceanic eddies are not well resolvedin most ocean GCMs. For ocean modeling, an important driver for the interest in stochastic parameterization comesfrom the intent to use these as a way of representing mesoscale eddies. These can be seen as part of the adiabaticprocesses of ocean flow, hence it is of interest that parameterizations do as little as possible to disturb the energy-conserving nature of the adiabatic processes. In the atmosphere, on the other hand, much of the research onstochastic parameterization is targeting the representation of diabatic processes (notably, convection).

A few of the issues and questions that were brought up during the discussions were:

• Are there guidelines for knowing when a stochastic parameterization will be useful or effective in the at-mosphere/ocean/GCMs? Before engaging in a search for the optimal method or approach for stochasticparameterization, one would like to be able to assess in what situations and under what conditions a stochas-tic parameterization can be useful at all. Two important aspects were discussed in this context: (i) Is theretemporal scale separation between the processes that are resolved and the processes that must be parameter-ized? This is to be distinguished from the question of whether or not there is spatial scale separation. (ii) Ifthe resolved scales of a system receive a large amount of random “kicks” (perturbations) from the unresolvedscales, so that the system is approaching (but has not reached) a kind of thermodynamic limit, a stochasticparameterization of these kicks can be appropriate.

• It was discussed that there are different goals that researchers have in mind for stochastic parameterizations.Representing sub-gridscale coherent structures and partially resolved coherent structures is one such goal. Arelevant distinction here relating to coherent structures and transport is between transport of coherent struc-tures and transport by coherent structures. Other aims that were stated in the discussion were (i) capturingnon-diffusive behavior (neglected in traditional diffusive eddy parameterizations), (ii) increasing variabilityon small scales, and (iii) improving the numerical stability of models.

• An issue that was raised several times during discussions is the absence of broadly agreed-on performancemeasures for stochastic parameterization. Should there be a “skill-score” for parameterizations, similar inspirit to the skill scores that are used to evaluate performance of numerical weather prediction models?This was considered useful, while it was also acknowledged that the different aims and expectations thatresearchers have for including stochastic parameterizations in their models make it difficult to identify per-formance measures that will be widely regarded as useful and important.

• Two further topics addressed in the discussion were (i) methodology and best-practices, and the advantagesand disadvantages of methodologies based on first principles versus more empirical approaches; and (ii)

Page 12: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

8 Five-day Workshop Reports

the usefulness of intermediate-complexity models for developing and testing stochastic parameterizationsmethods. This is a class of models that is no longer state of the art. However, testing methods in thesemodels can be very useful as an intermediate step between highly idealized models on the one hand andstate of the art models on the other.

During the week, a subgroup formed to discuss stochastic parameterization issues specifically regarding theatmospheric boundary layer (ABL). The primary question of interest was the simulation of the observed multipleregimes of the stably stratified ABL (SBL) and transitions between them [31, 33, 46, 47]. Under conditionsof stable stratification, the ABL is observed to exist in two states: the weakly stable boundary layer (WSBL)characterized by strong flow, a weak temperature inversion, and sustained turbulence; and the very stable boundarylayer (VSBL) characterized by weak flow, a strong temperature inversion, and weak and intermittent turbulence.These features are quite shallow (typically within a few tens of metres of the surface) and are therefore poorlyresolved by standard regional and global models of the atmosphere. Transitions from the WSBL to the VSBLtypically occur shortly after sunset during periods of strong radiative surface cooling; this process is expected tobe captured reasonably well by models. In contrast, the VSBL to WSBL transition is not well understood. Itappears to be related to a collection of intermittent turbulent processes of diverse origin (collectively denoted assubmesoscale, many of which processes are coherent structures) that has been described as a ”stochastic mix” [31].Some of these processes, such as instabilities associated with the resolved shear, may be simulated by regional andglobal models. Others, such as density currents and breaking gravity waves, will not be. Representation of the SBLhas been a persistent bias of weather and climate models, with consequent biases in simulation and prediction ofnear-surface temperature and wind fields of interest in applications ranging from agriculture to renewable energy.

The subgroup first discussed the problem in broad terms, considering which phenomena physically relevant toSBL dynamics can potentially be captured by existing parameterizations and which likely cannot. Initial attentionwas focused on the results of [9], which showed that even in high-resolution (1/3 km horizontal) mesoscale modelssuch as Weather Research and Forecasting (WRF), parameterized eddy diffusion strongly suppresses submesoscalemotion. Attention then turned to the fact that even those processes that can in principle be captured require finerthan standard vertical resolution. The following research plan was developed: targeted case studies will be carriedout using WRF for the time period and location corresponding to the well-instrumented CASES-99 field cam-paign. The initial simulations will vary vertical resolution, horizontal resolution, and the strength of parameterizeddiffusion to assess the effect of these factors on the simulation of SBL regimes and transitions between them. Hav-ing assessed this deterministic “baseline” of simulation quality, a new state-dependent stochastic representation ofsubmesoscale vertical turbulent transport will be introduced. The starting point for this parameterization will bethe statistical analyses of [47] and the proof-of-concept stochastic parameterization of [24]. By driving the localWRF simulation with the large-scale forecast ensemble produced for that time, we will also be able to assess howthe factors of increased resolution and stochastic physics improve ensemble spread in boundary layer quantities.

Some ideas were discussed for the next-generation of ocean parameterizations. Stochastic instead of deter-ministic methods were regarded as an interesting and promising approach, however other promising ideas wereconsidered as well. Anisotropic diffusion is drawing quite some interest in the oceanography community [15].Another promising approach is to adopt methods from non-Newtonian fluid mechanics [39, 4, 25].

Group on “Prediction”For much of their history of study, there has been a recognized link between coherent structures and prediction

in complex flows. By construction, coherent structures capture parts of the flow that maintain their structure forlong times relative to the rest of the field. Thus, their dynamics may be in some sense “simpler” than the rest ofthe flow, which could potentially be modeled in a purely stochastic way [35]. Lagrangian structures, like those thatwere the focus of this workshop, typically contain much more information and are much more stringently coherentthan their Eulerian counterparts, since they explicitly account for fluid advection. Thus, when they were firstintroduced, there was significant excitement about using them as new predictive tools. Unfortunately, Lagrangianstructures have not yet lived up to their promise in this regard, in part because methods for computing them typicallyrequire knowledge of the future evolution of the flow. To discuss whether this and other issues can be surmountedor whether Lagrangian structures will only ever be useful as a post-hoc analysis tool, one of the working groups atthis workshop focused on potential strategies for using Lagrangian structures for flow prediction.

One of the first issues the group faced is the question of what precisely is meant by ‘prediction’; that is, what

Page 13: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again 9

do we want to predict, and over what time horizon? In many cases of practical interest, one wants to know wheresome transported quantity will go; in the Deepwater Horizon oil spill, for example, the most important quantity tobe predicted is the future distribution of the oil. But in some other situations, such as in predicting the evolutionof a turbulent flow, one wants to know about the evolution of the flow properties themselves. It is possible thatdifferent strategies may have to be employed to handle these different kinds of predictions. It was also pointedout that, given the finite-time and aperiodic nature of the flows of interest, prediction is never going to be exactor mathematically rigorous, in part because the coherent structures themselves do not last forever in these kindsof flows. The best we may be able to do is to assign likelihoods to possible future scenarios based on the currentcoherent structures.

To do so, the group recognized that an important gap in our current understanding of coherent structures isthat we do not know much about their life cycle. That is, given the locations and some properties of coherentstructures now, and potentially at times in the past, we have little sense of how they will evolve in the future. Partof the reason for this lack of understanding is that current methods for locating Lagrangian structures have beendeveloped simply by considering the structure of the Lagrangian evolution equation x = u, where x is position,the dot signifies the time-derivative along a Lagrangian trajectory, and u is the velocity field. Typically, the velocityfield is assumed to be given, and no characteristics of its dynamics are assumed. In reality, however, the evolutionof the velocity field, and therefore the coherent structures themselves, is constrained by physics: its dynamics mustsatisfy conservation of mass, momentum, and energy, the laws of thermodynamics, and the like. Often, the velocityfield will be a solution of the Navier–Stokes equations, which bring their own complications; but in some cases,such as in data-assimilated large-scale ocean models, it will come from a somewhat simpler system. Thus, onepotential path forward for making coherent structures more predictive is to learn how to constrain their possiblefuture evolution by the properties of the velocity field that they arise from.

A second strategy the group discussed for making coherent structures more predictive is to model their evolu-tion by an ordinary differential equation. The (approximate) flow evolution could then be captured by a finite (andhopefully small) set of such ODEs, one per structure, that are coupled, but that would be easier to solve than the fullNavier-Stokes equations. This idea is reminiscent of how the utility of coherent structures was originally conceivedof by the turbulence community [35]. Developing such an ODE model, however, may be quite challenging, sinceLagrangian structures are potentially complex, spatially extended objects. It may thus be necessary to simplify thestructures themselves to be able to model them appropriately. It was noted that this approach is now being taken inthe context of Koopman operators, by approximating them in terms of only a few eigenmodes. Something similarmay be possible to do with transfer operators and almost-invariant sets, and then their evolution may potentiallybe modeled by an ODE (which may potentially have to be stochastic to compensate for the information removedby the spectral truncation). Much work remains to be done to make these ideas viable in practice for the unsteadyflows of interest in many applications, since these eigenmode decompositions have been only been studied so farfor steady or periodic cases. But they do represent a promising strategy going forward.

The final topic discussed by the group was what appropriate test cases are for developing strategies for predic-tion using coherent structures. Clearly, steady or time-periodic flows are not sufficient; in those cases, predictionis anomalously easy, since the time dependence is simple. Thus, some of the classic test cases for studying La-grangian structures, such as the double gyre, may not be appropriate for looking at their connection to prediction.Looking to observational data from, for example, the ocean is likely not the best choice either, due to inherentfinite spatiotemporal resolution and numerous physical processes that will affect the evolution of structures outsideof pure fluid advection. The group suggested that an appropriate case of intermediate difficulty might be a wellcontrolled laboratory experiment or direct numerical simulation of Navier–Stokes, where the flow can be forced tobe statistically stationary and will have known length, time, and velocity scales.

Thus, although much work remains to be done, the discussion group on prediction via coherent structures wasable to identify concrete paths forward that should be explored in future research.

Outcomes and the FutureThe topic of this workshop was highly multidisciplinary, and its structure was designed to initiate dialogues

between researchers from many areas. The general sentiment from participants and organizers was that this pur-pose had been achieved to a large degree. The highly collegial atmosphere in which all discussions were heldled to a greater understanding of each others’ research areas, and bodes well for future interactions between theparticipants.

Page 14: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

10 Five-day Workshop Reports

The groups discussed many different ways in which statistical models, coherent structures, and applicationsin oceanography, atmospheric sciences and turbulence overlapped. Many nascent intersections were identified, ashave been detailed in the separate reports from each of the groups. (These will not be separately enumerated here.)It is expected that some of these will coalesce into coherent transdisciplinary research projects in the near future.

Additionally, there were many areas/topics that saw lively debates but that did not necessarily result in a well-defined identification of a problem or an approach. It is hoped that the seeds planted at this workshop will in themedium term result in the crystallization of these into cogent research questions and/or grant proposals. A numberof the participants discussed initial plans to organize a follow-up meeting within a couple of years.

ParticipantsAbernathey, Ryan (Lamont Doherty Earth Observatory of Columbia University)Allshouse, Michael (Northeastern University)Aluie, Hussein (University of Rochester)Balasuriya, Sanjeeva (University of Adelaide)Berner, Judith (National Center for Atmospheric Research)Boffetta, Guido (Universita degli Studi di Torino)Braverman, Elena (University of Calgary)Crommelin, Daan (CWI Amsterdam)Frank, Jason (Utrecht University)Froyland, Gary (University of New South Wales)Ghil, Michael (University of California at Los Angeles)Gonzlez Tokman, Cecilia (The University of Queensland)Gottwald, Georg (The University of Sydney)Griffies, Stephen (National Oceanic & Atmospheric Administration)Holdsworth, Amber (University of Victoria)Junge, Oliver (Technical University of Munich)Karrasch, Daniel (Technische University Munchen)Keating, Shane (University of New South Wales)Kelley, Douglas (University of Rochester)Koltai, Peter (Freie Universitat Berlin)Lilly, Jonathan (NorthWest Research Associates)Lucarini, Valerio (University Hamburg)Miron, Philippe (University of Miami)Monahan, Adam (University of Victoria)Ouellette, Nicholas (Stanford University)Padberg-Gehle, Kathrin (TU Dresden)Peacock, Thomas (Massachusetts Institute of Technology)Penland, Cecile (NOAA/Earth Systems Research Lab/Physica Science Div.)Rypina, Irina (Woods Hole Oceanographic Institution)Schmalfuss, Bjorn (Friedrich-Schiller-University Jena)Stastna, Marek (University of Waterloo)Tantet, Alexis (Utrecht University)Tarshish, Nathaniel (Princeton University)Thiffeault, Jean-Luc (University of Wisconsin)Vanneste, Jacques (University of Edinburgh)Vercauteren, Nikki (Freie Universitat Berlin)Wouters, Jeroen (University of Sydney)

Page 15: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] R. Abernathey, C.B. Rocha, F.J. Poulin, M. Jansen and J. Penn, PYQG: Python Quasigeostrophic Model(2016). pyqg: v0.2.0 [Data set]. Zenodo. http://doi.org/10.5281/zenodo.50569.

[2] R. P. Abernathey and P. Cessi, Topographic enhancement of eddy efficiency in baroclinic equilibration. J. Phys.Oceanogr. 44 (2014) 2107-2126.

[3] M. Allshouse and J.-L. Thiffeault, Detecting coherent structures using braids, Physica D 241 (2012) 95-105.

[4] J. Anstey and L. Zanna, A deformation-based parametrization of ocean mesoscale eddy Reynolds stresses,Ocean Modelling 112 (2017) 99-111.

[5] L. Arnold, Random Dynamical Systems, Springer-Verlag, NY, 1998.

[6] S. Balasuriya, Barriers and transport in unsteady flows: A Melnikov approach, Series on Mathematical Mod-eling and Computation, SIAM Press, Philadelphia, 2016.

[7] S. Balasuriya, Stochastic uncertainty of advected curves in finite-time unsteady flows, Phys. Rev. E (2017),submitted.

[8] R. Banisch and P. Koltai, Understanding the geometry of transport: Diffusion maps for Lagrangian trajectorydata unravel coherent sets, Chaos 27 (2017), 035804.

[9] D. Belusic and I. Guttler, Can mesoscale models reproduce meandering motions, Q. J. R. Meterol. Soc. 136(2010), 553-565.

[10] P.S. Berloff, Random-forcing model of the mesoscale oceanic eddies. J. Fluid Mech. 529 (2005), 71–95.

[11] J. Berner et al., Stochastic parameterization: towards a new view of weather and climate models. Bull. Amer.Meteorol. Soc., 98 (2017), 565-588.

[12] A. E. BozorgMagham, S. D. Ross, and D. G. Schmale III, Real-time prediction of atmospheric Lagrangiancoherent structures based on forecast data: An application and error analysis, Physica D 258 (2013), 47-60.

[13] M.D. Chekroun, E. Simonnet and M. Ghil, Stochastic climate dynamics: Random attractors and time-dependent measures. Physica D 240 (2011), 1685-1700.

[14] P. Dimotakis, Turbulent mixing, Annu. Rev. Fluid Mech. 37 (2005), 329–356.

[15] B. Fox-Kemper, R. Lumpkin, and F. O. Bryan, Lateral transport in the ocean interior. In Ocean Circulation andClimate: A 21st century perspective (G. Siedler, S. M. Griffies, J. Gould, and J. A. Church, Eds.), InternationalGeophysics Series, 103, pages 185-209. Academic Press, 2013.

[16] J.S. Frederiksen and A.G. Davies, Eddy viscosity and stochastic backscatter parameterizations on the spherefor atmospheric circulation models. J. Atmos. Sci. 54 (1997), 2475–2492.

[17] G. Froyland, An analytic framework for identifying finite-time coherent sets in time-dependent dynamicalsystems, Physica D 250 (2013), 1–19.

11

Page 16: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

12 Five-day Workshop Reports

[18] G. Froyland, Dynamic isoperimetry and the geometry of Lagrangian coherent structures, Nonlinearity 28(2015), 3587.

[19] G. Froyland and O. Junge, Robust FEM-based extraction of finite-time coherent sets in using scattered, sparse,and incomplete trajectories. Arxiv 2017.

[20] G. Froyland and K. Padberg-Gehle. A rough-and-ready cluster-based approach for extracting finite-time co-herent sets from sparse and incomplete trajectory data. Chaos 25 (2015), 087406.

[21] A. Hadjighasem, M. Farazmand, D. Blazevski, G. Froyland and G. Haller, A critical comparison of La-grangian methods for coherent structure detection, Chaos (2017), in press.

[22] A. Hadjighasem, D. Karrasch, H. Teramoto, and G. Haller, Spectral clustering approach to Lagrangian vortexdetection, Phys Rev E 93 (2016), 063107.

[23] G. Haller, Lagrangian Coherent Structures, Annual Review of Fluid Mechanics 47 (2015), 137–162.

[24] Y. He, A.H. Monahan and N.A. McFarlane, The influence of bounary layer processes on the diurnal variationof the climatological near-surface wind speed probability distribution over land, J. Climate 25 (2012), 6441-6458.

[25] D. D. Holm and B. A. Wingate, Baroclinic instabilities of the two-layer quasigeostrophic alpha model. J.Phys. Oceanogr. 35 (2005), 1287-1296.

[26] A.A.M. Holtslag, G. Svensson, P. Baas, S. Basu, B. Beare, A.C.M. Beljaars, F.C. Bosveld, J. Cuxart, J.Lindvall, G.J. Steeneveld, M. Tjernstrom and B.J.J. van de Wiel, Stable atmospheric boundary layers and diurnalcycles - challenges for weather and climate models, Bull. Amer. Meteor. Soc. 11 (2013), 1691-1706.

[27] S. Juricke, T. N. Palmer, and L. Zanna. Stochastic perturbations to sub-grid scale ocean mixing: Impacts onlow frequency variability. Journal of Climate doi=10.1175/JCLI-D-16-0539.1 (2017).

[28] S. Keating, K. Smith, and P. Kramer, Diagnosing lateral mixing in the upper ocean with virtual tracers: spatialand temporal resolution dependence. J. Phys. Oceanogr. 41 (2011), 1512-1534.

[29] D. Kondrashov and P. Berloff, Stochastic modeling of decadal variability in ocean gyres. GeophysicalResearch Letters, 42 (2015), 1543-1553.

[30] F. Lekien, C. Coulliette, A. J. Mariano, E. H. Ryan, L. K. Shay, G. Haller, and J. Marsden, Pollution releasetied to invariant manifolds: A case study for the coast of Florida, Physica D 210 (2005), 1-20.

[31] L. Mahrt, Stably stratified atmospheric boundary layers, Annu. Rev. Fluid Mech. 46 (2014), 23-45.

[32] I. Mezic, S.Loire, V. Fonoberov and P. Hogan, A new mixing diagnostic and Gulf oil spill movement, Science330 (2010), 486–489.

[33] A.H. Monahan, T. Rees, Y. He and N. McFarlane, Multiple regimes of wind, stratification, and turbulence inthe stable boundary layer, J. Atmos. Sci. 72 (2015), 3178-3198.

[34] M. J. Olascoaga and G. Haller, Forecasting sudden changes in environmental pollution patterns, Proceedingsof the National Academy of Sciences of the USA 109 (2012), 4738-4743.

[35] N. T. Ouellette, On the dynamical role of coherent structures in turbulence, Comptes Rendus Physique 13(2012), 866-877.

[36] T.N. Palmer, A nonlocal dynamical perspective on model error: a proposal for nonlocal stochastic-dynamicparametrization in weather and climate prediction models. Q. J. R. Meteorol. Soc. 127 (2001), 279–304.

[37] T. Palmer and P. Williams (Eds.), Stochastic physics and climate modelling, Cambridge University Press,2010.

Page 17: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Transport in Unsteady Flows: from Deterministic Structures to Stochastic Models and Back Again 13

[38] T. Peacock and J. Dabiri, Introduction to focus issue: Lagrangian coherent structures, Chaos 20 (2010),017501.

[39] P. Porta Mana and L. Zanna, Toward a stochastic parameterization of ocean mesoscale eddies, OceanModelling, 79 (2014), 1-20.

[40] B. Sawford, Turbulent relative dispersion, Annu. Rev. Fluid Mech. 33 (2001), 289–317.

[41] K. Schlueter-Kuck and J. Dabiri, Coherent structure coloring: identification of coherent structures from sparsedata using graph theory, Journal of Fluid Mechanics 811 (2017), 468-486.

[42] S.C. Shadden, Lagrangian Coherent Structures. In: Transport and Mixing in Laminar Flows: FromMicrofluidics to Oceanic Currents (R. Grigoriev, ed), Wiley-VCH, 2011.

[43] G. J. Shutts, A kinetic energy backscatter algorithm for use in ensemble prediction systems. Q.J.R. Meteorol.Soc. 131, (2005), 3079–3102.

[44] A. Thompson and W. Young, Scaling baroclinic eddy fluxes: Vortices and energy balance. J. Phys. Oceanogr.36 (2006), 720-738.

[45] C.D. Thorncroft, B.J. Hoskins and M.E. McIntyre, Two paradigms of baroclinic-wave life-cycle behaviour.Quart. J. R. Meteorol. Soc. 119 (1993), 17-55.

[46] I.G.S. van Hooijdonk, J.M.M. Donda, H.J.H. Clercx, F.C. Bosveld and B.J.H. van de Wiel, Shear capacity asa prognostic for nocturnal boundary layer regimes, J. Atmos. Sci. 72 (2015), 1518-1532.

[47] N. Vercauteren, L. Mahrt and R. Klein, Investigation of interactions between scales of motion in the stableboundary layer, Q.J.R. Meterol. Soc. 142 (2016), 2424-2433.

[48] Y. Wang, F. J. Beron-Vera and M. J. Olascoaga. The life cycle of a coherent Lagrangian Agulhas ring, J.Geophys. Res. Oceans 121 (2016) 3944-3954.

[49] S. Waterman and J. M. Lilly, Geometric decomposition of eddy feedbacks in barotropic systems, J. Phys.Oceanogr. 45 (2015), 1009–1024.

[50] P. D. Williams, N. J. Howe, J. M. Gregory, R. S. Smith, and M. M. Joshi, Improved climate simulationsthrough a stochastic parameterization of ocean eddies, J. Climate 29 (2016), 8763–8781.

[51] L. Zanna, P. Porta Mana, J. Anstey, T. David and T. Bolton. Scale-aware deterministic and stochasticparametrizations of eddy-mean flow interaction. Ocean Modelling 111 (2017), 66-80.

Page 18: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 2

Combinatorial Reconfiguration (17w5066)

January 22 - 27, 2017Organizer(s): Naomi Nishimura (University of Waterloo), Takehiro Ito (Tohoku University),Amer Mouawad (University of Bergen)

Overview of the FieldReconfiguration is the study of relationships among solutions to an instance of a problem, and in particular the

step-by-step transformation from one solution into another such that the intermediate result of each step is also asolution. As a concrete example of a reconfiguration problem, we consider the assignment of customers to powerstations such that each customer obtains as much power as is required without exceeding the capacity of what eachstation can produce. Each such assignment can be viewed as a solution. When a station needs to be repaired, itmay be necessary to change to another assignment, moving a single customer at each step in order to minimizedisruptions. Commonly-asked questions in the area of reconfiguration include both structural questions concerningthe solution space and algorithmic questions about the complexity of solving various related problems.

For any combinatorial problem, there are multiple ways of defining the solution space, and for each suchdefinition, there are a variety of related problems. A reconfiguration graph for an instance of a problem can bedefined as a set of nodes, one for each solution to the problem (where there may be constraints with respect tosolution size), and a set of edges, where there is an edge between any two nodes that are adjacent (for somedefinition of adjacency). The definition of adjacency gives rise to the notion of a reconfiguration step, an operationby which a solution is transformed into an adjacent solution. The sequence of solutions traversed in a sequence ofreconfiguration steps, or equivalently, a path in the reconfiguration graph through the nodes associated with thosesolutions, is called a reconfiguration sequence.

Typical questions that arise include the following:

• Given two solutions, does there exist a reconfiguration sequence transforming one into the other? This isknown as the reachability problem.

• Given two solutions, what is the length of the shortest reconfiguration sequence transforming one into theother? This is the decision version of the shortest transformation problem.

• Given two solutions, produce a reconfiguration sequence (or a shortest reconfiguration sequence). This isthe search version of the shortest transformation problem.

• Given a problem, under what additional constraints (if any) is it guaranteed that for any instance the recon-figuration graph is connected? This is the connectivity problem.

• Given a problem, under what additional constraints (if any) is it guaranteed that for any instance the recon-figuration graph has a bound on the diameter?

14

Page 19: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Combinatorial Reconfiguration 15

For a given problem, there may be more than one natural definition of adjacency (and, corresponding to eachsuch definition, the notion of a reconfiguration step). For the problem of independent set, where the solution is asubset of the vertices such that there is no edge between any of the vertices in the set, we can view a reconfigurationstep as changing the position of a token, where there is a token on each vertex in the independent set. Thisimmediately implies at least three possible ways of moving tokens: token jumping (TJ) [14], in which a token ismoved from a vertex to any other vertex in the graph, token sliding (TS) [8, 14], in which a token is moved froma vertex along an edge to a neighbouring vertex, and token addition/removal (TAR) [11, 14], where in each stepa token is added or removed. For the problem of k-colouring, where each vertex in a graph is assigned a coloursuch that for each edge the endpoints have different colours, a reconfiguration step could consist of changing thecolour of a single vertex or swapping two colour classes. In yet another example of adjacency, for the problem ofsatisfiability of Boolean formulas, a reconfiguration step might consist of the flipping of a single variable in thetruth assignment (from true to false or from false to true).

For the reconfiguration steps of token jumping and token sliding, all feasible solutions have the same numberof tokens. In other situations, it may be necessary to give a bound on the size of solutions under consideration.As an example problem, we consider the problem of vertex cover, where the goal is to determine a subset of thevertices of a graph such that each edge in the graph has at least one endpoint in the set. If we take as our feasiblesolutions all vertex covers, of any size, then it is easy to see that we can find a reconfiguration sequence from anyvertex cover to any other: at each step we add one of the vertices in the target solution (adding a vertex to a solutiongives another solution) and after all have been added, removing a vertex that is not in the target solution (whichis still a solution, as it contains all the vertices in the target solution). A more interesting problem results fromrestricting the size of solutions to be at most k or k + 1, where k is the size of a minimum solution.

In addition to characterizing the solution spaces of classical computational problems (as well as solutions topuzzles), reconfiguration is related to problems in a variety of areas. In the area of network security, a network canbe represented as a graph where each vertex corresponds to a server and each edge to a communication link; a step-by-step change in firewalls corresponds to the reconfiguration of a cut. In the Frequency Assignment Problem, thegoal is to assign frequencies to users of a wireless network such that the interference between them is minimizedand the range of used frequencies is as small as possible. Frequencies can be viewed as colours, giving rise to acolouring problem. Due to the frequent changes in demand and addition of new transmitters as well as the difficultyof finding optimal assignments, there is a need to change between assignments. Reconfiguration gives a methodof doing so without interrupting service for all customers. As a final example, the moving of objects on a plane(modeled by moving tokens on a graph) has applications in the 3D-printing industry, where a “head” follows apath in laying down a layer of material. The use of multiple heads to reduce printing time necessitates ways to planmovement without collisions and, ideally, minimizing the distance traveled.

Recent Developments and Open ProblemsAs the field is quite young, all progress in the field qualifies as recent developments. Below are listed various

types of results along with related open problems.Defining AdjacencyRelations among different types of reconfiguration steps have been demonstrated; for independent set recon-

figuration, TJ and TAR are equivalent [14], and for clique reconfiguration, TJ, TAR, and TS are all equivalent [12].In this context, problems are equivalent if an instance is solvable using one type of reconfiguration step if and onlyif it is solvable using another type of reconfiguration step.

Open problems in this area include defining adjacency for problems with solutions that are, for example, givenas sequences rather than sets, or for areas such as computational geometry.

Structural ResultsStructural results focus on properties of the reconfiguration graph; most results in the category specify when

the reconfiguration graph is connected. Most of the results of this type focus on the problems of k-colouring anddominating set. In the latter problem, the goal is to find a set of vertices such that each vertex outside the set has aneighbour in the set.

There are many open problems with respect to not only connectivity of the reconfiguration graph, but alsoother properties, such as k-connectivity, properties of isolated vertices, and characteristics of various connectedcomponents.

Hardness Results

Page 20: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

16 Five-day Workshop Reports

In many cases, there is a correspondence between the complexity of a classical problem and the complexityof the related reachability reconfiguration problem: often both can be solved in polynomial time, or both are in-tractable (NP-complete and PSPACE-complete, respectively). Many of the PSPACE-hardness proofs follow thechain of reductions used for NP-hardness reductions, starting with the PSPACE-completeness of 3SAT reconfigu-ration [5].

There are, however, a few exceptions to this general pattern. Although the problem of 3-colouring is NP-complete, its reachability reconfiguration problem can be solved in polynomial time [13]. Conversely, the short-est path problem can be solved in polynomial time, but its reachability reconfiguration problem is PSPACE-complete [1]. Although exceptions to the general rules are known, currently there are no results explaining thereasons for such exceptions.

AlgorithmsGreedy algorithms have been developed for matching, independent set, and minimum spanning tree, and dy-

namic programming algorithms for list colouring, shortest path, and k-colouring. In addition, the complexity ofproblems has been considered using the framework of parameterized complexity, where the goal is to find algo-rithms with running times that are polynomial in the size of the input but possibly exponential (or worse) in termsof one or more parameters of the problem. Various results consider as parameters the sizes of solutions, the lengthsof reconfiguration sequences, and other attributes of inputs and problems [16].

One area with many open problems is that of finding shortest transformations between solutions. Among thefew known results are algorithms for satisfiability [15] and independent set using token-sliding on caterpillars [19].

Presentation HighlightsIn order to facilitate as much collaboration as possible, the workshop time was divided among technical talks,

open problem presentations, and follow-up discussions. The first two types of activities are summarized here, witha selection of completed work and work in progress detailed in Section 2.

Both the high quality of the contributions and the engagement of the participants led to exciting new collabo-rations.

Technical TalksTakehiro Ito (Tokohu University, Japan): Invitation to Combinatorial Reconfiguration The workshop started

with this introductory talk. It is possible to view the field of reconfiguration as middle ground between standardsearch problems, which ask for one solution to an instance of a problem, and enumeration problems, which ask forall solutions to an instance of a problem. It is important to note that since it is a decision problem, a reachabilityproblem does not require the generation of an actual reconfiguration sequence, which may be super-polynomial inlength.

The history of combinatorial reconfiguration was presented, showing both how the field was developed and inwhat new directions it might go. Most of the content can be found in Sections 2 and 2.

Nicolas Bousquet (Institut Polytechnique de Grenoble, France): Token Sliding on Chordal Graphs(Joint work with Bonamy)The reconfiguration of independent sets using token sliding was first considered by Hearn and Demaine [8],

who showed that the reachability problem is PSPACE-complete on planar graphs. In later results, polynomial-timealgorithms have been found for trees, cographs, claw-free graphs, and bipartite permutation graphs.

In new work, the authors show that both the reachability and connectivity problems can be solved in polynomial-time on interval graphs. The algorithm makes use of the geometric representation of an interval graph as an inter-section graph of intervals on the line, which can be obtained in polynomial time. Using this representation, onecan define the leftmost independent set. Although a naive approach may not yield a polynomial-time algorithm,dynamic programming can be employed to obtain algorithms for both problems.

In addition, the authors show that on split graphs (where the vertex set can be partitioned in two sets, oneinducing a clique and the other inducing a set of isolated vertices), the connectivity problem is co-NP-hard andco-W[2]-hard. In particular, they use the fact that k-Dominating Set is NP-hard and W[2]-hard and then show thatthe result still holds when there is no blocking set of size at most k + 1, where in a blocking set no vertex hasa private neighbour. Using a similar construction, the same hardness results can be derived for the connectivityproblem on bipartite graphs.

Akira Suzuki (Tohoku University, Japan): Reduction Tools on NCL

Page 21: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Combinatorial Reconfiguration 17

Nondeterministic Constraint Logic was first introduced by Hearn and Demaine [8] to show the hardness ofsliding block puzzles. In a constraint graph, the directed edges have weights of 1 or 2 and are either legal or not.In the legal direction, each vertex is the head of at least two weight 2 edges. The NCL problem is to determinewhether a target edge can be reversed by a sequence of legal moves, where in a legal move an edge is reversedsuch that the resulting direction is legal.

The problem NCL is difficult even if each vertex is either an AND vertex (the output edge is outgoing if bothinput edges are incoming) or an OR vertex (the output edge is outgoing if at least one input edge is incoming).Using this fact, PSPACE reductions are possible using NCL with just AND and OR gadgets. Demonstrations weregiven for sliding blocks and for edge colouring.

In addition, NCL can be generalized by introducing the notion of a neutral, undirected edge. This new versioncan accomplish all that can be accomplished using NCL, but often results in fewer gadgets. One of the new typesof gadgets (in addition to AND and OR gadgets) is a link gadget.

In new work [18], it can be shown that reconfiguration of list edge-colouring and edge-colouring are PSPACE-complete using AND, OR, and link gadgets. This result completes the classification of the complexity of thereconfiguration of list edge-colouring by showing that the problem is in P for k ≤ 3 and PSPACE-complete other-wise. For edge-colouring, the complexity is still open for k = 4, though this result proves PSPACE-completenessfor k ≥ 5.

The talk ended with some open questions about the complexity of more restricted constraint graphs.Karen Seyffarth (University of Calgary, Canada): Reconfiguring Vertex Colourings of 2-Trees(Joint work with Cavers)Recent work was presented in the area of the connectivity problem for k-colouring for various classes of

graphs. In 2008, Cereceda, van den Heuvel, and Johnson [3] proved that the reconfiguration graph for k-colouringis connected for all k ≥ col(H) + 1, where col(H) = maxδ(G) | G ⊆ H+ 1 is the colouring number of H .

More recently, Choo and MacGillivray [4] proved a connection between colouring numbers and Hamiltonic-ity, namely, that k0(H) ≤ col(H) + 2, for k0(H) the Gray code number of H , the least integer such that thereconfiguration graph for k-colouring has a Hamiltonian cycle for all k ≥ k0(H). In the same paper, values of k0

were shown for complete graphs, trees, and cycles. Later, Gray code numbers were found for complete bipartitegraphs [2].

The presentation gave a proof of the Gray code numbers for 2-trees. Except for certain cases in which k0(H) =5, the Gray code number for 2-trees is 4.

Matthew Johnson (Durham University, UK): Kempe Equivalence in Regular Graphs(Joint work with Bonamy, Bousquet, Feghali, and Paulusma)For a k-colouring, an (a, b)-component is a maximal connected subgraph whose vertices are coloured a or

b; such components are called Kempe chains. A Kempe change is the exchanging of the colours a and b of thevertices in an (a, b)-component, resulting in another proper colouring. A Kempe class consists of k-colourings thatare Kempe equivalent (one can be obtained from another by a sequence of Kempe changes).

In 1981, Las Vergnas and Meyniel showed that the set of k-colourings of a d-degenerate graph form a Kempeclass, k > d, where a graph is d-degenerate if every induced subgraph has a vertex of degree at most d. Moharconjectured in 2007 that for k ≥ 3, the k-colourings of a k-regular non-complete graph form a Kempe class, butin 2013 van den Heuvel found a counterexample for k = 3 [10].

Bonamy, Bousquet, Feghali, Johnson, and Paulusma have proved that for k ≥ 3, if G is a connected k-regulargraph that is neither complete nor the triangular prism, then the k-colourings of G form a Kempe class. In theirproof, they make use of the clique cutset lemma of Las Vergnas and Meyniel, and consider separately k-regulargraphs that are not 3-connected and k-regular graphs that are 3-connected. In the latter case, they make use of amatching lemma, and consider k-regular 3-connected graphs of diameter 2.

The talk ended with several open problems, including the question of whether the 5-colourings of a toroidaltriangular lattice form a Kempe class (which would have an application in physics, proving the validity of theWSK algorithm for simulating the antiferromagnetic Potts model) and what can be determined about the numberof Kempe changes needed to transform one colouring into another.

Jan van den Heuvel (London School of Economics and Political Science, UK): Token-Sliding Problems(Joint work with Brightwell and Trakultraipruk)

Page 22: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

18 Five-day Workshop Reports

The classic 15-puzzle can be interpreted as a problem of moving tokens on a graph, which can then be gener-alized to consider different graphs. A result by Wilson in 1974 shows that the reconfiguration graph for a slidingblock puzzle is connected except if the puzzle graph is a cycle on n ≥ 4 vertices, is bipartite and not a cycle, orthe exceptional graph Θ0 on 7 vertices.

To further generalize the problem, one can ask what would happen if there were more than a single uncoveredvertex and/or if not all tokens were considered to be identical. The authors show that the reconfiguration graph isconnected except if the puzzle graph is not connected, a path with at least two token labels, a cycle with at leasttwo token labels, a cycle with two token labels, with at least two of the same label, a 2-connected bipartite graphwith n − 1 distinct tokens, the graph Θ0 with one of four possible labelings of tokens, or has connectivity 1 withat least two token labels and a separating path preventing tokens from moving between blocks.

A related problem is that of finding the shortest reconfiguration sequence, which Goldreich showed to be NP-complete for cases with n − 1 different tokens and van den Heuvel and Trakultriapruk (2014) showed to be in Pwhen all tokens are the same, but NP-complete when at least one token is special and all others are identical. Theproof is based on a result for robot motion by Papadimitriou, Raghavan, Sudan, and Tamaki (1994).

Moritz Muhlenthaler (Technische Universitat Dortmund, Germany): Reconfiguration in Matroids RevisitedReconfiguration properties of matroids have been studied, in different guises, at least since the 1970s. The talk

touched very briefly on several interesting results from the past 40 years in this area and put them in the currentunified setting of reconfiguration problems [11, 10]. Furthermore, some new aspects of reconfiguration in matroidswere investigated, namely the complexity of reconfiguring common independent sets of two or more partitionmatroids. In particular, it was shown that reachability of two common independent sets of two partition matroidscan be decided in polynomial time, while for three or more matroids, the task becomes PSPACE-complete. Sincemany combinatorial optimization problems can be phrased in terms of finding maximum common independentsets of partition matroids, there are many interesting applications of these results.

Ruth Haas (University of Hawaii, USA): Reconfiguration of Dominating Sets(Joint work with Seyffarth)A subset S of the vertices of a graph is a dominating set if and only if every other vertex is adjacent to a

vertex of S. The domination number, γ(G), is the minimum cardinality of a dominating set of G, and the upperdomination number, Γ(G), is the maximum cardinality of a minimal dominating set of G. Various models ofdomination reconfiguration have been studied, e.g. by Subramaniam, Sridharan, and Fricke and by Hedetniemi,Hedetniemi, and Hutson; in addition, the γ-graph uses only γ sets and token jumping.

The goal is to find d0(G), the smallest value of k such that the reconfiguration graph is connected for allk ≥ d0(G). The authors showed in 2014 that do(G) = Γ(G) + 1 for bipartite graphs and chordal graphs [6], withfurther results obtained by Suzuki, Mouawad, and Nishimura. Alikhani, Fatehi, and Klavzar considered whichgraphs might be reconfiguration graphs.

In the talk, the following new results were presented:

• All independent dominating sets are in the same connected component of the reconfiguration graph fork = Γ(G) + 1, where an independent dominating set is a maximal independent set.

• If G is both perfect and irredundant perfect, then d0(G) = Γ(G) + 1, where a set is irredundant if everyvertex in the set has a private neighbour, and precise definitions of the terms relate sizes of independent setsand clique cover numbers.

• For certain classes of well-covered graphs, d0(G) = Γ(G) + 1, where a graph is well-covered if everymaximal independent set has the same cardinality.

Open Problem PresentationsHenning Fernau (Universitat Trier, Germany) Various extensions and generalizations of reconfiguration were

proposed, as detailed below.Building on ideas on generalizing the notion of adjacency, Fernau proposed approaching problems by deter-

mining what definition of adjacency would result in a connected reconfiguration graph. It was noted that for manyproblems there is a trivial solution, such as in the case of vertex cover, when all intermediate solutions are vertexcovers of any size (a vertex cover V1 can be reconfigured to a vertex cover V2 by adding all vertices in V2 and thendeleting all vertices in V1).

Page 23: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Combinatorial Reconfiguration 19

Another proposal was to consider the relationship between reconfiguration and reoptimization, where the goalis to determine a solution for an instance to a problem given a “similar” instance and its solution. Combining andgeneralizing the concepts could lead to the notion of a sequence of steps that changes both the instance and thesolution.

It was further observed that although many reconfiguration problems turn out to be computationally difficult,it may be beneficial to focus on results in areas where hardness is a positive attribute. One example could be takenfrom computational social choice, considering how easy it is for an election to be rigged.

Other possible topics include the reconfiguration of problems in computational geometry as well as that ofstring editing.

Tatsuhiko Hatanaka (Tokohu University, Japan): Optimizing a Colouring via a Reconfiguration Sequence Inthis presentation, a new variant of reconfiguration was introduced. In the optimization variant, the goal is to findthe optimal solution over all solutions reachable from a given solution. In particular, for the problem of colouringreconfiguration, an optimal solution is one that minimizes the number of colours used.

Preliminary results include a proof that the problem is NP-hard when the number of colours is at least five, anda polynomial-time algorithm for numbers of colours no greater than three.

Jan van den Heuvel (London School of Economics and Political Science, UK) The focus of this presentationwas the k-colouring reconfiguration problem, considering both the reachability problem and the connectivity prob-lem. The reachability problem can be solved in polynomial time when the number of colours, k, is at most three,and is PSPACE-complete otherwise (as a consequence of [8]).

For the connectivity problem, the answer is always no when k = 2 and is co-NP-complete when k = 3.The complexity remains open for k ≥ 4. There followed a discussion of the co-NP-completeness result, and inparticular the folding operation, where a vertex has two nonadjacent neighbours that can be folded together. It canbe shown that a bipartite instance G is a no-instance of the connectivity problem for k = 3 if and only if G can befolded to a cycle on six vertices. One might wish to try to generalize this to k = 4 by showing that G can be foldedto a cube; however, this is not the case. Further work is required on this problem, perhaps using another technique,and also perhaps by considering techniques for nonbipartite graphs.

Nicolas Bousquet (Institut Polytechnique de Grenoble, France): Graph Recolouring on Sparse Classes ofGraphs Based on a conjecture by Cereceda in 2007, one possible approach to characterizing the reconfigurationgraph for k-colouring is by considering the degeneracy of the graph and using that to determine the diameter of thereconfiguration graph. In particular, the conjecture states that the reconfiguration graph is of quadratic diameterwhen the number of colours is two greater than the degeneracy of the input. The conjecture has been shown to betrue for k-regular graphs, trees, chordal graphs, bounded treewidth graphs, and distance-hereditary graphs.

So far, all the results have used nothing more than a tree decomposition of the input graph, raising the questionas to whether other techniques might be applied.

Another question concerns the relationship between mixing time and the diameter of the reconfiguration graph,and in particular a characterization of classes of graphs for which the diameter of the reconfiguration graph is linear.Results are known for trees and k-degenerate graphs.

Kunihiro Wasa (National Institute of Informatics, Japan)This talk focused on the reconfiguration of optimal amidakujis. An amidakuji, also known as a ladder lottery,

is a way of generating a permutation by creating a series of vertical and horizontal bars. The number of verticalbars is the number of items to be permuted. The placement of horizontal bars joining adjacent vertical bars dictatesthe permutation, where from the top of each vertical bar, a path is traced following all horizontal bars encountered.An amidakuji is optimal if it uses the minimum number of bars to achieve the particular permutation.

The talk presented an algorithm that can be used to determine the shortest reconfiguration sequence betweenoptimal amidakujis, and presented open problems on various variants.

Ryuhei Uehara (Japan Advanced Institute of Science and Technology (JAIST), Japan)Starting with a general overview of the complexity of various games and puzzles, it was observed that most

one-player games are NP-complete whereas most two-player games are PSPACE-complete or EXP-complete [7].As an exception to this general rule, many sliding block puzzles, in which it is possible to return to the same state,are PSPACE-complete.

Uehara is the director of the JAIST Gallery, which houses NOB’s Puzzle Collection, a collection of about10,000 puzzles from around the world. Although he did not bring the entire collection to share with us, he did

Page 24: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

20 Five-day Workshop Reports

share his latest purchase, the Qubigon, which is a generalization of the well-studied slide-block 15-puzzle. Theobservation that the puzzle makes use of 18 of the 20 possible locations at once led to questions about the rela-tionship between the number of tokens and the number of locations more generally. (This was later addressed in apresentation by van den Heuvel.)

An additional open problem that was discussed related to token sliding for independent sets.

Scientific Progress MadeMatching Shortest TransformationMany techniques have been developed in the last few years in order to design polynomial time algorithms to

determine the existence of a transformation between two configurations (reachability). On the other hand, veryfew of them have been proposed to compute shortest transformations between two configurations. One of the mainreasons is that the shortest transformation problem seems to be much harder than the reachability problem evenif only little evidence has been found so far in that direction (i.e. we need a very simple reconfiguration problemin order to obtain a polynomial-time algorithm for the shortest transformation problem). One of our motivationswas to find problems on which the shortest transformation problem is hard while the reachability problem is verysimple. Another motivation consisted in developing new tools to prove that the shortest transformation problem ishard.

During the workshop, Nicolas Bousquet, Tatsuhiko Hatanaka, Takehiro Ito and Moritz Muehlenthaler foundpreliminary results on the hardness of shortest transformation for the matching problem on graphs of maximumdegree 4. In particular it implies that the shortest transformation is hard for the reconfiguration of independentsets on the token jumping and token sliding variants on line graphs. It answers a question of Ryuhei Uehara onthe complexity of the TJ problem on claw-free graphs raised during the workshop. The proof technique might begeneralized in order to show that there is no constant factor approximation algorithm for the Matching ShortestTransformation problem.

Equitable Colouring ReconfigurationA colouring of a graph is an assignment of colours to its vertices so that no two adjacent vertices have the

same colour, whereas an equitable colouring of a graph is a colouring where the difference between the numberof vertices coloured by any two colours is at most one. Equitable colourings have many real-world applications inscheduling, load-balancing, and timetabling, and has been the subject of much research interest. The question ofwhether one colouring can be transformed to another colouring by changing the colour of only one vertex at eachstep has also been studied in the field of reconfiguration problems and has been proved to be PSPACE-complete.

Tatsuhiko Hatanaka, Haruka Mizuta, and Krishna Vaidyanathan considered the reconfiguration of equitablecolourings, showing that the problem is PSPACE-complete, even for planar graphs using only four colours.

String Editing ReconfigurationDiscussions among Henning Fernau, Ruth Haas, Matthew Johnson, Naomi Nishimura, and Karen Seyffarth

focused on the issues surrounding solution spaces in which each solution consists of a sequence (in this case,a sequence of edit operations). Preliminary observations included the fact that a reconfiguration sequence is it-self a solution in the form of a sequence, allowing for the discussion of the metaproblem of reconfiguration ofreconfiguration.

Additional observations found connections to work on permutations, including Kunihiro Wasa’s open problempresentation on amidakujis.

Homomorphism ReconfigurationVijay Subramanya and Ben Moore considered the homomorphism reconfiguration problem for graphs on at

most four vertices. They showed that H-reconfiguration is in P when H is C4 or the diamond graph.Furthermore, they conjectured that if H1- and H2-reconfiguration are in P, then H-reconfiguration is also in

P, where H is obtained by joining H1 and H2 at a vertex. They proved the conjecture when H1 and H2 are bothequal to C3. This result extends to the case where H is a series of C3’s.

NCL and MatroidsTwo open problems presented by Akira Suzuki about restricted variants of nondeterministic constraint logic

can be rephrased in terms of connectivity of common independent sets of two partition matroids. Since NCL is atool which has been developed for proving hardness of puzzles, this is quite an unexpected connection.

Outcome of the Meeting

Page 25: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Combinatorial Reconfiguration 21

The objectives of the workshop were all met, as elaborated below; further outcomes are listed in Section 2.Providing an Opportunity for Joint Discussion by Researchers in Reconfiguration from All over the

WorldThe workshop was very successful in bringing together researchers from different countries and different re-

search groups. Unfortunately, funding and visa issues resulted in last-minute cancellations by researchers fromIndia and Lebanon.

In the interests of maintaining our world-wide inclusiveness, in our business meeting we decided to try toalternate the continents on which future workshops will be held, with the 2019 workshop to be held in France.

Identifying Future Research DirectionsThe many open problem sessions and follow-up discussions generated many future research directions. As

detailed in Section 2, progress has already been made in several of these directions.Deepening the Area by Establishing a Set of Common Methods and Algorithmic TechniquesAt the business meeting we discussed possible approaches to collecting and displaying known results. The

prototype of a website was shown and discussed.Broadening the Area by Making Connections to Related Areas and ProblemsSeveral of the presentations introduced the audience to new applications of reconfiguration, such as the appli-

cation of colouring to the Potts model (Section 2). In other talks, unexpected connections were discovered, suchas between ladder lotteries, string editing, and permutations (Sections 2 and 2) and between NCL and matroids(Section 2).

Additional OutcomesIn addition to the planned outcomes, to further the research community we are planning to hold a minisympo-

sium on reconfiguration at the 6th biennial Canadian Discrete and Algorithmic Mathematics Conference in 2017,and to have a special issue on reconfiguration in the journal Algorithms.

ParticipantsBousquet, Nicolas (Ecole Centrale de Lyon)Fernau, Henning (University Trier)Haas, Ruth (Smith College)Hatanaka, Tatsuhiko (Tohoku University)Ito, Takehiro (Tohoku University)Johnson, Matthew (Durham University)Mizuta, Haruka (Tohoku University)Moore, Benjamin (Simon Fraser University)Muehlenthaler, Moritz (University of Erlangen-Nuremberg)Nishimura, Naomi (University of Waterloo)Ono, Hirotaka (Kyushu University)Otachi, Yota (Japan Advanced Institute of Science and Technology)Seyffarth, Karen (University of Calgary)Subramanya, Vijay (University of Waterloo)Suzuki, Akira (Tohoku University)Uehara, Ryuhei (Japan Advanced Institute of Science and Technology)Vaidyanathan, Krishna (University of Waterloo)van den Heuvel, Jan (The London School of Economics and Political Science)Wasa, Kunihiro (National Institute of Informatics)

Page 26: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] P. Bonsma, The complexity of rerouting shortest paths, In Proceedings of the 37th International Symposiumon Mathematical Foundations of Computer Science, 2012, 222–233.

[2] M. Celaya, K. Choo, G. MacGillivray, and K. Seyffarth, Reconfiguring k-colourings of complete bipartitegraphs, Kyungpook Math. J. 56 (2016), 647–655.

[3] L. Cereceda, J. van den Heuvel, and M. Johnson, Connectedness of the graph of vertex colourings, DiscreteMath. 308 (2008), 913–919.

[4] K. Choo and G. MacGillivary, Gray code numbers for graphs, Ars Math. Contemp. 4 (2011), 125–139.

[5] P. Gopalan, P. G. Kolaitis, E. N. Maneva, and C. H. Papadimitriou, The connectivity of Boolean satisfiability:computational and structural dichotomies, SIAM Journal on Computing 38(6) (2009), 2330–2355.

[6] R. Haas and K. Seyffarth, The k-dominating graph, Graphs and Combinatorics, 30(3) (2014), 609–617.

[7] R. A. Hearn and E. D. Demaine, Games, Puzzles, and Computation, A K Peters, Wellesley Massachusetts,2009.

[8] R. A. Hearn and E. D. Demaine, PSPACE-completeness of sliding-block puzzles and other problems throughthe nondeterministic constraint logic model of computation, Theoretical Computer Science, 343(1–2) (2005),72–96

[9] R. Haas and K. Seyffarth, The k-dominating graph, Graphs and Combinatorics, 30(3) (2014), 609–617.

[10] J. van den Heuvel, The complexity of change, In Surveys in Combinatorics, Cambridge University Press(2013), 127–160.

[11] T. Ito, E. D. Demaine, N. J. A. Harvey, C. H. Papadimitriou, M. Sideri, R. Uehara, and Y. Uno On thecomplexity of reconfiguration problems, Theoretical Computer Science, 412(12-14) (2011), 1054–1065.

[12] T. Ito, H. Ono, and Y. Otachi, Reconfigurations of cliques in a graph, In Proceedings of the 12th AnnualConference on Theory and Applications of Models of Computation, 2015, 212–223.

[13] M. Johnson, D. Kratsch, S. Kratsch, V. Patel, and D. Paulusma, Finding shortest paths between graph colour-ings, In Proceedings of the 9th International Symposium on Parameterized and Exact Computation, 2014, 221-233.

[14] M. Kaminski, P. Medvedev, and M. Milanic, Complexity of independent set reconfigurability problems,Theoretical Computer Science, 439 (2012), 9–15.

[15] A. E. Mouawad, N. Nishimura, V. Pathak, and V. Raman, Shortest reconfiguration paths in the solutionspace of Boolean formulas, In Proceedings of 42nd International Colloquium on Automata, Languages, andProgramming, 2015, 985–996.

22

Page 27: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Combinatorial Reconfiguration 23

[16] A. E. Mouawad, N. Nishimura, V. Raman, N. Simjour, and A. Suzuki, On the parameterized complexityof reconfiguration problems, In Proceedings of the 8th International Symposium on Parameterized and ExactComputation, 2013, 281–294.

[17] A. E. Mouawad, N. Nishimura, V. Raman, and M. Wrochna, Reconfiguration over tree decompositions, InProceedings of the 9th International Symposium on Parameterized and Exact Computation, 2014, 246–257.

[18] H. Osawa, A. Suzuki, T. Ito, and X. Zhou, The complexity of (list) edge-coloring reconfiguration problem, InProceedings of the 11th International Conference and Workshops on Algorithms and Computation, to appear,2017.

[19] T. Yamada and R. Uehara, Shortest reconfiguration of sliding tokens on a caterpillar, In Proceedings of the10th International Workshop on Algorithms and Computation, 2016, 236–248.

Page 28: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 3

Brain Dynamics and Statistics: Simulationversus Data (17w5036)

February 27- March 3, 2017Organizer(s): Susanne Ditlevsen (University of Copenhagen), Priscilla Greenwood (Universityof British Colombia), Andre Longtin (University of Ottawa)

MotivationThe human brain is arguably the most complex biological structure. Making sense of its inner workings poses

an enormous challenge. Different components of its activity evolve over periods ranging from one thousandth ofa second to years, and over distances of one millionth of a meter to the size of the skull. Experiments perturband probe activity over these scales of time and space, yielding reams of data. Our cohort of scientists is leadingthe effort to organize these data into a coherent picture using state-of-the-art mathematical modelling techniques.This picture encompasses deterministic and random components. This is a consequence of the fact that part of thebrain’s activity seems to evolve in a predictable structured manner, while other parts are unpredictable, even whenresponding to the same stimulus or initiating the same action.

The goal of this workshop was to develop methods to characterize these components, how they interact witheach other, and how this interaction changes when stimuli impinge on the brain or the brain acts upon its envi-ronment. The techniques and advances that were discussed are at the forefront of two fields that seldom cometogether, namely dynamics and statistics, which respectively address the predictable and unpredictable aspects ofthe brain’s behaviour. The rising and established experts brought together by this workshop aim to gain a deeperunderstanding of the brain’s operational principles in the presence of variability, an understanding grounded inquantifiable ”statistical” certainty. The group is also driven by potential novel directions in mathematics that theintersection of concepts from dynamics and statistics will offer.

Overview of the topicThe primary goal of the workshop was to inspire collaboration among researchers in theoretical and experi-

mental neuroscience, by discussing statistical methods, stochastic modeling tools, and experimental possibilities.The data come from intra- and extracellular recordings of single neurons, revealing both spiking and subthresh-

old patterns, local field potential data, spatio-temporal measurements which show us spatial patterns with the extradimension of time, EEG and fMRI data. Specific questions are: What are the important modes of informationtransmission in the brain? Does it occur via spiking patterns, or time averages of patterns? Are various rhythmsalso important? Surprisingly little is known. We know about some simple aspects of the operation of primaryvisual cortex, such as how single cells code for edges or movement at different angles; but the representation ofthe many features that make up a stimulus across different spatio-temporal scales of activity is an unsolved puzzle.

In order to make progress we needed to bring together people who gather neural data and understand lab work,people who have an extensive but usually specialized current knowledge about the brain, people who work in the

24

Page 29: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 25

mathematics of dynamical systems, particularly stochastic dynamical systems, and people who have facility in thestatistics of stochastic processes. The common requirement is that these people are devoted to understanding thedynamics of the brain.

Despite a considerable recent body of literature on statistical inference and stochastic modeling in neuroscience,there remain substantial unsolved problems and challenges, some of which we addressed during the workshop.Examples are: understanding the role of stochasticity in the brain, developing efficient Monte Carlo methods forinference, understanding the relationship between system behavior (i.e. bifurcations and their stochastic counter-parts) and statistical properties of estimates based on data from these systems, and reconstructing the structure ofnetwork models from partially observed data.

The workshop discussed the way data are processed and understood in the biological sciences, from a dy-namical stochastic approach. Neuroscience is experiencing an imbalance between prowess in data collection anddeficiency in data interpretation. The improved models for data processing are expected to be of significant the-oretical interest from biophysical, mathematical and statistical viewpoints. Our work contributes to all parts ofneuroscience, from single cell analysis to the understanding of neuronal network dynamics, and potentially be-yond.

Main subjects of the workshop• The role of stochastic dynamics in the Brain

– Single neuron models

– Mean field models

– Network models

– Hypoelliptic models and the statistical challenges

• Information transmission, encoding and decoding

• Applications of neural modeling to brain physiology

– Control of Oscillatory Brain Dynamics

– EEG data during anaesthesia in humans

– Artifical neural network, biological instantiation of processing

– The weakly electrical fish

– Red-eared turtle

Page 30: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

26 Five-day Workshop Reports

Presentation HighlightsWe next give an overview of the presentations within the overall headings above, together with abstracts.

The role of stochastic dynamics in the BrainPeter Thomas: Noise in the Brain: Statistical and Dynamical Perspectives There is growing interest inapplying statistical estimation methods to dynamical systems arising in neuroscience. The discipline of statisticsprovides an intellectual framework for quantifying and managing uncertainty. For statistical methods to apply, onemust consider a system with some variability. Depending on where one locates the source of variability, differentmethods suggest themselves. The talk will discuss some challenges and opportunities in linking statistical andmathematical perspectives in theoretical neuroscience, including the problem of quantifying ”phase resetting” instochastic oscillators, the problem of identifying the most significant sources of noise in a finite state Markovmodel, the problem of inferring control mechanisms in brain-body motor control systems, and the problem ofparameter identification in noisy conductance-based models.

Adeline Samson: Hypoelliptic stochastic FitzHugh-Nagumo neuronal model: mixing, up-crossing and esti-mation of the spike rate Joint work with J.R. Leon (Universidad Central de Venezuela). In this presentation,we will introduce the hypoelliptic stochastic neuronal model, as diffusion approximation of intra-cellular or extra-cellular models. Then we will specifically focus on the FitzHugh-Nagumo model, and detail some of its properties:stationary distribution, definition of spikes, estimation of spiking rate.

Eva Locherbach: Memory and hypoellipticity in neuronal models I will discuss the effect of memory in twoclassical models of neuroscience, first the stochastic Hodgkin-Huxley model, considered in a series of papers by R.Hopfner, M. Thieullen and myself, and second, a model of interacting Hawkes processes in high dimension whichhas been studied recently by S. Ditlevsen and myself. I will explain why memory leads to hypoellipticity and thendiscuss the probabilistic and statistical consequences of this fact.

Antoni Guillamon: At the crossroad between invariant manifolds and the role of noise I will drift alongseveral topics in which we are currently involved. They all contain some treatment of recent tools from dynamicalsystems, mainly related to invariant manifolds, intertwined with the role of stochasticity. We aim at sharing ourresearch interests to boost discussion and eventual collaborations. The first example will be on bistable percep-tion where we are exploring up to which extent noise is necessary to explain data obtained from psycophysicalexperiments, as opposed to the information provided assuming quasi-periodic forcing. Second, we will focus onphase response curves (PRC) for oscillating neurons, a well-known tool to study the effectiveness of informationtransmission between cells; we have extended the notion of PRC to that of phase response field/function (PRF),also valid away from pure periodic behaviour. Stochastic PRCs have been already studied but PRFs constitute amore natural setting when neurons are subject to noisy inputs; we have neither explored the impact of noise onPRFs nor the extension of the concept to stochastic processes yet and, moreover, we also seek for experimentaltests of our theoretical findings. Therefore, we think it can be interesting to address these open questions in this au-dience. If time allows, we will comment on some other issues (estimation of conductances and short-term synapticdepression) that combine concepts of dynamical systems with the presence of noise.

Zachary Kilpatrick: Maintaining spatial working memory across time in stochastic bump attractor mod-els We discuss various network mechanisms capable of making spatial working memory more robust to noiseperturbation and error. The canonical example we begin with arises from classic oculomotor delayed responsetasks whereby a subject must maintain the memory of a location around a circle over the period of a few seconds.Asymptotic methods are used to reduce the dynamics of a bump attractor to a stochastic differential equation whosedynamics are governed by a potential that reflects spatial heterogeneity in the network connectivity. Heterogeneitycan serve to reduce the degradation of memory overtime, ultimately increasing the transfer of information forwardin time. We also show that connectivity between multiple layers of a working memory can further serve to stabi-lize memory, especially if they possess propagation delays. We conclude by discussing recent work, where we are

Page 31: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 27

modeling the phenomenon whereby a previous trials response attracts the current trials response, sometimes calledrepetition bias.

Lawrence Ward: Pattern formation via stochastic neural field equations The formation of pattern in bio-logical systems may be modeled by a set of reaction-diffusion equations. A form of diffusion-type coupling termbiologically significant in neuroscience is a difference of Gaussian functions used as a space-convolution kernel.Here we study the simplest reaction-diffusion system with this type of coupling. Instead of the deterministic formof the model, we are interested in a stochastic neural field equation, a space-time stochastic differential-integralequation. We explore, quantitatively, how the parameters of our model that measure the shape of the couplingkernel, coupling strength, and aspects of the spatially-smoothed space-time noise, control the pattern in the result-ing evolving random field. We find that a spatial pattern that is damped in time in a deterministic system may besustained and amplified by stochasticity, most strikingly at an optimal space-time noise level.

Laura Sacerdote: Integrate and Fire like models with stable distribution for the Interspike Intervals In1964, Gernstein and Mandelbrot [1] proposed the Integrate and Fire model to account for the observed stablebehavior of the Interspike Interval distribution. Their study of histograms of ISIs revealed the stable property andthey suggested modeling the membrane potential through a Wiener process in order to get the inverse Gaussianas first passage time distribution, i.e. a stable distribution. Holden (1975) [2] observed that stable distributionsdetermine a simple transmission pathway. Later many variants of the original model appeared with the aim toimprove its realism but meanwhile researches forgot the initial clue for the model. The Leaky Integrate and Firemodel that has not stable FPT distribution gives an example. The same holds for many other variants of thismodel. More recently Persi et al. (2004) [3] studying synchronization patterns, proposed a time non homogeneousintegrate and fire model accounting for heavy tail distributions. The existence of heavy tails, typical of stabledistributions is well recognized in the literature (see for example Tsubo et al [4] , Gal and Morom [5] and referencescited therin) Signals from different neurons are summed up during the elaboration. Different ISIs distributionswould determine an incredible variety of firing distributions as the information progresses in the network. However,it seems unrealistic to admit ISIs that cannot reproduce the same distribution when summed. This suggest thedevelopment of Integrate and Fire models using stable distributions. Furthermore, the stable ISIs paradigm givesrise to a more robust transmission algorithm since a possible lack of detection of some spike from the surroundingneurons does not change the nature of the final distribution. Here we rethink to the problem, taking advantage of themathematical progresses on Levy processes [6]. Hence, we propose to start the model formulation from the mainproperty, i.e. the stable nature of the ISIs distribution. We follow the Integrate and Fire paradigm but we modelthe membrane potential through a randomized random walk whose jumps are separated by intertimes with stabledistribution (or in the domain of attraction of such distribution). Observing that the supremum of the modeledmembrane potential results to be the inverse of a stable subordinator allows to determine the Laplace transform ofthe ISIs distribution. This is a preliminary contribution since we limit ourselves to some aspects of the modellingproposal ignoring any attempt to fit data. We are conscious that these are preliminary results and some furthermathematical study will be necessary to obtain models fitting data. In particular we expect an important role oftempered stable distributions [7]. [1] Gerstein G.L., Mandelbrot B. (1964) Random walk models for the activityof a single neuron. Biophys. J. 4: 41-68. [2] Holden, A.V. (1975) A Note on Convolution and Stable Distributionsin the Nervous System. Biol. Cybern. 20: 171-173. [3] Persi E., Horn D., Volman V., Segev R. and Ben-JacobE. (2004) Modeling of Synchronized Bursting Events: the importance of Inhomogeneity. Neural Computation 16:2577-2595. [4] Tsubo Y., Isomura Y. and Fukai T. (2012) Power-Law Inter-Spike Interval Distributions infer aConditional Maximization of Entropy in Cortical Neurons PLOS Computational Biology 8, 4: e1002461. [5] GalA., and Marom S. (2013) Entrainment of the Intrinsic Dynamics of Single Isola Neurons by Natural-Like Input TheJournal of Neurosciences 33(18), 7912-7918. [6] Kyprianou, A. (2014) Fluctuations of LA c©vy Processes withApplications. Springer Verlag, Berlin/Heidelberg. [7] Rosinski, J. (2007) Tempering stable processes. StochasticProcesses and their Applications. 117, 6: 677-707

Jonathan Touboul: Noise in large-scale neuronal networks, brain rhythms and neural avalanches It is nowfolklore that intracellular membrane potential recordings of neurons are highly noisy, owing to a variety of random

Page 32: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

28 Five-day Workshop Reports

microscopic processes contributing to maintenance. At macroscopic scales, reliable, fast and accurate responses tostimuli emerge, and are experimentally described through various quantities. I will focus in particular on mathe-matical models of large neuronal networks can account for the type of experimental data observed in synchronizedoscillations ( sources of brain rhythms) or neural avalanches. These two quantities are relevant in that rhythmsreportedly support important functions such as memory and attention, while distributions of avalanches were re-ported to reveal that the brain operates at criticality where it maximizes its information processing capacities. Iwill show that a simple theory of large-scale dynamics allows understanding under a common framework bothphenomena. In particular, I will show the relatively paradoxical phenomenon that noise can contribute to synchro-nization of large neural assemblies, and that the experimentally reported heavy-tailed distributions of avalanchedurations and sizes may be in fact related to Boltzmanns molecular chaos that naturally emerges in limits of largeinteracting networks with noise. These works were developed with Alain Destexhe and Bard Ermentrout.

Janet Best: Variability and regularity in neurotransmitter systems In this talk I will discuss a couple of recentthemes in my work: (1) Population models of deterministic neural systems enable one to capture the biologicalvariability in individuals. These models can explain how the same circuits can operate differently in differentindividuals or can change in time in individuals because of dynamic changes in gene expression. They can be usedto discover the characteristics of subpopulations in which drugs are efficacious or deleterious and are thereforeuseful for clinical trials; (2) In volume transmission neurons don’t engage in one on one transmission but projectchanges in biochemistry over long distances in the brain. Thus, the nuclei containing the cell bodies (like the dorsalraphe nucleus for serotonin) are basically acting as endocrine organs. Recent work with Sean Lawley shows whyvolume transmission works, that is why concentrations of neuromodulator in the extracellular space are very evendespite the fact that terminals and varicosities are distributed unevenly and firing may be random.

Shigeru Shinomoto: Emergence of cascades in the linear and nonlinear Hawkes processes The self-excitingsystems as represented by neural networks are known to exhibit catastrophic chain reaction if the internal inter-action exceeds an epidemic threshold. Recently, we have shown that the same systems may exhibit nonstationaryfluctuations already in the subthreshold regime [1,2]; cascades of events or spikes may emerge in the Hawkes pro-cess even in the absence of external forcing. In a practical situation, however, systems are subject to time-varyingenvironment in addition to internal interaction between elements. Here we attempt to estimate the degree in whichthe nonstationary fluctuations occurring in the system are induced by external forcing or internal interaction. [1]T. Onaga and S. Shinomoto, Bursting transition in a linear self-exciting point process. Physical Review E (2014)89:042817. [2] T. Onaga and S. Shinomoto, Emergence of event cascades in inhomogeneous networks. ScientificReports (2016) 6:33321.

Romain Veltz: Quasi-Synchronisation in a stochastic spiking neural network I will discuss a recent meanfield (from E. Locherbach and collaborators, R. Robert and J.Touboul) of a stochastic spiking neural networkfrom a dynamical systems perspective. More precisely, I will present some recent results concerning the quasi-synchronisation of the neurons as function of the different parameters of the network (gap junction conductances,synaptic strength...).

Information transmission, encoding and decodingMassimiliano Tamborrino: Novel manifestation of the noise-aided signal enhancement We discuss varioustype of noise-aided signal enhancements. Since not all stimulus levels can be decoded with the same accuracy,it is of paramount interest to determine which stimulus intensities can be discriminated most precisely. It is wellknown that the presence of noise corrupts signal transmission in linear systems. Nevertheless, noise may have apositive effect on signal processing in nonlinear systems, as confirmed by the stochastic resonance phenomenon.Stochastic resonance is typically observed in systems with a threshold in presence of a weak signal. However, thesubthreshold regime is not a necessary condition when considering more than one neuron, since, for example, asuprathreshold signal may be also enhanced by noise in a network of threshold devices. Other phenomena wherenoise enhances the signal are for example coherence resonance and firing-rate resonance. We present a study of

Page 33: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 29

the decoding accuracy of the stimulus level based on either the first-spike latency coding or the rate coding (fromthe exact spike counting distribution) in a neuronal model as simple as the perfect integrate-and-fire model. Wereport counter-intuitive results, representing a novel manifestation of the noise-aided signal enhancement whichdiffers fundamentally from the usual kinds reported on.

Benjamin Lindner: Spontaneous activity and information transmission in neural populations In my talk Iwill first review features of the spontaneous activity of nerve cells in neural populations and in recurrent networks,ranging from effects of cellular properties (e.g. noise and leak currents) and slow external noise (up/down transi-tions in a driving population) to the slow fluctuations that can build up due to the recurrent connectivity. I will thendiscuss how these features affect information transmission and stimulus detection, for instance, enable or suppressinformation filtering, benefit overall population coding, or lead to the detection of a short single cell stimulation ina large recurrent network.

Richard Naud: Burst ensemble multiplexing: connecting dendritic spikes with cortical inhibition Twodistinct types of inputs impinge on different spatial compartments of pyramidal neurons of the neocortex. A popularview holds that the input impinging on the distal dendrites modulates the gain of the somatic input encoding.This gain modulation is thought to participate in top-down processes such as attention, sensory predictions andreward expectation. Here we use computational and theoretical analyses to determine how the two input streamsare represented simultaneously in a neural ensemble. We find that dendritic calcium spikes in the distal dendritesallows multiplexing of the distal and somatic input streams by modifying the proportion of burst and singlet events.Two ensemble-average quantities encode the distal and somatic streams independently: the event rate and the burstprobability, respectively. Simulations based on a two-compartment model reveal that this novel neural code canmore than double the rate of information transfer over a large frequency bandwidth. To corroborate these findings,we determined analytically the parameters regulating mutual information in a point process model of bursting.Secondly, we find that an inhibitory microcircuitry combining short-term facilitation and short-term depressioncan decode the distal and somatic streams independently. These results suggest a novel functional role of bothactive dendrites and the stereotypical patterns with which inhibitory cell types interconnect in the neocortex. Burstensemble multiplexing, we suggest, is a general code used by the neural system to flexibly combine two distinctstreams of information.

Applications of neural modeling to brain physiologyJeremie Lefebvre: State-Dependent Control of Oscillatory Brain Dynamics Numerous studies have shownthat periodic electrical stimulation can be used not only to interfere with the activity of isolated neurons, but alsoto engage population-scale synchrony and collective rhythms. These findings have raised the fascinating prospectof manipulating emergent brain oscillations in a controlled manner, engaging neural circuits at a functional level toboost information processing, manipulate cognition and treat neurobiological disorders (so called oscillopathies).Capitalizing on this, it has been shown that brain stimulation can be tuned to alter perception and task performance.Rhythmic brain stimulation forms the basis of a control paradigm in which one can manipulate the intrinsic oscilla-tory properties of cortical networks via a plurality of input-driven mechanisms such as resonance, entrainment andnon-linear acceleration. But the brain is not a passive receiver: outcomes of brain stimulation, either intracranialor non-invasive, are highly sensitive to ongoing brain dynamics, interfering and combining with internal fluctu-ations in non-trivial ways. Exogenous control on brain dynamics has indeed been shown to be gated by neuralexcitability, where effects of brain stimulation are both state-dependent and highly sensitive to stimulation param-eters. To understand this phenomenon, we here used computational approach to study the role of ongoing state onthe entrainment of cortical neurons. We examined whether state-dependent changes in thalamo-cortical dynamicscould implement a gain control mechanism regulating cortical susceptibility to stimulation. We found that theresulting increase in irregular fluctuations during task states enables a greater susceptibility of cortical neurons toentrainment, and that this phenomenon can be explained by a passage through a bifurcation combined to stochas-tic resonance. We also investigated the relationship between the stimulation parameters, such as amplitude andfrequency, on entrainment regimes for different levels of sensory input. Taken together, our results provide newinsights about the state-dependent interaction between rhythmic stimulation and cortical activity, accelerating the

Page 34: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

30 Five-day Workshop Reports

development of new paradigms to interrogate neural circuits and restore cognitive functions based on the selectivemanipulation of brain rhythms.

Rune W Berg: Neuronal population activity involved in motor patterns of the spinal cord: spiking regimesand skewed involvement Motor patterns such as chewing, breathing, walking and scratching are primarily pro-duced by neuronal circuits within the brainstem or spinal cord. These activities are produced by concerted neuronalactivity, but little is known about the degree of participation of the individual neurons. Here, we use multi-channelrecording (256 channels) in turtles performing scratch motor pattern to investigate the distribution of spike ratesacross neurons. We found that the shape of the distribution is skewed and can be described as log-normal-like,i.e. normally shaped on logarithmic frequency-axis. Such distributions have been observed in other parts of thenervous system and been suggested to implicate a fluctuation driven regime (Roxin et al J. Neurosci. 2011). Thisis due to an expansive nonlinearity of the neuronal input-output function when the membrane potential is lurkingin sub-threshold region. We further test this hypothesis by quantifying the irregularity of spiking across time andacross the population as well as via intra-cellular recordings. We find that the population moves between supra-and sub-threshold regimes, but the largest fraction of neurons spent most time in the sub-threshold, i.e. fluctuationdriven regime. Read more about this work here: Peter C Petersen, Rune W Berg ”Lognormal firing rate distributionreveals prominent fluctuation driven regime in spinal motor networks” eLife 2016;5:e18805

Axel Hutt: Model and prediction of anaesthetic-induced EEG The monitoring of patients under generalanaesthesia is an essential part during surgery. To interpret correctly measured brain activity, such as electroen-cephalogram (EEG), it is essential to first understand the possible origin of EEG and hence classify the physio-logical state of the patient correctly. Moreover, it would be advantageous to even predict the development of thebrain activity to anticipate severe changes of the physiological state. The talk presents a novel model explainingmajor EEG features under light anaesthesia, such as the spectral smile effect, by a denoising of brain dynamics. Ina second part, the talk shows how to predict EEG under anaesthesia by applying a data assimilation technique.

Mark McDonnell: What can we learn from deep-learning? Models and validation of neurobiological learn-ing inspired by modern deep artificial neural networks In the field of machine learning, deep-learning hasbecome spectacularly successful very rapidly, and now frequently achieves better-than-human performance on dif-ficult pattern recognition tasks. It seems that the decades-old theoretical potential of artificial neural networks(ANNs) is finally being realized. For computer vision problems, convolutional ANNs are used, and are often char-acterized as biologically inspired. This is due to the hierarchy of layers of nonlinear processing units and poolingstages, and learnt spatial filters resembling simple and complex cells. However, this resemblance is superficial. Anopen challenge for computational neuroscience is to identify whether the spectacular success of deep-learning canoffer insights for realistic models of neurobiological learning that are constrained by known anatomy and phys-iology. I will discuss this challenge and argue that we need to validate proposed neurobiological learning rulesusing challenging real data sets like those used in deep-learning, and ensure their learning capability is compa-rable to that of deep ANNs. To illustrate this approach, in this talk I will show mathematically how a standardcost-function used for supervised training of ANNs can be decomposed into an unsupervised decorrelation stageand a supervised Hebbian-like stage. With this insight, I argue that this form of learning is feasible as a neurobi-ological learning mechanism in recurrently-connected layer 2/3 and layer 4 cortical neurons. I will further showthat the model can learn to very effectively classify patterns (e.g. images of handwritten digits from the MNISTbenchmark); error rates are comparable to state of the art deep-learning algorithms, i.e. less than 1.

Jonathan D. Victor: How high-order image statistics shape cortical visual processing Several decades ofwork have suggested that Barlow’s principle of efficient coding is a powerful framework for understanding reti-nal design principles. Whether a similar notion extends to cortical visual processing is less clear, as there is no”bottleneck” comparable to the optic nerve, and much redundancy has already been removed. Here, we presentconvergent psychophysical and physiological evidence that regularities of high-order image statistics are indeedexploited by central visual processing, and at a surprising level of detail. The starting point is a study of naturalimage statistics (Tkacic et al., 2010), in which we showed that high-order correlations in certain specific spatial

Page 35: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 31

configurations are informative, while high-order correlations in other spatial configurations are not: they can beaccurately guessed from lower-order ones. We then construct artificial images (visual textures) composed eitherof informative or uninformative correlations. We find that informative high-order correlations are visually salient,while the uninformative correlations are nearly imperceptible. Physiological studies in macaque visual cortexidentify the locus of the underlying computations. First, neuronal responses in macaque V1 and V2 mirror thepsychophysical findings, in that many neurons respond differentially to the informative statistics, while few re-spond to the uninformative ones. Moreover, the differential responses largely arise in the supragranular layers,indicating that the computations are the result of intracortical processing. We then consider low- and high-orderlocal image statistics together, and apply a dimension-reduction (binarization) to cast them into a 10-dimensionalspace. We determine the perceptual isodiscrimination surfaces within this space. These are well-approximated byellipsoids, and the principal axes of the ellipsoids correspond to the distribution of the local statistics in naturalimages. Interestingly, this correspondence differs in specific ways from the predictions of a model that implementsefficient coding in an unrestricted manner. These deviations provide insights into the strategies that underlie therepresentation of image statistics.

Volker Hofmann: Population coding in electric sensing: origin and function of noise correlations In manycases, great knowledge regarding single neuron activity during the encoding of sensory signals or the generationof behavioral outputs has been achieved. On another scale, however, we still lack detailed information of themechanisms of concerted neuronal activity, i.e. population coding. This is crucial to understand the neuronal codeand remains a central problem in neuroscience. Extrapolating the knowledge of single unit activity to the scale ofa neuronal population is often complicated by the fact that the activities of neurons are typically correlated ratherthan being independent. Such correlations, which can arise in terms of the average responses to different stimulias well as in terms of trial to trial variability, were shown to substantially impact the efficacy of population codes.To investigate the sources and function of correlations we use the weakly electric fish Apteronotus leptorhynchusas a model system, due to the wealth of physiological and anatomical knowledge that is available with regard toelectrosensory processing. These fish sense electric fields with an array of electroreceptors that project to threeparallel segments of the medullary electrosensory lateral line lobe (ELL). Previous studies established, that the sizeand the organization of receptive fields differs substantially between pyramidal neurons in the different segments,which should consequently result in very different amounts of correlations in each segment. In contrast to this,our experimental recordings revealed very similar levels of correlation magnitudes. To explain this surprisingresult, we investigated the differential receptive field interactions using a modeling approach. Considering theantagonistic center-surround organisation of receptive fields, we were able to show that very different receptivefield organization can give rise to very similar amounts of correlations. After establishing the presence of noisecorrelations in the ELL, we assessed their potential impact and function for the encoding of electrosensory stimuli.Our preliminary results suggest that ELL noise correlations encode stimuli independent of classical measures ofneuronal activity (i.e. firing rate). The stimulus dependent changes in correlation levels are potentially modulatedvia the recurrent ELL connectivity which will be a focus of our future investigations to unravel the mechanismsmediating this independent additional channel of information transmission in the brain.

PostersEach poster presenter had 5 minutes to introduce their poster to the full workshop. During our 45 minutes

breaks, the posters were discussed in smaller groups. Having the posters up for the full meeting gave ample timeto discuss and study the posters.

• Catalina Vich: Different strategies to estimate synaptic conductances

• Timothy Whalen: Pallidostriatal Projections Promote Beta Oscillations in a Biophysical Model of the Parkin-sonian Basal Ganglia

• Jacob Østergaard: Capturing spike variability in noisy Izhikevich neurons using point process GLMs

• Peter Rowat: Stochastic network thinking applied to firing patterns of stellate neurons

• Wilhelm Braun: Spike-triggered neuronal adaptation as an iterated first-passage time problem

Page 36: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

32 Five-day Workshop Reports

• Mareile Grosse Ruse: Modeling with Stochastic Differential Equations and Mixed Effects

• Kang Li: Mathematical neural models for visual attention

• Pietro Quaglio: SPADE: Spike Pattern Detection and Evaluation in Massively Parallel Spike Trains

• Alexandre Rene: Dimensionality reduction of stochastic differential equations with distributed delays

Impact Reports from ParticipantsParticipants were asked to write a few lines telling about what they learned. Below are individual reports, also

giving hints at outstanding problems in the field, and new directions for future research.

Laura Sacerdote, Torino: The workshop was a great occasion to meet very interesting people in a wonder-ful environment. We had contributions from theoricians and experimentalists with a challenging alternance thatallowed each one to learn from the others. I already knew some of the scientists attending to the workshop but thediscussions after each talk, as well as the long coffee breaks, gave me the opportunity to know new people and topropose new scientific links for my research. The presence of a very mixed group during my talk, gave me hintsfor interesting improvements of my work. The size of the workshop was perfect, in a small group it becomes easyto start to talk with the others, learning from posters and talks but also from many private discussions.

Benjamin Lindner, Berlin: The schedule was much better than that of many other workshops I have attended:long talks with sufficient time for questions during and after the talks, long breaks for even more informal discus-sions. The relaxed schedule, the number and kind of participants, the way the sessions were chaired - all of thiswas encouraging for a creative atmosphere during this workshop that I enjoyed a lot.

I met old friends and colleagues (Susanne Ditlevsen, Axel Hutt, Jeremie Lefebvre, Andre Longtin, MarkMcDonnell, Richard Naud, Laura Sacerdote, and Peter Thomas) and talked with them about old times and newresearch projects. I was happy to make new acquaintances (Wilhelm Braun, Cindy Greenwood, Volker Hofmann,Zachary Kilpatrick, Eva Locherbach, Massimiliano Tamborrino, Jonathan Touboul, Romain Veltz, Jonathan Victor,Lawrence Ward), to learn about exciting new research directions, and to get a bit of feedback on my own researchefforts too.

I was especially inspired by the mixture of presentations on fundamental math problems related to stochasticprocesses in neuroscience (stochastic dynamics of excitable systems) and talks that were devoted to more data-driven and biology-centered topics - I think this was an excellent mixture that met many of my research interests.I know from many conversations that I was not the only one who felt like that.

Thank you again for organizing this marvelous workshop.

Mariele Groβe Ruse, Copenhagen, Denmark: What I appreciated a lot was the atmosphere among our groupand that I all the time felt very welcome and included. No matter the ”academic status”, all participants were open,happy to talk/discuss, happy to provide inputs/feedbacks/ideas and also to chat about topics beyond research. Igot very good inputs regarding applicability of my model to different kinds of input, which is useful for my PhDproject. I talked to people with various background, which I always perceive as very interesting. Definitely, Ilearned a lot, although there was as much I did not understand, due to me having not really a background inneuroscience.

Mark McDonnell, Australia: I enjoyed discussions with a lot of slightly more junior researchers who werefamiliar with my work on stochastic resonance from the time of my PhD and just after. This was an enjoyableexperience to see the new directions these researchers are following in that field, and have the opportunity todiscuss with them. I also enjoyed many discussions with people about their experiences with machine learning andhow it might benefit their future research.

A potential collaboration is in progress with Jonathan Touboul, whom I met at the workshop for the first time.And of course the workshop was an opportunity to continue work on existing collaborations with Cindy Green-

wood and Lawrence Ward.

Page 37: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 33

Zachary Kilpatrick, Boulder Colorado: Thanks so much for organizing this workshop! It was wonder-ful. My current work and future plans were impacted in several different ways from my experience at the BIRSworkshop. One of the first conversations I had at the conference was over dinner with Andre Longtin, and heshared his experience in balancing research, teaching, and service with me, providing some useful ideas for howto use my time wisely in my career. Peter Thomas provided some excellent feedback after my talk on differentinformation theoretic measures and error quantification I could use to better think about the way the brain mightoptimize working memory systems. Furthermore, he was gracious enough to give me extensive details on hisrecent work on asymptotic phase for stochastic systems, taking me through some special cases. I learned a lot!Other conversations with Jonathan Touboul, Benjamin Lindner, and Axel Hutt gave me excellent ideas for futureprojects. Wilhelm Braun took the time to ask me some of my own advice about how to proceed at his stage (as anearly postdoc) in view of getting an academic job in the future. Furthermore, I had an excellent conversation withSusanne Ditlevsen about planning current and future meetings for the International Conference on MathematicalNeuroscience.

Peter Thomas, Case Western University: Benjamin Lindner and I, who had been working together previ-ously, were able to complete and finally submit a paper to the Journal of Statistical Physics while we were at Banfftogether, and began work on our next paper. I also learned a great deal from the workshop talks. Adeline Sam-son and Eva Locherbach’s talks on Hypoelliptic stochastic systems gave me ideas for how to show that a certainkind of nerve cell ”resets” (in a stochastic sense) each time it fires, which is something my biological collaboratorhas conjectured repeatedly but which we did not know how to formulate. Antoni Guillamon’s talk give me newideas about how to represent the phase and amplitude of stochastic oscillators, a long standing interest of mine.Rune Berg’s talk gave me new ideas for a project with a different collaborator involving mathematical modelingof motor control systems. I benefitted from side conversations with many participants, including the three orga-nizers (Greenwood, Ditlevsen, and Longtin). I particularly appreciated the chance to discuss data assimilationtechniques with an expert in this area, Axel Hutt. And snowshoeing up the back side of Mount Sulphur in subzerotemperatures with Zach and Wilhelm was a real treat.

Jonathan Victor, Cornell: First, Romain Veltz: following my presentation, we had a substantive discussionabout new approaches to understanding cortical dynamics, which may provide a way to account for some of thepuzzling features of the data I had discussed. (The basic idea here is that patches of visual cortex don’t representjust an orientation (as is traditionally thought), but rather, three parameters – an orientation, a dispersion, and ascale – effectively a 2x2 symmetric matrix. The implications are that cortical dynamics don’t live on a torus, butrather, on a stack of Poincare disks.) Second, Volker Hofmann: with whom I had a series of discussions on activesensation, and the parallels and contrasts between the electric sense in weakly electric fish, and vision.

But most importantly, I really want to thank you for inviting me to this very stimulating event.

Timothy Whalen, Carnegie Mellon University: As a younger PhD student, it was a great learning experienceto meet with experts in neural data analysis from across the experimental-mathematical spectrum. Several of thetalks and discussions in the poster hall inspired ideas on how to address difficult, quantitative questions in mydata and better relate these to my simulations. The Banff Center proved to be a fantastic venue for this purpose,streamlining everything so that we attendees could focus on the content of the workshop. The fellow researchersI’ve connected with and the new ideas I’ve learned from the workshop will not soon be forgotten!

Lawrence Ward, UBC, Vancouver: I was very happy to meet and converse with Axel Hutt and JonathanTouboul, having previously known of their work in this field. I of course learned many new things about braindynamics from the talks, but also from my conversations with their authors. Again, the work of Hutt, Touboul,Lindner, Kilpatrick, Veltz and Thomas was very revealing, and in some cases new to me (esp. Hutt on effectsof anesthesia, Kilpatrick on memory and bumps). All of this will stimulate my future work on stochastic neuronmodels as well as on brain dynamics.

Axel Hutt, INRIA, France: The workshop gathered most of the top-level researchers working in the researchfield. This allowed to learn directly the primary current hot topics in the field and who works on what. The

Page 38: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

34 Five-day Workshop Reports

rather informal meeting made it possible to chat with researchers at a beer on general and specific science whatwas elucidating in several cases. I would say the workshop was quite valuable scientifically and definitely worthmaking the long trip from Europe.

Kang Li, Copenhagen: It was a great experience for me, in terms of both academic studies and mountainviewing. I brought a poster and there were some who came by and discussed with me. Among them were Eva,Adeline, Laura, and some others but I can’t remember their names. I presented my work and future ideas andreceived interesting and useful feedbacks. The posters and talks from participants are brilliant and inspiring, fromparameter estimations of point & diffusion processes, to the synchronization and coherence of spike trains, and tothe emerging application of deep neural networks, etc. I was trained as a statistician and have been focusing onmodel inference, but there are obviously many different areas beyond mine which I really learned a lot from.

Eva Locherbach, Paris: Personally, I found it very interesting to meet Peter Thomas; in particular his questionhow to distinguish between variability that is intrinsically generated versus the one that is extrinsically generated isa problem that I would like to study in models such as Hawkes processes in high dimension, subject to mean-fieldinteractions within a spatially structured system of neurons which are arranged in populations. I also would liketo study phase response curves in such models - and this is certainly inspired by some of the talks presented at theconference. The talk of Adeline Samson was also very interesting for me, in particular her question how to decidewhether there is presence or not of channel noise for example (can we write a real statistical test for this?). Othertalks that inspired me a lot were those of Zachary Kilpatrick, of Shigeru Shinomoto and certainly also BenjaminLindner. Finally, let me say that the posters that were presented were very interesting and that I really appreciatedthe way those posters were shortly presented during the conference.

Adeline Samson, Grenoble: This conference was very productive for me, starting a new project with Massi-miliano Tamborrino around some good beers in the cafe!

Toni Guillamon, Barcelona: The facilities provided by BIRS and the Banff Center are excellent; staying thereis so easy and confortable that you can keep your mind occupied with the scientific problems and, at the same time,you forget you are working. The information provided along the whole process is very clear and all is prepared toarrive at Banff and work efficiently. The lodging quality is excellent, as well as the food in the restaurant, and theleisure activities. Among them, I will remember our walk on the Tunnel Mountain, the excursion to the Sulphuremountain, and the chats during the meals and around a glass of wine (sorry for the alcoholic reference) at the”bistrot”.

I find also worth to mention the job made by the organizers of the workshop, mainly in selecting the lecturesand the attendants. They were able to put together a group of excellent persons, with common interests but non-coincident backgrounds, and with a sincere desire to understand each other’s work. More personally speaking, Icome from dynamical systems theory and I’ve been working mainly on deterministic treatment of brain dynamicswith an increasing interest in statistics and stochastic processes. In this workshop, it has been very rewarding tosee how my research could link to the interests of other participants, most of them experts on aspects that I do notknow but I do need! At the end, I came back home with a long list of stimulating suggestions as well as a group ofcolleagues which whom I can eventually interact in the future. I wish I could say the same of every meeting I’mattending!

In virtue of my sincere words in the above paragraph, thanks again for organizing it.

Volker Hofmann, McGill University: 1. I met researchers that work in my immediate field (weakly electricfish) and approach common question with employing different methods (i.e. by theoretical means). I am happyto have met Andre Longin and Benjamin Lindner at this work shop that allowed me to discuss a few thingsregarding recent and coming publications in the field. In addition to that I have had conversations with a number ofresearchers that share interests in certain more general research questions and use different animal models and/ormethods. Also seeing how these people combine methodological expertise which I share, with further methodsand techniques was of particular interest and will most probably help me to develop my own profile further downthe road. Particularly my conversations with Rune Berg and Jonathan Victor were interesting in this respect.

Page 39: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Brain Dynamics and Statistics: Simulation versus Data 35

2. To me, who is doing his very first steps in computational modeling this workshop was a jump start intothe field and gave me a good overview of the aspects that are ”cutting edge problems” in neuronal modeling and”mathematical neuroscience”. While I will probably not be working on any of these problems soon, I think thisbroad overview is helpful for me to see where the field of computational modeling is at the moment and where itis going in the near future. I actually learned about some problems that I didn’t know were problems.

3. The very positive and accommodating atmosphere at this workshop has positively reinforced my own work.I have become some good feedback after the presentation of my talk and this feedback has led to ideas that I amtesting at the moment in the revision of one of my submitted publications.

Massimiliano Tamborrino, Linz: I have been honoured of being invited as a speaker to the workshop ”BrainDynamics and Statistics: Simulation versus Data”. Without any doubt I can state that it has been the most inspiringand successful workshop I have ever attended. First of all, it gave me the opportunity of presenting myself andmy research to a group of eminent professors and researchers, half of them I never met before. The organizersmade an excellent job selecting the workshop participants and I believe that was one of the reason of the success ofthe workshop. All participants share an interest towards neuroscience, coming from different background though,e.g. statistics, mathematics, physics, experimental neuroscience, etc. More importantly, each participant waswilling to listen, share and discuss ideas and open problems, making the workshop the perfect spot for starting newcollaborations and being inspired by new topics. I wish there could be more of these workshops.

Outcome of the MeetingThis workshop brought together researchers from several different disciplines to work on the problem of un-

derstanding stochastic brain dynamics and statistics. This involved statisticians, probabilists, physicists, mathe-maticians and experimentalists. The experimental neuroscientists attending the workshop have been selected, inpart, because of their interest in neuronal modeling and understanding stochastic effects in the brain. Their inputwas essential in our discussion.

The workshop has enabled us to cristalize and update the current directions and main goals in this area. Wehave forged links among people working in various directions, all being important for the advancement of thefield. We have obtained a more focussed view of the current state of the field, and become aware of directions andactivities of the various groups, the latest results in diverse areas, and where to find expertise within subtopics.

Our field has had a slow development. It is a rather complicated topic, having many modeling and physiologicalaspects, as well as statistical challenges. Treatments of this subject have generally been focussed on one or anotherof these aspects. In this meeting we have had a group of people with different emphasis of research. Several peoplejoin biological research with modeling, and have presented creative methods to address the statistical challengesof this particular field.

The workshop integrated graduate students and postdocs. All of the participants presented their work duringthe meeting, either with a talk or a short presentation and subsequent discussion of their poster in smaller groups. Itis important to note that these young people were excellent in their ability to communicate their ideas and researchto the diverse audience attending the workshop.

Informal feedback from workshop participants has been very positive with many new ideas emerging and theparticipants have all learned a great deal. All participants heartily thank BIRS and its staff for providing trulyexceptional facilities and organization to support the meeting.

ParticipantsBerg, Rune W (University of Copenhagen)Best, Janet (Ohio State University)Braun, Wilhelm (University of Ottawa)Ditlevsen, Susanne (University of Copenhagen)Greenwood, Priscilla (University of British Colombia)Grosse Ruse, Mareile (University of Copenhagen)Guillamon, Antoni (Universitat Politchnica de Calatunya)Hofmann, Volker (McGill University)

Page 40: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

36 Five-day Workshop Reports

Hutt, Axel (German Weather Service)Kilpatrick, Zachary (University of Colorado)Lefebvre, Jeremie (Krembil Research Institute)Li, Kang (University of Copenhagen)Lindner, Benjamin (Humboldt-Universitat zu Berlin)Locherbach, Eva (Universite de Cergy-Pontoise)Longtin, Andre (University of Ottawa)McDonnell, Mark (University of South Australia)Naud, Richard (University of Ottawa)Orlandi, Javier (University of Calgary)Østergaard, Jacob (University of Copenhagen)Quaglio, Pietro (Juelich Research Centre)Ren, Alexandre (Forschungszentrum caesar)Rowat, Peter (University of California San Diego)Sacerdote, Laura (University of Torino)Samson, Adeline (Universite Grenoble Alpes)Shinomoto, Shigeru (Kyoto University)Tamborrino, Massimiliano (Johannes Kepler University Linz)Thomas, Peter (Case Western Reserve University)Touboul, Jonathan (College de France & Inria)Veltz, Romain (Inria Sophia Antipolis Mediterranee)Vich, Catalina (Universitat de les Illes Balears)Victor, Jonathan D. (Weill Cornell Medical College)Ward, Lawrence (University of British Columbia)Whalen, Timothy (Carnegie Mellon University)

Page 41: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] P.E. Greenwood and L.M. Ward, Stochastic Neuron Models, Springer Mathematical Biosciences InstituteLecture Series 1.5, Springer-Verlag, Heidelberg, 2016.

[2] S. Ditlevsen and A. Samson: Parameter estimation in neuronal stochastic differential equation models fromintracellular recordings of membrane potentials in single neurons: a Review. Journal de la Societe Francaise deStatistique, 157(1), 6-16, 2016.

[3] Stochastic Biomathematical Models with applications to Neuronal Modeling. Eds: M. Bachar, J. Batzel andS. Ditlevsen. Volume 2058 of the Lecture Notes in Mathematics series (Biosciences subseries), 2012.

[4] B. Lindner, J. Garcia-Ojalvo, A. Neiman, and L. Schimansky-Geier. Effects of noise in excitable systems,Phys. Rep. 392, 321, 2004.

37

Page 42: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 4

Optimization and Inference for PhysicalFlows on Networks (17w5165)

March 5 - 10, 2017Organizer(s): Michael Chertkov (Los Alamos National Laboratory), Sidhant Misra (Los AlamosNational Laboratory), Marc Vuffray (Los Alamos National Laboratory), Anatloy Zlotnik (LosAlamos National Laboratory)

Overview of the FieldMathematical models that describe the flow of fluids, movement of particles, or transfer of information over anetwork of channels appear in a wide range of fields of theoretical and scientific study, as well as importantengineering applications. Particular interest is devoted to models for which the network structure is representedusing graphs where flows on edges and conditions at nodes are governed by sets of physical laws, which maybe steady or dynamic, and deterministic or stochastic. Important examples of such physics-governed networkflow systems include the electric power grid, natural gas transmission systems, and traffic flows of air or landvehicles. While these large-scale engineered systems have been the subject of extensive study, recent developmentsin applied mathematics and computation promise a breakthrough in our ability to analyze them and develop newmethods for efficient and reliable operations through optimization and control.

Numerous problems of practical interest in applications of flow networks can be mathematically posed as opti-mization and control. and inference and learning problems. For example, the Optimal Power Flow (OPF) problemin electric transmission networks is an optimization problem that seeks to minimize the cost of power generationamong distributed sources subject to the physical constraints of transmission networks. Alternatively, given acontinuous stream of noisy/uncertain phase measurements throughout the power grid, we may infer congestion inthe network. The relation between these problems may be used to actively control reactive power generation atdifferent locations or change the network topology. As in the case of the power grid, the infrastructure networks ofinterest may be very large, and may consist of sub-networks or components characterized by qualitatively differentphysics and/or dynamics on multiple spatial and temporal scales. Therefore, the scaling properties for the math-ematics and computational algorithms used to address these problems require careful investigation. Moreover,these problems tend to have intrinsic features such as non-linearities and non-convexities that are dictated by thephysical laws governing flows through the network. Such properties make the design of scalable algorithms forthese problems a challenge.

The theoretical sides of the relevant mathematical fields, which include optimization, graph theory, statisticalinference, dynamical systems, partial differential equations (PDEs), among others, have each given rise to sophis-ticated computational techniques. However, problems of optimization and inference for physical flow networks

38

Page 43: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 39

require methods from each of these disciplines to be further developed in coordination with one another. Forinstance, a very accurate and efficient method for simulation of flow dynamics governed by PDEs may be unsuit-able for use within an optimization algorithm. Instead, appropriate models that are tractable for optimization, yetaccurately represent the physics involved, may be used to develop optimal control algorithms with significantlyimproved scalability and stability. Similarly, inference algorithms for stochastic state estimation on networks, andthe related marginal probabilities, need to be designed in a way that they provide accurate input for stochastic opti-mization problems. The objective of this workshop was to bridge these mathematical and methodological gaps inthe context of large-scale networks. by bringing together an interdisciplinary community of researchers working inthe fields of network science, optimization, dynamical systems, optimal control, machine learning, and statisticalphysics to provide an environment of interaction at the interfaces of these fields. The purpose was to exchangeand combine new and emerging ideas with particular focus on inference, learning and optimization for physicalnetworks characterized by dynamics and uncertainty.

Recent Developments and Open Problems

Recently, significant progress progress has been made in mathematical techniques for Optimization, Optimal Con-trol, and algorithms for learning and inference in Graphical models. At the same time need for new methods inthe aforementioned topics have exploded in many application areas. The growing use of renewable energy forpower generation and need for improved economic and reliable operation of the power grid requires methods fornon-linear non-convex optimization and optimization under uncertainty in large networks. Due to the increase inthe use of gas generators for balancing out fluctuations in the electric power grid, the gas transmission systems arewitnessing an unprecedented level of fluctuation and stochasticity, creating a need for new and efficient methods foroptimal and robust optimal control that can be applied to the non-linear PDE governed gas network physics. Theadvent of large data, need for state estimation in infrastructure networks require fast and sample optimal learningalgorithms for graphical models, as well as new methods for inference.

While several methods and algorithms have been developed recently for these application areas, there arestill several open problems and unmet requirements that can be potentially resolved by interaction between theapplied mathematics and algorithms community and the engineering community. Several convex relaxations andapproximation methods such as the Lasserre’s hierarchy have been successfully used to solve the so-called OptimalPower Flow Problem, but further specialization is required to make the algorithms scalable. Optimization underuncertainty has been extensively studied recently for the linear DC approximation of the power flow equations, butmethods for uncertainty quantification and optimization under uncertainty for the full non-linear AC power flowphysics remain unexplored.

Traditional high fidelity simulation methods perform poorly to solve optimal control problems in network flowsgoverned by non-linear partial differential equations. Recently it has been shown that a lower fidelity representationof the non-linear network dynamics that is optimization friendly can solve the same problem much more efficiently.The extention of these methods to accommodate uncertainty, i.e., to solve a robust optimal control problem remainsopen. In addition, there is still much to be explored in the field of PDE constrained network optimal controlproblems.

Graphical Models have received a lot of attention due to their ability to naturally model network problems bycapturing the network structure and the corresponding dependency between variables succinctly via the underlyinggraph. Recent work has demonstrated successfully the use of Belief Propagation on tree networks to solve theOptimal Power Flow problem. However, the case of loopy networks, and general methods to handle continuousGraphical Models remains open. In the field of learning structure/parameters of a Graphical Model from data,numerous methods based on regularized M-estimators that exploit convex optimization have been proposed. How-ever, with huge advances in mixed integer programming and the availability of powerful commercial solvers allowthe use of learning algorithms based on non-convex optimization, which is an active area of research with severalopen ends.

This workshop aimed at generating new ideas to solve the above-mentioned open problems by bringing togethera multi-disciplinary community from both the applied mathematics and the engineering applications side. The nextsection summarizes the various presentations in the three broad topics mentioned above.

Page 44: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

40 Five-day Workshop Reports

Presentation HighlightsThe proposed schedule involved surveys of mathematical fields and motivating applications so that all partici-pants learned the fundamental assumptions and perspectives of each field. In addition, leaders in each area havedelivered focused technical presentations of recent ideas, techniques, and research results that could be adoptedby participants from other fields and disciplines. Significant focus was placed on inference, learning and opti-mization problems involving physical processes that occur over network systems. Some of the problems that weconsidered, such as optimization of electric power systems, are themselves mature fields under active investigation,while others are in an exploratory stage and have not yet been addressed by the mathematically oriented researchcommunity. All of the problems have important common features. They are (a) stated on large graphs/networks;(b) constrained by physics-related constraints that are expressed through algebraic equations, ordinary or partialdifferential equations, or a mix; (c) require re-formulation to facilitate optimization; (d) depend on practical algo-rithmic solutions such as approximations or relaxations with analytic or numerical guarantees. The presentationscan be categorized into three broad categories: (i) Optimization methods in Power Systems, (ii) Optimal Controlin Power Systems and Gas Networks, and (iii) Graphical Models - inference, learning and optimization.

Optimization methods in Power Systems

Miles Lubin and Line Roald: Chance Constraints for Improving Security of AC Optimal Power Flow In thetwo linked talks the speakers presented a new method to account for uncertainty in the AC optimal power flowproblem. Using a linearized model around an operating point, the presenters showed how to solve a convex opti-mization problem that simulates real-time proportional response throughout the system and uses chance constraintsto enforce physical limits. For the chance constraints enforcing line flow, an approximation based on two-sidedchance constraints was developed. The speakers have also discussed experimental part where they reported thatsolution to this convex problem can be corrected back to AC feasibility and is significantly more robust than thesolution to the deterministic AC-OPF problem [19].

Andy Sun: Optimal Transmission Switching with DC and AC Power Flows: New Formulations and StrongRelaxations In this talk, the problem of optimal transmission switching (OTS) in a power grid to reduce operatingcost and improving reliability was discussed. The author explained study of OTS problems under both DC andAC power flow models. For the DC-OTS problem, a cycle-based reformulation and characterize the convex hullof the cycle-induced relaxation was proposed; for AC-OTS, an MISOCP relaxation and strengthen it with SDPand McCormick-based disjunctions were proposed. It was shown that the proposed reformulations and algorithmsefficiently solved IEEE and NESTA instances and lead to significant cost benefits with provably tight bounds [8].

Cedric Josz: Application of complex polynomial optimization to optimal power flow Multivariate poly-nomial optimization where variables and data are complex numbers is a non-deterministic polynomial-time hardproblem that arises in various applications such as electric power systems, signal processing, imaging science,automatic control, and quantum mechanics. Complex numbers are typically used to model oscillatory phenom-ena which are omnipresent in physical systems. A complex moment/sum-of-squares hierarchy of semidefiniteprograms to find global solutions with reduced computational burden compared with the Lasserre hierarchy forreal polynomial optimization was proposed in this talk. The authors applied the approach to large-scale sectionsof the European HV electricity transmission grid. Thanks to an algorithm for exploiting sparsity, instances withseveral thousand variables and constraints can be solved to global optimality. In discussions after the presenta-tion the author has mentioned, that other approaches for solving the deterministic AC OPF were presented at theworkshop. These include SOCP and SDP relaxations proposed by Andy Sun, as well as the graphical modelsapproach by the team at Los Alamos National Laboratory. In addition, more complicated problem were presented,such a chance-constrained AC OPF. In the following discussions it was also noted that link between the momentapproach and graphical models need to be elucidated in the future. Synergies between both approaches could leadto advances in power systems optimization and to future collaborations between researchers in different areas ofapplied mathematics (statistics, information theory on one hand, and numerical optimization on the other hand) [7].

Page 45: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 41

Dvijotham Krishnamurthy: Robust feasibility analysis and robust optimization for infrastructure net-works The presenter addressed the problem of insuring network security under uncertainties, caused by renewableenergy sources, and non-traditional energy demands. He presented a fast and reliable tool for constructing innerapproximations of steady state voltage stability regions in multidimensional injection space such that every pointin our constructed region is guaranteed to be solvable. The method is supported by numerical simulations thatdemonstrate that this approach outperforms all existing inner approximation methods in most cases. The stabilityregions were also shown to cover substantial fractions of the true voltage stability region. This technique has im-portant applications, such as fast screening for viable injection changes, constructing an effective solvability indexand rigorously certified loadability limits. [14].

Dongchan Lee: Reachability analysis of transient stability with linear programming relaxation Transientsecurity assessment of power system remains highly challenging task due to the nonlinear and complex nature ofthe system. The current practices in the industry are mostly based on time-domain simulations, which are compu-tationally expensive especially under uncertainties. In order to overcome the limitations of the existing techniques,the author proposed a reachability analysis approach. He first described a generator model with a set of algebraicequation with the Euler’s method. Then the author considered evolution of the system states in a polytope template,which can be solved with linear programming relaxation at every time step. This approach allows to consider po-tentially any high order generator model. The effect of uncertainty was discussed which, as the workshop showed,is a growing theme in the community. The author has looked at this problem from the prospectives of transientstability and has reported development of algorithms to efficiently assess the security of the power grid. In theresult of the follow up discussions the presenter has stated that the workshop was a great opportunity to talk toother researchers and think about future directions. Thanks to the workshop the presenter got the opportunity tomeet researchers from Los Alamos National Lab, whom he is planning to collaborate with in the summer of 2017[10].

Jean Bernard Lasserre: A moment SOS approach for some chance constraints issues on semi-algebraicsets In the talk the author has described a general framework to handle chance-constraints. It consists of posingthe problem as an infinite-dimensional LP in an appropriate space of measures. Then when data are polynomialsand semi-algebraic sets one may invoke the moment-SOS approach. For instance it allows to approximate thefeasible set associated with a chance constraint by the superlevel set x: p(x) >=0 of some polynomial of degree”d” whose coefficients are optimal solutions of an SDP. The larger ”d” the better is the approximation. The presen-ter has mentioned in the discussion period that he was particularly interested by the various papers related to theOPF. The presenter have discussed with Sidhant Misra extensively about how to handle chance constraints via themoment-SOS approach. This could be further developed and extended during the internship of the presenter PhDstudent Tillman at LANL in the summer of 2017 [9].

Daniel Molzahn: Error Bounds on Power Flow Linearizations: A Convex Relaxation Approach Thepower flow equations model the relationship between the voltages phasors and power flows in an electric powersystem. The nonlinearity of the power flow equations results in algorithmic and theoretical challenges, includingnon-convex feasible spaces for optimization problems constrained by these equations. Accordingly, many practicalapproaches for solving power system optimization and control problems employ linearizations of the power flowequations. By leveraging developments in convex relaxation techniques, this presentation describes recent progressregarding a method for bounding the worst-case errors resulting from power flow linearizations. Specifically, witha focus on the DC power flow approximation, this presentation characterized the worst-case error in the line flowsover a specified range of operational conditions. In terms of links to other presentations and attendees, the pre-senter noticed in the follow up discussions that the audience was exactly the right group of people for my work.For instance, the approach the presenter used to develop worst-case error bounds on power flow linearizationsapplies the complex hierarchy of moment/sum-of-squares relaxations presented by Cedric Josz. This hierarchy isalso closely related to a hierarchy of relaxations proposed by Jean Bernard Lasserre. The inner approximationsproposed by Dj (Krishnamurthy Dvijotham) are also closely related in that the convex relaxations develop outerapproximations that complement Dj’s inner approximations of the feasible spaces of power system optimizationproblems.The presenter has also discussed an application of the worst-case error bounds to the topology identifi-

Page 46: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

42 Five-day Workshop Reports

cation algorithm presented by Deepjyoti Deka. Regarding the development of new work and extending existingideas, the presenter specifically had discussions with Cedric Josz and Jean Bernard Lasserre on new applications ofthe hierarchies of convex relaxations, with Dj, Florian Dorfler, Savario Bolognani, and Deepjyoti Deka on exten-sions and applications of the worst-case error bounds I presented, and with Sidhant Misra and Line Roald on issuesrelated to chance constraints in power system optimization problems with AC power flow models. The presenterhas emphasized that he had enjoyed discussions with a variety of other researchers on their work [4].

Control in Power Systems and Gas Networks

Florian Dorfler: Control of Low-Inertia Power Systems: Naive & Foundational Approaches A major transi-tion in the operation of electric power grids is the replacement of bulk generation based on synchronous machinesby distributed generation based on low inertia power electronics sources. The accompanying “loss of rotational in-ertia” and fluctuations by renewable sources jeopardize the system stability, as testified by the ever-growing numberof frequency incidents. As a remedy, numerous studies demonstrate how virtual inertia can be emulated throughvarious devices, but few of them address the question of “where” to place this inertia. It is however stronglybelieved that the placement of virtual inertia hugely impacts system efficiency, as demonstrated by recent casestudies. A major transition in the operation of electric power grids is the replacement of synchronous machines bydistributed generation connected via power electronic converters. The accompanying “loss of rotational inertia”and the fluctuations by renewable sources jeopardize the system stability. The problem of inertia allocation hasbeen hinted at before, where the grid is modeled by the linearized swing equations, and eigenvalue damping ratiosas well as transient overshoots (estimated from the system modes) are chosen as optimization criterion for placingvirtual inertia and damping. The resulting problem is highly non-convex, but a sequence of approximations led tosome insightful results. The presented work focused on network coherency as an alternative performance metric,that is, the amplification of stochastic or impulsive disturbances to a quadratic performance index measured by theH2 norm. As performance index, a classic coherency criterion was chosen that penalizes angular differences andabsolute frequencies, which has recently been popularized for consensus and synchronization studies as well asin power system analysis and control. A foundational analysis was presented for model reduction for low-inertiapower systems, and a coherency metric was proposed. A system-theoretic methodology for optimal inertia allo-cation was presented, which included theorems on performance bounds and optimal resource allocation given theassumption of uniform disturbance-damping. Finally, a tractable computational method for the general case ofoptimal inertia allocation was presented [15].

Saverio Bolognani: Networked Feedback Control for a Smart Power Distribution Grid Emerging chal-lenges in distribution grids were discussed, including distributed microgenerators, high spatio-temporal electricmobility, the constraints of grid congestion, and unsustainable grid reinforcement. Tractable methodologies forvirtual grid reinforcement using feedback control was then presented, based on distributed “model-free” controland centralized chance-constrained decision-making. A fundamental control system model for power distributiongrids was stated with underdetermined sensing using meters for voltage, line currents, and transformer loading,and underactuated control using tap changers, reactive power compensators, and active power management. Thecollection of hard, soft, and chance- constraints used in the optimal power flow (OPF) problem were reviewed, anddisturbance attenuation for distribution grid dynamics was developed using a linearization approach on the tangentspace to the solution manifold [2].

Michael Herty: Modeling, Simulation and Optimization of Gas Networks There has recently been anintense discussion on physical phenomena on graphs and in particular in the context of gas pipeline network dy-namics. An overview of the physics and engineering modeling typically used in the academic literature was given.A new model for representing gas flow dynamics in pipeline networks by asymptotic analysis was presented. Themodel is derived from the isothermal Euler equations using phenomenological approximations. The derivationof the model was presented as well as numerical results to illustrate order of accuracy, validity, and properties ofthe approximation. A comparison of the new model with existing models from the mathematical and engineeringliterature was given, in particular with respect to solution of the full-physics Euler equations. Concepts for nodalcontrol of gas networks using the developed model were discussed [6].

Page 47: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 43

Anatoly Zlotnik: Discretization for optimal control of physical flows on networks A fundamental appli-cation for optimal control of physical flows on networks is the operation of large-scale natural gas transmissionpipelines. A control system model was outlined for the distributed dynamics of compressible gas flow throughlarge-scale pipeline networks with time-varying injections, withdrawals, and control actions of compressors andregulators. The gas dynamics PDE equations over the pipelines, together with boundary conditions at junctions,were shown to be reducible using lumped elements to a sparse nonlinear ODE system with compact expression ingraph theoretic notation. This system was developed as a consistent discretization of the PDE equations for gasflow that can be used to represent the dynamic constraints for optimal control problems for pipeline systems withknown time-varying withdrawals and injections and gas pressure limits throughout the network. A problem formu-lation for intra-day economic optimal control of large-scale pipelines was presented, and an example of tractable,validated solution was discussed [23].

Anders Rantzer: H-infinity optimal control on networks Classical control theory does not scale well forlarge systems like traffic networks, power networks and chemical reaction networks. However, in this lecture wewill present a class of networked control problems for which scalable distributed controllers can be proved toachieve the same performance as the best centralized ones. The control objective is stated in terms of frequencyweighted H-infinity norms. This makes it possible to combine disturbance rejection at low frequencies with ro-bustness to high frequency measurement noise and model errors. An optimal controller is given in the form of amulti-variable PI (proportional integrating) controller, which is distributed in the sense that control action along agiven network edge is entirely determined by states at nodes connected by that edge. Fundamental bounds on theachievable performance are given in terms of the algebraic connectivity of the network graph [16].

Mihailo Jovanovic: Low-complexity modeling of partially available second-order statistics: theory andan efficient matrix completion algorithm State statistics of linear systems satisfy certain structural constraintsthat arise from the underlying dynamics and the directionality of input disturbances. The problem of complet-ing partially known state statistics was described. The aim of this work is to develop tools that can be used inthe context of control-oriented modeling of large-scale dynamical systems. For the type of applications of in-terest, the dynamical interaction between state variables is known while the directionality and dynamics of inputexcitation is often uncertain. Thus, the goal of the mathematical problem that we formulate is to identify the dy-namics and directionality of input excitation in order to explain and complete observed sample statistics. Morespecifically, we seek to explain correlation data with the least number of possible input disturbance channels. Weformulate this inverse problem as rank minimization, and for its solution, we employ a convex relaxation based onthe nuclear norm. The resulting optimization problem is cast as a semidefinite program and can be solved usinggeneral-purpose solvers. For problem sizes that these solvers cannot handle, we develop a customized alternatingminimization algorithm (AMA). We interpret AMA as a proximal gradient for the dual problem and prove sub-linear convergence for the algorithm with fixed step-size. We conclude with an example that illustrates the utilityof our modeling and optimization framework and draw contrast between AMA and the commonly used alternatingdirection method of multipliers (ADMM) algorithm [22].

Dennice Gayme: Evaluating performance in linear oscillator networks: from vehicle platoons to powergrids Two measures of synchronization performance for networks of coupled linear oscillators were defined andcharacterized. The first is the aggregate steady state variance of the system due to a disturbance at a single node.The second is the steady state variance of the phase difference between a given pair of nodes due to distributeddisturbances. Both metrics are quantified in terms of the H2-norm of an appropriately defined linear system. A sys-tematic framework was presented that allows computation of these metrics for networks over graphs with arbitrarystructure. The framework was used to derive relationships between both measures of synchronization performanceand the effective resistance between particular nodes in the graphs underlying the oscillator networks. Interpreta-tions were given of the second performance metric as both a local and global measure of network performance.Finally, the theory was applied to two applications; the evaluation of the transient real power losses in power gridsand the characterization of coherence for a certain class of vehicle platoons with relative and absolute velocityfeedback [5].

Page 48: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

44 Five-day Workshop Reports

Graphical Models

Nicholas Ruozzi: Continuous Graphical Models Computing the mode or MAP assignment in a probabilisticgraphical models is generally intractable. As a result, for discrete graphical models, the MAP problem is often ap-proximated using linear programming relaxations. When much of the research has focused on characterizing whenthese relaxations are tight, only a few results are known for their continuous analog. The presenter focused on thelatter problem showing that one can use graph covers to provide necessary and sufficient conditions for continuousMAP relaxations to be tight. In particular this characterization give simple proofs that the relaxation is tight forlog-concave decomposable and log supermodular decomposable models. The presenter concludes by exploringthe relationship between these two seemingly distinct classes of functions and provides specific conditions underwhich the MAP relaxation can and cannot be tight [18].

The presenter asked other participants for feedback and discussions about other classes of continuous graphicalmodels where such questions could be ask and answered. In particular if it could be relevant and possible to findthe mode of the probability distribution with a message-passing technique beyond Gaussian graphical models.

Patrick Rebeschini: Min-sum and network flows This talk was about message-passing algorithms for solv-ing systems of linear equations in the Laplacian matrices of graphs and to compute electric flows. These two prob-lems are fundamental primitives that arise in several domains such as computer science, electrical engineering,operations research, and machine learning. Formers algorithms that have been proposed are typically centralizedand involve multiple graph theoretic constructions or sampling mechanisms that make them difficult to implementand analyze. On the other hand, message-passing routines are distributed, simple, and easy to implement. Thepresenter establishes a framework to analyze message-passing algorithms to solve voltage and flow problems. Thepresenter shows that the convergence of the algorithms is controlled by the total variation distance between thedistributions of non-backtracking random walks that start from neighbor nodes. More broadly, his analysis ofmessage-passing introduces new insights to address generic optimization problems with constraints [17].

This talk connected the community of graphical modeling, optimization and power systems by combiningtechniques of the earlier to solve problems for the latter in an efficient and decentralized fashion. In follows updiscussions both theoretical aspects and applicability to concrete problems were discussed.

Sungsoo Ahn: Optimizing Gauge transformation for inference in graphical model Computing partitionfunction is the most important inference task arising in applications of Graphical Models (GM). Since it is compu-tationally intractable, approximate algorithms are used to tackle the problem. In this talk, the presenter introducedthe technique, coined gauge transformation, modifying GM factors such that the partition function stay the same(invariant), and propose two optimization formulations which generalize the Bethe Free Energy, Belief Propa-gation approach. Then the optimizations are solved efficiently by Alternating Direction Method of Multipliers(ADMM) algorithms. The first algorithm provides deterministic lower bounds of the partition function. The al-gorithm is exact for GMs over a single loop with a special structure, even though the popular Belief Propagationalgorithm performs badly in this case. The second algorithm is of a randomized, Monte Carlo, type. It lowerssample variance, which can be further reduced with the help of annealed/sequential/adaptive importance sampling.The experiments show that the newly proposed Gauge-ADMM algorithms outperform other known algorithms forthe approximate inference task [1].

The talk shares some common interest to that of Nicholas Ruozzi, ”Continuous Graphical Models”. In thefollow up discussions it was suggested to consider the gauge transformation approach of discrete GMs to theframework to continuous graphical models and analyze its connection to existing algorithms for continuous GMs.

Marc Vuffray: Optimal Learning of Sparse Graphical Models Graphical Models represent multivari-ate probability distributions for which direct dependencies between random variables are captured by a network.Graphical Models are widely used for uncertainty management, inference and model reductions. In many applica-tions the network of a Graphical Models is expected to be sparse i.e. the number of edges is of the same order than

Page 49: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 45

the number of nodes. As Graphical Models are often not known a priori or cannot always be deduced from firstprinciples, it is of importance to learn Graphical Models from data. In this talk the presenter has focused on efficientmethods for learning Graphical Models over discrete random variables and Gaussian random variables when datatake the form of several independent observation of random variables. Several low-complexity learning-algorithmsthat achieve or quasi-achieve the information theoretical lower-bound on data-requirement were proposed [21, 13,12].

The talk was well received in particular by non-experts in the field of machine learning. The comments anddiscussions undertaken after the presentation focused on being aware of the problem of data acquisition and dataprocessing in a network. In particular, how reconstruction imprecision based on finite data can have an impact inthe control of these networks and what tools can help to overcome this problem. The methods and setting in thistalk connected the fields of Gaussian graphical models and learning or solving inverse problems.

Andrey Lokhov: Reconstructing the power grid dynamic model from sparse measurements The presen-ter discusses the problem of parameter reconstruction in the power grid dynamics from the observations comingfrom the phasor measurement units (PMUs). The presenter assumes that the dynamics is described by a system ofcoupled linearized second-order differential swing equations. The fluctuations on individual nodes are assumed tobe independent and Gaussian. Given the data coming from the sparsely located PMU sensors, the goal is to learnthe inertia and the damping coefficients of the generators using the statistics of the ambient fluctuations. Adoptingthe description of the process as a multivariate stochastic Ornstein-Uhlenbeck process, the presenter discusses thestrengths and the limitations of several approaches to this problem, based on the method of covariance matrices, aswell as the maximum likelihood and least-squares estimators.

The framework presented in this talk interested specialists in control for developing & using better real timeactuation of the system. The presenter engaged in further discussions for possible collaborations on this interdis-ciplinary integration with Pr. Jovanovic & Pr. Gayme.

Deepjoti Deka: Topology Learning in Power Grids from Ambient Dynamics Estimation of the operationaltopology of the power grid is necessary for optimal market settlement and reliable dynamic operation of the grid.The presenter introduces a novel framework for topology estimation for general power grids (loopy or radial) usingtime-series measurements of nodal voltage phase angles that arise from the swing dynamics. The learning methodutilizes multivariate Wiener filtering to unravel the interaction between fluctuations in voltage angles at differentnodes and identifies operational edges by considering the phase response of the elements of the multivariate Wienerfilter. The performance of this learning framework has been demonstrated through simulations on standard IEEEsynthetic test cases [20].

A talk similar in flavor to that of Andrey Lokhov but focused on a concrete application in power systems. Thepresenter engages in discussions afterwards with A. Lokhov and with the same participant in control theory ofpower networks.

Yury Maximov: Statistical Learning with Proxies for Power Flow Feasibility In power systems and in par-ticular in energy transmission systems it is of importance to determine if a state, i.e. a set of power productions andconsumptions, can be realized in a safe and secure fashion. The presenter shown an approach that uses samplingand machine learning to develop a classifier for secure and insecure states. This statistical machine learning canprovide probabilistic guarantees for the quality of the classifier, i.e., it guarantees that we do not treat an unsolvablestate as a solvable one. This algorithm approximate the solvability region, which enables in particular to providepolyhedral approximations of the solvability region, as well as more sophisticated convex approximations.

This talk was of interested and well received by scientists from the power system community present at theconference with many implementations related questions asked after the talk. This approach for finding the feasi-bility set is an original and novel approach which combines knowledge from machine learning and power systems.

Page 50: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

46 Five-day Workshop Reports

Other TopicsVijay Subramanian: Mean field Games: Incentive and Lottery Mechanisms Motivated by systems with alarge number of strategic players, such as in Internet marketplaces, the speaker explored the use of incentive andlottery mechanisms in mean-field games.

First, he considered real-time streaming of video to co-located wireless devices where cooperation among thedevices would lead to greater system efficiency. Based on ideas drawn from truth-telling auctions, a mechanismthat achieves this cooperation via appropriate transfers (monetary payments or rebates) in a setting with a largenumber of devices, and with peer arrivals and departures was designed. Furthermore, the complexity of calculatingthe best responses under this regime is low. This allowed to implement the proposed system on an Android testbed,and illustrate its efficient performance using real world experiments. The tools developed here easily generalize toother resource allocation problems.

Second, the general problem of resource sharing in societal networks, consisting of interconnected communica-tion, transportation, energy and other networks important to the functioning of society was considered. Participantsin such network need to take decisions daily, both on the quantity of resources to use as well as the periods of us-age. With this in mind, the presenter discussed the problem of incentivizing users to behave in such a way thatsociety as a whole benefits, specifically by rewarding users with lottery tickets based on good behavior, and pe-riodically conducting a lottery to translate these tickets into real rewards. The user decision problem was posedas a mean field game (MFG), and the incentives question as one of trying to select a good mean field equilibrium(MFE). The existence of such an MFE under different settings was shown, and it was also illustrated how to choosean attractive equilibrium using as an example demand-response in energy networks [11].

The two topics discussed are based on the joint work of the speaker with Srinivas Shakkottai (TAMU) and JianLi (UMass Amherst), with the first also with Rajarshi Bhattacharyya (TAMU) and Suman Paul (TAMU), and thesecond also with Le Xie (TAMU), Bainan Xia (TAMU), Xinbo Geng (TAMU), Hao Ming (TAMU).

In the following discussions the presenter stated his intent to look into Belief Propagation with mixed integerand real variables as well as the LP-BP versions of it.

Gunnar Flotterod (KTH) and Carolina Osorio (MIT): Stochastic network link transmission model Stochas-tic network link-transmission model

In this presentation based on a joint work with Carolina Osorio (MIT) – the presenter has described the linktransmission model which has recently gained popularity as an efficient yet exact model of kinematic waves onroad networks. The presenter formulated a stochastic instance of this model, where the number of vehicles any-where in the network is a distributed quantity. The model approximates network-wide stochastic dependencies.Multiple questions related to connections between the model setting and other physical network flow problemsdiscussed during the workshop was raised. The subject requires further work and investigation.

Philippe Jacquod: Multistability and topologically protected loop flows in meshed planar networks Thepresenter shown that the number of stable fixed points of locally coupled Kuramoto models depends on cycles inmeshed networks. In particular one should expect that the number of fixed points increases for meshed networkswith more cycles and with longer cycles. The presenter shows an upper-bound for the number of stable fixed pointin planar networks which support this intuition and has identified network topologies carrying stable fixed pointswith angle differences larger than Π/2. Compared to earlier approaches this bounds is lower and hence muchcloser to the true number of stable fixed points. [3].

Outcome of the MeetingThe workshop helped to highlight and appreciate new connections between the mathematical areas of dynamicalsystems, network science, control theory, optimization, and applied probability. Bringing together the theory andapplications community helped emphasize the important modeling aspects in each application, problems of impor-tance. Developing efficient algorithms for optimization under uncertainty in networks invariably needs additional

Page 51: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 47

structures, assumptions and simplifications, and domain specific knowledge from the applications communityhelped identify what aspects of the system modeling are mandatory, and in what parts simplification is accept-able. Presentations from the theory/methods community brought to attention potential new directions to solvechallenging problems in the application areas of infrastructure networks.

The schedule involved an entire day for discussions. Several new problems, solution methods, and new collab-orations were conceived during these discussions. Some examples of new ideas include using Lasserre’s hierarchyto solve stochastic optimization problems in non-linear networks, direct approximations to non-linear networkphysics around an operating point that allow for efficient optimization and uncertainty quantification.

The workshop has received very positive informal feedback, and we anticipate the new ideas and collaborationsto lead to new and exciting advancements in the field. All of this was made possible by the BIRS through itsexcellent organization, facilities and support staff.

ParticipantsAhn, Sung-Soo (KAIST)Bolognani, Saverio (Swiss Federal Institute of Technology)Chertkov, Michael (Los Alamos National Laboratory)Deka, Deepjoti (Los Alamos National Lab)Dorfler, Florian (Swiss Federal Institute of Technology)Flotterod, Gunnar (KTH Royal Institute of Technology)Gayme, Dennice (Johns Hopkins University)Gorodnitskii, Oleg (Skoltech)Herty, Michael (RWTH Aachen University)Jacquod, Philippe (U of Applied Sciences of Western Switzerland)Josz, Cedric (LAAS CNRS)Jovanovic, Mihailo (University of Minnesota)Krishnamurthy, Dvijotham (California Institute of Technology)Lasserre, Jean-Bernard (LAAS-CNRS 7, Toulouse)Lee, Dongchan (MIT)Lokhov, Andrey (Los Alamos National Laboratory)Lubin, Miles (Massachussetts Institute of Technology)Maximov, Yury (Skolkovo Institute of Science and Technology)Misra, Sidhant (Los Alamos National Laboratory)Molzahn, Daniel (Argonne National Laboratory)Rantzer, Anders (Lund University)Rebeschini, Patrick (Yale University)Roald, Line (ETH Zurich)Ruozzi, Nicholas (UT Dallas)Savla, Ketan (University of Southern California)Scaglione, Anna (Arizona State University)Subramanian, Vijay (University of Michigan, Ann Arbor)Sun, X. Andy (Georgia Institute of Technology)talukdar, saurav (University of Minnesota)Vuffray, Marc (Los Alamos National Laboratory)Yang, Shuoguang (Columbia University)Yu, Suhyoun (MIT)Zlotnik, Anatoly (Los Alamos National Laboratory)

Page 52: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] Sungsoo Ahn, Michael Chertkov, and Jinwoo Shin. Gauge optimization via admm for approximate in-ference. arXiv preprint arXiv:1703.01056, 2017. [2] Saverio Bolognani, Ruggero Carli, Guido Cavraro, andSandro Zampieri. Distributed reactive power feedback control for voltage regulation and loss minimization.IEEE Transactions on Automatic Control, 60(4):966–981, 2015. [3] Robin Delabays, Tommaso Coletta, andPhilippe Jacquod. Multistability of phase-locking in equalfrequency kuramoto models on planar graphs. Jour-nal of Mathematical Physics, 58(3):032703, 2017. [4] Krishnamurthy Dvijotham and Daniel K Molzahn. Errorbounds on the dc power flow approximation: A convex relaxation approach. In Decision and Control (CDC),2016 IEEE 55th Conference on, pages 2411–2418. IEEE, 2016. [5] T. W. Grunberg and D. F. Gayme. Perfor-mance measures for linear oscillator networks over arbitrary graphs. IEEE Transactions on Control of NetworkSystems, PP(99):1–1, 2016. [6] Michael Herty. Multi-scale modeling and nodal control for gas transporta-tion networks. In Decision and Control (CDC), 2015 IEEE 54th Annual Conference on, pages 4585–4590.IEEE, 2015. [7] Cedric Josz. Application of polynomial optimization to electricity transmission networks.arXiv preprint arXiv:1608.03871, 2016. [8] Burak Kocuk, Santanu S Dey, and Xu Sun. New formulation andstrong misocp relaxations for ac optimal transmission switching problem. IEEE Transactions on Power Sys-tems, 2017. [9] Jean B Lasserre. Representation of chance-constraints with strong asymptotic guarantees. IEEEControl Systems Letters, 1(1):50–55, 2017. [10] Dongchan Lee and Konstantin Turitsyn. Dynamic polytopictemplate approach to robust transient stability assessment. arXiv preprint arXiv:1705.01189, 2017. [11] Jian Li,Rajarshi Bhattacharyya, Suman Paul, Srinivas Shakkottai, and Vijay Subramanian. Incentivizing sharing in real-time d2d streaming networks: A mean field game perspective. IEEE/ACM Transactions on Networking (TON),25(1):3–17, 2017. [12] Andrey Y Lokhov, Marc Vuffray, Sidhant Misra, and Michael Chertkov. Optimal struc-ture and parameter learning of ising models. arXiv preprint arXiv:1612.05024, 2016. [13] Sidhant Misra, MarcVuffray, Andrey Y Lokhov, and Michael Chertkov. Towards optimal sparse inverse covariance selection throughnon-convex optimization. arXiv preprint arXiv:1703.04886, 2017. [14] Hung D Nguyen, Krishnamurthy Dvi-jotham, Suhyoun Yu, and Konstantin Turitsyn. A framework for robust steady-state voltage stability of distri-bution systems. arXiv preprint arXiv:1705.05774, 2017. [15] Bala Kameshwar Poolla, Saverio Bolognani, andFlorian Dorfler. Optimal placement of virtual inertia in power grids. IEEE Transactions on Automatic Con-trol, 2017. [16] Anders Rantzer, Carolina Lidstrom, and Richard Pates. Structure preserving h-infinity optimalpi control. arXiv preprint arXiv:1612.07824, 2016. [17] Patrick Rebeschini and Sekhar Tatikonda. Decay ofcorrelation in network flow problems. In Information Science and Systems (CISS), 2016 Annual Conferenceon, pages 169–174. IEEE, 2016. [18] Nicholas Ruozzi. Exactness of approximate map inference in continuousmrfs. In Advances in Neural Information Processing Systems, pages 2332–2340, 2015. [19] Kaarthik Sundar,Harsha Nagarajan, Miles Lubin, Line Roald, Sidhant Misra, Russell Bent, and Daniel Bienstock. Unit commit-ment with n-1 security and wind uncertainty. In Power Systems Computation Conference (PSCC), 2016, pages1–7. IEEE, 2016. [20] Saurav Talukdar, Deepjyoti Deka, Blake Lundstrom, Michael Chertkov, and Murti VSalapaka. Learning exact topology of a loopy power grid from ambient dynamics. In Proceedings of the EighthInternational Conference on Future Energy Systems, pages 222–227. ACM, 2017. [21] Marc Vuffray, SidhantMisra, Andrey Lokhov, and Michael Chertkov. Interaction screening: Efficient and sample-optimal learningof ising models. In Advances in Neural Information Processing Systems, pages 2595–2603, 2016. [22] ArminZare, Yongxin Chen, Mihailo R Jovanovic, and Tryphon T Georgiou. Low-complexity modeling of partiallyavailable second-order statistics: theory and an efficient matrix completion algorithm. IEEE Transactions onAutomatic Control, 62(3):1368–1383, 2017. [23] Anatoly Zlotnik, Michael Chertkov, and Scott Backhaus. Op-

48

Page 53: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Optimization and Inference for Physical Flows on Networks 49

timal control of transient flow in natural gas networks. In Decision and Control (CDC), 2015 IEEE 54th AnnualConference on, pages 4563–4570. IEEE, 2015.

Page 54: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 5

New trends in arithmetic and geometry ofalgebraic surfaces (17w5146)

March 12 - 17, 2017Organizer(s): Noam D. Elkies (Harvard University), Keiji Oguiso (University of Tokyo),Matthias Schutt (Universitat Hannover)

OverviewThe interplay of arithmetic and geometry has had a major impact on the recent development of algebraic ge-

ometry and number theory. Over the years, algebraic curves have been a driving force, culminating with ground-breaking results such as Faltings’ finiteness theorem for rational points on algebraic curves of general type, andthe proof of Fermat’s Last Theorem by Wiles and Taylor. With curves much better understood, it seems natural toturn to higher-dimensional varieties, starting with surfaces.

Algebraic surfaces have been at the heart of algebraic geometry ever since the Italian school shaped the subjectat the beginning of the 20th century. There is little doubt that algebraic surfaces support rich arithmetic struc-tures; precise results on their arithmetic properties, however, are still widely conjectural, despite spectacular recentachievements such as the proof of the Tate conjecture for K3 surfaces in many important settings due to MadapusiPera, Maulik, and Charles.

The astounding facets of the rich interplay of arithmetic and geometry, which have featured in many recentdevelopments, were a key motivation to propose a workshop at BIRS. While K3 surfaces were a large focus of theworkshop, important roles were also reserved for Enriques surfaces and certain aspects of general type surfaces.

Recent Developments and Open ProblemsAlgebraic surfaces are two-dimensional varieties (usually considered to be smooth and projective) over some

algebraically closed field. Over the complex numbers, we can identify them with suitable manifolds, to which wecan then apply important principles such as GAGA following Serre’s classical work. To avoid confusion (arisingfrom the fact that varieties over C of complex dimension d are real manifolds of dimension 2d), we emphasizethat Riemann surfaces correspond to algebraic curves, whose properties and structures we take to be fairly wellunderstood (even though in the arithmetic fine print one may argue about this).

Thus turning to algebraic surfaces, we note immediately that they offer much more space for us to manoeuvre.Notably, an algebraic surface admits as subvarieties not only points but also curves, which in fact govern much ofthe structure of the surfaces. Another crucial difference is fibrations with fibers and base both of dimension one.Fibrations have played a central role in the study and classification of algebraic surfaces, as they come with severalcritical advantages. To name just two, they provide an important insight into the surface’s structure — dividingcurves into horizontal and vertical ones, for instance; on the other hand, they allow us to carry over concepts fromcurve theory, most canonically through the scheme-theoretic concept of the generic fiber.

We next detail some of the most recent developments and describe open problems in the area of algebraic and

50

Page 55: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New trends in arithmetic and geometry of algebraic surfaces 51

arithmetic of surfaces, especially those relevant to the present workshop.Rational pointsGiven an algebraic variety X defined over some field K, one fundamental problem is to determine the set

X(K) of K-rational points on X . Of course, this problem is most interesting (albeit less geometric) when Kis not algebraically closed, especially when K is a number field. Already very basic questions turn out to beoften hard to answer, starting with whether X(K) is finite or infinite, which can be made more complicated (andinteresting) by removing certain algebraic subvarieties, such as those dominated by rational and abelian varieties.For the workshop, this circle of problems was mostly a very important motivation lurking in the background, but aquick discussion will still set the scene for most of what is to come.

It may be instructive to recall the now classical case of curves, where the picture is clear-cut. Let C, then, be asmooth projective curve defined over some number field K, and let g be the genus of C.

If g = 0, then either C has a K-rational point or C(K) = ∅. In the first case, it quickly follows that C isrational (i.e. isomorphic to P1 over K), and in particular C(K) is infinite. In the latter case, the same propertieshold true over any extension of K where C acquires a rational point.

At the other end of the scale, i.e. if C is of general type (which means g > 1), then it was conjectured byMordell in the 1920’s that C(K) would always be finite. After celebrated early contributions, for instance bySiegel, Mordell’s conjecture was finally proved by Faltings in the landmark paper [Fa83].

This leaves the case of genus one. If C(K) 6= ∅, then C is an elliptic curve; this means that C(K) has thestructure of an abelian group, which (for any number field K, but also under mild conditions over function fields)is finitely generated. More precisely, C(K) can contain any abelian finite group of length at most two as torsionsubgroup (although the precise groups that occur are rather limited by the given field K; for instance, over Qthe torsion subgroups run through all cyclic groups until Z/10Z together with Z/12Z, and the non-cyclic groupsZ/2Z× Z/2nZ for n = 1, 2, 3, 4 by Mazur’s famous theorem). As for the rank of C(K), this could also a prioribe any nonnegative intger r, and in fact it is an open problem whether given K, the rank of C(K) for all ellipticcurves C overK is bounded. For instance, the the current record over Q stands at 28, due to Elkies [Elk07] (whoseconstruction uses elliptic fibrations on K3 surfaces!). The rank can be made arbitrarily large by passing to finiteextensions K ′/K.

Note, however, that these considerations have taken us far beyond the original question, as we are alreadyinvestigating the precise structure of C(K) as opposed to its cardinality. Indeed, for the cardinality, the answeris simple: C(K) could be either finite or infinite, but on increasing the base field K by a finite extension K ′,we can first ensure that C is elliptic (over K ′) and then endow it with a rational point of infinite order so thatC(K ′) becomes infinite, and indeed dense in the associated Riemann surface. This property is also called potentialdensity, and it is of great relevance for us because it is geometric: it does not depend anymore on the field K orthe chosen model of C, but only on the isomorphism class of C over C (given that C may be defined over somenumber field).

A similar picture persists for algebraic surfaces — as far as we can see. Here the genus is replaced by theKodaira dimension. Surfaces of Kodaira dimension κ = −∞ are well understood as far as the cardinality ofthe set of rational points, though one can then ask for more subtle properties such as density — or even ask toremove a countable union of rational subvarieties. This leads to Manin’s conjectures for Fano varieties, whichhave seen substantial progress in recent years (in any dimension, in fact). In the opposite direction, for surfacesof general type (i.e. κ = 2), Lang’s conjecture predicts that rational points accumulate on a Zariski-closed propersubset, but this problem still seems hopelessly out of reach. We thus turn to the intermediate case of κ = 0. Hereabelian surfaces can be treated just like elliptic curves, but it is the case of K3 surfaces where new phenomenaarise. Notably, potential density is guaranteed by special structures on the surface. Not too surprisingly, an infiniteautomorphism group suffices for this end; much less trivial is that the same applies to genus one fibrations bya result of Bogomolov and Tschinkel [BT00] (and carries over to Enriques surfaces). This should serve as anindication that one ought to take a closer look at the structure of the curves on a surface (and at fibrations).

Curves on surfacesThe curves on an algebraic surface S generate a huge group of formal linear combinations, the divisor group

Div(S). As with curves, we can retrieve essential information about the surface S after dividing out by suitableequivalence relations; presently these could be linear equivalence, algebraic equivalence or numerical equivalence.All quotients share the same discrete part — up to torsion, which in fact is eliminated by numerical equivalence

Page 56: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

52 Five-day Workshop Reports

so that the resulting group Num(S) turns out to be an integral lattice with the natural intersection pairing. Its rankρ(S) is called the Picard number of S and is one of the most fundamental invariants of an algebraic surface. Weemphasize that the Picard number is in general not preserved by deformations, but in some special instances, asfor abelian or K3 surfaces (or irreducible holomorphic symplectic manifolds), there is a nice compatibility withmoduli theory.

Despite its importance, the problem of computing the Picard number of a given algebraic surface S remainsnotoriously difficult. However, if S is defined over some number field K, there is an arithmetic approach, whichcan serve as a prototypical example for this kind of question. Fix some prime p of K where S has good reduction(denoted by Sp). Then there is an embedding

Num(S) → Num(Sp),

primitive on the level of lattices. This yields the bound

ρ(S) ≤ ρ(Sp). (5.0.1)

It might seem that we have merely reduced one intractable problem to another. But crucially Num(Sp) lends itself,at least in principle, to explicit computations via the `-adic cycle class map

Num(Sp) → H2et(Sp,Q`(1)).

What’s more, the embedding is Galois equivariant, so the image lies in the invariant part under Frobenius. In fact,a famous conjecture of Tate states that this should be an equality after tensoring with Q`:

Conjecture (Tate). In the above setting, one always has

Num(Sp)⊗Q` = H2et(Sp,Q`(1))Frob.

This conjecture, one of the most important and influential in the field, has seen great progress in the last5 years: it has been proved competely for K3 surfaces. The pioneering work of Artin and Swinnerton-Dyer[ASD73] on homogeneous spaces proved the Tate conjecture for elliptic K3 surfaces; Nygaard [Nyg83a] provedit for ordinary K3 surfaces in any characteristic; and the remaining cases were recently settled by work of Maulik[Mau14], Charles [Cha13], Madapusi Pera [Mad15] (for odd characteristic), and Kim–Madapusi Pera [KM15] (forcharacteristic 2).

This paves the way for several applications and new directions. First, along the above lines, one can try tocompute the action of Frobenius on cohomology to obtain upper bounds as in (5.0.1). For parity reasons, thesebounds alone on ρ(S) are never sharp if ρ(S) is odd, but this subtlety can be overcome by varying p and examiningthe lattice structures more closely (see work of van Luijk [vLui07] and Kloosterman [Klo07]). It is another resultof Charles [Cha14] that all this information together will be sufficient to derive a sharp upper bound on ρ(S) inany case (at least in theory, and depending on the Hodge conjecture for the self-product S × S). We note that forall of this, we do not need the full action of Frobenius on H2

et(Sp,Q`(1)), only its eigenvalues with multiplicity.These can be computed, for instance, by extensive point counts (and applying the Lefschetz fixed point formula),or through p-adic cohomological methods and p-adic approximation.

We highlight two further applications of the above ideas. The first concerns very special curves on K3 surfaces,namely rational ones. The problem has seen remarkable progress recently, starting from work of Bogomolov-Hassett-Tschinkel [BHT11] and then greatly extended by Li and Liedtke in [LL12]. The main novel technique isto use Picard jumps upon reduction from S to Sp in (5.0.1), forced for instance by parity, to detect infinitely manyrational curves, first on Sp, but then, using arguments going back to Bogomolov and Mumford, also on the originalsurface S. In particular, this applies to any K3 surface S over Q with ρ(S) odd or at least 5. The second applicationdraws the connection back to rational points. Wondering whether K3 surfaces over number fields satisfy potentialdensity, the key case to consider seems to be Picard number 1. In order to check this, one may combine the abovemethods with elementary deformation theory to exhibit explicit K3 surfaces with Picard number one, say over Q,and use them as test cases.

Good reduction and Honda-Tate

Page 57: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New trends in arithmetic and geometry of algebraic surfaces 53

The problem of good reduction lies at the heart of arithmetic geometry, and even more so do integral models,say over integer rings in number fields. Prototypica examples are often provided by moduli spaces; especially forShimura varieties integrality questions are one of the key issues for a better understanding, both of the varietiesand the objects parametrized.

The classical examples are provided by abelian varieties, for which we have a uniform treatment thanks to theexistence of Neron models. Indeed, Serre and Tate showed in [ST68] that an abelian variety over a number field hasgood reduction if and only its `-adic cohomology is unramified (for some ` or for all `, and evenH1 suffices). LaterFontaine proved that there cannot be any (positive-dimensional) abelian schemes over Z [Fo85]. However, alreadyin dimension one, i.e. for elliptic curves, it is not clear yet precisely which quadratic fields support everywhereintegral models of some elliptic curve.

A similar picture persists for K3 surfaces (and higher-dimensional Calabi–Yau varieties) where no models overZ may exist as proven independently by Abrashkin [Ab89] and Fontaine [Fo93]. Surprisingly, there really seem tobe no known explicit integral models of K3 surfaces (unless one considers [Ma15] as explicit); but on the theoreticalside there has been substantial progress recently, although part of it depends on the arithmetic analogue of partsof the Minimal Model Program. More precisely, assuming the existence of so-called Kulikov models (a strongform of semi-stable reduction which under certain conditions follows from work of Maulik [Mau14]), Liedtkeand Matsumoto prove potential good reduction for K3 surfaces over a discrete valuation ring, i.e. unramifiedGalois representations imply good reduction over some unramified finite extension (which generally cannot becircumvented, see [LM17]). Very recently, there has also been a crystalline version in [CLL17].

In a similar, but somewhat converse direction, one may also ask whether given a candidate zeta function(satisfying all the standard conditions over some finite field imposed by the Weil conjectures, compatibilities etc.),there is a variety within a given class (say abelian or K3) over the fixed finite field with this exact zeta function.Classically this is known for abelian varieties as the theorem of Honda–Tate (cf. [Hon68]). Taelman proved itrecently for K3 surfaces in [Tae16], again assuming a strong form of semi-stable reduction and possibly after anextension of the finite field (which, for instance, makes the Galois action on Num(S) trivial). We point out that it isnot known whether the extension can be avoided in general, or under certain conditions; but experiments conductedby Kedlaya and Sutherland provide rich test data for quartic surfaces over F2 [KS16].

Automorphisms of Enriques surfacesEnriques surfaces probably form the most mysterious surfaces of Kodaira dimension zero, because they are

governed by their universal covers, namely by K3 surface, and thus by lattice theory, but yet they sometimes showa completely different behaviour. For instance, a very general complex K3 surface has finite automorphism group(or even trivial or of order two), but a general Enriques surface has infinite automorphism group by [2]. In fact,Enriques surfaces with finite automorphism group are very rare; over C, they have been classified completely intoseven types by Kondo [Kon86]. In (small) positive characteristics, different situations may persist; recently, theodd characteristic case (and part of characteristic two, see below) was solved completely by Martin [Ma17]. Incharacteristic two, however, the moduli space of Enriques surfaces decomposes into two 10-dimensional compo-nents, corresponding to singular (or ordinary) Enriques surfaces on the one hand and classical Enriques surfaceson the other, as was shown by Liedtke in [Li15]. The singular Enriques surfaces still admit a smooth universal K3cover; in practice this means that they lend themselves to a treatment very similar to all other characteristics, whichwas exploited with great success in [KK15] and [Ma17]. The classical Enriques surfaces, however, behave ratherdifferently: much of their geometry seems to be closer to the complex counterpart (for instance genus one fibra-tions still have two ramified fibers), but the universal covers are no longer smooth K3 surfaces, but only K3-like.The Enriques surfaces at the intersection of the singular and the classical component, the so-called supersingularEnriques surfaces, might be considered even more mysterious.

In a different direction, one may ask for finite subgroups of the (usually infinite) automorphism group. Forcomplex K3 surfaces, this problem leads to now classical work of Mukai [Muk88] which classifies finite groupsacting symplectically (i.e. leaving the regular 2-form invariant) in terms of the Mathieu group M23. RecentlyMukai and Ohashi started to apply this approach to complex Enriques surfaces and M12 in [MO15].

Other aspectsOf course, there are many other open problems on algebraic surfaces which we cannot mention in great detail

here. They range from very classical problems, such as the geography of surfaces of general type, through moderntopics such as the ubiquitous moduli theory (especially for K3 surfaces, Enriques surfaces, but also for irreducible

Page 58: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

54 Five-day Workshop Reports

holomorphic symplectic manifolds) to the most recent advances in connection with dynamics, with emphasis onrational and K3 surfaces.

Presentation HighlightsThe workshop featured a great number of very interesting talks. There were numerous experienced leaders

as speakers, but we made an effort to schedule a good portion of talks by junior participants, one even beforereceiving his Master’s degree. Throughout the audience was very appreciative and took an active role in the talks,with intensive discussions regularly continuing during the breaks. Below we lay out some of the presentationhighlights.

Anthony Varilly-Alvarado (Rice University): A conjecture on Brauer groups of K3 surfacesTorsion points govern much of the theory of elliptic curves (as indicated in 5). Varilly-Alvarado reported on

recent attempts to find a good replacement for torsion points on K3 surfaces, ideally lending itself to uniformbounds such as Merel’s 1996 results for elliptic curves [Mer96]. Recent findings suggest that the Brauer groupsmight be a good candidate yielding analogous statements for K3 surfaces. Growing evidence was lately providedby works of several researchers, including Cadoret–Charles, Orr–Skorobogatov [OS17], and the speaker withcollaborators.

Lenny Taelman (University of Amsterdam): Equivariant Witt groups and zeta functionsTaelman presented work that should have a high impact on the Honda–Tate problem for K3 surfaces (cf. 5).

The talk itself (which has in the meantime resulted in a joint paper with Bayer-Fluckiger [BFT17]) was almostpurely concerned with quadratic forms, but it shed a completely new light on them by working out necessary andsufficient criteria for the following problem: Given a discrete valuation ring R with field of fractions K, considera symmetric bilinear space V over K. Assume that some group G acts on V by isometries. Does V contain aunimodular lattice stabilized by G?

Taelman’s result also provides restrictions on the possible characteristic polynomials of Frobenius on the mid-dle cohomology of a smooth projective variety of even dimension over a finite field, thus generalizing a theoremof Elsenhans and Jahnel [EJ15] (in a rather conceptual way).

Kazuhiro Ito (Kyoto University): On the construction of K3 surfaces over finite fields with given L-function

Ito improved on Taelman’s result on Honda-Tate for K3 surfaces [Tae16] (cf. 5) by giving a construction ofK3 surfaces over finite fields with given L-function, independent of the previously crucial good reduction criterion[It16]. The proof combines Taelman’s approach with showing the existence of an elliptic fibration with a section,or of an ample line bundle of low degree. It still involves a potential finite extension of the base field, and a mildcondition on the characteristic (either p ≥ 7, or p = 5 plus one of three extra conditions, e.g. ρ ≥ 4).

Yuya Matsumoto (Nagoya University): Degeneration of K3 surfaces and automorphismsMatsumoto’s talk provided a good reduction criterion for K3 surfaces (as in 5) that works not only for isolated

examples but in full families [Ma16]. Again assuming the existence of the so-called Kulikov models, his methodsrequire only the existence of an automorphism (of finite or infinite order) that acts on the regular 2-form by aprimitive root of unity of order m = 5 or m ≥ 7. To this end, the author studies the weight filtration of the `-adiccohomology groups. Then the compatibility with the induced action of the automorphism group lets him excludecertain types of degeneration.

Curtis McMullen (Harvard University): Algebraic integers and surfaces dynamicsMcMullen discussed reverse engineering at the interface of algebraic geometry and dynamical systems. In

detail, he explained how to create dynamical systems on varieties starting with an algebraic integer (a Salemnumber, to be precise, which will serve as eigenvalue of some automorphism on cohomology). Explicit dynamicalsystems of minimal entropy on K3 surfaces, rational surfaces and complex tori were described in detail.

Simon Brandhorst (Leibniz Universitat Hannover): On the dynamical spectrum of projective K3 sur-faces

Brandhorst focused on the above mentioned problem of realizing a Salem number λ ∈ C as entropy of anautomorphism of a complex projective K3 surface. He gave a affirmative answer, except that his methods onlycover some power of λ. Afterwards, the techniques were extended to Enriques surfaces and 2-dimensional tori,both projective and non-projective [Bra17].

Francois Charles (Universite Paris-Sud): Arithmetic ampleness and an arithmetic Bertini theorem

Page 59: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New trends in arithmetic and geometry of algebraic surfaces 55

Charles discussed properties of certain ample line bundles in arithmetic geometry, aiming for analogues ofwell-known geometric results in the arithmetic setting [Cha17]. The key case consisted in an ample hermitianline bundle L on a projective arithmetic variety X of dimension at least 2 where the proportion of the effectivesections of L⊗n defining irreducible divisors on X was shown to tend to 1 as n tends to∞. Applications includedrestriction theorems and an arithmetic analogue of the Bertini irreducibility theorem, improving on the results from[CP16] (while at the same time building on them).

Asher Auel (Yale University): Decomposition of the diagonal and phantom categories on surfacesAuel surveyed recent developments on the problems of rationality and stable rationality, combining perspec-

tives from unramified cohomology and zero-cycles as well as derived categories and semiorthogonal decomposi-tions. In particular, he reported on joint work with Bernardara (cf. [AB17]), concerning phantom subcategoriesof the derived category of an algebraic surface. He showed how these can be viewed as a stronger measure ofrationality than the existence of a decomposition of the diagonal.

Jorg Jahnel (University of Siegen): On the distribution of the Picard ranks of the reductions of a K3surface

Jahnel reported on joint work with Costa and Elsenhans [CEJ16] on the distribution of the Picard ranks of aK3 surface under the reduction map (5.0.1) as the prime p varies. To this end, he considered the determinant ofthe Galois representation on the transcendental cycles. After a Tate twist, this gives either a trivial or a quadraticcharacter. Whenever this character evaluates to −1, he showed that the Picard number jumps. What’s more, thecharacter can be related to the discriminant of the K3 surface. This paves the way to a sufficient criterion for theexistence of infinitely many rational curves on K3 surfaces not covered by the cases laid out in 5.

Toshiyuki Katsura (Hosei University): Classification of Enriques surfaces with finite automorphismgroups in characteristic 2

Katsura’s talk concerned the problem of finite automorphism groups of those Enriques surfaces in characteristictwo whose universal cover is not a (smooth) K3 surface; that is, classical or supersingular Enriques surfaces –exactly those which were not covered in 5. In joint work with Kondo and Martin [KKM17], they use genus onefibrations (much like Kondo did in [Kon86], but now also including quasi-elliptic fibrations) and the classificationof conductrices from [ESB04] to establish the full classification of the configurations of smooth rational curves(which happen to form a finite set) on Enriques surfaces with finite automorphism group. In particular, they provideexamples for each configuration, some supporting different finite groups of automorphisms. This leaves open onlythe determination of the underlying moduli spaces.

Outcome of the MeetingThe workshop brought together experts from all over the world working on algebraic surfaces, ranging from

leaders in the field to rising stars and newcomers. It featured exciting talks about the latest developments, whichwere extremely well received and led to lively discussions. The workshop strengthened existing interactions andat the same time initiated several new collaborations and directions.

ParticipantsAuel, Asher (Yale University)Bauer, Ingrid (University of Bayreuth)Brandhorst, Simon (Leibniz University Hannover)Brunyate, Adrian (Stony Brook)Casalaina-Martin, Sebastian (University of Colorado at Boulder)Catanese, Fabrizio (University of Bayreuth)Costa, Edgar (Dartmouth College)Dembele, Lassina (Warwick University)Elkies, Noam D. (Harvard University)Elsenhans, Andreas-Stephan (University Paderborn)Garbagnati, Alice (University Statale di Milano)Hulek, Klaus (Fakultaet fuer Mathematik und Physik)Huybrechts, Daniel (University Bonn)

Page 60: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

56 Five-day Workshop Reports

Ito, Kazuhiro (Kyoto University)Jahnel, Joerg (University of Siegen)Katsura, Toshiyuki (Hosei University Japan)Kostiuk, Jordan (University of Alberta)Kuwata, Masato (Chuo University)Laface, Roberto (Leibniz Universitat Hannover)Lieblich, Max (University of Washington)Logan, Adam (Government of Canada)Matsumoto, Yuya (University of Tokyo)McMullen, Curtis (Harvard University)Mukamel, Ronen (Rice University)Nasrecki, Bartosz (University of Bristol)Oguiso, Keiji (The University of Tokyo)Ohashi, Hisanori (Tokyo University of Science)Ouchi, Genki (IPMU, The university of Tokyo)Rams, Slawomir (Jagiellonian University)Roulleau, Xavier (University of Marseille)Sarti, Alessandra (University of Poitiers)Schuett, Matthias (Leibniz University Hannover)Shepherd-Barron, Nicholas (KingOs College London)Shioda, Tetsuji (Rikkyo University)Taelman, Lenny (Universiteit van Amsterdam)Tayou, Salim (Universite Paris-Sud)van Luijk, Ronald (Universiteit Leiden)Varilly-Alvarado, Anthony (Rice University)Veniani, Davide Cesare (University of Mainz)Voight, John (Dartmouth College)Whitcher, Ursula (American Mathematical Society)Yu, Xun (Tianjin University)

Page 61: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[Ab89] Abrashkin, V. A.: Modular representations of the Galois group of a local field, and a generalization of theShafarevich conjecture, Izv. Akad. Nauk SSSR Ser. Mat. 53 (1989), 1135–1182.

[ASD73] Artin, M., Swinnerton-Dyer, H.: The Shafarevich-Tate conjecture for pencils of elliptic curves on K3surfaces, Invent. Math. 20 (1973), 249–266.

[AB17] Auel, A., Bernardara, B.: Cycles, derived categories, and rationality, Surveys on Recent Developmentsin Algebraic Geometry, Proceedings of Symposia in Pure Mathematics 95, Amer. Math. Soc., Providence, RI,2017, 199–266.

[BP83] Barth, W., Peters, C.: Automorphisms of Enriques surfaces, Invent. Math. 73 (1983), 383–411.

[BFT17] Bayer-Fluckiger, E., Taelman, L.: Automorphisms of even unimodular lattices and equivariant Wittgroups, preprint (2017), arXiv: 1708.05540.

[BHT11] Bogomolov, F. A., Hassett, B., Tschinkel, Y.: Constructing rational curves on K3 surfaces, DukeMath. J. 157 (2011), 535–550.

[BT00] Bogomolov, F. A., Tschinkel, Y.: Density of rational points on elliptic K3 surfaces, Asian J. Math. 4(2000), no. 2, 351–368.

[Bra17] Brandhorst, S.: On the stable dynamical spectrum of complex surfaces, preprint (2017), arXiv:1708.06169.

[Cha13] Charles, F.: The Tate conjecture for K3 surfaces over finite fields, Invent. Math. 194 (2013), 119–145.Erratum: Invent. Math. 202 (2015), 481–485.

[Cha14] Charles, F.: On the Picard number of K3 surfaces over number fields, Algebra & Number Theory 8(2014), 1–17.

[Cha17] Charles, F.: Arithmetic ampleness and an arithmetic Bertini theorem, preprint (2017), arXiv: 1703.02481.

[CP16] Charles, F, Poonen, B.: Bertini irreducibility theorems over finite fields, J. Amer. Math. Soc. 29 (2016),81–94.

[CLL17] Chiarellotto, B., Lazda, C., Liedtke, C.: A Neron-Ogg-Shafarevich critersion for K3 Surfaces, preprint(2017), arXiv: 1701.02945.

[CEJ16] Costa, E., Elsenhans, A.-S., Jahnel, J.: On the distribution of the Picard ranks of the reductions of a K3surface, preprint (2016), arXiv: 1610.07823.

[ESB04] Ekedahl, T., Shepherd-Barron, N. I.: On exceptional Enriques surfaces, preprint (2004), arXiv:math/0405510.

[Elk07] Elkies, N. D.: Three lectures on elliptic surfaces and curves of high rank, preprint (2007), arXiv:0709.2908.

57

Page 62: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

58 Five-day Workshop Reports

[EJ15] Elsenhans, A.-S., Jahnel, J.: On the characteristic polynomial of the Frobenius on etale cohomology, DukeMath. J. 164 (2015), 2161–2184.

[Fa83] Faltings, G.: Endlichkeitssatze fur abelsche Varietaten uber Zahlkorpern, Invent. Math. 73, 349–366(1983).

[Fo85] Fontaine, J.-M.: Il n’y a pas de variete abelienne sur Z, Invent. Math. 81 (1985), 515–538.

[Fo93] Fontaine, J.-M.: Schemas propres et lisses sur Z, Proceedings of the Indo-French Conference on Geometry(Bombay, 1989), 43–56, Hindustan Book Agency (1993).

[Hon68] Honda, T.: Isogeny classes of abelian varieties over finite fields, J. Math. Soc. Japan 20 (1968), 83–95.

[It16] Ito, K.: Unconditional construction of K3 surfaces over finite fields with given L-function in largecharacteristic, preprint (2016), arXiv: 1612.05382.

[KK15] Katsura, T., Kondo, S.: On Enriques surfaces in characteristic 2 with a finite group of automorphisms, toappear in J. Alg. Geom., preprint (2015), arXiv:1512.06923.

[KKM17] Katsura, T., Kondo, S., Martin, G.: Classification of Enriques surfaces with finite automorphism groupin characteristic 2, preprint (2017), arXiv: 1703.09609.

[KS16] Kedlaya, K., Sutherland, A. V.: A census of zeta functions of quartic K3 surfaces over F2, LMS Journalof Computing 19, special issue A (2016), 1–11.

[KM15] Kim, W., Madapusi Pera, K.: 2-adic integral canonical models and the Tate conjecture in characteristic 2preprint (2015), arXiv: 1512.02540.

[Klo07] Kloosterman, R.: Elliptic K3 surfaces with geometric Mordell-Weil rank 15, Canad. Math. Bull. 50(2007), no. 2, 215–226.

[Kon86] Kondo, S.: Enriques surfaces with finite automorphism groups, Japan. J. Math. 12 (1986), 191–282.

[LL12] Li, J., Liedtke, C.: Rational Curves on K3 Surfaces, Invent. Math. 188 (2012), 713–727.

[Li15] Liedtke, C.: Arithmetic Moduli and Lifting of Enriques Surfaces, J. reine angew. Math. 706 (2015), 35–65.

[LM17] Liedtke, C., Matsumoto, Y.: Good Reduction of K3 Surfaces, to appear in Compos. Math., preprint(2017), arXiv: 1411.4797v3.

[vLui07] van Luijk, R.: K3 surfaces with Picard number one and infinitely many rational points, Algebra NumberTheory 1 (2007), no. 1, 1–15.

[Mad15] Madapusi Pera, K.: The Tate conjecture for K3 surfaces in odd characteristic Invent. Math. 201 (2015),625–668.

[Ma17] Martin, G.: Enriques surfaces with finite automorphism group in positive characteristic, preprint (2017),arXiv: 1703.08419.

[Ma15] Matsumoto, Y.: On good reduction of some K3 surfaces related to abelian surfaces, Tohoku Math. J. 67(2015), 83–104.

[Ma16] Matsumoto, Y.: Degeneration of K3 surfaces with non-symplectic automorphisms, preprint (2016), arXiv:1612.07569.

[Mau14] Maulik, D.: Supersingular K3 surfaces for large primes, Duke Math. J. 163 (2014), 2357–2425.

[Mer96] Merel, L.: Bornes pour la torsion des courbes elliptiques sur les corps de nombres, Invent. Math. 124(1996), 437–449.

Page 63: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New trends in arithmetic and geometry of algebraic surfaces 59

[Muk88] Mukai, S.: Finite groups of automorphisms of K3 surfaces and the Mathieu group, Invent. Math. 94(1988), 183–221.

[MO15] Mukai, S., Ohashi, H.: Finite groups of automorphisms of Enriques surfaces and the Mathieu groupM12,preprint (2015), arXiv: 1410.7535v2.

[Nyg83a] Nygaard, N. O.: The Tate conjecture for ordinary K3 surfaces over finite fields, Invent. Math. 74 (1983),no. 2, 213–237.

[OS17] Orr, M., Skorobogatov, A. N.: Finiteness theorems for K3 surfaces and abelian varieties of CM typepreprint (2016), arXiv: 1704.01647.

[ST68] Serre, J.-P., Tate, J.: Good reduction of abelian varieties, Ann. of Math. 88 (1968), 492–517.

[Tae16] Taelman, L.: K3 surfaces over finite fields with given L-function, Algebra & Number Theory 10 (2016),1133–1146.

Page 64: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 6

Generated Jacobian equations: fromGeometric Optics to Economics (17w5078)

April 9 - 14, 2017Organizer(s): Nestor Guillen (University of Massachusetts), Jun Kitagawa (Michigan StateUniversity), Robert McCann (University of Toronto)

Overview of the FieldThe field of generated Jacobian equations (GJE) is a nascent field, motivated on one hand by questions in geometricoptics and partial differential equations, and on the other by structures that arise naturally in economics. It maybe cast as an outgrowth of the theory of optimal transport (OT), which has seen a lot of development and progressover the last couple of decades.

As may be gleaned from the fields mentioned above, the span of areas involving the analysis of generatedJacobian equations is vast. This ubiquity of GJE and OT reflects the fact that they are concerned with the study ofthe following common situation: one has mappings (alternatively “changes of variables”, or “matchings”) with aprescribed action on volume and which satisfy an optimality condition. Such situations include measure preservingmappings that minimize a cost functional, optimal matchings between two or more populations, couplings ofrandom variables that minimize covariance, ray tracing maps from one reflecting surface to another, to name a few.

As we are dealing with mappings with a prescribed action on given measures, we are led naturally to the notionof a prescribed Jacobian equation. This is an equation where the unknown is a map T (sometimes, but not alwaysa diffeomorphism) between two spacesX and Y (frequently Riemannian manifolds or subsets of Euclidean space),and the distortion of the volume by this map is prescribed. This leads to equations of the form

detDT (x) = ψ(x, T (x)).

In this generality, this is a nonlinear system of PDEs, however, we are interested in a more special situation. If Tis a gradient map, that is if T (x) = ∇u(x) for some scalar valued potential function u, the equation turns into thefollowing second order scalar PDE, known as the Monge-Ampere equation:

detD2u(x) = ψ(x,∇u(x)).

The most studied class of solutions to this equation is the class of convex u where ψ is nonnegative, in which casethe equation is degenerate elliptic. In the study of more general prescribed Jacobian equations, there is an analogueof “mapping arising from a convex potential” through the introduction of what is known as a generating function

60

Page 65: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Generated Jacobian equations: from Geometric Optics to Economics 61

G. This refers to a real valued function,

G : (x, y, z) ∈ X × Y × R 7→ R,

which determines a duality structure between the spacesX and Y . This generalizes the notion of duality betweena vector space and its dual, which corresponds to the choice of

G(x, y, z) = −〈x, y〉+ z,

(here 〈·, ·〉 is the duality pairing between a vector and covector). The theory of GJE is concerned with prescribedJacobian equations that arise from a generating function. Concretely, they involve an unknown scalar functionu : Ω 7→ R solving a nonlinear PDE of the form

detD(Tu(x)) = ψ(x, u(x),∇u(x)), Tu(x) = T (x, u(x), Du(x)),

where Tu is a generalization of the gradient map from the classical Monge-Ampere case above, which is determinedby solving the following system of equations in T and Z:

DxG(x, T (x, u, p), Z(x, u, p)) = p, G(x, T (x, u, p), Z(x, u, p)) = u.

As with the classical Monge-Ampere equation, it is advantageous to restrict attention to the class of G-convexsolutions u, that is, those functions that can be expressed in terms of some dual function v : Y 7→ R and G via thegeneralized Legendre transform defined by

u(x) = supy∈Y

G(x, y, v(y)). (6.0.1)

Then, the map Tu(x) (under additional conditions) is characterized by

Tu(x) = y, where y = argmaxy∈Y

G(x, y, v(y)). (6.0.2)

The classical Monge-Ampere equation corresponds to the generating function G(x, y, z) = −〈x, y〉+ z on Rn ×Rn × R, in which case u is a convex function and Tu(x) = ∇u(x).

Applications of Generated Jacobian EquationsLet us elaborate further on the ways GJE arise in areas such as geometric optics, differential geometry, mathe-

matical physics, and economics. We could start by pointing out that the wide applicability of GJE may already beinferred from the fact that GJE “contains” Optimal Transport as a “special case”, however, this does not seem likea fair or illustrative approach. A more interesting measure of the merits for GJE is given by the many examples ofproblems in GJE are not treatable by the optimal transport framework.

Let us start with an important class of examples of GJE, namely the “near field” reflector problems in geo-metric optics, which fall outside the scope of OT1. In fact, at a previous BIRS workshop (Workshop # 12w5118),Trudinger posited the concept of GJE and proposed it as a framework that covers both optimal transport as well asnear field reflector problems. The report for this workshop [] remarks that “a natural question is how to extend theresults known up to now for optimal transport maps, to this more general setting, and it seems to give an interestingand challenging line of research”.

One particular example is the point source near field reflector problem. In this problem there is a set ofdirections X ⊂ S2 through which light emanates from a point source at the origin O, and there is a region Y ⊂ Σin a smooth surface Σ. The goal is to illuminate Y by constructing a mirror to catch and reflect the light emitted bythe source. Since the problem will be underdetermined otherwise, there are also two absolutely continuous massdistributions with densities f(x) and g(y) on X and Y respectively, with equal masses, f representing the initialdistribution of light through X and g the desired output pattern on Y .

1“far field” problems, which are a limiting case of the near field regime, can be understood via OT

Page 66: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

62 Five-day Workshop Reports

Then make the choice

G(x, y, z) =z−2 − 1

4 |y|2z−1 − 1

2 (x, y). (6.0.3)

Here, the radial graph of G(·, y, z) over S2 is an ellipsoid of revolution whose foci are at O and y, and whoseeccentricity is determined from z. Then, a G-concave function u (defined as in (6.0.1), but with an inf instead of asup) represents a reflective surface given by envelope of ellipsoids of revolution with one foci at O and the otherat points in Y . Assuming the reflective surface obeys Snell’s law, a ray emanating from O in the direction x willbounce off the reflective surface and eventually hit Σ at a point Tu(x) ∈ Y , where Tu(x) is as in (6.0.2) but withan argmin instead of argmax. Then in order to construct the desired mirror, one should take the radial graph of aG-convex solution u : X 7→ R of the GJE

detDTu(x) = f(x)/g(DTu(x)). (6.0.4)

By considering other source and target domains X and Y , and different G corresponding to other basic reflectivesurfaces (e.g. paraboloids or cartesian ovals) one may treat many different near field geometric optic problems,such as parallel beam sources, or refraction problems.

In another direction, note all of the geometric optic problems above involve determining a surface whose normalvector behaves in such a way so that the ray tracing map sends one measure into another. This situation is verysimilar to that of prescribing the Gauss curvature of a convex surface (the Minkowski problem), which leads us todifferential and convex geometry. A number of problems in convex analysis and convex geometry, generalizing theMinkowski problem, all lead naturally to Monge-Ampere type equations and generating functions. The Minkowskiproblem, which involves finding a closed convex surface with prescribed Gauss curvature, may, be posed as theanalysis of a function of the form

ρ(x) = infyG(x, y, z(y)), G(x, y, z) :=

z

x · y , x, y ∈ S2,

for some strictly positive function z(y) (and G(x, y, z) being defined only for x, y ∈ S2 with x · y > 0). There arealso other related problems of great interest in the study of convex bodies, the Lp- and log-Minkowski problems.It is reasonable to expect that an analogue of this problem when R3 is replaced by a general Riemannian manifoldwill also lead to a GJE.

Alternatively, from the viewpoint of dynamics an important notion –currently limited to OT, with no parallelat the more general level of GJE– is that of displacement convexity and Wasserstein barycenters. Recall thatif (X, d) is a metric space, OT imbues the space P2(X) (probability measures on X with finite second moment)with a metric dMK,2, defined via the Kantorovich problem

dMK,2(γ0, γ1)2 := infπ∈Π(γ0,γ1)

∫X×X

d(x, y)2 dπ(x, y),

where Π(γ0, γ1) is the space of probability measures in X × X whose left and right marginals are γ0 and γ1,respectively. Then dMK,2 makes P2(X) a Polish and/or geodesic space as long as (X, d) is itself a Polish and/orgeodesic space respectively, and thus in the latter case one can consider geodesics via this metric between twoprobability measures on X .

Displacement convexity is a property of functionalsE on P2(X), requiring the convexity ofE along geodesicsin this dMK,2 metric. This notion of convexity for functionals is particularly useful in analyzing gradient flows forfunctionals on P2(X) (such as relative entropy, as a famous example) and in particular for understanding the rateof convergence to equilibrium.

Lastly, let us discuss the emergence of generated Jacobian equations in economics, concretely, in relation toboth matching problems and principal/agent problems. In either case, the set up involves compact metric spacesX and Y , and (in the greatest generality) a generating function φ : X × Y × R 7→ R, which is assumed tobe continuous, strictly decreasing in its third argument, and such that φ(x, y,R) = R for all x, y. Moreover, ψrepresents the inverse generating function. In this context, φ and ψ will represent the utility functions of certainparties.

Page 67: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Generated Jacobian equations: from Geometric Optics to Economics 63

In the matching context, φ(x, y, v) represents the utility that an agent x obtains when matched with agent ywho is receiving a utility of v; the quantity ψ(x, y, u) then corresponds to the utility agent y obtains when matchedwith agent x, who is receiving a utility of u. By contrast, in the principal-agent context φ(x, y, v) corresponds tothe utility earned by an agent of type x who chooses decision y and makes a transfer of v to the principal, andψ(x, y, u) is the transfer which yields a utility of u to an agent of type x who chooses decision y. One is givendistributions for agents x and agents y in the former case, and a utility function for the principal and a distributionfor the agents in the latter one, and the goal is to find a stable matching: an admissible matching between partieswhere no pair has incentive to change their assigned pairing. Utility functions φ are called quasilinear if they areof the form φ(x, y, z) = b(x, y)+z, for some b(x, y), and previous work has been done connecting OT to this caseof quasilinear utility. However, it can be more natural to assume a non-quasilinear utility, which now becomes partof the realm of general GJEs.

Recent Developments and Open Problems

It seems that generated Jacobian equations and the analysis ofG-convex functions first appeared in the literaturein two independent, vastly different contexts. The first is in the PDE literature motivated by near-field geometricoptics problems, and the second in the economics literature in relation to principal-agent and matching problemsin the non-quasilinear setting.

On the PDE side, in [34] Trudinger introduced GJE as a convenient framework covering a broad type ofproblems –motivated by geometric optics problems that are not covered by OT. Up until this point, far-field op-tics problems had been recognized as being covered by the OT framework (for example, [35]). The work ofTrudinger in [34], besides introducing the GJE framework, studies existence of weak solutions subject to thesecond boundary value condition (the natural boundary condition when one prescribes the action of a volumedeforming transformation), and provides (C2 and higher) regularity results for smooth, positive densities undernatural structural conditions on the generating function and domains. One of the conditions is a natural analogueof the Ma-Trudinger-Wang condition which plays a central role in regularity of OT ([28]). Since then, Guillen andKitagawa have developed regularity theory (C1,α) for weak solutions of GJE with possibly discontinuous densities,while Jhaveri has obtained partial regularity results under minimal assumptions (see also discussion of presenta-tions below). In terms of pure geometric optics, Gutierrez and Tournier analyzed the regularity for the near fieldparallel reflector/refractor problem [19] and also studied problems for media with different refractive indices [18].Gutierrez and Sabra also studied models for scattering on free form lenses [17] and aspherical lenses [16]. There isalso work of Karakhanyan and Wang [20] regarding the near field point source problem, which obtains regularityof reflectors for smooth densities under natural structural assumptions, but furthermore, provides an example ofsingular behavior of a nature not exhibited in OT. This brief and incomplete lists illustrates just some of the recentactivity from the PDE side concerning GJEs, and the many geometric optic problems that GJE encompasses.

Independently, from the economics community, Noldeke and Samuelson [32] initiated a systematic studyof generating functions and G-convex analysis in the context of the “implementation duality” in matching andprincipal-agent problems. Previous works [5] and [12] analyzed the quasilinear case, which falls within the realmof OT, but the work of Noldeke and Samuelson deals specifically with utility functions which are not quasilin-ear. Another interesting development is in relation to the second welfare theorem, which in the quasilinear caseessentially indicates that a matching in equilibrium coincides with the optimality conditions arising in OT. In thenon-quasilinear case, non-transferable utilities may make it impossible to characterize equilibrium matchings asoptimal, which has led Galichon to propose “equilibrium” as a replacement for “optimal” in matching problems.This in turn leads to a question that arises naturally and has connections with geometric optics as well: what isthe analogue for GJEs of Kantorovich’s relaxation of OT? This is an unresolved question, in fact, it is not evenclear what a corresponding notion of a transport plan should be within GJE. This issue not withstanding, work byLiu [26] shows there is a variational characterization behind certain reflector antenna problems, and shows thisquestion can form a unifying thread between economics and geometric optics within GJE.

Finally, we mention the great deal of activity relating to numerical schemes for GJEs. There is work in thesemi-discrete setting by Merigot, Meyron, and Thibert [31] as well as by Kitagawa, Merigot, and Thibert [25] foroptimal transport. Monotone approximation schemes for viscosity solutions of the Monge-Ampere equation have

Page 68: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

64 Five-day Workshop Reports

been obtained recently by Froese and Oberman [13], which are notable for their handling of the second boundaryvalue problem. Developing respective numerical schemes for various geometric optic problems is a worthwhileresearch direction that is still in its early stages.

Presentation HighlightsThis workshop brought together mathematicians (notably, experts in differential geometry, PDE, convex analysis),engineers and applied mathematicians (numerical analysis, optimization, mathematical physics), and economists(stable matching theory, principal/agent problems). Our chief goals were to compare each field’s methods, researchdirections, and conjectures, and to lay the ground for future interdisciplinary work.

General Theory of GJEsNestor Guillen began with an overview of generated Jacobian equations, presenting the basic setup and ter-

minology as first introduced by Trudinger in [34]. This included a brief overview of the applications to geometricoptics and economics, a discussion of his recent work in regularity of weak solutions with Kitagawa [15], and keydifferences with the classical Monge-Ampere and optimal transport theory (as shown by Karakhanyan and Wang[20]).

Yash Jhaveri discussed a partial regularity result for GJE, showing that for generating functions that satisfystructural conditions and continuous, bounded data, there are two closed sets of measure zero, outside of which theassociated transport mapping is a C0,α homeomorphism. One significance of this result is that it does not requirean MTW like condition that is usually necessary for full regularity results. The result first appeared in the contextof the real Monge-Ampere equation by Figalli and Kim [11] and OT by De Philippis and Figalli [10].

Brendan Pass presented recent work with McCann on OT problems between spaces of unequal dimension.They have shown that when the dimension of the source domain is larger than that of the target domain, the problemcan be described by a non-local analogue of the Monge-Ampere equation that arises for transport between spacesof equal dimensions. Such problems are of great interest in economic matching problems, as the spaces to bematched (buyers and sellers, employers and employees, etc.) naturally depend on different numbers of parameters.In another recent work with Chiappori and McCann [6], Pass has exhibited some conditions under which one canobtain uniqueness and regularity of transport maps when the target space is one dimensional.

Numerical AnalysisBoris Thibert presented two numerical schemes for the semi-discrete optimal transport problem based on

damped Newton methods. The first is applicable to transportation of an absolutely continuous measure to a discretemeasure subject to a cost function satisfying the MTW condition, under only mild connectivity assumptions onthe source of the source measure (joint with Kitagawa and Merigot [25]). This scheme relies on the geometricimplications of convexity coming from the MTW condition, which was first shown by Loeper ([27]). The secondconsiders the case where the source measure is supported on a simplex soup and the target is discrete (joint withMerigot and Meyron [31]). In this case the convex structure of the MTW case is lost, hence the scheme requiresthe points in the support of the discrete target satisfy a natural non-degeneracy type condition. Both results includerigorous proofs of convergence rates.

Farhan Abedin discussed an iterative method for computing solutions to GJE also in the semi-discrete case.The method is in the vein of Caffarelli, Kochengin, and Oliker’s in the far-field reflector problem [3] and Kitagawa’sfor strongly MTW costs [24], but Abedin’s method is novel in that it does not require any MTW like conditions onthe generating function to obtain termination after a finite number of steps (which is new even in the OT case) andthus applies to an extremely wide variety of generating functions.

Brittany Froese introduced a general framework to construct monotone approximation schemes for Monge-Ampere type equations given a point cloud satisfying certain structural conditions. With this framework and acareful resolution of boundary geometry, given a degenerate elliptic equation with a comparison principle, Froese isable to construct approximation schemes which converge to viscosity solutions of the PDE [14]. Numerical results

Page 69: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Generated Jacobian equations: from Geometric Optics to Economics 65

were also presented for examples including the prescribed Gauss curvature problem with Dirichlet boundary data,the quadratic optimal transport problem, and problems arising from seismic imaging and beam shaping.

GJE and OT in economics

Larry Samuelson’s talk dealt with generalized notions of convexity and how they are a means to understand the“implementability” of assignment maps in natural problems in economics, again in instances where the underlyingutilities may not be quasilinear. A natural concept is that of a profile v : Y 7→ R implementing a pair (u, y), whereu : X 7→ R (another profile) and y : X 7→ X (an assignment), which means that

u(x) = maxy∈Y

φ(x, y, v(y)), y(x) ∈ argminy∈Y φ(x, y, v(y)).

In the matching model, u and v are profiles of utilities for the buyers and sellers. In the principal-agent model, uis a rent function for the agent, giving a utility u(x) for agent x, and v is a tariff function, giving the tariff v(y) atwhich any agent can execute contract y. One result presented by Samuelson was that the set of pairwise, stable, fulloutcomes satisfying a given initial condition (y1, v1) is nonempty and closed in the right topology. Additionally,he mentioned that implementation maps are examples of a (antitone) Galois connection: a pair of order reversingmappings between posets of a set.

Alfred Galichon introduced the framework of equilibrium transportation, which encompasses problems aris-ing naturally in economics which often fall outside the scope of OT. As mentioned in the previous section, thesecond welfare theorem gives the equivalence of stability and optimality in matching problems involving quasilin-ear utilities, but this may not be the case in the non-quasilinear case. However, Galichon posits that it does makesense to talk about equilibrium in matchings in the non-quasilinear case, and perhaps this is the correct notion topursue rather than optimality. Galichon illustrated this situation by introducing a variation on the matching prob-lem involving taxation. In this setting, workers x are paired with firms y and receive a wage w(x, y) while theworker obtains a utility α(x, y) (this may represent aspects different from pay, such as job satisfaction). The firmobtains a utility γ(x, y), hence their total profit is γ(x, y)−w(x, y). However, there is a tax levied on the worker’spay, thus the actual benefit a worker gains is α(x, y) + N(w(x, y)) for some function N . The goals is then tomatch a group of workers with firms in a stable manner, in the case of no taxation (N(w) = w) we recover OT andstability is equivalent to optimality. In the case of taxation, Galichon introduces the notion of feasible sets definedby

Fxy = (u, v) ∈ R2 : u− α(x, y) ≤ N(γ(x, y)− v),

and a triple (P, u, v) where P ∈ P(X × Y ), u, v : X , Y → R is said to be an equilibrium when

(u(x), v(y)) 6∈ Fxy and (x, y) ∈ spt(π)⇒ (u(x), v(y)) ∈ Fxy. (6.0.5)

Finding (π, u, v) such that (6.0.5) holds is known as a Nonlinear Complimentary Problem (NCP). In the continuumcase, if the matching between x and y is given by a smooth map x → y(x), then y(x) is the G-gradient map of aG-convex function u which (using the mass balance) solves a corresponding GJE. The map y(x) and scalar u(x)are examples of an implementation map, as discussed by Noldeke and Samuelson [32].

Shuangjian Zhang discussed a principal-agent problem modeling a pricing problem for a monopolist involv-ing non-quasilinear utility functions, and presented necessary and sufficient conditions for the principal’s problemto be a linear program (joint with McCann, [30]). Here the principal-agent problem is where a principal and agentsattempt to maximize their utilities which arise through the execution of certain contracts, but often with someamount of information asymmetry. In the context of Zhang and McCann’s work the principal is a monopolistmanufacturer and agents are potential buyers, the different contracts to be executed are the purchase of differentmodels of whatever product the principal manufactures. The monopolist then has a (possibly non-quasilinear)profit function π(x, y, v), which is the profit generated by agent x buying the product y at price v. Then the mo-nopolist knows the distribution dµ of different types of buyers in the market and the utility that a given buyer typederives from purchasing a given model of product, and seeks to set the price v(y) for each model y in a manner

Page 70: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

66 Five-day Workshop Reports

which maximizes their profit, i.e. they wish to maximize the functional∫X

π(x, y(x), v(y(x))) dµ(x),

where here y(x) is the choice of model that a buyer x will select based on their own utility and a given pricingscheme. Zhang has shown that under a convexity condition on the agents utility function along certain curves, theprincipal’s problem is a linear program. This convexity condition follows from a strengthening of the analogueof MTW condition for GJEs, which corresponds to the condition of non-negative cross curvature used in previouswork of Figalli, Kim, and McCann ([12]) for the case of quasilinear utilities.

Guillaume Carlier presented a model for equilibrium prices with constraints stemming from work with Eke-land and Galichon. In the model, one has a set Z representing feasible goods, and which is assumed to be acompact metric space. The goal is to determine a price system that clears the market, that is, a price functionp : Z 7→ R which leads to equilibrium.

One is given a set of producers y ∈ Y (assumed also a compact metric space), a cost function c : Y × Z 7→ R(what it costs to y to produce z), and a distribution of producers corresponding to a probability distribution ν overY . Then, given p(z), it’s c transform is pc(y) = maxzc(y, z) − p(z) and it represents what the maximum costto producer y when selling good z in this price system. Consumers are represented by a compact metric space Xwith a probability distribution µ ∈ P(X).

So far, this is standard, the novelty arises in that consumers are modeled to have a budget: each consumer xhas a revenue φ(x), φ ∈ C(X), as well as a utility function given by U : X × Z 7→ R, such that U ∈ C(X × Z).Then, given a price function p(z), we define

Up(x) = maxU(x, z) : z s.t. p(z) ≤ φ(x)

Then, the problem is to determine a price controls p and matching plans for consumers/goods γ ∈ P(X × Z)and producers/goods σ ∈ P(Y × Z) satisfying the following. The prize function is such that p ∈ C(Z,R) andminZ p(z) = minX φ, while πx#γ = µ, πy#σ = ν, and πz#γ = πz#σ. Then, the equilibrium corresponds tothese objects satisfying

pc(y) + p(z) = c(y, z) σ-a.e., while p(z) ≤ φ(x) and U(x, z) = Uρ(x) γ-a.e.

This a case of nontransferable utility and OT methods do not apply. An existence result is obtained for equilibriain the above sense.

Beatrice Acciaio provided an introduction to the theory of causal transport and some of its applications, basedon joint work with Backhoff and Zalashko [1]. In probabilistic language, classical OT corresponds to minimizingthe expectation of the cost function:

infEπ[c(x, y)] : P ∈ Π(µ, ν)

where as usual, Π(µ, ν) denotes the space of probability measures over X × Y with marginals µ and ν. Acciaiointroduced the variant of causal transport. Here one takes filtrations FX and FY over X and Y respectively, thenπ ∈ Π(µ, ν) is said to be a causal plan if for all t and D ∈ FYt the map x 7→ P x(D) is measurable w.r.t. to FXt .Then, the causal transport problem is the minimization problem

infEπ[c(x, y)] : P ∈ Π(µ, ν) such that P is causal,

corresponding to a constrained version of the classical OT problem where time matters, hence the involvementof causality. Attainability for the causal transport problem, as well as the expected duality relations are satisfied.Acciaio also discussed an application of causal transport, to the following question of importance in stochasticintegration: given a space of events Ω imbued with two filtrations F and G, and a probability measure P, thengiven X a semimartingale in (Ω,F ,P) when can we say that X is also a semimartingale in (Ω,G,P)? Withan appropriate choice of cost function, causal transport provides an optimal transport characterization for thoseprocesses B which are a Brownian motion with respect to a given filtration and remain a semimartingale withrespect to an enlarged filtration.

Page 71: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Generated Jacobian equations: from Geometric Optics to Economics 67

Dynamics

Bernhard Schmitzer’s presentation was concerned with the problem of “unbalanced” optimal transport andvarious approaches. This problem, considered in joint work with Benedikt Wirth [33], is relevant to situationswhere the two measures under consideration mays not have the same total mass –e.g. a partial transport prob-lem or an inference problem where one needs to estimate the similarity between two distributions that may havedifferent total mass. Such problems can arise for example in image interpolation between images that are notnormalized (such as medical images). Schmitzer’s talk considered both static as well as dynamic formulations(much in the spirit of work of Benamou and Brenier). Such considerations are important as frequently unbalancedtransport is initially defined via a dynamic formulation, such as in the case of the Wasserstein-Fisher-Rao metric,and a major question is if there is a corresponding static formulation. Schmitzer and Wirth’s work deals with aninterpolation involving the OT metric derived from the distance cost, and not from the distance squared as in thecase of Wasserstein-Fisher-Rao.

Codina Cotar presented recent developments in the connections between multimarginal optimal transport anddensity functional theory. Density functional theory or DFT, is a simplified version of quantum mechanics inwhich an N -particle system (represented by a probability distribution on R3N ) is described by the behavior ofits one-particle marginals on R3, and models electron interactions of molecules as described by the Schrodingerequation.

In recent joint work with Friesecke and Kluppelberg [7], Cotar has investigated an approximate DFT groundstate energy functional where the electron-electron interaction in the Hohenberg-Kohn functional is replaced withone coming from optimal transport. This transport is a multimarginal transport problem with a Coulomb costfunction, of the form c(x1, . . . , xN ) :=

∑i 6=j |xi − xj |

−1. In the above joint work, Cotar has shown that thisoptimal transport problem has a unique solution, the minimal energy in the DFT approximation is always a lowerbound for the true quantum mechanical ground state energy, and in the two particle case this optimal transportenergy gives the semiclassical limit of the Hohenberg-Kohn functional. Finally, it was announced that there is anongoing joint work with Petrache in which they are analyzing higher order corrections to this semiclassical limitin the general N -particle case.

Katy Craig presented recent results concerning existence, uniqueness, and stability of solutions to PDE in-volving both degenerate diffusion and aggregation. Specifically, the focus was on equations of the form

∂tρ = div((∇K ∗ ρ)ρ) + ∆(ρm)

which arise in many contexts such as biological chemotaxis, swarming, and modeling granular media. Suchequations are known to correspond, at least formally, to gradient flows of certain energies under the Wassersteinmetric. Questions such as existence and uniqueness are known for cases where the kernel K is λ-convex for someλ ∈ R, but there are many interesting examples (such as those raised above) that do not fall in this framework.In [8], Craig has shown existence, uniqueness, and quantitative stability estimates (double-exponential bounds intime) for gradient flows of an energy functional that is ω-convex, for some modulus of convexity ω satisfyingthe Osgood criterion. In further joint work with I. Kim and Y. Yao [9], Craig uses this framework to analyze theKeller-Segel equation with a hard height constraint (which can be viewed as a limiting case of the above diffusion-aggregation equation as m→∞).

Micah Warren discussed some new ideas related to a mean curvature flow of the pseudo-Riemannian structuregiven by optimal transport. In 2010, Kim and McCann [21] introduced a pseudo-Riemannian metric defined usingthe cost function in an optimal transport problem, which in particular captures the MTW condition as positivityof certain sectional curvatures under this metric. In an interesting development, Kim, McCann, and Warren [22]showed that minimizers in the optimal transport problem give rise to codimension n, volume maximizing sub-manifolds under the pseudo-Riemannian metric of Kim and McCann (after a conformal change). Warren suggeststhen that a natural line of investigation is to consider the gradient flow of the volume functional, which leads tothe pseudo-Riemannian mean curvature flow. It is noted that this could be a very promising direction, as generallymean curvature flows tend to be better behaved in the pseudo-Riemannian framework compared to the positivedefinite case, as in for example, a result of Li and Salavessa regarding the second fundamental form of space-likemanifolds under mean curvature flow in pseudo-Riemannian spaces.

Page 72: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

68 Five-day Workshop Reports

Geometric opticsCristian Gutierrez presented recent results on free-form lens design. Taking the point source refractor problem

as an example, traditional methods encase the light source in some medium, then cut a ball out centered at the lightsource to create a lens. However, in practice such a model results in very thick, large lenses, which are heavy anddifficult to manufacture. Thus a new goal is to construct a lens sandwiched between two optically active surfacesto focus a light source into some desired energy distribution, which can potentially be made thinner. Gutierrezdiscussed recent joint work with Sabra [16, 17] in which they show that given one optically active surface, they cansolve a system of PDE in order to find a corresponding second surface which will combine with the first to createsuch a desired lens, in the far-field regime. They give conditions under which the representation of this secondsurface is physically realizable, along with estimates that can be given for the thickness of the resulting lens. It isnoted than an important question that remains is to find an appropriate first surface so that the second surface maybe relatively simple to manufacture.

Ahmad Sabra presented results on an alternative framework to construct optical surfaces in the near-fieldregime when the usual convex model may face physical obstructions. Using the near-field parallel, point sourcereflector problem as an example, Sabra discussed how the usual model of constructing a reflecting surface asintersections of ellipsoids may encounter obstructions such as self-blocking of reflected rays by the surface itself.In recent work with Gutierrez, he proposes to instead take unions of ellipsoids which will result in reflectors that areno longer convex. Sabra is able to prove existence of such reflectors when the target energy distribution is discrete,and then pass to a limit in order to recreate absolutely continuous densities. It should be noted that regularity forthis model is still unexplored and falls outside of the usual elliptic framework, in fact it would give rise to a GJEof mixed hyperbolic - elliptic type.

GeometryAram Karakhanyan talked about generalizations of Blashke’s rolling ball theorem. The original theorem

deals with the following geometric situation (which explains the name of the theorem), we are given M and M ′,two smooth and strongly convex surfaces whose second fundamental forms IIM and IIM ′ are such that IIM (x) ≥IIM ′(x′) whenever x ∈ M and x ∈ M ′ are such that the outer unit normal vectors satisfy nM (x) = nM ′(x

′).Blashke’s theorem says that if M and M ′ are internally tangent at one point then M is contained in the convexregion bounded byM ′. In the OT community, this statement is reminiscent of the local-to-global property for costssatisfying the MTW condition.

Karakhanyan presented a generalization of this result where instead, one considers M being strongly c-convexand M ′ a c-hyperplane with respect to a cost function c satisfying certain natural structural assumptions. He alsodemonstrated one important application of this result, to parallel reflector and refractor problems.

Young-Heon Kim presented a new attempt at defining a canonical notion of the barycenter of a measure on ametric measure space, via a process of Wasserstein regularization. In joint work with Pass [23], if µ ∈ P(M) forsome metric measure space (M,d,m), and ε > 0, then they consider the problem of minimizing the functional

ν 7→∫X

∫X

d2(x, y)dµ(x)dν(y) + εW 22 (ν,m)

over ν ∈ P(M), where here W2 is the 2-Wasserstein distance between probability measures on M . For eachε > 0, there exists a unique minimizer µε, and moreover the µε converge weakly to some limiting measureB(µ) as ε 0, which is supported in the set b(µ), the set of barycentric points of µ that minimize the quantity∫Md2(x, ·)µ(dx). Moreover, B(µ) can uniquely be characterized as the minimizer of W2(·, µ) over probability

measures supported on b(µ). Additionally, Kim and Pass have analyzed some of the dynamics that the mappingµ 7→ B(µ) induces over the space P(M) of probability measures, including fixed points and periodic orbits.

Yi Wang presented recent developments on a nonlinear Sobolev trace inequality involving the k-Hessian en-ergy in place of the usual Dirichlet energy. It is a classical result that if u is a function on Ω with u = f on ∂Ω anduf is the harmonic extension of f to Ω, then∫

−u∆u dx+1

|∂Ω|

∫∂Ω

u∂νu σ(dx) ≥ 1

|∂Ω|uf∂ν(uf ) σ(dx)

Page 73: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Generated Jacobian equations: from Geometric Optics to Economics 69

where ν is the outward unit normal and σ the surface measure on ∂Ω. One question that arises is if the first termabove (which corresponds to the 1-Hessian energy) can be replaced by the k-Hessian energy for k > 1, and in thiscase what the appropriate choice of boundary operator to replace ∂ν may be. In joint work with Case [4], Wanghas shown that such a boundary operator exists, along with a corresponding “polarized” version of the inequality,yielding a sharp Sobolev trace inequality for k-admissible functions.

Robin Neumayer discussed a one parameter family of variational problems on the upper half space in Rn,where certain values of the parameter correspond to optimizers of the classical Lp Sobolev inequality with zeroboundary condition, and the Lp Sobolev trace inequality. In joint work with Maggi [29], in the case of n ≥ 2 andp > 1, Neumayer succeeded in characterizing minimizers for all positive values of the parameter, along with adetailed analysis of the properties of the optimal value as a function of the parameter. Additionally when p = 2and n ≥ 3, the variational problem can be interpreted as minimizing certain curvature quantities over a class ofconformal changes of the Euclidean metric, and the aforementioned characterizations of minimizers prove that infor various ranges of the parameter one recovers exactly the spherical, flat, and hyperbolic geometries.

Deane Yang gave an overview of the current state of the Lp Brunn-Minkowski theory. The Lp Minkowskiproblem is, given a Borel measure µ on Sn−1, to find a convex body K such that the Lp surface area measure ofthe body is µ. Here, the Lp-surface area of a convex body K with outer normal νK is a Borel measured over Sn−1

given by

Sp(K,ω) :=

∫x∈νK−1(ω)

(x · νK(x))1−p dHn−1(x), ∀ ω ∈ B(Sn−1).

The case p = 1 corresponds to the classical Minkowski problem of prescribed Gauss curvature, and for p ≥ 1, theproblem can be attacked with a variational approach. Yang presented recent results on a particularly interestingcase which is when p→ 0, which is the logarithmic Minkowski problem of prescribing the cone volume measureof a body. In joint work with Boroczky, Lutwak, and Zhang [2], Yang has shown that under something calledthe subspace concentration inequality, the log Minkowski problem has a solution. Finally, Yang closed with aquestion, asking if the case corresponding to p = −n can be recast as an optimal transport or generated Jacobianequation, which could provide a new tool for this difficult and important case.

Reception of the workshop and participant’s feedback

Given the reception of the workshop, it is safe to say that the goal of bringing several different communitiesof researchers together was achieved successfully. In particular, several new collaborations were initiated at theconference. Katy Craig reports that motivated by the discussions during the conference, she has started a projectwith Micah Warren where they will be exploring connections between the Jordan-Kinderlehrer-Otto scheme fromgradient flows and game theory. Galichon and Carlier indicated that as a result of the workshop they have startedon a new project involving a hedonic model with tight budget constraints. Abedin expressed that as a result of theworkshop he is considering embarking on a project involving regularity for GJE equations under a strong MTWtype condition.

There was also a very successful open problem session which was originally planned to last for one hour, butended up going for twice that. Problems were proposed by Alfred Galichon, Jun Kitagawa, Aram Karakhanyan,Ahmad Sabra, and Alessio Figalli.

The participants’ feedback was extremely positive and enthusiastic, with many highlighting the interdisci-plinary nature of the workshop, here we present quotes from four different participants:

I learned about the fascinating parallel between problems in reflector design in geometric optics and problemson taxation in matching markets.

The connection between generated Jacobian equations and economics seems very intriguing and hopefully willspark some ideas for future research.

Page 74: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

70 Five-day Workshop Reports

My work is focused on economics, and before this conference, I was unaware of both the vast theoreticalliterature in economics as well as applications to optics and other areas. The conference was an invaluable point ofentry into these areas, accomplishing in a few days what would have taken me months or years to do on my own.I will make good use of what I’ve learned in my continuing research.

The introductory lecture was very useful, helping me put many of the subsequent talks into context. The openproblem session was also great.

ParticipantsAbedin, Farhan (Temple University)Acciaio, Beatrice (London School of Economics)Bowles, Malcolm (University of British Columbia)Carlier, Guillaume (Universite Paris Dauphine)Chen, Eric (Princeton University)Cotar, Codina (University College London)Craig, Katy (University of California Santa Barbara)Figalli, Alessio (ETH Zurich)Galichon, Alfred (New York University)Guillen, Nestor (University of Massachusetts Amherst)Gutierrez, Cristian (Temple University)Hamfeldt, Brittany (New Jersey Institute of Technology)Jhaveri, Yash (ETH Zurich)Karakhanyan, Aram (University of Edinburgh)Kim, Young-Heon (University of British Columbia)Kitagawa, Jun (Michigan State University)Mawi, Henok (Howard University)McCann, Robert (University of Toronto)Merigot, Quentin (Universite Paris-Dauphine / CNRS)Neumayer, Robin (University of Texas)Pass, Brendan (University of Alberta)Sabra, Ahmad (University of Warsaw)Saldanha Salvador, Tiago Miguel (Mcgill University)Samuelson, Larry (Yale University)Schmitzer, Bernhard (TU Munich)Thibert, Boris (LJK Universite Grenoble)Wang, Yi (Johns Hopkins University)Warren, Micah (University of Oregon)Yang, Deane (NYU Tandon School of Engineering)Zhang, Shuangjian (University of Toronto)

Page 75: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] B. ACCIAIO, J. BACKHOFF VERAGUAS, AND A. ZALASHKO, Causal optimal transport and its links to enlargement of filtrations and continuous-time stochastic optimization,ArXiv e-prints, (2016).

[2] K. J. BOROCZKY, E. LUTWAK, D. YANG, AND G. ZHANG, The logarithmic Minkowski problem, J. Amer.Math. Soc., 26 (2013), pp. 831–852.

[3] L. A. CAFFARELLI, S. A. KOCHENGIN, AND V. I. OLIKER, On the numerical solution of the problem ofreflector design with given far-field scattering data, in Monge Ampere equation: applications to geometry andoptimization (Deerfield Beach, FL, 1997), vol. 226 of Contemp. Math., Amer. Math. Soc., Providence, RI,1999, pp. 13–32.

[4] J. S. CASE AND Y. WANG, A fully nonlinear Sobolev trace inequality, arXiv:1606.00071, (2016).

[5] P.-A. CHIAPPORI, R. J. MCCANN, AND L. P. NESHEIM, Hedonic price equilibria, stable matching, andoptimal transport: equivalence, topology, and uniqueness, Econom. Theory, 42 (2010), pp. 317–354.

[6] P.-A. CHIAPPORI, R. J. MCCANN, AND B. PASS, Multi- to one-dimensional transportation,arXiv:1510.00717 (To appear in Comm. Pure Appl. Math.), (2015).

[7] C. COTAR, G. FRIESECKE, AND C. KLUPPELBERG, Density functional theory and optimal transportationwith Coulomb cost, Comm. Pure Appl. Math., 66 (2013), pp. 548–599.

[8] K. CRAIG, Nonconvex gradient flow in the Wasserstein metric and applications to constrained nonlocalinteractions, Proceedings of the London Mathematical Society, 114 (2017), pp. 60–102.

[9] K. CRAIG, I. KIM, AND Y. YAO, Congested aggregation via Newtonian interaction, arXiv:1603.03790 (Toappear in Arch. Ration. Mech. Anal.), (2016).

[10] G. DE PHILIPPIS AND A. FIGALLI, Partial regularity for optimal transport maps, Publ. Math. Inst. HautesEtudes Sci., 121 (2015), pp. 81–112.

[11] A. FIGALLI AND Y.-H. KIM, Partial regularity of Brenier solutions of the Monge-Ampere equation, DiscreteContin. Dyn. Syst., 28 (2010), pp. 559–565.

[12] A. FIGALLI, Y.-H. KIM, AND R. J. MCCANN, When is multidimensional screening a convex program?, J.Econom. Theory, 146 (2011), pp. 454–478.

[13] B. D. FROESE AND A. OBERMAN, Convergent finite difference solvers for viscosity solutions of the ellipticMonge–Ampere equation in dimensions two and higher, SIAM Journal on Numerical Analysis, 2011.

[14] B. D. FROESE, Meshfree finite difference approximations for functions of the eigenvalues of the hessian,Numerische Mathematik, (2017).

[15] N. GUILLEN AND J. KITAGAWA, Pointwise estimates and regularity in geometric optics and other generatedjacobian equations, Comm. Pure Appl. Math., 70 (2017), pp. 1146–1220.

71

Page 76: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

72 Five-day Workshop Reports

[16] C. E. GUTIERREZ AND A. SABRA, Aspherical lens design and imaging, SIAM J. Imaging Sci., 9 (2016),pp. 386–411.

[17] C. E. GUTIERREZ AND A. SABRA, Freeform Lens Design for Scattering Data With General Radiant Fields,preprint.

[18] C. E. GUTIERREZ AND F. TOURNIER, Local near field refractors and reflectors, Nonlinear Analysis: Theory,Methods & Applications, 108, 302–311, 2014.

[19] C. E. GUTIERREZ AND F. TOURNIER, Regularity for the near field parallel refractor and reflector problems,Calculus of Variations and Partial Differential Equations, 54, Number 1, 917–949, 2015.

[20] A. KARAKHANYAN AND X.-J. WANG, On the reflector shape design, J. Differential Geom., 84 (2010),pp. 561–610.

[21] Y.-H. KIM AND R. J. MCCANN, Continuity, curvature, and the general covariance of optimal transportation,J. Eur. Math. Soc. (JEMS), 12 (2010), pp. 1009–1040.

[22] Y.-H. KIM, R. J. MCCANN, AND M. WARREN, Pseudo-Riemannian geometry calibrates optimaltransportation, Math. Res. Lett., 17 (2010), pp. 1183–1197.

[23] Y.-H. KIM AND B. PASS, A canonical barycenter via Wasserstein regularization, arXiv:1703.09754, (2017).

[24] J. KITAGAWA, An iterative scheme for solving the optimal transportation problem, Calculus of Variationsand Partial Differential Equations, 51 (2014), pp. 243–263.

[25] J. KITAGAWA, Q. MERIGOT, AND B. THIBERT, A Newton algorithm for semi-discrete optimal transport,arXiv:1603.05579 (To appear in J. Eur. Math. Soc. (JEMS)), (2017).

[26] J. LIU, Light reflection is nonlinear optimization, Calculus of Variations and Partial Differential Equations,1–18, 2013.

[27] G. LOEPER, On the regularity of solutions of optimal transportation problems, Acta Math., 202 (2009),pp. 241–283.

[28] X.-N. MA, N. S. TRUDINGER, AND X.-J. WANG, Regularity of potential functions of the optimaltransportation problem, Arch. Ration. Mech. Anal., 177 (2005), pp. 151–183.

[29] F. MAGGI AND R. NEUMAYER, A bridge between Sobolev and Escobar inequalities and beyond,arXiv:1609.02346, (2016).

[30] R. J. MCCANN AND K. SHUANGJIAN ZHANG, On concavity of the monopolist’s problem facing consumers with nonlinear price preferences,ArXiv e-prints, (2017).

[31] Q. MERIGOT, J. MEYRON, AND B. THIBERT, An algorithm for optimal transport between a simplex soupand a point cloud, preprint.

[32] G. NOLDEKE, AND L. SAMUELSON. The implementation duality, preprint.

[33] B. SCHMITZER AND B. WIRTH, A Framework for Wasserstein-1-Type Metrics, ArXiv e-prints, (2017).

[34] N. S. TRUDINGER, On the local theory of prescribed Jacobian equations, Discrete Contin. Dyn. Syst., 34(2014), pp. 1663–1681.

[35] X.-J. WANG, On the design of a reflector antenna. II, Calc. Var. Partial Differential Equations, 20 (2004),pp. 329–341.

Page 77: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 7

Quantum Field Framework for StructuredLight Interactions (17w5079)

April 23 - 28, 2017Organizer(s): David L. Andrews (University of East Anglia, UK), Robert Boyd (Universityof Ottawa, Canada), Konstantin Y. Bliokh (RIKEN, Japan), Mark R. Dennis (University ofBristol, UK), Alexander Lvovsky (University of Calgary, Canada), Duncan O’Dell (McMasterUniversity, Canada)

Overview

It is widely recognised that structured light, whose theoretical foundations were first laid twenty-five years ago,serves as a near-perfect test case for studying the quantum-classical boundary in physics. The pace of advancesin both theory and experiment, since the original concept was developed, has led to the research communityidentifying a wide range of issues demanding urgent attention, to resolve conflicts between representations, withthe aim of securing a consistent, agreed framework to describe the photonic interactions of such light. Specificproblems had already been identified in connection with canonical and non-canonical forms of operator, coherenceand quantum uncertainty, and the quantized forms of both linear and angular optical momentum.

Such is the backdrop to the initial 2015 proposal for this workshop which, focused on the theory and mathe-matics of the subject, from the outset received enthusiastic international support from all quarters. Although un-connected, the importance of this topical area was substantially underscored by the publication, only three monthsbefore the workshop took place, of a major ‘roadmap’ review of the significance and practical potential for thewhole field of structured light. 1 Six of the participants at this workshop featured in the long list of authors.

It was rewarding to find that only a handful of the experts who had been informally approached were ultimatelyunable to accept formal invitation; shortly before the workshop took place, almost every one of the places availablewas filled. Most participants were theorists, but a handful of especially knowledgeable experimentalists were alsoincluded, to sharpen the focus on issues of key practical significance. A few individuals had to pull out just beforethe conference began, all of them for family or health-related reasons. In the event, thirty-five individuals wereable to participate, representing ten nations: Australia, Canada, Germany, Japan, Kazakhstan, Netherlands, Poland,UK, USA and Singapore.

1H. Rubinsztein-Dunlop, A. Forbes, M. V. Berry, M. R. Dennis, D. L. Andrews, M. Mansuripur, C. Denz, C. Alpmann, P. Banzer, T. Bauer,E. Karimi, L. Marrucci, M. Padgett, M. Ritsch-Marte, N. M. Litchinitser, N. P. Bigelow, C. Rosales-Guzman, A. Belmonte, J. P. Torres, W. N.Tyler, M. Baker, R. Gordon, A. B. Stilgoe, J. Romero, A. G. White, R. Fickler, A. E. Willner, G. Xie, B. McMorran, and A. M. Weiner, J. Opt.19, 013001 (2017).

73

Page 78: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

74 Five-day Workshop Reports

The workshop was structured to provide an opportunity, without obligation, for each member to deliver a shortlecture (some participants needed this explicit opportunity in order to secure independent funding to travel to themeeting). In the event, twenty-four talks were given, as listed in Section 2 below, and several other papers werealso circulated during the meeting. The aim was to allow some preliminary points of discussion to be recognized,prior to full discussion sessions later in the week.

By mid-week the workshop had identified eight different topics, as shown in Section 3, which a show of handsindicated were considered the most worth revisiting in more detailed discussions. Together, these addressed asizeable subset of the issues that had been identified as possible topics, at the planning stage. Not surprisingly,many participants were interested in all of them. Nonetheless, for expeditious discussion in groups of a practicablesize, three groups were formed to pursue, in parallel sessions, loosely cognate sets of topic. Each group wasassigned two members to report back to the reassembled full meeting the essential content and upshot of theirdiscussions, as detailed in Section 4. A summing up session for all attendees was held on the last day of theworkshop.

Immediately following the meeting, the Contact Organizer solicited brief feedback from all participants. Alongwith numerous messages of congratulations and thanks for the meeting, some very significant and promisingnew lines of investigation were identified, many of them involving a prospective collaboration between workshopmembers – and in many cases interactions between individuals who had not worked or even met before. Theyrepresent a substantial legacy, one that can be expected to lead to tangible fruit in the form of new work andpublications in due course.

Lecture schedule1. Iwo Bialynicki-Birula. Do electromagnetic waves with fixed orbital angular momentum exist? Examiningorbital angular momentum and spin angular momentum shows that a relativistic treatment introduces a couplingbetween them. Neither of them satisfies a conservation relation; only the total angular momentum is a conservedquantity.

2. Elliot Leader: The elementary particle angular momentum controversy: lessons from laser optics. Informingparticipants in the field of laser optics about the recent angular momentum controversy in the particle physicscommunity.

3. Ivan Fernandez-Corbaton: An algebraic approach to light-matter interactions. Expounding an S-matrixbased methodology for the study and engineering of light-matter interactions, exhibits the ease with which sym-metries and conservation laws enter the formalism, with an algebraic character that is suitable for computer imple-mentation.

4. Garth Jones: Spherical descriptions of photon fields compared with plane wave descriptions. Typicallywhen describing photonic events, the photon fields are cast as plane wave vector fields. In an alternative approachthe fields are described in terms of spherical waves, expressible in the mathematical language of vector sphericalharmonics. This approach is complementary to that of plane waves, and is more amenable to modelling specifictypes of phenomena, such as isotropic processes and condensed phase energy transfer.

5. Wolfgang Loffler: Perfect darkness in exact solutions of Maxwell’s equations. Possibilities exist to identifynontrivial lines of dark vortices in exact solutions, leading to an examination of the importance of relativisticallyinvariant ‘null’ solutions. One issue with such solutions is their very broad plane-wave frequency spectrum; thereare a number of strategies to narrow this down, that could lead to experimental demonstration.

6. Alexander Lvovsky: Beating the Rayleigh limit.7. Miguel Alonso: Structured Gaussian beams: ray and wave pictures. The analogy between structured Gaus-

sian beams and other systems can be described by two-dimensional harmonic oscillators. A ray-based (semiclas-sical) treatment based on a Poincare sphere reveals the underlying geometry including beam shape and geometric(Berry) and Gouy phases, and new families of structured Gaussian beams emerge.

8. Masud Mansuripur: Electromagnetic energy, force, torque, linear momentum, and angular momentum.Maxwell’s equations, manipulated in different ways, allow both familiar and unfamiliar treatments of electric andmagnetic dipole interactions with the electromagnetic field. Different definitions of the Poynting vector and thestress tensor lead to different expressions for energy, force, torque, linear momentum, and AM. Each approach

Page 79: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Quantum Field Framework for Structured Light Interactions 75

to the classical theory is self-consistent, as well as consistent with conservation laws; each has advantages anddisadvantages. The traditional definition of the Poynting vector in conjunction with the Einstein–Laub stresstensor provides the most reliable and physically sensible approach to the interactions of light and matter.

9. Duncan O’Dell: The Abraham-Minkowski controversy: an ultracold atom perspective. Whilst the momen-tum of light in vacuum is well understood, the complexity of real media means that when light propagates insidea dielectric it is hard to separate the momentum of the electromagnetic field from that of the constituent atoms.A dilute gas of ultracold atoms constitutes a particularly simple dielectric where experiments and theory can beunderstood from first principles. In particular, the well-known experiment by Campbell et al. [Phys. Rev. Lett.94, 170403 (2005)] is usually considered to verify the Minkowski result. But this is wrong: both Abraham andMinkowski give the same result for their particular set-up. Also, a geometric phase is associated with the Abraham-Minkowski problem – an optical analogue of the He-McKellar-Wilkens phase of an electric dipole moving in amagnetic field. This phase can be measured using an atom interferometer.

10. Halina Rubinsztein-Dunlop: Quantum aspects of structured light interactions, and ultracold atomic en-sembles.

11. Mohamed Babiker: The current status and future of structured matter vortex waves. A critical review ofcurrent work focuses on the practical limitations, i.e. the feasibility of experimental work in various types of matterwave, questioning what new physics is expected in each category, and what new applications may be envisaged.

12. Konstantin Bliokh: Edge modes, degeneracies, and topological numbers in non-Hermitian systems. Fun-damental problems arise at the interface between two hot areas of modern physics: (a) non-Hermitian quantummechanics and (b) topological states in wave systems. Non-Hermitian systems can have nontrivial topologicalproperties and new type of chiral edge modes, determined by so-called ‘exceptional points’ (i.e. degeneracies inthe bulk spectrum) and two types of topological numbers.

13. Kobus Kuipers: Using local field topology for controlling quantum transitions. Two topics: firstly, howto measure various vectorial components of both the electric and magnetic fields of light guided in nanophotonicstructures, where phase- and polarization singularities are ubiquitous near the nanostructures. Secondly, it emergesthat C-points and the field topology around them could be used to deterministically direct the emission of quantumtransitions involving a change in orbital angular momentum.

14. Robert Boyd: The promise of orbital angular momentum for quantum communications. Hanbury Brownand Twiss interferometry with twisted light.

15. Robert Fickler: Experimental limitations for large orbital angular momentum quantum numbers. Severalapproaches generate photons with large OAM quantum numbers. One technique involving spin-to-orbit couplinguses tailored q-plates to generate photons with up to 200 quanta of the reduced Planck’s constant, ~. Off-the-shelfspatial light modulators are limited to generating photons with up to 300 quanta. A third, recently establishedspiral phase mirror technique makes outputs with quantum numbers of 10000 possible. The mirrors are cut surfaceinto such that the depth modulation corresponds to the azimuthal phase modulation, on optical reflection.

16. Ebrahim Karimi: High-dimensional intra-city quantum cryptography, and quantum hacking with struc-tured photons.

17. David Andrews: Representations of structured light and their quanta. Questions arise over the extent towhich common figurative representations of structured light and photons are defensible, and the possibility of theirconveying misinformation. A look at quantum uncertainty in the light of a reformulated expression for opticalmode orthogonality suggests implications for the number-phase quantum uncertainty in vortex light.

18. Enrique Galvez: High-dimensional spaces of polarization and spatial mode of photons. There are modesof light that give rise to spatially-variable polarization; this is a topic that touches of the fundamentals of light, andespecially on the information that can be encoded into it.

19. Peter Banzer: Transverse angular momentum of light.20. Martin Lavery: Atmospheric turbulence and the propagation of structured modes.21. Alexander Khanikaev: Photonic topological insulators: from theory to experimental realization.22. Daniel Leykam: Topological phases and topological photonics. Connections are established between

topological photonic systems and structured light. Firstly it can be shown how Bloch band degeneracies such asDirac points display analogies with optical vortices. Secondly, examination of the microscopic structure of opticalfields in topological photonic systems, such as the presence of polarization singularities or optical vortex rings,may be used to diagnose topological phases

Page 80: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

76 Five-day Workshop Reports

23. Akbar Salam: Quantisation of the electromagnetic field in magnetodielectric media. Considering theformulation of electrodynamics in a magnetoelectric medium, from both the classical and quantum mechanicalpoints of view, raises questions regarding the potential utility of employing a macroscopic theory of QED to studystructured light-matter interactions.

24. Nader Engheta: Zero-index materials: some wave physics and quantum optical features.

Headline topicsThe following listing of topics were decided upon as especially worth focusing upon in detailed discussion:

(i) the separability of quantum spin and orbital angular momentum, and the role laser optics might play indetermining their limits of measurability;

(ii) degenerate down-conversion, and the implications of quantum uncertainty on the spatial origin of correlatedphoton pairs;

(iii) defensible forms of the quantum number - phase uncertainty relation, as regards structured modes of light;(iv) the relativistic origin of spin-orbit coupling, and its implications for electron vortices;(v) dispersive effects in the Abraham and Minkowski formulations of linear and angular momentum, for pho-

tons travelling through a dispersive medium;(vi) potential limitations of a fundamental quantum nature on the information content of an individual photon;(vii) the role of local field symmetry and topology in structured light and other chiroptical interactions with

matter;(viii) cold atoms, elementary particles and their connections with quantised orbital angular momentum.

Session discussionsThree discussion groups were subsequently convened to address these issues, with each participant free to attendand participate in whichever proved to interest them most. Two reporters were assigned from within each group:

(A). Spin and orbital angular momentum separability [Konstantin Bliokh (Japan) and Ivan Fernandez-Corbaton(Spain)] (Room 202);

(B). Structured matter waves and quantised angular momentum [Halina Rubinsztein-Dunlop (Australia) andDuncan O’Dell (Canada)] (Room 107);

(C). Quantum uncertainty in structured light and down-conversion [Martin Lavery (United Kingdom) and JorgGotte (China)] (Lecture Theatre).

The following summarizes the outcome of these discussions, as presented to the reconvened full workshop.In Group A, where there were extensive discussions about the separability of orbital and spin angular momen-

tum, it became clear that there are two distinct frameworks, whose individual usefulness depended to some extenton the realm of application.

In Framework 1, helicity is an independent physical quantity related to dual symmetry. The integral valuesof spin and orbital angular momentum (AM) can be separated in the most generic EM fields in free space. How-ever, local spin and orbital AM densities can be separated only in monochromatic optical fields. To obtain thesequantities one can use either (i) canonical operators – which do not satisfy the transversality constraint, but obeycanonical SO(3) commutation relations – or (ii) modified operators compatible with transversality, but obeyingmodified commutation relations. Modified operators with unusual commutation relations are natural in the pres-ence of gauge fields (e.g. the momentum operator of a charged particle in a magnetic field). In the case of light,this is the Berry-connection field in momentum space. Importantly, the densities of the spin AM and momentumin a monochromatic field are directly measurable via radiation torque and force on a point dipole particle. Theyalso satisfy separate local conservation laws (continuity equations).

In Framework 2, only transverse operators are allowed. The operators that are used are connected with symme-try transformations, and this is exploited in applications to experiments. The typical explanations of experimentalresults that are made using spin and orbital AM have a symmetry explanation in this framework. Sometimes, thereare two different symmetry explanations for what would correspond to spin to orbital AM transfer in Framework

Page 81: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Quantum Field Framework for Structured Light Interactions 77

1. Total angular momentum is the only operator considered to be connected with physical rotations. Helicity playsan important role in this framework.

In Group B there were three sessions focused on structured matter waves: 1) Cold atoms and optical angularmomentum; 2) Electron, neutron, and other matter waves; 3) The Abraham-Minkowski controversy.

1) The term ‘cold atoms’ refers to atoms cooled to less than one millionth of one degree above absolute zero,sometimes even 1000 times colder. At these temperatures the de Broglie wavelength becomes long enough toextend between the atoms in a trapped gas so that the gas becomes quantum degenerate. If the atoms are bosonsthey will Bose condense into a single quantum state forming a macroscopic quantum matter wave with a highdegree of coherence. Bose-Einstein condensates (BECs) with orbital angular momentum (OAM) were first madeby stirring a condensate with a laser beam, forming quantized vortices in the superfluid. The meeting discussed amore sophisticated method developed in a series of experiments by Bill Phillip’s group at NIST [M. F. Andersenet al, “Quantized Rotation of Atoms from Photons with Orbital Angular Momentum”, Phys. Rev. Lett. 97,170406 (2006); C. Ryu, “Observation of Persistent Flow of a Bose-Einstein Condensate in a Toroidal Trap”, Phys.Rev. Lett. 99, 260401 (2007)] where a Laguerre–Gaussian laser beam was interfered with a counter-propagatingGaussian beam to form a corkscrew-like interference pattern. The atoms diffract off this intensity pattern (viaa stimulated Raman transition) and in so doing worth of angular momentum is transferred from the light to theexternal motion of each atom. This method can be used to introduce a singly-quantized vortex into a BEC, andalso to rotate a BEC in a ring trap. Related experiments have also been performed in Bigelow’s group [K. C.Wright, L. S. Leslie, and N. P. Bigelow, “Optical control of the internal and external angular momentum of aBose-Einstein condensate”, Phys. Rev. A 77, 041601(R) (2008)].

An interesting basic question that has been tackled by some of the participants is whether optical OAM in-fluences transitions between internal states in atoms. The selection rules for a dipole transition are ∆L = ±1,∆m = 0,±1. In traditional experiments where the laser carries no OAM the change in angular momentum ofthe atom is supplied by the ‘spin angular momentum’ (SAM) of the light: circularly polarized light carries ±~of SAM depending on its handedness. The question is then whether OAM can play a similar role in transitionsas SAM. A theory paper in 2002 [M. Babiker, C. R. Bennett, D. L. Andrews, and L. C. Davila Romero, “OrbitalAngular Momentum Exchange in the Interaction of Twisted Light with Molecules”, Phys. Rev. Lett. 89, 143601]suggested that dipole transitions would be unaffected by OAM and the lowest multipole transition that could beaffected by OAM is the electric quadrupole transition. A very recent experiment has confirmed the first part of thisprediction [F. Giammanco et al., “Influence of the photon orbital angular momentum on electric dipole transitions:negative experimental evidence”, Opt. Lett. 42, 219 (2017)].

Open questions in this area include whether there are efficient methods to generate OAM in beams of atoms (asopposed to the case where the atoms are held in a trap) and whether having cold atoms with OAM would facilitateprecision measurements. In contrast to light, atoms are massive and thus sensitive to gravity, and typically theyhave magnetic moments and are also electrically polarizable, and thus sensitive to material properties. Moreover,in contrast to electrons they are neutral and 1800 times more massive.

2) Electron beams are a very active area of investigation, not least because of their technological applicationssuch as in electron microscopes. Like light and atoms, beams of these particles can be structured, including Airyand vortex beams [J. Harris et al., “Structured quantum waves”, Nature Phys. 11, 629 (2015); H. Larocque and E.Karim, “A New Twist on Relativistic Electron Vortices”, Physics Viewpoint Physics 10, 2 (2017)]. A significantmotivation for studying such structured beams is the possibility of developing probes for magnetic structure at thenanoscale, benefitting from the fact that a vortex is a sub-wavelength structure.

Airy beams are wave-packets with the remarkable property that they appear as though they are accelerating,even though they are propagating in free space [N. Voloch et al., “Generation of Electron Airy Beams”, Nature 494,331 (2013)]. These beams also ‘self-heal’ if an obstacle is placed in their path. Vortex beams of electrons were firstcreated in the lab in 2010 [J. Verbeeck, H. Tian, and P. Schattschneider, “Production and application of electronvortex beams”, Nature 467, 301 (2010)]. The theoretical analysis of electron beams is involved and dependsupon whether one is in the non-relativistic regime (described by the Schrodinger-Pauli equation) or the relativisticregime (described by the Dirac equation). In non-relativistic electron beams OAM and SAM are independentlyconserved [K. Y. Bliokh, Y. P. Bliokh, S. Savel’ev, and F. Nori, “Semiclassical Dynamics of Electron Wave PacketStates with Phase Vortices,” Phys. Rev. Lett. 99, 190404 (2007)]. However, in the relativistic regime these twoangular momenta become coupled [K.Y. Bliokh, M. R. Dennis, and F. Nori, “Relativistic electron vortex beams:

Page 82: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

78 Five-day Workshop Reports

Angular momentum and spin-orbit interaction” Phys. Rev. Lett. 107, 174802 (2011)]. There has recently beensome controversy as to whether a true vortex can exist in an electron beam. In particular, in a theoretical paperby one of the participants [I. Bialynicki-Birula and Z. Bialynicka-Birula, “Relativistic Electron Wave PacketsCarrying Angular Momentum,” Phys. Rev. Lett. 118, 114801 (2017)] it was shown that the SAM and OAMcontributions to the vorticity exactly cancel each other out. A non-zero vorticity is seen as a crucial feature of avortex. Nevertheless, the probability density does coil around the beam in a manner highly reminiscent of a vortex;practical applications are unlikely to be affected by this cancellation.

A fundamental issue with free electrons is that they are hard to polarize; as first realized by Pauli and alsoBohr. The standard Stern-Gerlach method of sorting particles according to their magnetic moment by using aninhomogeneous magnetic field fails for electrons, as the Lorentz force on such light particles leads to transverseforces which completely overwhelm the force on the magnetic moment. However, recent work by some of theparticipants [E. Karimi, L. Marrucci, V. Grillo, E. Santamato, “Spin-to-orbital angular momentum conversionand spin-polarization filtering in electron beams”, Phys. Rev. Lett. 108, 044801 (2012); E. Karimi, V, Grillo,R. W. Boyd, and E. Santamato, “Generation of a spin-polarized electron beam by multipole magnetic fields”,Ultramicroscopy 138, 22 (2014)] has shown that so-called ‘q-filters’, i.e. inhomogeneous magnetic fields with aquadrupole distribution, can give rise to a spin-orbit interaction that couples the OAM and the SAM of the electronbeam. If a beam with the correct OAM is input to the system then a spin-polarized beam is produced.

Questions for future investigation include the further development of efficient methods for producing spin-polarized electrons, and also extending the idea of matter-wave vortex beams to neutrons [C.W. Clarke et al,“Controlling neutron orbital angular momentum”, Nature 525, 504 (2015)]. Concerning this last point, it waspointed out during the meeting that the neutron interferometer used to detect the vortex beam was essentially awhite light (Michelson) style interferometer, due to the very low degree of transverse coherence in the neutronbeam, and that this meant that the results were not very clear cut. The important problem of vortex beams ofneutrons therefore remains ripe for further work. It was also noted that there are intriguing suggestions that self-accelerating beams can be used to prolong the lives of unstable particles [I. Kaminer et al., “Self-accelerating Diracparticles and prolonging the lifetime of relativistic fermions”, Nature Physics 11, 261 (2015)].

3) The Abraham-Minkowski controversy refers to two different expressions for the momentum density oflight inside a dielectric medium derived separately by Abraham (1909) and Minkowski (1908). Although theseare classical effects it is often convenient to express them in terms of the photon momentum: pA = p0/n andpM = p0n, where p0 = ~k is the free space momentum of a single photon and n is the refractive index of thedielectric. This topic was first discussed during and following talks 8 and 9 during the earlier part of the workshop.

The momentum density of light can be extracted from the stress-energy tensor, but it turns out that thereis a considerable degree of arbitrariness in writing down stress-energy tensors. Examples include those due toAbraham, Minkowski and also Einstein and Laub [M. Mansuripur, Electromagnetic Force and Momentum, inRoadmap on Structured Light, edited by H. Rubinsztein-Dunlop, Journal of Optics 19, 013001, 8 (2017)]. Thishas led some authors to prefer to focus on the actual forces involved by calculating the Lorentz force on eachcurrent (bound and unbound) in the material due to the electromagnetic field [J. P. Gordon, “Radiation forces andmomenta in dielectric media”, Phys. Rev. A 8, 14 (1973)].

Experiment is the ultimate arbiter in science, and over the years there have been a large number of experimen-tal investigations that have variously agreed with either Abraham or Minkowski or have come up with their ownexpressions. The essential difficulty lies in separating the total momentum (which all agree must be conserved)into that due to the optical field and that due to the material. In other words, the polarization excitation induced bythe optical field that propagates inside a medium is part atom, part electromagnetic field. Also, the result one findsdepends upon precisely what measurements are made. One school of thought [championed by Loudon and collab-orators: C. Baxter, M. Babiker, and R. Loudon, “Canonical approach to photon pressure”, Phys. Rev. A 47, 1278(1993)] is that the Abraham expression corresponds to kinetic momentum of the light and the Minkowski expres-sion to a canonical momentum. For example, if an experiment could measure the recoil velocity of a transparentblock of dielectric as a pulse of light enters it, then using the concept of the uniform motion of the centre-of-massof energy [a principle emphasized by Einstein: A. Einstein, “The principle of conservation of motion of the centerof gravity and the inertia of energy”, Ann. Phys. 20, 627 (1906)] leads to the expectation that the momentum ofthe light in the block takes the Abraham value [S. M. Barnett and R. Loudon, “The enigma of optical momentumin a medium”, Phil. Trans. R. Soc. A 368, 927 (2010)]. By contrast, in experiments involving diffraction it can be

Page 83: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Quantum Field Framework for Structured Light Interactions 79

argued that it is more likely that the canonical (Minkowski) momentum sill be found because the wavelength, andhence the wave vector, is associated with canonical momentum.

During the discussions, participant Masud Mansuripur gave a tutorial on the classic Balazs thought experimentwhere a pulse of light enters a dielectric block [N. L. Balazs, “The energy-momentum tensor of the electromagneticfield inside matter”, Phys. Rev. 91, 408 (1953)] and using only a knowledge of the Fresnel reflection coefficientsobtained the result that the momentum density is the average of the Abraham and Minkowski results, p = p0(n+n−1)/2 (in contrast to the Abraham momentum density result found using the principle of uniform motion of thecentre-of-mass energy). Mansuripur’s result is also obtained if one instead calculates the Lorentz force on thebound currents and charges in the material [M. Mansuripur, “Radiation pressure and the linear momentum of theelectromagnetic field”, Opt. Express 12, 5375 (2004)]. Mansuripur went on to discuss the case of the recoil ofa mirror immersed in a fluid, investigated in the classic experiment by Jones [R. V. Jones and J. C. S. Richards,“The pressure of radiation in a reflecting medium”, Proc. Roy. Soc. A 221, 480 (1954)] whose results support theMinkowski result, the idea being that the mirror’s recoil is a direct measure of the momentum of light in a medium.One issue that Mansuripur highlighted, yet to be tested experimentally, is the effect of varying the Fresnel reflectioncoefficients of the mirror. This controls the interference fringes that form in the medium between the incident lightand that reflected from the mirror. These intensity fringes lead to forces on the molecules in the fluid as momentumtransfers between light and medium.

A number of suggestions for future investigations were made: firstly, to repeat the Jones experiment as dis-cussed above, with varying mirror reflectivity. Secondly, it was suggested to repeat Ashkin’s surface bulge ex-periment in which the reaction of a fluid surface to an incident beam of light is measured (the surface can eitherbulge outward or be pressed inward) [A. Ashkin and J.M. Dziedzic, “Radiation pressure on a free liquid surfac”,Phys. Rev. Lett. 30, 139 (1973)] as the interpretation of this experiment is controversial. Thirdly, there has been asuggestion [E. A. Hinds and S. M. Barnett, “Momentum Exchange between Light and a Single Atom: Abraham orMinkowski?”, Phys. Rev. Lett. 102, 050403 (2009)] for measuring the recoil of a single atom due to a laser pulse– which could potentially be attempted with cold atomic gases, provided a clever way is found to circumvent theproblems associated with the random recoil due to spontaneous emission.

In Group C discussions centred on: 1) Degenerate down conversion; 2) Number-phase uncertainty; 3) Infor-mation content of photons, and its limits.

1) Discussion was launched by considering the consequence of a recent publication [K. A. Forbes, J. S. Fordand D. L. Andrews “Nonlocalized generation of correlated photon pairs in degenerate down-conversion”, Phys.Rev. Lett. 118, 133602 (2017)]. This suggests that each photon pair produced by spontaneous parametric downconversion (SPDC) can originate from spatially distinct locations. Interest centred on whether this would affectthe information content of the down-converted biphoton, and how any such change of information might manifestitself. It was agreed that considering only plane waves as excitation during SPDC is a limiting case, which led tothe question of whether a superposition of wave-vectors is necessary to explain and measure this effect.

2) Number-phase quantum uncertainty remains a vexed issue, and despite the special significance of phase inconnection with structured light it seems to have received little attention in the literature. It is well known that thereis no simple or completely defensible form for an optical phase operator, partly due to the cyclical nature of itsparameter space. In the course of the discussions on number-phase uncertainty, participant Iwo Bialynicki-Birulagave a brief tutorial on a negentropic representation, cast in terms of limits on azimuthal measurability. This leadsto questions about the phase of a single photon, which it was agreed should be rephrased with regard to some formof reference. Phase is only relevant relative to another photon, or material oscillation, begging the question ofwhether there is an operator which could realise this and how it could best be implemented.

3) This group then discussed that the extent of accessible information of a photon, recognising that it is subjectto boundary condition of the measurement process, so that in a sense every measurement of information relieson a restricted Hilbert space. This in turn should the affect the fundamental uncertainty relation and leads to thequestion whether there is a correction required to account for this. Michael Mazilu pointed to his work on opticaleigenmodes as offering a possible way forward [M. Mazilu et al., “Optical eigenmodes; exploiting the quadraticnature of the energy flux and of scattering interactions”, Opt. Express 19, 933 (2011)].

Other, informal group discussions focused on the C points occurring in photonic crystal waveguide modes, asobserved in experiments by Kopus – a relatively unexplored area of research at the intersection between structuredlight and topological phases of light and matter. This group identified pressing questions that should be addressed

Page 84: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

80 Five-day Workshop Reports

in the near future:1) What is the relation between the C points observed experimentally in real space modal profiles, and momen-

tum space C points occurring in models of photonic topological insulators?2) Present designs for photonic topological insulators are basis on silica-rod photonic crystals. For experi-

mental probes of near-field properties, it would be better to employ a design utilising air-hole photonic crystals.Accordingly, there is a need to investigate whether such a design supports topological edge modes.

3) Once an appropriate design is identified, it would be interesting to study the near field structure of thetopological edge modes, and compare against the occurrence of C points in previously-studied photonic crystalwaveguides.

Outcomes and future plansIn the closing session, in addition to many voices of satisfaction with the meeting there were calls for a follow-upworkshop, perhaps in two or three years’ time. In the meanwhile there will be a follow-up to another suggestion,that proceedings of work initiated by this workshop might be combined with content from this summer’s now-biennial International Conference on Optical Angular Momentum [Capri, 18-22 September 2017] in a special issueof some suitable journal. The organisers of that conference have now been approached, to explore this possibility.

Numerous new collaborations were instigated directly through the workshop; those that have been reportedthus far involve groupings of individuals who together represent more than a third of the workshop participants,specifically including Alonso, Andrews, Bliokh, Fernandez-Corbaton, Galvez, Jones, Karimi, Khanikaev, Leader,Leykam, Loeffler, Mazilu, O’Dell and Salam.

Press releaseAt this early point in the twenty-first century it is already apparent that a paradigm shift is taking place in numer-ous areas of technology, as the new science of photonics increasingly outperforms and displaces twentieth centuryelectronics. Prominent examples are high throughput nanoscale connectivity in IT systems, secure and enhancedtelecommunications, and quantum computing. Some of the latest advances, aiming to exploit the distinctive quan-tum properties of light, convey information in the form of highly unconventional structured beams of light. Thispotentially transformative field is revolutionising many areas of optical physics, yet it has raised numerous ques-tions in urgent need of address at the level of fundamental theory. This workshop brought top experts together inthe aim of resolving these issues, laying new mathematical foundations to secure the ground for further progress.

AcknowledgementThanks to all other organisers and participants: above all, and on behalf of all, hearty thanks to BIRS for makingthis workshop possible, intellectually profitable, and thoroughly enjoyable.

ParticipantsAlonso, Miguel (University of Rochester)Andrews, David (University of East Anglia)Babiker, Mohamed (University of York)Banzer, Peter (MPI for the Science of Light)Bialynicki-Birula, Iwo (Center for Theoretical Physics Polish Academy of Sciences)Bliokh, Konstantin (RIKEN Cluster for Pioneering Research)Boyd, Robert (University of Ottawa)Carmichael, Howard (University of Auckland)Coles, Matt (University of East Anglia)

Page 85: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Quantum Field Framework for Structured Light Interactions 81

Desyatnikov, Anton (Nazarbayev University)Engheta, Nader (University of Pennsylvania)Fernandez-Corbaton, Ivan (Karlsruhe Institute of Technology)Fickler, Robert (University of Ottawa)Galvez, Enrique (Colgate University)Gotte, Jorg (University of Glasgow)Gutierrez-Cuevas, Rodrigo (University of Rochester)Jones, Garth (University of East Anglia)Karimi, Ebrahim (University of Ottawa)Khanikaev, Alexander B. (Queens College of CUNY)Kuipers, Kobus (Kavli institute Delft)Lavery, Martin (University of Glasgow)Leader, Elliot (Imperial College London)Leykam, Daniel (Nanyang Technological University)Loeffler, Wolfgang (Leiden University)Lvovsky, Alexander (University of Calgary)Mansuripur, Masud (University of Arizona)Mazilu, Michael (University of St Andrews)O’Dell, Duncan (McMaster University)Rubinsztein-Dunlop, Halina (University of Queensland)Salam, Akbar (Wake Forest University)Sanders, Barry (University of Calgary)Sang-Nourpour, Nafiseh (U. of Calgary)Tijssen, Teuntje (University of Bristol)van Kruining, Koen (Max Planck Institute for the Physics of Complex Systems)Yao, Alison (University of Strathclyde)

Page 86: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 8

Phase Transitions Models (17w5110)

April 30 - May 5, 2017Organizer(s): Yaniv Almog (Louisiana State University), Petru Mironescu (University ClaudeBernard Lyon 1)

OverviewThe term phase transition (or phase change) is most commonly used to describe transitions between differentstates of matter (solid/liquid/gas, superconducting/normal, ferromagnetic/paramagnetic, isotropic/nematic, etc.).Second-order phase transition models have been in the focus of mathematical research for a number of decades.As opposed to first-order models, where the transition occurs along a sharp interface (e.g. the surface of a droplet),second-order ones, developed under the impulsion of Lev Landau, phenomenologically include a smoothing (gra-dient like) term leading to diffuse interfaces and transition phenomena. Mathematically, one is lead to considerin the models a small parameter (typically, ε > 0) interpreted as the characteristic length of the interface, and toreplace the discrete order parameter u (with e.g. u = 0 or u = 1 in the case of a two-phase fluid) by a continuousone. This order parameter could be either a scalar, as in the discrete case, or a complex u (with |u|2 interpreted asa density) or a probabilistic function of the orientation of molecules (the Q-tensor below).

Mathematical issues within these models are related to the study of the well-posedness of the boundary condi-tions, the behavior of minimizers or critical points of the appropriate energy functional, approximate descriptionof transition layers, description of metastable states, asymptotics as ε → 0, justification of first-order transitionmodels as limits of the second-order one. More specialized directions concern the sound justification of physicalobservation as expulsion of magnetic flux (Meissner effect), existence of critical fields, existence of lattices ofvortices in type II superconductors (Abrikosov lattices), etc.

A very partial list of second-order phase transition models include

1. The van der Waals-Cahn-Hilliard theory of phase transitions in a fluid. In this theory, the total energy of thefluid filling the container Ω ⊂ Rn is assumed to be given by ε2|∇ρ|2 +W (ρ), where ε is a small parameterandW : R→ [0,∞) is a two-well potential, vanishing at a and b. A typical example isW (ρ) = ρ2(1−ρ)2,with a = 0 and b = 1. When ε = 0, minimizers take only the values a and b. In the limit ε→ 0, minimizersor critical points of bounded energy of the rescaled energy

Fε(ρ) =

∫Ω

(1

ε|∇ρ|2 + εW (ρ)

),

are expected to get closer and closer to a and b. In the typical example above, the limit should be thecharacteristic function of a subset A ⊂ Ω. The shape of A depends on the constraint on ρ: mass constraint

82

Page 87: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 83

∫Ω

ρ = m, contact energy with the walls of the container, for example of the form εσ(ρ), etc. In each

case, minimizers approximate a two-phase configuration satisfying a variational principle related to theequilibrium configuration of liquid drops.

Variants of the above functional include vector-valued unknowns ρ : Ω → Rd and adapted potentials W :Rd → [0,∞).

2. The Ginzburg-Landau model. This model and its variant are relevant for the theory of superconductivity andBose-Einstein condensates.

Its simplest version is given by the energy functional

Eε(u) =

∫Ω

(1

2|∇u|2 +

1

4ε2(1− |u|2)2

), (8.0.1)

where u ∈ H1(Ω,C), Ω ⊂ Rn, and ε > 0 is a (small) parameter. The physically relevant quantity hereis |u|2, which plays the role of a density. The unknown function u can be subject to various boundaryconditions (Dirichlet, semi-stiff, etc.).

There are numerous generalizations and modifications for this functional. One can, for instance, discuss theabove model on manifolds, to consider u ∈ H1(Ω,Rn), etc. One can also consider time-dependent gradientflow (with either heat, Schrodinger or wave operator for the time derivatives).

This simplified energy is obtained by neglecting the effect of magnetic field in the Ginzburg-Landau modelof superconductivity. In two dimensions, the full energy functional takes the form

Gε(u) =

∫Ω

(|∇Au|2 + |∇A− hex|2 +

1

2ε2(1− |u|2)2

). (8.0.2)

In the above A ∈ H1(Ω,R2) denotes the magnetic vector potential (∇A is the magnetic field), hex is theapplied magnetic field and u ∈ H1(Ω,C) denotes the superconductivity order parameter. The minimizationof Gε can be carried out without any prescribed boundary conditions, but one can also prescribe boundaryconditions other than the natural ones, on some subset of the boundary.

Another variant is obtained by adding a traping potential, leading to the Bose-Einstein model for conden-sates. The effect of electric current can also be added to the Ginzburg-Landau model of superconductivity, inwhich case the problem is not variational anymore (it cannot be formulated in terms of an energy functionalthat needs to be minimized).

Other variants include spin models for ferromagnetic materials. Mathematically, this involves two unknownwave functions u1, u2, weakly coupled through the potential terms.

3. Oseen-Frank model of nematic and smectic liquid crystals. Liquid crystals are an intermediate state betweenliquids and solid crystals. Their main mathematical feature is the orientation of the molecules. In the simplestmodel, one associates to each molecule in the container Ω ⊂ R2 or R3 an orientation n = n(x) ∈ RP2,

and the energy to minimize is∫

Ω

|∇n|2 (equal elastic constants). One may further simplify by replacing the

physically realistic target RP2 by S2.

A first sophistication of this model consists of introducing a more general dependence of the energy on n.The general form of the Oseen-Frank energy density for a nematic liquid crystal in the absence of externalelectromagnetic fields is given by

FN (n,∇n) = K1|∇ · n|2 +K2|n · (∇n) + τ |2+

+K3|n(∇n)|2 + (K2 +K4)(tr(∇n)2 − |∇ · n|2) . (8.0.3)

In the above, Ki (i = 1, 2, 3, 4) are dimensionless elastic coefficients which depend on the material, and τis a real number representing the chiral pitch in some liquid crystals.

Page 88: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

84 Five-day Workshop Reports

Smectic crystals are liquid crystals disposed in twisted or tilted layers. A typical Oseen-Frank model forsuch crystals is

J(ψ, n) =

∫Ω

(|(i∇+ qn)ψ|2 + µ

(−|ψ|2 +

1

2|ψ|4

)+ FN (n,∇n)

). (8.0.4)

The number q is the characteristic smectic layer number: 2π/q is the characteristic layer thickness. Theconstant µ must be positive. The additional complex order parameter ψ has the following interpretation: |ψ|characterizes the strength of the density modulation, while the (suitable normalized) phase of ψ measuresthe smectic layer displacement relative to that of perfect 1D crystalline order of layer periodicity 2π/q.

The minimization of (8.0.2) is usually carried over all (ψ, n) ∈ H1(Ω,C)H1(Ω,S2).

4. The Landau-de Gennes phase transition models. Oseen-Frank models cannot explain lines of disclinationobserved in 3D liquid crystals. The Landau-de Gennes phenomenological model describes liquid crystalsnot via the director n, but uses instead a 33 symmetric matrix Q(x), interpreted as the (suitably normalized)covariance matrix of the probability distribution of n(x) ∈ RP2.

In this case, the total energy takes the form

Eε(Q) =

∫Ω

(fe(∇Q) + fLdG(Q)) ,

where the elastic energy is given by

fe(∇Q) :=L1

2|∇Q|2 +

L2

2Qij,jQik,k +

L3

2Qik,jQij,k

=

3∑j=1

L1

2|∇Qj |2 +

L2

2(divQj)

2+L3

2∇Qj · ∇QTj

, (8.0.5a)

while the bulk Landau-de Gennes energy density is

fLdG(Q) := a tr(Q2)

+2b

3tr(Q3)

+c

2

(tr(Q2))2

. (8.0.5b)

Temperature and material depending constants a, b,c determine the critical points of the bulk energy, whosenature governs the (meta)stability of the isotropic and nematic phases and the behavior of the liquid crystal.In type II liquid crystals, the molecules will tend to be aligned most of the time, leading to a Q-tensor of theapproximate form Q(x) = n(x) ⊗ n(x) − (1/3)I3. In small regions (diffuse points or lines) Q will differmuch from this form, one of the options (but not the only one) consisting of being close to Q = 0 (as inisotropic liquids).

5. Micromagnetics. In certain asymptotic regimes, the magnetization of a material contained in a thin filmin R3 is described (at the mesoscopic level) by a 3D vector field m : Ω ⊂ R2 taking values into S2. Indimensional reductions procedures, one is then led to the study of the asymptotic behavior of functionals ofthe form

Eε(m) = ε

∫Ω

|∇m|2 + f(ε)

∫Ω

||∇|−1/2 (∇ ·m)|2 ,

where f(ε)→ 0 as ε→ 0; the given f accounts of the characteristics of the asymptotic regime.

6. The Bloch-Torrey model of diffusion-weighted magnetic resonance imaging (DW-MRI). For a magnetiza-tion M and an external magnetic field B the time-dependent equations

dM

dt= MB + h2∆M ,

where h is a parameter. In two dimensions the steady-state problem reduces to a Schrodinger equation withpurely imaginary potential.

Page 89: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 85

Recent DevelopmentsThe problems that have been addressed in the workshop were spanned over a wide area of interest. Some of theproblems that were considered are listed below.

1. Linear stability of trivial solutions. In particular, some attention was devoted to the spectral theory of nonselfadjoint operators, which appear, for instance, when one considers the effect of electric current into theGinzburg-Landau model, or when considering the Bloch-Torrey model.

2. Tunneling effects for either the Robin-Laplacian (or p-Laplacian) or the linearized Ginzbug-Landau model.

3. Weakly nonlinear analysis of either the Ginzburg-Landau model or for the smectic A liquid crystals model.These problems are characterized by boundary layer solution, decaying exponentially fast away from theboundary.

4. Analysis of singularities for the Ginzburg-Landau model. The issues considered include the analysis of localminimizers in the sense of De Giorgi in dimension 3, the interaction between local (singularities) and globalanalysis (genus) in the asymptotic analysis of minimizers.

5. Existence and linear stability of radially symmetric solutions of the Landau-de Gennes model.

6. New models of liquid crystals and their analysis. In this direction, topics include models for the free energyaccounting of the shape the molecules and the role of the penalization terms in the free energy in preventingthe escape of theQ-tensor from the physically realistic region. A related topic is the one of discrete models tonematic configuration, and the justification of the derivation of the continuous model as limit of the discreteone.

7. The transitions from isotropic to uniaxial, from uniaxial to biaxial nematic solutions for the Landau-deGennes model, and the transition from smectic A to smectic C states for the Chen-Lubensky model. Inparticular, the results presented include a rigorous justification of the existence of Saturn rings and/or biaxialtransitions around spherical droplets in nematic liquids.

8. Some general issues of analysis related to phase transition models. In particular, description of an appropriatefunctional space for the analysis of semi-stiff boundary problems, homotopy classes of Sobolev maps withprescribed singularities, lifting problems for Sobolev maps, justification of 3D to 2D dimensional reductionsvia asymptotic analysis of variational inequalities in large cylinders, and the rigorous analysis of variationalproblems under prescribed mass and energy constraints. In a different direction, applications of the ideasdeveloped in relation with phase transition theories to the relaxation of total length minimization Plateautype problems were discussed. Attention was also focused on the use of Allen-Cahn type regularizations inthe modeling of the motility of eukaryotic cells on substrates.

Presentation Highlights

Non selfadjoint operators

Bernard Helffer considered the complex Airy operator −d2/dx2 + ix defined on the domain

D = (u+, u−) ∈ H2(R+)H2(R−) |u′+(0) = u′−(0) = κ[u+(0)− u−(0)] .

The transmission boundary condition satisfied at x = 0 is in frequent use when studying the above-mentioned Bloch-Torrey equations. Among other things, he presented a proof of completeness of theeigenspace, and of the simplicity of the eigenvalues. Some additional results were presented for the Robinrealization of the same operator acting on R+ (i.e., u′(0) = κu(0)) (cf. [29, GH]).

Page 90: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

86 Five-day Workshop Reports

Yaniv Almog considered the Schrodinger operator with a purely imaginary potentialAh = −h2∆ + iV in the semiclassical limit h → 0. Assuming a smooth domain and a smooth poten-tial with no critical points (∇V 6= 0 in Ω) he obtained the left margin of the spectrum ofAh. A lower boundof inf <σ(Ah) have previously been obtained in [6, 30] for the Dirichlet realization of Ah, an upper bound,under some significant assumptions on V , has been obtained in [8] for the Dirichlet realization. Here bothupper bounds and lower bounds have been obtained in much greater generality for a variety of boundaryconditions, including the above-mentioned transmission boundary condition.

Petr Siegl presented a study of the eigenfunctions of operators of the type

−d2/dx2 + |x|β + iV (x) ; |V | . |x|γ as |x| → ∞

acting on the real line, where β ≥ 2 and γ ≥ 0. It has been established that the system of eigenfunctionsis complete. Nevertheless, very often it does not form a basis. The main result for this specific operatoris that whenever γ < β/2 − 1 the eigenfunctions form a Riesz basis. Some negative results for the caseγ > β/2− 1 have been demonstrated as well (cf. [39, 43]).

Surface superconductivitySoeren Fournais considered the 3D Ginzburg-Landau energy functional (8.0.2) in the

limit ε → 0. It is well known that when the applied magnetic field satisfiesε−2 < hex < ε−1/Θ0 (Θ0 ≈ 0.59 is known as de Gennes value) the minimizer decays exponentiallyfast away from the boundary. The energy density function on the surface (the local energy of the minimizerper unit surface area) has been studied. A proof was presented to the fact that this energy density is monotonenon-decreasing with the angle between the applied magnetic field and the local tangent plane.

Ayman Kachmar presented a study of the smectic A energy (8.0.3)–(8.0.4) in the limit where the large elasticconstants force the director field to be in the chiral nematics state, i.e.,

n∗ = (cos(τx3), sin(τx3), 0) ,

where the x3 axis direction is arbitrary. In previous works [5, 31] a reduced functional where n = n∗ (andhence FN = 0) was considered. Kachmar demonstrated that the results obtained for the reduced functioncan be reproduced for the full functional (8.0.3)–(8.0.4), in the limit of large elastic constants (cf. [41]).

Michele Correggi began by reviewing some of the results he has obtained with Rougerie on the behavior of theminimizer of (8.0.2) near the boundary, in the surface superconductivity regime. It has been conjectured byPan [45] that for any critical point of (8.0.2) the solution becomes uniformly distributed along the boundaryas ε → 0. Coreggi and Rougerie proved this conjecture for the global minimizer (cf. [24]). He thendiscussed the effects of curvature and corners on the energy [22, 25].

Tunneling and Weyl asymptoticsNicolas Raymond considered the Robin Laplacian Lh = −h2∆ defined on

D = u ∈ H2(Ω) |h1/2 ∂u

∂ν

∣∣∣∂Ω

= u

in the semiclassical limit h → 0. The leading behavior (as well as higher order terms) of the negativeeigenvalues has been obtained. Assuming two points of maximal curvature on ∂Ω for a 2D symmetricdomain, it is demonstrated that the gap between the first two eigenvalues can be approximated by the gapbetween the first pair of eigenvalues of a simple boundary operator. The presentation also contained someinsights of the Lp spectral theory of magnetic Laplacians. In particular, a localization result, similar in natureto existing L2 results, has been obtained (cf. [40, 26]).

Page 91: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 87

Hynek Kovarık considered the p-Laplace operator with Robin boundary conditions on Euclidean domains withsufficiently regular boundary. In particular, the asymptotic behavior of the first eigenvalue, given by

Λ(Ω, p, α) := infu∈W 1,p(Ω)

u6≡0

∫Ω

|∇u|pdx− α∫∂Ω

|u|pdσ∫Ω

|u|pdx,

when the strength, α, of the (negative) boundary term grows to infinity, depends on the geometry of thedomain. Localization of the corresponding minimizers, near the point of maximal mean curvature, has beenobtained as well [38].

Virginie Bonnaillie-Noel devoted her presentation to the semiclassical analysis of the Neumann realization of themagnetic Laplacian (−ih∇ + A)2 (where ∇A = i3) on a smooth planar domain. As in the presentation ofRaymond a symmetric domain is considered, with precisely two points on the boundary where the maximalcurvature is attained (an ellipse for example). By obtaining WKB expansions of the eigenfunctions a formalderivation of the gap between the first two eigenvalues is achieved (cf. [15]).

Liquid crystalsPeter Pallfy-Muhoray reported a study where the effects of particle shape on the behavior of soft condensed mat-

ter systems were considered. A simple orientational density functional form of the Helmholtz free energyincluding both long-range attractive and short-range repulsive interactions is provided. It is applied to an ex-ample, taking into account nematic order due to both temperature–dependent attractive (Maier-Saupe) andconcentration dependent repulsive (Onsager) interactions. The shape dependence of the attractive interac-tions originates in the polarizability, while the shape dependence of the repulsive interactions arises throughthe excluded volume. The varying phase behavior due to the relative contributions of these two effects hasalso been considered (cf. [46]).

Xiaoyu Zheng continued the presentation of Pallfy-Muhoray, and provided more details as to how the Onsagermodel can be modified to account for hard core repulsion, including the effect of shape-dependent excludedvolume. A detailed derivation of the new model from the Helmholtz free energy was presented. Unlikethe Onsager model where the pressure remains finite for arbitrarily high densities, the new model predictsdivergence of the pressure at a critical density.

Dmitry Golovaty presented a Gamma-convergence result of the Landau-de Gennes (LdG) model (8.0.5) for anematic liquid crystalline film in the limit of vanishing thickness. The film is assumed to be attached to afixed surface, where an anchoring energy of the form

fs(Q, ν) = α [(Qν · ν)− β]2

+ γ |(I3 − ν ⊗ ν)Qν|2 ,is assumed, where ν is a unit normal vector, and α, γ > 0, β ∈ R are given parameters. In the limit ofvanishing thickness, Γ-convergence to a limiting surface energy is proved. Then, the limiting problem wasdiscussed in several parameter regimes (cf. [27]).

Dan Phillips considered a surface stabilized ferroelectric liquid crystal cell which, after being cooled from thesmectic-A to the smectic-C phase forms V-shaped (chevron like) defects. The defects create an energybarrier that prevent (for the Γ-limit) switching between equilibrium patterns. To obtain such a transition, agradient flow for a Chen-Lubensky energy is examined. The flow allows the order parameter to vanish, andconsequently it is possible to overcome the energy barrier (cf. [44]).

Patricia Bauman presented a study of minimizers, on a bounded domain, of the Landau-de Gennes functional(8.0.5) where instead of (8.0.5a) use is made of the Maier-Saupe energy

fms(Q) =

−K|Q|2 + inf

ρ∈AQ

∫S2 ρ(p) ln(ρ(p))dp , if Q ∈M

∞ , otherwise,

Page 92: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

88 Five-day Workshop Reports

whereAQ := ρ ∈ L1(S2;R) : ρ ≥ 0, s

∫S2ρ(p)dp = 1,

Q =

∫S2

(p⊗ p− 1

3I3

)ρ(p)dp ,

(8.0.6)

andM = Q ∈ R33 |Qt = Q ; TrQ = 0 ; σ(Q) ∈ (−1/3, 2/3) .

Note that M is the set of physically acceptable states, in view of the relation betwen Q and ρ given in(8.0.6). Using the fact that the energy density is singular outside the physically realistic range, it is provedthat minimizers are regular and, in several model problems, take on values strictly within the physical range(cf. [9]).

Lia Bronsard provided an instructive illustration of the advantage of the Landau-de Gennes model over the morerigid Oseen-Frank or Ericksen theories. She considered a spherical colloidal particle immersed in a liquidcrystal, satisfying homeotropic weak anchoring at the surface of the colloid (with strength parameter W )and approaching a uniform uniaxial state at infinity. The analysis is performed in the case of the Landau-deGennes model with equal elastic constants, i.e., in (8.0.5) one takes L1 = L2 = L3 := L.

For small balls (i.e., r L1/2) and relatively strong anchoring, she proved the existence of a limitingquadrupolar configuration, with a “Saturn ring” defect, corresponding to an exchange of eigenvalues of theQ-tensor. Relatively strong anchoring means that r is of the order of L/W ; the limiting shape depends on the

limiting ratiorL

W. An interesting feature of this result is the explicit form of the quadrupolar configurations.

The Saturn ring defect appears as a discontinuity in the principal eigenvector of Q(x), the Q-tensor passingthrough a uniaxial state as eigenvalue branches cross via an “eigenvalue exchange” mechanism. In the largeparticle limit, she was able to analyze the strong anchoring (Dirichlet) condition under the extra assumptionthat the minimizer is axially symmetric. In this setting, she obtained that the limiting map when r L1/2

is the unique uniaxial axially symmetric minimizer of the Oseen-Frank energy which is compatible with thebehavior at infinity of the liquid crystal (cf. [4]). Some of the striking features of this analysis: the Oseen-Frank model does not allow line defects as in the Saturn ring, and the numerical simulations based on theEricksen model do not provide the correct radius of the ring.

Arghir Zarnescu gave a general overview of the research project of the group he forms with Radu Ignat, LucNguyen and Valeriy Slastikov, and of some of their main achievements. His presentation served as anintroduction to the one of Luc Nguyen. In particular, he put in perspective the Landau-de Gennes andthe Ginzburg-Landau models, the former one being a higher dimensional, more complex version than thelatter. It turns out that new interesting features appear, connected to new phenomena which are specificto this type of models. A typical example is the uniqueness and the stability of radial solutions. Whileit is well understood for decades for the Ginzburg-Landau model, its resolution for the Landau-de Gennesmodel required new ideas and techniques which are of independent interest, in particular new separation ofvariables and uniqueness techniques. His presentation was based on [33, 34, 35, 36].

Luc Nguyen detailed one of the new features presented in the previous lecture. In Ginzburg-Landau theory, itis known that the hedgehog f(r)eiθ is energy minimizing. In the Landau-de Gennes setting, the hedgehogf(r)

x

|x| is minimizing in certain regimes, but not in all of them. Another way to present this is that even if

the energy functional and the boundary data are invariant under the orthogonal group, the minimizer need nothave this property. A striking result presented is that even for two dimensional liquid crystals, uniquenessis a non-trivial matter. More specifically, if the boundary datum on ∂D has no topological obstruction, theminimizers are “unique and rotationally symmetric” (cf. [36]). As an application, he derived the existenceof multiple non-minimizing rotationally symmetric critical points.

Giacomo Canevari presented recent progress related to nematic shells. These are rigid colloidal particles with atypical dimension in the micrometer scale coated with a thin film of nematic liquid crystal whose molecular

Page 93: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 89

orientation is subjected to a tangential anchoring. The recent interest in their study is related to the possibilityof using them as building blocks of meso-atoms with a controllable valence. Mathematically, they aredescribed as compact boundaryless two dimensional surfaces M embedded into R3, together with a unit-norm tangent vector field n : M → TM . Such a smooth n need not exist, and even in H1 n does not existin a surface of non zero Euler characteristic, by the weak form of the Poincare-Hopf’s theorem (see alsoRobert Jerrard’s presentation). He presented a discrete-to-continuous approach of nematic shells. It consistsof discretizing the surface using triangulations, and to define the tangent vectors only on the vertices of thetriangulation, avoiding in this way all topological obstructions. A discrete energy is defined, mimicking thestandard Dirichlet energy. The main result presented is the Γ-convergence of the discrete energy as ε → 0.The second order expansion involves, at the minimal energy level, the Euler characteristic of the surface anda renormalized energy in the spirit of [13] (cf. [19]). A striking fact about the renormalized energy is that itkeeps trace of the discrete structures used to discretize the surface M around each singularity.

Jinhae Park discussed the nature of minimizers for isotropic-nematic 1D interface problem in case of equal elasticconstants in (8.0.5), i.e., L1 = L2 = L3 := L. In this setting, it is interesting to allow non positive values ofthe parameter L. The matter of discussion is the stability and/or minimality of 1D interfaces, i.e., of entirecritical points of (8.0.5) depending only on x3 and satisfying appropriate anchoring conditions at x3 = ±∞.This issue is investigated under the assumption b2 = 27ac, meaning that the bulk energy of the normal andthe nematic phase are equal. When L = 0, it is proved that the minimizer must be a kink similar to the onefor the Modica-Mortola functional. Assuming Q(−∞) = 0, the anchoring condition at∞ is proved to playa crucial role when L 6= 0: homeotropic, planar and tilt anchoring lead to different analysis. For examplein the case of homeotropic anchoring it is proved that the uniaxial equilibrium is stable when L ≤ 0, butunstable when L > 0 (cf. [47]).

Other aspects of the Ginzburg-Landau theoriesNicolas Rougerie considered a mean-field approximation of multi-particle Hamiltonian, appropriate for ”almost

bosonic” 2D anyons. The order parameter associated with these anyons minimizes the energy functional

Eafβ [u] :=

∫R2

∣∣(−i∇+ βA[|u|2])u∣∣2 + V (r)|u|2

dr, (8.0.7)

where V is a trapping potential andA[λ] := ∇⊥ log |r| ∗ λ .

After providing some physical background, the energy and the density (u) asymptotics in the limit β → ∞were presented [23].

Kirill Samokhin presented the challenges of the study of non centrosymmetric superconductors. One of theirnoticeable features is the possibility of unusual nonuniform superconducting states even without any externalfield. In such situations, the strong spin-orbit coupling of electrons with the crystal lattice makes it necessaryto describe superconductivity in terms of one or more nondegenerate bands characterized by helicity. He alsoexplained how the origin of non uniform superconducting states can be traced in the first-order gradient termsin the Ginzburg-Landau free energy (cf [48]). He gave some insight concerning the topological classificationof the superconducting states using the integer-valued Maurer-Cartan invariants and the Bogoliubov-Wilsonloops, with focus, for a two-dimensional setting, on the corresponding wave function topology.

Etienne Sandier presented a first progress towards the description of local minimizers of the simplified Ginzburg-Landau functional (8.0.1) in 3D. An entire solution of (8.0.1) is locally minimizing in the sense of De Giorgiif Eε(u + φ,BR) ≥ Eε(u,BR) for every R > 0 and φ ∈ C∞c (BR;C). Such solutions are classified in 2D(they are either constant, or, up to an isometry, the unique radial solution of degree 1 at infinity, urad). Theirclassification in dimension ≥ 3 is widely open. He presented the following result: an entire solution whoseaverage energy is sufficiently small, in the sense that

lim infR→∞

E(u,BR)

R lnR< 2π,

Page 94: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

90 Five-day Workshop Reports

must be constant [49]. Here, 2π is precisely the density of the 2D solution urad considered as a functionof three variables. This may open the way for classifying the local minimizers in 3D as constants or, up toisometries, the 2D solution urad.

Robert Jerrard focused on the analysis of the functional

Gε(u) =

∫S

(1

2|Du|2g +

1

4ε2F (|u|2g)

),

with u : S → TS a tangent vector field, and S a (two dimensional) surface. He explained a Γ-convergenceresult at the second order of Gε as ε → 0. In particular, he showed that energy minimizers uε convergeoutside a finite number of singularities to a canonical vector field u∗ of unit length, whose number of singu-larities is |2− 2g|, g being the genus of the (compact) surface S (cf. [32]). In addition, he obtained a secondorder expansion of the energy Gε(uε) involving a renormalized energy in the spirit of the one devised byBethuel, Brezis and Helein for the Dirichlet problem in the plane (cf. [13]). The remarkable aspect of thisanalysis is that the role of the Dirichlet condition is played by the topology of S, more specifically by itsEuler characteristic.

Radu Ignat presented a uniqueness (up to symmetries) asymptotic result for minimizers of the Ginzburg-LandauEε functional for R3 (and not complex) valued unknown functions uε. His starting point is the following

well-known symmetry result for S2-valued harmonic maps: the Dirichlet energy∫D|∇n|2, with D the unit

disc and n : D → S2 subject to the boundary condition n(eiθ) = (ekiθ, 0) has exactly two solutions, onesuch that n3 > 0, the other one such that n3 < 0. These solutions are of the form (f(r)ekiθ,±g(r)), withr := |x|. One expects this uniqueness (up to the ± sign) and special form of the solutions to propagate tominimizers of Eε on D, at least for small ε. This is indeed the case. He presented elements of the proof,with emphasis on the crucial steps and ideas in [37].

Miscelaneous topicsLeonid Berlyand presented a study of two types of models describing the motility of eukaryotic cells on sub-

strates. The first one, a phase-field model, consists of the Allen-Cahn equation for the scalar phase fieldfunction coupled with a vectorial parabolic equation for the orientation of the acting filament network. Inthe sharp interface limit the equation of the motion of the cell boundary has been derived. The existence oftwo distinct regimes of the physical parameters is then established and the existence of traveling waves inone of them is proved. The second model type is a non-linear free boundary problem for a Keller-Segel typesystem of PDEs in 2D with area preservation and curvature entering the boundary conditions. A family ofradially symmetric standing wave solutions (corresponding to a resting cell) are obtained. Then, with the aidof topological tools, traveling wave solutions (describing steady motion) with non-circular shape are shownto bifurcate from the standing wave. The bifurcation analysis explains, how varying a single (physical)parameter allows the cell to switch from rest to motion (cf. [12, 10]).

Gershon Wolansky considered a non-local Liouville equation

±∆ψ =e−λψ

‖e−λψ‖1in Ω ⊂ Rn , ψ|∂Ω = 0 ,

where λ > 0. The physical motivation behind this equation is a variational problem for entropy maximiza-tion under prescribed mass and energy. He presented an unconditional existence and uniqueness proof incase of electrostatic (repulsive) self-interaction, and conditional existence and uniqueness in dimension twoin the case of gravitational (attractive) self-interaction.

Stan Alama considered a a nonlocal isoperimetric functional with a confinement term, derived as the sharp inter-face limit of a variational model for self-assembly of diblock copolymers under confinement by nanoparticleinclusion. This functional appears as the sharp interface limit of a model of diblock copolymer/nanoparticle

Page 95: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 91

blend where a large number static nanoparticles serve as a confinement term, penalizing the energy outsideof a fixed region. In a periodic setting, this energy takes the form

Eγ,σ,µ(u) :=

∫T3

|Du|+ γ‖u−m‖2H−1(T3) + σ

∫T3

(u− 1)2.

Here, µ is a measure with density on T3 and the BV unknown density function u is subject to the mass

constraint∫T3

= m.

The following asymptotic regime naturally appears in connection with the sharp interface limit of the Ohta-Kawasaki functional, modeling the self-assembly of diblock copolymers: m→ 0, γ, σ →∞.

In this regime and after a suitable rescaling, a two terms expansion for the energy without confinementterm was presented, involving a small parameter η (representing, roughly speaking, the radius of the smallparticles). The first term captures the location of droplets in the limit η → 0 (through a sum of Dirac masses);the second one, a Coulomb type interaction between droplets. Much more delicate is the analysis of the casewith confinement. This is obtained in presence of confinement densities of µwhich are spatially variable andattain a nondegenerate maximum. Under such an assumption, it is possible to obtain a two-terms asymptoticexpansion exhibiting a separation of length scales due to competition between the nonlocal repulsive andconfining attractive effects in the energy. Two different asymptotic behaviours may occur, according to thetotal volume of the droplets: either the minority phase splits into several droplets at an intermediate scale, orthe minimizers form a single droplet converging to the maximum of the confinement density (cf [1]).

Michel Chipot presented new aspects of the justification of dimensional reduction for infinitely large cylinders,theory he developed over the last decade [20, 21]. More specifically, he established the existence of mini-mal solutions (local minimizers in the sense of De Giorgi) to some variational elliptic inequalities. Thesesolutions are obtained by appropriate truncations of the unbounded domain. A typical situation is the oneof an infinite (straight) cylinder, but the techniques allow considering more general cylinder like domains,in particular bounded by appropriate graphs. Uniqueness of the minimal solutions is obtained via adaptedcomparison principles.

Peter Sternberg introduced a non-standard isoperimetric problem in the plane associated with a metric having adegenerate conformal factor at two points. More specifically, he considers the minimization problem

inf E(γ) with γ : [0, 1]→ R2, E(γ) :=

∫ 1

0

F (γ)|γ′|.

The competitors must satisfy γ(0) = p−, γ(1) = p+ and the “enclosed area” condition∫γ

ω = const.,

with ω := −p2dp1.

The conformal factor F : R2 → [0,∞) vanishes exactly at p±.

Existence of minimizers is related to the existence of traveling wave solutions to a Hamiltonian systemassociated with the energy functional

H(u) :=

∫ (1

2|∇u|2 +W (u)

), with W (u) := F 2(u).

Minimizer for the one-well problem exist for quadratic potentials F 2, and need not exist for more degenerateones. He explained how to obtain existence for the two-wells problem assuming separate existence for theone-well ones at p± (cf. [3]). This result has implications on the existence of traveling waves. It is alsopossible to derive a limitation for the maximal propagation speed for these traveling waves through anexplicit upper bound depending on the conformal factor F .

Page 96: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

92 Five-day Workshop Reports

Giandomenico Orlandi discussed a relaxation of the Steiner tree problem, consisting of finding a minimal (forvarious metrics) connected graph containing given points. He proposed a variational approximation of thisproblem, as well as of the Gilbert-Steiner irrigation problem for the Euclidean distance in the plane. Therelaxation involves Modica-Mortola or Ginzburg-Landau type functionals. More generally, he consideredthe Plateau problem in 2D with coefficients into a normed group. He presented elements of the proof of thefact that these approximations do indeed Γ converge to the original problems (cf. [14]), and thus offer analternative approach for handling such questions known to be NP hard.

Itai Shafrir presented a natural notion of class within Sobolev spaces of sphere-valued maps, with focus on thespaces W 1,1(Ω;S1) and W 1,2(Ω; S2). More specifically, two maps are in the same class if they have “thesame singularities” (the same distributional Jacobian). The latter functional setting is relevant for liquidcrystals. He introduced the metric and Hausdorff distance between two classes, and gave their precise values(cf. [18, 17, 16]). The former distance can be interpreted as the minimal energy required to move thesingularities from one place to the other, and its exact value involves the Wasserstein distance between theJacobians of maps.

Petru Mironescu presented a functional analytic setting adapted to the study of the semi-stiff problems. A typicalrelevant space in this case is W 1/p,p(S1;S1), with 1 < p <∞. Such a space is critical for lifting and weakconvergence matters: bubbling phenomena appear, and there is no phase control or preservation of thedegree in the weak limit. He described the behavior of weakly convergent sequences, demonstrating thatnon compactness can appear only via the conformal group of the unit disc (profile decomposition cf. [42])and gave applications of the profile decomposition to the existence of critical points for semi-stiff problems[11].

Future Perspectives

The meeting revealed the emergence of new techniques and intermediate problems that in the near future shouldfacilitate the ground for approaching several problems that seemed out of reach before:

1. Study of stability, uniqueness and other qualitative properties of radial solutions of vector-valued radiallysymmetric functionals.

2. Classification of locally minimizing entire solutions of the Ginzburg-Landau functional in dimension ≥ 3.

3. Complete analysis of Landau-de Gennes models with penalization terms confining the Q-tensor within aphysically realistic region.

4. Sharp analysis of the biaxial escape phenomenon in nematics.

5. Rigorous asymptotic analysis of many order parameters Ginzburg-Landau type functionals – beyond thep-waves models.

6. Linear stability analysis of hydrodynamic flow in a tube of arbitrary cross section.

7. Rigorous analysis of tunneling effects for the Neumann magnetic Laplacian and other operators.

8. Determination of the preferred orientation of the helical axis in nearly cholesteric smectic A liquid crystals.

9. Analyzing the behaviour of the superconductivity order the boundary layer in a three-dimensional setting, inthe surface superconductivity parameter regime.

Page 97: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 93

ParticipantsAlama, Stanley (McMaster University)Almog, Yaniv (Ort-Braude College)Bauman, Patricia (Purdue University)Berlyand, Leonid (Pennsylvania State University)Bonnaillie-Noel, Virginie (CNRS, Ecole normale superieure)Bronsard, Lia (McMaster University)Canevari, Giacomo (University of Oxford)Chipot, Michel (University of Zurich)Correggi, Michele (Roma Tre University)Fournais, Søren (University of Aarhus)Golovaty, Dmitry (University of Akron)Helffer, Bernard (Universite de Nantes)Ignat, Radu (Universite Paul Sabatier - Toulouse 3)Jerrard, Robert (University of Toronto)Kachmar, Ayman (Lebanese University)Kovarik, Hynek (Universita degli studi di Brescia)Mironescu, Petru (University Claude Bernard Lyon 1)Nguyen, Luc (University of Oxford)Orlandi, Giandomenico (University of Verona)Palffy-Muhoray, Peter (Kent State University)Park, Jinhae (Chungnam National University)Phillips, Daniel (Purdue University)Raymond, Nicolas (Universite Rennes 1)Rougerie, Nicolas (Universite Grenoble-Alpes and CNRS, LPMMC)Samokhin, Kirill (Brock University)Sandier, Etienne (Universite Paris Est Creteil)Shafrir, Itai (Technion-Israel Institute of Technology)Siegl, Petr (University of Bern)Sternberg, Peter (Indiana University)Wolansky, Gershon (Technion-Israel Institute of Technology)Zarnescu, Arghir (Basque Center for Applied Mathematics)Zheng, Xiaoyu (Kent State University)

Page 98: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] S. ALAMA, L. BRONSARD, R. CHOKSI AND I. TOPALOGLU, Droplet phase in a nonlocal isoperimetricproblem under confinement, arXiv preprint arXiv:1609.03589 (2016).

[2] S. ALAMA, L. BRONSARD, R. CHOKSI AND I. TOPALOGLU, Ground-states for the liquid drop and TFDWmodels with long-range attraction, arXiv preprint arXiv:1707.06674 (2017).

[3] S. ALAMA, L. BRONSARD, A. CONTRERAS, J. DADOK AND P. STERNBERG, A Degenerate IsoperimetricProblem and Traveling Waves to a Bi-stable Hamiltonian System, arXiv preprint arXiv:1504.00423 (2015).

[4] S. ALAMA, L. BRONSARD AND X. LAMY, Minimizers of the Landau-de Gennes energy around a sphericalcolloid particle, arXiv preprint arXiv:1504.00421 (2015).

[5] Y. ALMOG, Thin boundary layers of chiral smectics, Calculus of Variations and Partial Differential Equations,33 (2008), pp. 299–328.

[6] Y. ALMOG, The stability of the normal state of superconductors in the presence of electric currents. SIAM J.Math. Anal. 40 (2) (2008), pp. 824–850.

[7] Y. ALMOG, D. GREBENKOV AND B. HELFFER, On a Schrodinger operator with a purely imaginary potentialin the semiclassical limit, arXiv preprint arXiv:1703.07733 (2017).

[8] Y. ALMOG AND R. HENRY, Spectral analysis of a complex Schrodinger operator in the semiclassical limit.SIAM J. Math. Anal. 44 (2016), pp. 2962–2993.

[9] P. BAUMAN AND D. PHILLIPS, Regularity and the behavior of eigenvalues for minimizers of a constrainedQ, Calculus of Variations and Partial Differential Equations, 55 (2016), pp. 1–22.

[10] L. BERLYAND, J. FUHRMANN AND V. RYBALKO, Bifurcation of traveling waves in a Keller-Segel type freeboundary model of cell motility, arXiv preprint arXiv:1705.10352 (2017).

[11] L. BERLYAND, P. MIRONESCU, V. RYBALKO AND E. SANDIER, Minimax critical points inGinzburg-Landau problems with semi-stiff boundary conditions: existence and bubbling, Comm. Partial Dif-ferential Equations 39(5) (2014), pp. 946-1005.

[12] L. BERLYAND, M. POTOMKIN AND V. RYBALKO, Phase-field model of cell motility: Traveling waves andsharp interface limit, C. R. Math. Acad. Sci. Paris 354 (2016), pp. 986–992.

[13] F. BETHUEL, H. BREZIS AND F. HELEIN, Ginzburg-Landau vortices, Progress in Nonlinear DifferentialEquations and their Applications, 13, Birkhauser Boston Inc., Boston, MA, 1994.

[14] M. BONAFINI, G. ORLANDI AND E. OUDET, Variational approximation of functionals defined on1-dimensional connected sets: the planar case, arXiv preprint arXiv:1610.03839 (2016).

[15] V. BONNAILLIE-NOEL, F. HERAU AND N. RAYMOND, Curvature induced magnetic bound states: towardsthe tunneling effect for the ellipse (2016).

94

Page 99: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 95

[16] H. BREZIS, P. MIRONESCU AND I. SHAFRIR, Distances between classes in W 1,1(Ω; S1), arXiv preprintarXiv:1606.01526 (2016).

[17] H. BREZIS, P. MIRONESCU AND I. SHAFRIR, Distances between homotopy classes of W s,p(SN ;SN ),ESAIM Control Optim. Calc. Var. 22(4) (2016), pp. 1204–1235.

[18] H. BREZIS, P. MIRONESCU AND I. SHAFRIR, Distances between classes of sphere-valued Sobolev maps,C. R. Math. Acad. Sci. Paris 354(7) (2016), pp.677–684.

[19] G. CANEVARI AND A. SEGATTI, Defects in Nematic Shells: a Γ-convergence discrete-to-continuumapproach, arXiv preprint arXiv:1612.07720 (2017).

[20] M. CHIPOT, ` goes to plus infinity, Birkhauser Advanced Texts: Basler Lerhbucher, Birkhauser Verlag, Basel,viii+181 pp, 2002.

[21] M. CHIPOT, Asymptotic issues for some partial differential equations, Imperial College Press, London,ix+252 pp, 2016.

[22] M. CORREGGI AND E. L. GIACOMELLI, Surface superconductivity in presence of corners, Rev. Math. Phys.,29 (2017), 1750005.

[23] M. CORREGGI, D. LUNDHOLM AND N. ROUGERIE, Local density approximation for the almost-bosonicanyon gas, arXiv preprint arXiv:1611.00942 (2016).

[24] M. CORREGGI AND N. ROUGERIE, On the Ginzburg-Landau functional in the surface superconductivityregime, Commun. Math. Phys., 332 (2014), pp. 1297–1343.

[25] M. CORREGGI AND N. ROUGERIE, Effects of boundary curvature on surface superconductivity, Lett. Math.Phys., 106 (2016), 445.

[26] S. FOURNAIS, L. L. TREUST, N. RAYMOND AND J. VAN SCHAFTINGEN, Semiclassical Sobolev constantsfor the electro-magnetic Robin Laplacian, arXiv preprint arXiv:1603.02810 (2016).

[27] D. GOLOVATY, A. MONTERO AND P. STERNBERG, Dimension reduction for the Landau-de Gennes modelon curved nematic thin films, arXiv preprint arXiv:1611.03011 (2016).

[28] D. S. GREBENKOV AND B. HELFFER, On spectral properties of the Bloch-Torrey operator in twodimensions. Submitted. Preprint: http://arxiv.org/abs/1608.01925.

[29] D. S. GREBENKOV, B. HELFFER AND R. HENRY, The complex Airy operator with a semi-permeablebarrier. To appear in SIAM J. Math. Anal (2017). Preprint: http://arxiv.org/abs/1603.06992.

[30] R. HENRY, On the semi-classical analysis of Schrodinger operators with purely imaginary electric potentialsin a bounded domain, arXiv preprint arXiv:1405.6183 (2014).

[31] B. HELFFER AND X.-B. PAN, Reduced Landau–de Gennes functional and surface smectic state of liquidcrystals, Journal of Functional Analysis, 255 (2008), pp. 3008–3069.

[32] R. IGNAT AND R. JERRARD, Interaction energy between vortices of vector fields on Riemannian surfaces,arXiv preprint arXiv:1701.06546 (2017).

[33] R. IGNAT, L. NGUYEN, V. SLASTIKOV AND A. ZARNESCU, Uniqueness results for an ODE related to ageneralized Ginzburg-Landau model for liquid crystals, arXiv preprint arXiv:1404.1724 (2014).

[34] R. IGNAT, L. NGUYEN, V. SLASTIKOV AND A. ZARNESCU, Stability of the melting hedgehog in theLandau-de Gennes theory of nematic liquid crystals, arXiv preprint arXiv:1404.1729 (2014).

[35] R. IGNAT, L. NGUYEN, V. SLASTIKOV AND A. ZARNESCU, Instability of point defects in atwo-dimensional nematic liquid crystal model, arXiv preprint arXiv:1503.03670 (2015).

Page 100: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

96 Five-day Workshop Reports

[36] R. IGNAT, L. NGUYEN, V. SLASTIKOV AND A. ZARNESCU, Stability of point defects of degree ±1

2in a

two-dimensional nematic liquid crystal model, arXiv preprint arXiv:1601.02812 (2016).

[37] R. IGNAT, L. NGUYEN, V. SLASTIKOV AND A. ZARNESCU, On the uniqueness of minimisers ofGinzburg-Landau functionals, arXiv preprint arXiv:1708.05040 (2017).

[38] H. KOVARIK AND K. PANKRASHKIN, On the p-Laplacian with Robin boundary conditions and boundarytrace theorems, Calculus of Variations and Partial Differential Equations, 56 (2017), p. 49.

[39] D. KREJCIRIK AND P. SIEGL, Pseudomodes for Schroedinger operators with complex potentials, arXivpreprint arXiv:1705.01894 (2017).

[40] A. KACHMAR, P. KERAVAL AND N. RAYMOND, Weyl formulae for the Robin Laplacian in the semiclassicallimit, Confluentes Math. (2016).

[41] A. KACHMAR, X.-B. PAN ET AL., Existence of surface smectic states of liquid crystals, arXiv preprintarXiv:1602.06241 (2016).

[42] P. MIRONESCU, Profile decomposition and phase control for circle-valued maps in one dimension, C. R.Math. Acad. Sci. Paris 353(12) (2015), pp. 1087–1092.

[43] B. MITYAGIN AND P. SIEGL, Local form-subordination condition and Riesz basisness of root systems, arXivpreprint arXiv:1608.00224 (2016).

[44] L. MRAD AND D. PHILLIPS, Dynamic analysis of chevron structures in liquid crystal cells, Molecular Crys-tals and Liquid Crystals, 647 (2017), pp. 66–91.

[45] X. B. PAN, Surface superconductivity in applied magnetic fields above HC2, Comm. Math. Phys., 228

(2002), pp. 327–370.

[46] P. PALLFY-MUHORAY, M. Y. PEVNYI, E. G. VIRGA AND X. ZHENG, The Effects of Particle Shape inOrientationally Ordered Soft Materials, IAS/Park City Mathematics Series (to appear).

[47] J. PARK, W. WANG, P. ZHANG, Z. ZHANG, On minimizers for the isotropic-nematic interface problem,Calculus of Variations and Partial Differential Equations 56(2) (2017), pp. 1–15.

[48] K. SAMOKHIN, Symmetry and topology of two-dimensional noncentrosymmetric superconductors, arXivpreprint arXiv:1511.07493 (2015).

[49] E. SANDIER AND I. SHAFRIR, Small energy Ginzburg-Landau minimizers in R3, arXiv preprintarXiv:1612.00821 (2017).

Page 101: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Phase Transitions Models 97

– can’t fix

Page 102: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 9

Algebraic Combinatorixx 2 (17w5012)

May 14 - 19, 2017Organizer(s): Julie Beier (Earlham College), Patricia Muldoon Brown (Armstrong State), RosaOrellana (Dartmouth), Stephanie van Willigenburg (University of British Columbia)

OverviewAlgebraic combinatorics is a large branch of mathematics with strong ties to many areas including representationtheory, computing, knot theory, mathematical physics, symmetric functions and invariant theory. This workshopfocused on the three areas below, inspired by the previous Algebraic Combinatorixx workshop and current trends.

• Algebraic combinatorics and representation theory: The representation theory of Lie algebras, quantumgroups, Hecke algebras / double affine Hecke algebras, diagram algebras, symmetric groups and more utilizealgebraic combinatorics and often take advantage of combinatorial objects such as crystals, Littlemann paths,tableaux, quivers, lattice paths, knots, and alternating sign matrices to discover new results.

• Algebraic combinatorics and geometry: This area includes the understanding of posets, lattices, simpli-cial complexes, CW-complexes, polytopes, and groups or arrangements also associated with these such asreflection groups, braid groups, hyperplane arrangements and descent algebras. This subarea of algebraiccombinatorics connects to tropical geometry, Schubert calculus, Grassmannians and techniques in geometricrepresentation theory.

• Algebraic combinatorics and combinatorial functions: In this category we are referring to symmetric func-tions such as Hall-Littlewood polynomials, Schur functions / k-Schur functions, and Weyl group charactersas well as quasisymmetric functions such as those arising from random walks, P-partitions, 0-Hecke algebrasand Hopf algebras. In addition, the Kazhdan-Lusztig polynomials fit into this category.

These three areas are clearly interconnected and all are of current interest in research. For example, the searchfor an algorithmic description of the coefficients that arise in the inner product of two Schur functions, known asKronecker coefficients, has formed a very active research area over the past twenty years. Most recently this is dueto deep connections with quantum information theory and the central role it plays within geometric complexitytheory, which is an approach that seeks to settle the celebrated P versus NP problem – one of the several 1,000,000Millennium Prize Problems set by the Clay Mathematics Institute. Meanwhile, the search for a combinatorialformula for the product of two Schubert polynomials is another longstanding open problem that has fuelled muchresearch and development in the area due to its connections to algebraic geometry and the resolution of Hilbert’s15th Problem.

98

Page 103: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Algebraic Combinatorixx 2 99

In addition to these mathematical goals, the workshop also sought to foster the community of women in al-gebraic combinatorics, cutting across the false research versus teaching institution divide. Thus, in addition topresentations, work was done during the week in teams; each team focusing on a different problem.

Presentation Highlights

The first day contained eight talks by the team leaders. In these, they presented background and problems that theteams would work on during the week. They are discussed in the following section. On the remaining days weheard a total of 15 excellent talks on current research. It is impossible to do justice to all of these talks but somehighlights are mentioned below.

Helene Barcelo spoke about discrete homotopy theory for graphs, known work of Babson, Barcelo, Kramer, deLongueville, Laubenbacher, Severs, Weaver and White, and argued that this theory is a good analogy to classicalhomotopy theory (cf. [3]). She went on to present a new idea developed by Barcelo, Capraro and White for adiscrete homology on graphs and argued that it gives expected results such as Hurewicz Theorem and a discreteversion of the Eilenbrerg-Steenrod axioms [4]. Moreover, she presents a conjecture that the path homologies ofGrigor’yan, Muranov, and Yau are isomorphic to these discrete homologies for undirected graphs (cf. [10]).

Yui Cai presented work that was joint with Margaret Readdy and published this month on q-stirling numbers[8]. First, she argues that the q-Stirling numbers of the second kind can be understood using weights on allowablerestricted growth words. Next we are presented with a new poset called a Stirling poset of the second kind. Caiwas then able to give a poset explanation and homological interpretation to this work. Finally, we saw that usingrook placements similar techniques could be used to produce the q-stirling numbers of the first kind.

The relationship between claw-contractible-free graphs and the e-positivity of a graph’s chromatic symmetricfunction has been discussed since Stanley’s 1995 paper [16]. Samantha Dahlberg presented results by Dahlberg,Foley and van Willigenburg, just recently posted on the archive, that settle this matter [9]. In this talk, we finallyfind the answer by learning of infinite families of graphs that are not claw-contractible and do not have chromaticsymmetric functions that are e-positive. Moreover, we also saw one such family that is claw free but not e-positive.

Scientific Progress Made

During the week, we were split into eight teams working on eight different projects. Approximately eight hourswere specifically designated for work in teams, and a number of teams met outside of the other workshop activitiesto work farther. In this section we briefly describe each project and summarize the progress made so far.

Chromatic Symmetric Functions

Team Leader: Angele HamelTeam Members: Julie Beier, Samantha Dahlberg, Maria Gillespie, Stephanie van Willigenburg

Consider all of the proper colorings of a graphG. With each coloring,K, associate a monomial xK(v1)xK(v2) . . ..The chromatic polynomial is defined to be the sum of these monomials, and is known to be symmetric. Stanley[and Gasharov] conjecture that all claw free graphs have chromatic polynomials that are Schur positive [17].

We know from the work of Schilling that if one can create a crystal structure on the proper colorings of a graphG then the chromatic polynomial will indeed be Schur positive. We started with a special simple case of claw freegraphs that are already known to be Schur positive, paths on n vertices. We spent the week working on a definitionof Kashiwara operators fi, ei and of φi, εi for path colorings. Subsequently, we started to show that these satisfythe Stembridge axioms, which would give us a crystal structure. At the end of our time, we are still working onrefining our rules so that they will generalize to colorings of non-path graphs, and proving our rules for the pathcase.

Page 104: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

100 Five-day Workshop Reports

Quasisymmetric Analogue of Macdonald PolynomialsTeam Leader: Sarah MasonTeam Members: Cristina Ballantine, Zajj Daugherty, Angela Hicks, Elizabeth Niese

This project was initiated to work toward finding a quasisymmetric analogue to the Macdonald polynomials.It became quickly evident that the naive approach of combining suitable nonsymmetric Macdonald polynomialswould not lead to the desired outcome. Since the initial definition of the Macdonald polynomials is based ona modification of the Hall inner product on the power sum symmetric functions, the team decided to changeapproaches and work toward an appropriate modification of the inner product on QSym and NSym. In order to dothis, a quasisymmetric power sum basis is needed, so this is the task the group spent time on and made significantprogess.

Calculation 1: An explicit calculation of the Cauchy kernel:

∑β

HNβ (X)MQ

β (Y ) =∏j

1−

∑i≥1

xi

yj

−1

Result 1: A quasisymmetric power sum (type 1) PQα such that 〈PQα , PNβ 〉 = δαβzα where zα = zα.

Result 2: A monomial expansion for the quasisymmetric power sum (type 1):

PQα =∑α≤β

cαβMQβ

Conjecture 1: For λ ` n,pλ =

∑α=λ

PQα .

Algebraic Voting Theory & Representations of Sm[Sn]

Team Leader: Kathryn NymanTeam Members: Helene Barcelo, Megan Bernstein, Sarah Bockting-Conrad, Erin McNicholas, Shira Viel

We consider the problem of selecting a committee consisting of one member chosen fromm candidates in eachof n departments. Voters rank the possible committees and a positions scoring method (such as the Borda count)can then be used to select a winning committee. We can view the profile and results spaces of voting information asQSm[Sn] modules, where the wreath product acts on the committees by permuting candidates within a departmentand permuting the departments. Since the positional voting procedure is a QSm[Sn] module homomorphismbetween the spaces, Schur’s Lemma provides information on the voting information lost in the election process aswell as the voting information that determines the outcome.

Our first goal is to decompose the results space R into simple QSm[Sn]-submodules. Let p(m) denote thenumber of partitions of m, and enumerate the set of partitions as (γ1, . . . , γp(m)). These partitions index theirreducible representations of Sm. Note that m of these partitions are hooks; we refer to the others as non-hook.In addition, we define a flat partition of k to be the partition consisting of a single part of size k.

We conjecture R has the following decomposition into simple QSm[Sn]-modules:

R = ⊕(a1+a2+...+ap(m)=nai=0 ∀ non-hook γi

)S(a1,...,apm ) (9.0.1)

where S(a1,...,ap(m)) denotes the simple QSm[Sn]-module indexed by the multi-partition (α1, . . . , αp(m)) of nwhere each αi is the flat partition of |αi| = ai. A proof of Matt Davis shows that Equation (9.0.1) holds in the case

Page 105: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Algebraic Combinatorixx 2 101

when m = 2; we are working on generalizing his proof. Towards that end, our primary result is a computation ofthe character of Vn, the representation of Sm[Sn] corresponding to R.

Let g ∈ Sn[Sm], viewed as a permutation of 11, 12, . . . , 1m, . . . , n1, n2, . . . , nm. (Think of ik as candidatek from department i). Writing g as a product of disjoint cycles, for each i ∈ [n] define the cycle family Fi(g)to be the set of cycles in g containing ik for some k ∈ [m]. By definition of the action of the wreath product,π determines the permutation of the departments. Thus, if ik and jl appear together in some cycle of g, π sendsdepartment i to department j and for all s ∈ [m], is is sent to jt for some t ∈ [m]. We define m(Fi(g)) = #j ∈[n] : Fj(g) = Fi(g), i.e. the number of departments appearing in cycles containing candidates from departmenti. If Fi(g) = Fj(g) we will say j is a member of Fi(g). Thus, m(Fi(g)) is the number of members in Fi(g).Finally, we let F(g) = Fi(g) : i ∈ [n].

Theorem 9.0.1 For each g ∈ Sn[Sm],

χVn(g) =∏

F∈F(g)

# cycles c in g : the length of c is m(F ). (9.0.2)

cd-Index of Eulearian Posets

Team Leader: Margaret ReaddyTeam Members: Yue Cai, Anastasia Chavez, Gizem Karaali, Heather Russell

The Readdy group is currently looking at two questions: how to compute the cd-index of important familiesof Eulerian posets and to discover combinatorial interpretations of the coefficients of the cd-index invariant. A lotof time was spent getting the group to speed on the geometry and coalgebraic structure of flag enumeration forpolytopes. We are in the midst of understanding the cd-index of the family of cyclic polytopes and finding a morestreamlined way to compute the cd-index.

The cyclic polytopes are a family of polytopes central to the Upper Bound Theorem; that is, for fixed dimensionn and vertices f0 the cyclic polytope maximizes the face vector. Further demonstrating this family’s importance,for the cd-index of polytopes, more generally, Eulerian posets, Billera and Ehrenborg showed the cyclic polytopemaximizes each coefficient of the cd-index [5]. In essence, this is an Upper Bound Theorem for the cd-index. TheReaddy group also began discussing a combinatorial interpretation for the cd-index of the permutahedron. Thispolytope will very likely have a more tractable answer as we can use its inherent group structure.

On the Kronecker quasi-polynomials

Team Leader: Mercedes RosasTeam Members: Marni Mishna, Sheila Sundaram

A central result in the representation theory of the symmetric group says that the set of partitions λ of n labelthe irreducible representations Sλ of Sn.

In this setting, a major open problem is to understand the decomposition into irreducibles for the tensor productof representations. The Kronecker coefficients gλµ,ν are the coefficients governing this decomposition,

Sµ ⊗ Sν ∼=⊕µ,ν

gλµ,νSλ

The coefficients gλµ,ν are intriguing and poorly understood, and determining a satisfactory formula for them is oneof the major open questions in algebraic combinatorics.

Our project focuses on a related function. Let Qλµ,ν be the stretching function defined by

Qλµ,ν(n) = gnλnµ,nν

Page 106: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

102 Five-day Workshop Reports

It is known that Qλµ,ν(n) is a quasi–polynomial in n, but little else is known about this function [11]. Recently,Balloni and Vergnes have developed algorithms to compute these polynomials when the length of the partitions issmall [1, 2].

On the other hand, quasi–polynomials appear naturally in the study of dilations of rational polytopes. In thissituation, Ehrhart quasi-polynomials count the number of integer points inside dilations of polytopes. Of course,not all quasi–polynomials correspond to such a counting function.

It is known that Qλµ,ν(n) is not, in general, the counting function for the number of integral points inside thedilations of any polytope, [11, 6].

In our research team we explored the reduced Kronecker coefficients, or equivalently, the Qλµ,ν(n) when µ, νand λ are stable. Here, we refer to a stabilization phenomenon first observed by Murnaghan, [13]. These triplesare important because they include all Littlewood–Richardson coefficients [12], they are themselves Kroneckercoefficients, and the value of the usual Kronecker coefficients can be recovered from them, [7]. Finally, there is awide consensus that the reduced Kronecker coefficients should be easier to understand than the general Kroneckercoefficients.

In our work at Banff, we characterized the Kronecker polytope corresponding to those triples µ, ν and λ with`(µ), `(ν) ≤ 2 and `(λ) ≤ 4, i.e, (2, 2, 4), and programmed our results into MAPLE. This is probably the situationwhere the Kronecker coefficients are best known; it belongs to the case of two-rowed shapes, one of a handful ofcases for which explicit formulas exist [15]. In the following picture there is an example of the resulting polytopeand some of its dilations.

We plan to continue working on this project. Currently we are looking at the following case (3, 3, 9). We areoptimistic that getting a better understanding of this polynomial in this more unknown setting will lead to resultsin the general situation. In particular, we are trying to determine whether the resulting quasi-polynomial is thecounting function for the dilations of a polytope, with the eventual goal of fully describing it.

Minimaj Crystal

Team Leader: Anne SchillingTeam Members: Georgia Benkart, Laura Colmenarejo, Pamela Harris, Rosa Orellana, Greta Panova, Martha Yip,Meesue Yoo

Suppose f is a symmetric function defined on a combinatorial object C, and f has a positive integer expansionin terms of Schur functions. Our team explored the question of whether this implies C has a crystal structure. Asa test case for this investigation, we focused on the particular case that C is the set of ordered multiset partitions of

Page 107: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Algebraic Combinatorixx 2 103

1, 2, . . . , n into k + 1 nonempty blocks. The motivation to consider this particular example was a recent paperby Rhoades, “Ordered Set Partition Statistics and the Delta Conjecture” [14].

We have defined operators on C that we conjecture to be crystal operators and that preserve the minimajstatistic (see the picture below). We have worked out many examples that indicate that the conjecture is correct.We intend to prove that they satisfy Stembridge’s axioms, which will imply that C has a crystal structure. We havea representation theoretic interpretation of C that should serve as a guide.

Minimaj Crystal

Team SchillingGeorgia Benkart, Laura Colmenarejo, Pamela E. Harris,Rosa Orellana, Greta Panova, Martha Yip, Meesue Yoo

Suppose f is a symmetric function defined on a combinatorial object C,and f has a positive integer expansion in terms of Schur functions. Our teamexplored the question if this implies C has a crystal structure. As a test case forthis investigation, we focused on the particular case that C is the set of orderedmultiset partitions of 1, 2, . . . , n into k + 1 nonempty blocks. The motivationto consider this particular example was a recent paper by Rhoades, “OrderedSet Partition Statistics and the Delta Conjecture.”

We have defined operators on C that we conjecture to be crystal operatorsand that preserve the minimaj statistic. We have worked out many examplesthat indicate that the conjecture is correct. We intend to prove that they satisfyStembridge’s axioms, which will imply that C has a crystal structure. We havea representation theoretic interpretation of C that should serve as a guide.

Future directions: Prove the conjecture that this is a crystal by checking theStembridge axioms. To facilitate the work of the team, we plan to split thedi↵erent cases among us. Given the size of the group, we plan to collaboratein smaller subsets to find common times to meet via Skype. Pamela Harris willserve as the team’s communication facilitator.

Step by step plan: Do further big examples to check the crystal operators aredefined correctly. If they are, then try to condense the rules for applying themand reduce the number of cases to consider. If not, try to modify the rules.Relate the minimaj statistic to q-Kostka numbers.

Conjectural fi operator:

1

Reduced Word Bounds

Team Leader: Bridget TennerTeam Members: Susanna Fishel, Elizabeth Milicevic, Rebecca Patrias

Let R(w) denote the set of reduced words for a fixed permutation w in the symmetric group Sn. Identifyingall reduced words which are related by a sequence of commutation relations, one obtains the well-studied set ofcommutation classes for w, defined by C(w) = R(w)/[ij ∼ ji] where |i − j| > 1. Similarly, one defines the setof braid classes for w to be B(w) = R(w)/ [i(i+ 1)i ∼ (i+ 1)i(i+ 1)] , which have received comparatively lessattention in the literature. Our goal for the week at BIRS was to study the braid classes of a fixed permutation, andwe decided to focus on an enumerative problem in particular.

Problem: What is the number |B(w)| of braid classes in the set of all reduced words R(w)?Our strategy for approaching this problem was to simultaneously study the number of commutation and braid

classes. Since R(w) is a finite set, we can organize the elements of R(w) into a table which lists the elementsin each distinct commutation class C1, C2, . . . , Ck across in rows, and the elements in each distinct braid classB1, B2, . . . , Bm in columns. If we only record nonempty classes, there must be at least one element of R(w) ineach row and column. It is straightforward to prove that each entry in this table contains at most one element ofR(w). Combining this observation with the fact that any two reduced words for w are related by a sequence ofcommutation relations and/or braid relations, we obtain sharp upper and lower bounds on the number of reducedwords in terms of the number of braid and commutation classes.

Proposition: For any w ∈ Sn, we have |C(w)|+ |B(w)| − 1 ≤ |R(w)| ≤ |C(w)||B(w)|.Having proved this proposition early in the week, our goal for the remainder of the workshop was to determine

precisely which permutations realize these upper and lower bounds. Since both |R(w)| and |C(w)| have been fairlywell-studied, achieving this goal would then yield closed formulas for |B(w)| for such families of permutations.

Theorem 9.0.2 (Fishel-Milicevic-Patrias-Tenner) There is a classification of the elementsw ∈ Sn which achievethe above upper and lower bounds on |R(w)|:

1. |R(w)| = |C(w)||B(w)| if and only if |C(w)| = 1 or |B(w)| = 1.

2. |R(w)| = |C(w)|+ |B(w)| − 1 if and only if w lies in one of a few special families.

Page 108: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

104 Five-day Workshop Reports

Regarding those elements which achieve the upper bound, the permutations which have a single commutationclass are called fully commutative and can be characterized in terms of pattern avoidance. Similarly, it is possibleto completely describe those permutations which have a single braid class. For the lower bound, in fact we provesomething stronger by listing all permutations such that |R(w)| = |C(w)| + |B(w)| − e, where e ∈ 0, 1. Foreach such permutation, there exists an ordering on the rows and columns in the table discussed above such that theresulting pattern is “serpentine”, and this visualization provided a useful platform upon which to build our analysis.We will continue by writing up these results and submitting them for publication.

Ehrart Theory of Alcoved and Tropical PolytopesTeam Leader: Josephine YuTeam Members: Emily Barnard, Carolina Benedetti, Patricia Brown

Our project focuses on characterizing the Ehrhart polynomials of alcoved and tropical polytopes. To this endwe started our analysis focusing on 2-dimensional alcoved polytopes. It is known that the leading coefficient of theEhrhart polynomial of a lattice polytope P is the (Euclidean) volume of the polytope and that the second leadingcoefficient is half of the boundary volume (normalized so that the standard simplex in a lattice has volume 1).For example, the standard 3-dimensional simplex has Ehrhart polynomial x3 + 3x2 + 3x + 1. We obtained thefollowing lemma whose proof we omit.

Lemma: Let A be the normalized area (twice Euclidean area) and P be the (lattice) perimeter of an alcovedpolygon. Then the pair (A,P ) satisfies the following relations.

1. If the alcoved polygon has no interior lattice points, then A+ 2 = P .

2. If the alcoved polygon contains an interior lattice point and is not three times a smallest simplex, thenA+ 8 ≥ 2P (Scott’s inequality).

3. 6A ≤ P 2, and equality is achieved exactly for regular hexagons.

We know that not every pair (A,P ) satisfying the conditions above can be achieved by an alcoved polygon. Inview of this, we are making steps towards the following problems:

1. Characterize the pairs (A,P ) that come from alcoved polygons. The next section may be useful.

2. Study higher dimensions.

3. Study h∗ vector, which coincides with (in reverse order) the h-vector of the alcoved triangulation (or anyunimodular triangulation) of the polytope.

4. Understand families of alcoved polytopes whose Ehrhart and/or h∗ polynomial are palindromic and uni-modal.

5. Some alcoved polytopes arise from matroids. Using the known Hopf algebras structures on matroids, canwe define similar operations on (families of) alcoved polytopes?

Some of the results above can be extended in the setting of tropical polytopes as follows.

Lemma: For lattice tropical polygons, the area A and perimeter P satisfy 6A ≤ P 2, where the perimeter Pof a pendant line segment is defined as twice its lattice length. The Ehrhart coefficients have the same interpretation.

We can decompose an alcoved polygon into ribbons by slicing it with parallel hyperplanes (lines). An alcovedpolygon is thus determined by the numbers of triangles in the layers. A sequence of positive integers come froman alcoved polygon this way if and only if

1. it is unimodal,

Page 109: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Algebraic Combinatorixx 2 105

2. only the peak can be even, and

3. only the peak can repeat.

Since there are three different hyperplane directions, different sequences may give the same alcoved polygon. Weplan to characterize ”canonical” sequences giving rise to alcoved polytopes.

Each alcoved polytope has an “alcoved triangulation” obtained by slicing with all hyperplanes of the formxi−xj = c where c ∈ Z. The dual graph of an alcoved triangulation is a subgraph of the tiling of Rn/(1, 1, . . . , 1)by regular permutohedra. Recall that all the two dimensional faces of a permutohedron (in any dimension) is eithera square or a hexagon.

Conjecture: A subgraph of the permutohedral tiling is dual to the alcoved triangulation of an alcoved polytopeif and only if

1. it is connected and induced,

2. if three vertices of a square are in the subgraph, then so is the fourth, and

3. if four vertices of a hexagon are in the subgraph, then so are the other two.

This gives us a way to study higher dimensional alcoved polytopes using graphs only.

Open problems.

1. Make connections to reduced words in affine Weyl groups.

2. Generalize to Weyl groups of other types.

Outcomes of the MeetingAs detailed above, all teams made significant progress on their problems. According to our team reports, they allhave concrete plans to continue to work together to finish at least the problem they started at the meeting. Severalteams are planning to submit papers within the next year and some have already scheduled times to meet in personagain. Moreover, many people commented that they learned new mathematics or who to talk to about their work.This is fantastic and was an important goal.

MSRI generously agreed to provide $25,000 of funding for participants from this workshop to meet as teamsthis summer for continued work. We began accepting applications on June 1st and have selected four teams thatwill be meeting at MSRI during July and August of 2017. We are appreciative of the MSRI’s support and believetheir support illustrates the significance of the research completed during this workshop.

We also would like to acknowledge that the Association for Women in Mathematics, AWM, provided fundingfor several of the US based participants to attend this workshop. This was absolutely vital to us creating a commu-nity with such diversity, and we appreciate their support. The financial support came from both their establishedtravel funding program and from the new Advance grant program, NSF-HRD 1500481 - AWM ADVANCE grant.

In addition to the mathematical outcomes, it is clear from the evaluations that the community was enhancedas almost every respondent referred to the value of the networking, mentoring and conversations that occurredduring the week. We had participants from numerous countries, people at many different points in their career, andwomen who work at a wide range of institutions. Respondents overwhelmingly valued the team structure and feltthe conference was well organized. Every respondent said that they believed this was a valuable use of their timeand several commented that they strongly hope there will be another Algebraic Combinatorixx in the future.

The comments from the week and in the evaluations were incredibly positive, and we share the participants’gratitude to BIRS for providing such a fantastic space to work and amazing staff to help. Below is a small sampleof some of the comments about the significance of the week for them:

• “I feel like I learned more algebraic combinatorics in 4 days than in years before. I learned so much. I alsomade personal and professional connections that I will maintain going forward”

Page 110: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

106 Five-day Workshop Reports

• “It has helped to re-energize me. ... I learned a lot of cool math, and my group made serious progress on ourproject and has plans to continue.”

• (Listed as a strength of the workshop) “Bringing together an extraordinary group of mathematicians - [the]talks were so impressive and informative”

• “I think my group will write a paper and continue to collaborate.”

• “My team made great progress and I am confident that we will be able to write a nice paper.”

• “The supportive community here was exceptional.”

• “The discussions at meals, in groups, in the evening discussions were all amazingly friendly and helpful.”

• “I most appreciated the way the organizers and more experienced attendees fostered a positive, supportiveenvironment for collaboration and mentoring. I received so much valuable advice and mathematical insight.”

• “I really appreciated that there were different kinds of mathematicians here - those from liberal arts colleges[and] those from R1 schools. That mix was helpful; I benefited from many perspectives and many differentways of doing math.”

• “This networking is alone worth the trip.”

• “I just really enjoyed the experience overall and felt energized professionally and personally.”

• “I think this has been my favorite conference/workshop!”

• “I would do this every summer if I could.”

ParticipantsBallantine, Cristina (College of the Holy Cross)Barcelo, Helene (Mathematical Sciences Research Institute)Barnard, Emily (North Carolina State University)Beier, Julie (Earlham College)Benedetti , Carolina (Fields Institute)Benkart, Georgia (University of Wisconsin-Madison)Bernstein, Megan (Georgia Institute of Technology)Bockting-Conrad, Sarah (DePaul University)Brown, Tricia (Armstrong State University)Cai, Yue (Texas A&M University)Chavez, Anastasia (University of California - Berkeley)Cho, Soojin (Ajou University, South Korea)Colmenarejo, Laura (University of Massachusetts at Amherst)Dahlberg, Samantha (University of British Columbia)Daugherty, Zajj (City College of New York)Fishel, Susanna (Arizona State University)Gillespie, Maria (University of California - Davis)Hamel, Angele (Wilfrid Laurier University)Harris, Pamela (Williams College)Hicks, Angela (Lehigh University)Karaali, Gizem (Pomona College)Mason, Sarah (Wake Forest University)McNicholas, Erin (Willamette University)Milicevic, Elizabeth (Haverford College)

Page 111: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Algebraic Combinatorixx 2 107

Mishna, Marni (Mathematics, Simon Fraser University)Niese, Elizabeth (Marshall University)Nyman, Kathryn (Willamette University)Orellana, Rosa (Dartmouth College)Panova, Greta (University of Pennsylvania and IAS Princeton)Park, Kyoungsuk (IDECCA Inc.)Patrias, Rebecca (Universite du Quebec a Montreal)Readdy, Margaret (University of Kentucky)Rosas, Mercedes (Univeristy of Sevilla)Russell, Heather (University of Richmond)Schilling, Anne (University of California Davis)Sundaram, Sheila (Pierrepont School)Tenner, Bridget (DePaul University)van Willigenburg, Stephanie (University of British Columbia)Viel, Shira (North Carolina State University)Yip, Martha (University of Kentucky)Yoo, Meesue (Universitat Wien)Yu, Josephine (Georgia Institute of Technology)

Page 112: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] V. Baldoni and M Vergne, Multiplicity of compact group representations and applications to Kronecker coef-ficients, ArXiv e-prints, (2015).

[2] V. Baldoni, M. Vergne, and M. Walter, Computation of Dilated Kronecker Coefficients, ArXiv e-prints, (2016).

[3] E. Babson, H. Barcelo, M. de Longueville and R. Laubenbacher, Homotopy theory for graphs, J. AlgebraicComb. 24 (2006) 31-44.

[4] H. Barcelo, V. Capraro, and J. White, Discrete homology theory for metric spaces, Bulletin of the LondonMath. Soc. 46 (2014) 889-905.

[5] L. Billera, and R. Ehrenborg, Monotonicity of the cd-index for polytopes, Mathematische Zeitschrift 233(2000), 421-441.

[6] , E. Briand, R. Orellana, and M. Rosas, Reduced Kronecker coefficients and counter–examples to Mulmuley’sstrong conjecture SH, ArXiv:0810.3163 (2008)

[7] E. Briand, R. Orellana, and M. Rosas, The stability of the Kronecker product of Schur functions, J. Algebra,331 (2011), 11–27.

[8] Y. Cai and M. Readdy, q-Stirling numbers: A new view, Advances in Applied Mathematics 86 (2017), 50 - 80.

[9] S. Dahlberg, A. Foley, and S. van Willigenburg, Resolving Stanley’s e-positivity of claw-contractible-freegraphs, arXif:1703.05770 [math.CO] (2017).

[10] A. Grigor’yan, Y. Muranov, and S.-T. Yau, Homologies of digraphs and Kunneth formulas, Comm. Anal.Geom, To appear.

[11] R. King, Some remarks on characters of symmetric groups, Schur functions, Littlewood-Richardson and Kronecker coefficients. “Mathematical Foundations of Quantum Information”, Sevilla,http://congreso.us.es/enredo2009/School files/Sevilla King.pdf (2009).

[12] D. E. Littlewood., The Kronecker product of symmetric group representations. J. London Math. Soc., 31(1956), 89-93.

[13] F. D. Murnaghan, The Characters of the Symmetric Group. Amer. J. Math., 59 (1937), 739-753.

[14] B. Rhoades, Ordered set partition statistics and the delta conjecture. Arxiv:1605.04007 (2016).

[15] M. H. Rosas., The Kronecker product of Schur functions indexed by two-row shapes or hook shapes. J.Algebraic Combin., 14 (2001), 153–173.

[16] R. Stanley, A Symmetric function generalization of the chromatic polynomial of a graph, Adv. Math. 111(1995), 166-194.

[17] R. Stanley, Graph colorings and related symmetric functions: ideas and applications, Discrete Math 193(1998), 267-286.

108

Page 113: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 10

Recent Advances in Discrete and AnalyticAspects of Convexity (17w5074)

May 21 - 26, 2017Organizer(s): Karoly Bezdek (University of Calgary), Alexander Koldobsky (University ofMissouri), Dmitry Ryabogin (Kent State University), Vladyslav Yaskin (University of Alberta),Artem Zvavitch (Kent State University)

Overview of the FieldConvexity is a very old topic which can be traced back at very least to Archimedes. It shares ideas and meth-ods from many fields of Mathematics, including Differential Geometry, Discrete Geometry, Functional Analysis,Harmonic Analysis, Geometric Tomography, Combinatorics, Probability, and it has numerous applications. Theaim of the meeting was to discuss most recent developments in the area and interrelate new discrete and analyticmethods.

Presentation HighlightsThe topics of the workshop included convex geometry, discrete geometry, theory of valuations, geometric inequal-ities, probability, and geometric functional analysis.

Grigoris Paouris presented his work “Affine isoperimetric inequalities on flag manifolds” joint with S. Dannand P. Pivovarov. LetK be a compact set inRn (and convex, depending on the context) and k be an integer 1 ≤ k ≤n− 1. The following are known as affine quermassintegrals and dual affine quermassintegrals correspondingly:

Φk(K) =

(∫G(n,k)

|K|F |−n)−1/(nk)

dF,

Ψk(K) =

(∫G(n,k)

|K ∩ F |n)1/(nk)

dF,

where G(n, k) is the Grassmanian of the k-dimensional subspaces in Rn.The results of Furstenberg-Tzkoni and Grinberg show that Ψk is invariant under linear volume-preserving

transformations, and Φk is invariant under affine volume-preserving transformations. In this talk affine and dual

109

Page 114: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

110 Five-day Workshop Reports

affine quermassintegrals were extended from the Grassmanian to the flag manifolds. The corresponding invariantproperties were also established.

Busemann-Strauss and Grinberg established the following extremizing inequality for Ψk. Let K be of volume1 and Dn be the Euclidean ball of volume 1, then Ψk(K) ≤ Ψk(Dn). Lutwak asked whether the followinginequality is true: Φk(K) ≥ Φk(Dn). Paouris and Pivovarov earlier proved that the latter holds with an absolutemultiplicative constant. In this talk the authors extended these results to the setting of flag manifolds. Somefunctional forms of these constructions were also discussed.

Peter Pivovarov spoke about his joint work “On a quantitative reversal of Alexandrov’s inequality” withG. Paouris and P. Valettas. Alexandrov’s inequality implies that for a convex body K we have the following:(

Vn(K)

Vn(B)

)1/n

≤(Vn−1(K)

Vn−1(B)

)1/(n−1)

≤ · · · ≤ V1(K)

V1(B),

where B is the unit Euclidean ball and V1,..., Vn are the intrinsic volumes.Milman’s random version of Dvoretzky’s theorem shows that a large initial segment of this sequence is essen-

tially constant, up to a critical parameter called the Dvoretzky number. The authors showed that this near-constantbehavior actually extends further, up to a different parameter associated with K. Namely, set

β∗ = β∗(K) =V ar(hK(g))

(EhK(g))2,

where hK is the support function of K and g is a standard Gaussian random vector in Rn. It is shown that thereexists a constant c > 0 such that if K is a symmetric convex body in Rn and 1 ≤ k ≤ c/β∗(K), then

V1(K)

V1(B)≤(

1 + c

√kβ∗ log

(e

kβ∗

))(Vk(K)

Vk(B)

)1/k

.

This yields a new quantitative reverse inequality that sits between the approximate reverse Urysohn inequality,due to Figiel–Tomczak-Jaegermann and Pisier, and the sharp reverse Urysohn inequality for zonoids, due to Hug–Schneider.

Liran Rotem gave a talk “Powers of convex bodies” based on a joint work with V. Milman. The main questiondiscussed in the talk is the following: given a convex body K in Rn and a number α ∈ R, is there a naturaldefinition for the power Kα. If α = −1, then the natural definition of K−1 is Ko, the polar of K. This is dueto the fact that polarity satisfies properties analogous to those of the function x 7→ 1/x. The authors suggested aconstruction of Kα for 0 < α < 1 (again the motivation was that this operation should be similar to the functionx 7→ xα). This is done by first defining this operation for ellipsoids, in a natural way, and then passing to generalbodies by using ellipsoidal envelopes.

In her talk “Recent results on approximation of convex bodies by polytopes”, Elisabeth Werner discussed tworesults. The first one, joint with J. Grote, generalizes a theorem by Ludwig, Schuett and Werner on approximationof a convex body K in the symmetric difference metric by an arbitrarily placed polytope with a fixed number ofvertices. Namely, let K be a convex body in Rn, n ≥ 2, that is C2

+. Let f : ∂K → R+ be a continuous strictlypositive function with ∫

∂K

f(x)dµ∂K(x) = 1,

where µ∂K is the surface area measure on ∂K. Then for sufficiently large N there exists a polytope Pf in Rnhaving N vertices such that

voln(K∆Pf ) ≤ aN−2/(n−1)

∫∂K

κK(x)1/(n−1)

f(x)2/(n−1)dµ∂K(x),

where a > 0 is an absolute constant, and κK is the Gaussian curvature.The second recent result, joint with S. Hoehner and C. Schuett, gives a lower bound, in the surface deviation, on

the approximation of the Euclidean ball by an arbitrary positioned polytope with a fixed number of k-dimensionalfaces.

Page 115: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Recent Advances in Discrete and Analytic Aspects of Convexity 111

Dan Florentin spoke about “New Prekopa Leindler Type Inequalities and Geometric Inf-Convolution of Func-tions”. The talk is based on a joint work with S. Artstein and A. Segal. Consider the class Cvx0(Rn) of non-negative convex functions on Rn vanishing at the origin (these are called geometric convex functions). The authorsdefine a geometric inf-involution φλψ by first providing a geometric interpretation and then showing the preciseformula:

(φλ ψ)(z) = inf0<t<1

infz=(1−t)x+ty

max

1− t1− λφ(x),

t

λψ(y)

.

Then they prove that for λ ∈ (0, 1) and φ, ψ ∈ Cvx0(Rn) one has∫e−φλψ ≥

(1− λ∫e−φ

+λ∫e−ψ

)−1

.

They also discuss other forms of this inequality and explain why this is an analogue of the Prekopa-Leindlerinequality.

Bo’az Klartag presented his work on “Convex geometry and waist inequalities”. The spherical waist inequalitystates that any continuous function f from the unit sphere Sn to Rl has a large fiber, i.e., the (n − l)-dimensionalvolume of some fiber f−1(t) is at least as large as that of Sn−l.

Here the author proves the following result. Let K be a convex body in Rn and 1 ≤ 1 ≤ n. Then for anycontinuous function f : K → Rl,

supt∈Rl

voln−l(f−1(t)) · sup

E∈AG(n,l)

voll(K ∩ E) ≥ voln(K).

Furthermore, if K ⊂ Rn is a convex body of volume 1, then there exists a volume-preserving linear map TKsuch that K = TK(K) has the following property. Let 1 ≤ l ≤ n and f : K → Rl be a continuous map. Thenthere exists t ∈ Rl with

voln−l(f−1(t)) ≥ cn−l,

where c > 0 is a universal constant.The theme of waist inequalities is continued in the talk of Arseniy Akopyan “Waists of balls in different

spaces”, based on a joint work with R. Karasev and A. Hubard. The speaker starts with the following curiousquestion. Can one map a shape of width 1 into a shape inside the strip of width 0.99 in such a way that thedistances do not decrease? The answer is “No”, as follows from the following result of the authors. For any convexbody K ∈ Rn and a continuous map f : K → Rn−1 there exists a fiber f−1(y) of 1-Hausdorff measure at leastthe width of K.

Further the speaker presents a number of waist inequalities for balls in spaces of constant curvature, tori,parallelepipeds, projective spaces and other metric spaces.

Christos Saroglou spoke about his joint work with S. Myroshnychenko and D. Ryabogin on “Star bodies withcompletely symmetric sections”. The starting point of this research was the question: Let K be a convex body inRn, n ≥ 3. If all orthogonal projections of K are 1-symmetric, is it true that K must be a Euclidean ball? (A bodyis said to be 1-symmetric if its symmetry group contains the symmetry group of a cube of the same dimension).

The authors gave a positive answer to this problem, as well as other closely related questions. For, example thecorresponding problem for sections also has a positive answer. In fact, a more general statement is true. If f is aneven function on the sphere whose restriction to every equator is isotropic, then f is a constant.

Jaegil Kim presented a joint work with S. Dann and V. Yaskin titled “Busemann’s intersection inequality inhyperbolic and spherical spaces”. A version of Busemann’s intersection inequality says that ellipsoids in Rn arethe only maximizers of the quantity ∫

Sn−1

|K ∩ ξ⊥|ndξ (10.0.1)

in the class of star bodies of a fixed volume.The authors study this question in the hyperbolic space Hn and the sphere Sn. It is shown that in Hn the

centered balls are the unique maximizers of (10.0.1) in the class of star bodies of a fixed volume. However, on thesphere the situation is completely different. It is surprising that in Sn with n ≥ 3 the centered balls are neithermaximizers nor minimizers of (10.0.1), even in the class of origin-symmetric convex bodies.

Page 116: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

112 Five-day Workshop Reports

Ning Zhang spoke about his work “On bodies with congruent sections by cones or non-central planes”. Let Kand L be convex bodies in R3 such that the projections K|H and L|H are congruent for every subspace H . Doesthis imply that K is a translate of ±L. This question is open. Ryabogin solved a version of this problem whencongruency is replaced by rotation. In the language of functions his result can be stated as follows. Let f and gbe continuous functions on S2 such that for every 2-dimensional subspace H there is a rotation φH in H such thatf(θ) = g(φH(θ)) for all θ ∈ S2 ∩H . Then f(θ) = g(θ) or f(θ) = g(−θ) for all θ ∈ S2.

The speaker considered a similar question for small circles on the sphere, instead of great circles. Namely,consider a fixed t ∈ (0, 1) and let f and g be continuous functions on S2 such that for every 2-dimensional affinesubspace H that is distance t from the origin there is a rotation φH in H such that f(θ) = g(φH(θ)) for allθ ∈ S2 ∩H . Is it true that f(θ) = g(θ) for all θ ∈ S2. In this talk it is shown that the answer to this question isaffirmative if f and g are of class C2(S2).

Petros Valettas gave a talk about “A Gaussian small deviation inequality”, a joint work with G. Paouris. TheGaussian concentration phenomenon states that for any L-Lipschitz map f : mathbbRn → R one has

P (|f(Z)−M | > t) ≤ exp

(−1

2t2/L2

),

for all t > 0, where Z is an n-dimensional standard Gaussian random vector and M is the median for f(Z).The authors are interested in refining (a one-sided version of) this inequality by replacing L with the variance

Varf(Z). It is known that Varf(Z) ≤ L2. However, there are many Lipschitz maps for which Varf(Z) L2.Their main result reads as follows. For any convex map f ∈ L2(γn) one has

P (f(Z)−M < −t) ≤ exp1

2

(− π

1024t2/Varf(Z)

),

for all t > 0.Monika Ludwig presented her joint work with L. Silverstein “Valuations on lattice polytopes”. The study of

valuations on convex bodies is a classical area. There are well-known classifications of such valuations. In this talkthe authors study valuations on the spaceP(Zn) of lattices polytopes (these are convex hulls of finitely many pointsfrom Zn). A natural question is to classify SLn(Z) and translation invariant valuations on P(Zn). Some workin this direction was done by Betke and Kneser in the case of real-valued valuations. Here the authors prove thefollowing result about vector-valued valuations. Z : P(Zn)→ Rn is an SLn(Z) equivariant, translation covariantvaluation if and only if there exist c1,..., cn+1 ∈ R such that

Z = c1l1 + · · ·+ cn+1ln+1.

Here, li’s are defined by the formula

l(kP ) =

n+1∑i=1

li(P )ki,

where P is a lattice polytope, k is a positive integer, and

l(P ) =∑

x∈P∩Znx

is the discrete moment vector of P .The next natural step is to look at what happens for tensor-valued valuations. They find a classification of

SLn(Z) equivariant, translation covariant valuations with values in the space of symmetric tensors of rank at most8. The case of tensors of rank 9 and higher appears more complicated.

Franz Schuster talked about “Even SO(n) Equivariant Minkowski Valuations – An Update”. The main themeis finding Hadwiger-type theorems for Minkowski valuations. Recall that a map Φ from the set Kn of convexbodies in Rn to itself is called a Minkowski valuation if

Φ(K ∩ L) + Φ(K ∪ L) = Φ(K) + Φ(L),

Page 117: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Recent Advances in Discrete and Analytic Aspects of Convexity 113

whenever K, L, and K ∪ L are in Kn.The first problem is to describe the set MValSO(n) of continuous Minkowski valuations, which are translation

invariant and SO(n) equivariant.The second problem is to find a classification/description of MVal

SO(n)i . Here, MVal

SO(n)i = Φ ∈ MValSO(n) :

Φ(λK) = λiΦ(K).The speaker discussed recent progress and insights on these problems. In particular, Wannerer and Schuster

obtained the following result. If Φi ∈ MValSO(n),∞i (∞ refers to smooth valuations), 1 ≤ i ≤ n − 1, then there

exists a unique function gΦi ∈ C∞([−1, 1]) such that

h(ΦiK,u) =

∫Sn−1

gΦi(〈u, v〉)dSi(K, v), u ∈ Sn−1,

where dSi is the i-th surface area measure.Of course, the natural goal is to remove smoothness in the above theorem. Recently, Dorrek proved the follow-

ing. If Φi ∈ MValSO(n)i , then there exists a unique function gΦi ∈ L1([−1, 1]) such that

h(ΦiK,u) =

∫Sn−1

gΦi(〈u, v〉)dSi(K, v), u ∈ Sn−1.

The description of such functions gΦi is still an open problem.Wolfgang Weil spoke about “Integral representations of mixed volumes”, a joint work with D. Hug and J. Rataj.

Recall that mixed volumes arise as coefficients in the following expansion:

vol(t1K1 + · · ·+ tmKm) =

m∑i1=1

· · ·m∑

in=1

ti1 · · · tinV (Ki1 , ...,Kin),

where K1,...,Km are convex bodies and t1,...,tm are positive numbers.It would be important to represent mixed volumes by integrals of local quantities of K1,...,Km. In the case

when there are only two bodies such a formula was obtained earlier by the same authors. There is a function fj,n−jsuch that for all K, M (in suitable general position)

V (K[j],M [n− j]) =

∫F (n,n−j+1)

∫F (n,j+1)

fj,n−j(u1, L1, u2, L2)ψj(K, d(u1, L1))ψn−j(M,d(u2, L2)),

where ψj(K, ·) and ψn−j(M, ·) are flag measures of K and M and the integration is over the manifolds of flagsF (n, n− j + 1) and F (n, j + 1) respectively, which are defined as follows:

F (n, j + 1) = (u, L) : L ∈ G(n, j + 1), u ∈ Sn−1 ∩ L,

ψj(K, ·) =

∫G(n,j+1)

1((u, L) ∈ ·)S′j(K|L, du)dL.

Recently the authors extended this result to the case of more than two bodies. The formula is similar to the oneabove.

Martin Henk gave a talk about “The even dual Minkowski problem”, based on joint works with K. Boroczkyand H. Pollehn. The Minkowski-Christoffel problem asks for characterizations of area measures Si(K, ·) of aconvex body K among all finite Borel measures on the sphere. When 1 < i < n− 1 the problem is still open.

Lutwak initiated the dual Brunn-Minkowski theorem. Instead of the Minkowski addition as in the classicalsetting, here one uses the radial addition. In this theory there is a local dual Steiner formula, which allows todefine ith dual curvature measures Ci(K, ·). Moreover, there are explicit formulas for these measures that allowan extension from an integer i to a real number q. The dual Minkowski problem asks for necessary and sufficientconditions for a given finite Borel measure on the sphere to be the qth dual curvature measure of some convex bodyK. The authors found the following necessary condition. Let K be an origin-symmetric convex body, q ∈ (1, n),and L be a proper subspace. Then

Ci(K,Sn−1 ∩ L) < min

1,

dimL

q

Ci(K,S

n−1).

Page 118: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

114 Five-day Workshop Reports

Later Zhao, Boroczky, Lutwak, Yang, Zhang showed that this condition is also sufficient.What happens for other values of q? Zhao found a necessary and sufficient condition when q < 0. Henk and

Pollehn gave a necessary condition for q ≥ n + 1. Let K be an origin-symmetric convex body in Rn. Then forevery proper subspace L ⊂ Rn,

Ci(K,Sn−1 ∩ L) <

q − n+ dimL

qCi(K,S

n−1),

and the bound is best possible.An open question is whether this condition is also sufficient.Hermann Konig presented his work “Submultiplicative operators in Ck-spaces”, joint with D. Faifman and

V. Milman. Multiplicative operators have been studied by many authors and the corresponding characterizationwere obtained. In this talk the goal is to look at the stability properties. Can we go from multiplicative operatorsto submultiplicative?

The first main result is the following. Let I ⊂ Rn be an open set and k ∈ N0. Consider a map T : Ck(I) →Ck(I) that is bijective and submultiplicative, meaning that for all f , g ∈ Ck(I) one has

T (f · g) ≤ T (f) · T (g).

T is also assumed to be pointwise continuous and satisfying the property f ≥ 0 if and only if Tf ≥ 0. Then thereexist functions p, A ∈ C(I), satisfying p > 0, A ≥ 1, and a Ck-diffeomorphism u so that

for k = 0, T f(u(x)) =

f(x)p(x), f(x) ≥ 0,−A(x)|f(x)|p(x), f(x) < 0,

for k > 0, T f(u(x)) = f(x).

The second main result, which is joint with V. Milman, reads as follows. Let φ : R→ R be submultiplicative,i.e.

φ(xy) ≤ φ(x)φ(y), x, y ∈ R.φ is also assumed to be measurable, continuous at 0, 1, and satisfying φ(−1) < 0 < φ(1). Then there existnumbers p > 0, A ≥ 1 so that

φ(x) =

xp, x ≥ 0,−A|x|p, x < 0.

Konstantin Tikhomirov spoke about “Superconcentration, and randomized Dvoretzky’s theorem for spaceswith 1-unconditional bases”. For an origin-symmetric convex body B in Rn and a linear operator U : Rn → Rndefine

`(B,U) =(E‖U(G)‖2B

)1/2,

where G is the standard Gaussian vector in Rn. The body B is said to be in `-position if `(B, Idn) = 1 and

1 = det Idn = sup|detU | : U ∈ Rn×n, `(B,U) ≤ 1

.

The main result is the following. Let B be an origin-symmetric convex body in Rn in the `-position, and such thatthe space (Rn, ‖ · ‖B) has a 1-unconditional basis. Further, let ε ∈ (0, 1/2] and k ≤ cε log n/ log 1

ε . Then for arandom k-dimensional subspace E ⊂ Rn uniformly distributed according to the Haar measure, one has

PB ∩ E is (1 + ε)-spherical ≥ 1− 2n−cε,

where c > 0 is a universal constant.This shows that the “worst-case” dependence on epsilon in the randomized Dvoretzky theorem in the ell-

position is significantly better than in John’s position.Alexander Litvak presented his joint work with K. Tikhomirov titled “Order statistics of vectors with depen-

dent coordinates”. In 2000 Mallat and Zeitouni posed the following question. Let X = (X1, ..., Xn) be an

Page 119: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Recent Advances in Discrete and Analytic Aspects of Convexity 115

n-dimensional random Gaussian vector with independent centered coordinates (possibly with different variances).Let T be an orthogonal transformation of Rn and Y = T (X). Is it true that for every k ≤ n one has

Ek∑j=1

j- mini≤n

X2i ≤ E

k∑j=1

j- mini≤n

Y 2i ?

Here, for a sequence of numbers a1,..., an and j ≤ n, j- min1≤i≤n ai denotes the j-th smallest element of thesequence.

In their work the authors solve this problem in the affirmative (up to an absolute multiplicative constant). Thatis, there is an absolute constant C > 0 such that

Ek∑j=1

j- mini≤n

X2i ≤ CE

k∑j=1

j- mini≤n

Y 2i .

Matthew Stephen presented his joint work with N. Zhang “Grunbaum’s inequality for projections”. Let K bea convex body in Rn whose centroid is at the origin. Let ξ ∈ Sn−1 and denote ξ+ = x ∈ Rn : 〈x, ξ〉 ≥ 0.Grunbaum’s inequality says that

voln(K ∩ ξ+)

voln(K)≥(

n

n+ 1

)n,

with equality at the cone.Another closely related inequality is due to Minkowski and Radon. If K is a convex body with centroid at the

origin and hK is its support function, then

hK(ξ)

hK(ξ) + hK(−ξ) ≥1

n+ 1,

with equality at the cone.Stephen and Zhang realized that there should be a connection between these two inequalities. Moreover, they

found a general inequality that has these two as particular cases. They proved the following. Fix 1 ≤ k ≤ n andtake E ∈ G(n, k), ξ ∈ Sn−1 ∩ E. If K is a convex body with centroid at the origin, then

volk((K|E) ∩ ξ+)

volk(K|E)≥(

k

n+ 1

)k.

The equality condition is also characterized.Susanna Dann talked about “Flag area measures”, a joint work with J. Abardia and A. Bernig. Let V be an

Euclidean vector space of dimension n and let K(V ) be the space of non-empty compact convex subsets in V . Avaluation on V is a map µ : K(V )→ R satisfying

µ(K ∪ L) + µ(K ∩ L) = µ(K) + µ(L),

whenever K,L,K ∪ L ∈ K(V ). We say that µ is continuous if it is so with respect to the topology on K(V )induced by the Hausdorff metric. A flag area measure on V is a continuous translation-invariant valuation withvalues in the space of signed measures on the flag manifold consisting of a unit vector v and a (p+ 1)-dimensionallinear subspace containing v where 0 ≤ p ≤ n− 1.

Using local parallel sets Hinderer constructed examples of SO(n)-covariant flag area measures. There is anexplicit formula for his flag area measures evaluated on polytopes involving the squared cosine of the angle betweentwo subspaces. The authors construct a more general space of SO(n)-covariant flag area measures via integrationof appropriate differential forms. They also compute the dimension of this space, discuss their properties andprovide explicit formulas on polytopes, which are similar to the formulas for Hinderer’s examples, however with anarbitrary elementary symmetric polynomial in the squared cosines of the principal angles between two subspaces.Hinderer’s flag area measures correspond to special cases where the elementary symmetric polynomial is just the

Page 120: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

116 Five-day Workshop Reports

product. Moreover, they construct an explicit basis for this space, which gives a classification result in the spirit ofHadwiger’s theorem.

Boaz Slomka gave a talk “On convex bodies generated by Borel measures”, a joint work with H. Huang. Theirgoal is to introduce a natural way of generating convex bodies from Borel measures. They suggest the followingconstruction. Given a Borel measure µ on Rn, define

M(µ) =⋃

0≤f≤1∫Rn fdµ=1

∫Rnyf(y)dµ(y)

,

where the union is taken over all measurable function f : Rn → [0, 1] with∫Rn fdµ = 1. The set M(µ) is called

the metronoid generated by µ. In particular, if µ is a finite combination of δ-measures, then M(µ) is a polytope.The latter suggests that some classical quantities related to approximation of convex bodies by polytopes can beextended to the class of metronoids.

For example, one such quantity is

dR(K) = inf

N ∈ N : ∃P = conv(x1, . . . , xN ) ⊂ Rn, 1

RP ⊂ K ⊂ P

,

which naturally extends in the case of metronoids as follows:

d∗R(K) = inf

µ(Rn) :

1

RM(µ) ⊂ K ⊂M(µ)

.

Similarly, the vertex index introduced by Bezdek and Litvak:

vein(K) = inf

N∑i=1

‖xi‖K : K ⊂ P = conv(x1, . . . , xN )

in the setting of metronoids can be modified as follows:

vein∗(K) = inf

∫Rn‖x‖Kdµ(x) : K ⊂M(µ)

.

Next, the authors establish some bounds for the quantities they introduced. For example, they show that for anorigin-symmetric convex body K in Rn one has

c√n ≤ c vein∗(Bn2 ) ≤ vein∗(K) ≤ C1vein∗(Bn1 ) ≤ C2n.

Matt Alexander presented his work done with M. Fradelizi and A. Zvavitch “Polytopes of Maximal VolumeProduct”. Let K be an origin-symmetric convex body in Rn, and let

Ko := x ∈ Rn : 〈x, y〉 ≤ 1 ∀y ∈ K

be its polar body. Denote byP (K) = voln(K)voln(Ko)

the volume product of the body K.In the non-symmetric case the volume product is defined as follows. The polar body of a convex body K in

Rn with respect to a point z is

Kz := x ∈ Rn : 〈x− z, y − z〉 ≤ 1 ∀y ∈ K.

For a convex body K the Santalo point is the unique point s(K) such that

vol(Ks(K)) = minz∈intK

vol(Kz).

Page 121: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Recent Advances in Discrete and Analytic Aspects of Convexity 117

Define the volume product of K to be

P (K) = voln(K)voln(Ks(K)).

The maximum of the volume product in the class of convex bodies is known: the maximizers are centeredellipsoids. However, it is interesting to find the maximum in some specific classes of convex bodies.

The authors examine the volume product for classes of restricted polytopes. Let Pnm be the set of all polytopesin Rn with non-empty interior having at most m vertices. Denote

Mmn = sup

K∈PnmP (K).

They prove that this supremum is achieved at some polytope with exactly m vertices and the sequence Mnm is

strictly increasing in m.A polytope is called simplicial if every facet is a simplex. The authors show that if K is of maximal volume

product among polytopes with at most m vertices, then K is a simplicial polytope.They also investigate some particular classes Pnm. In particular, if K is an origin-symmetric convex body in

P38 , then the maximal volume product of such bodies is given by the double cone on a regular hexagonal base.

They also find the maximizers of the volume product for convex hulls of n+ 2 points in Rn.Gideon Schechtman presented his joint work with A.Naor on lower bounds for the distortion of bi-Lipschitz

embeddings of metric spaces. A metric space (X, dX) is said to admit a bi-Lipschitz embedding into a metricspace (Y, dY ) if there exist s ∈ (0,∞), D ∈ [1,∞) and a mapping f : X → Y such that

sdX(x, y) ≤ dY (f(x), f(y)) ≤ DsdX(x, y), ∀x, y ∈ X.

When this happens it is said that (X, dX) embeds into (Y, dY ) with distortion at most D. The authors have previ-ously done some work on bounding from below the distortion of embedding certain metric spaces into Lp.

In this talk the speaker considered embedding certain grids in Schatten p-classes Sp into Lp. In particular, letMn[m] be the grid of all n×nmatrices, whose entries have values in the set [m] = −m,−(m−1), ...,m−1,m,equipped with the S1 norm. The authors prove that for any n and m large enough with respect to n, the distortionof embedding Mn[m] into a Banach space X is at least of order n1/2/α(X). Here α(X) is the smallest constantK satisfying the so-called linear upper α inequality

Eεij=±1‖n∑

i,j=1

εijxij‖ ≤ KEεi,δj=±1‖n∑

i,j=1

εiδjxij‖

for all xij ∈ X. Much of the talk was devoted to α inequalities in different settings.Carsten Schuett spoke about his joint work with O.Giladi, J.Prochno, N.Tomczak-Jaegermann and E.Werner

on the geometry of triple tensor products of `np -spaces. Let X be an n-dimensional normed space with the unit ballBX . The volume ratio of X is defined by

vr(X) = infE⊂BX

(voln(BX)

voln(E)

)1/n

,

where the infimum is taken over all ellipsoids contained in BX , and voln is volume in Rn. In their earlier work,Schuett and Tomczak-Jaegermann established the exact behavior of the volume ratio of tensor products of thespaces `np and `nq . In the talk, Schuett explained the extension of this result to triple tensor products of the spaces`np , `

nq and `nr for all choices of p, q, r ∈ [1,∞]. The authors established the exact behavior of the volume ratio of

these triple tensor products.Karoly Bezdek has given a talk under the title “From dual bodies to the Kneser-Poulsen conjecture”. Let Md

denote the d-dimensional Euclidean, hyperbolic, or spherical space. The r-dual set of a given set in Md is theintersection of balls of radii r centered at the points of the given set. As a Blaschke–Santalo-type inequality forr-duality it was shown in the talk that for any set of given volume in Md the volume of the r-dual set becomesmaximal if the set is a ball. As an application also the following was proved. The Kneser–Poulsen Conjecture

Page 122: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

118 Five-day Workshop Reports

states that if the centers of a family ofN congruent balls in Euclidean d-space is contracted, then the volume of theintersection does not decrease. A uniform contraction is a contraction where all the pairwise distances in the firstset of centers are larger than all the pairwise distances in the second set of centers. Finally, the talk presented anoutline of the proof of the Kneser–Poulsen conjecture for uniform contractions (with N sufficiently large) inMd.

Marton Naszodi spoke about his joint work with Karoly Bezdek under the title “The Kneser–Poulsen conjecturefor special contractions”. Recall that the Kneser–Poulsen Conjecture in general states that if the centers of a familyof N unit balls in Ed is contracted, then the volume of the union (resp., intersection) does not increase (resp.,decrease). They consider two types of special contractions. First, a uniform contraction is a contraction where allthe pairwise distances in the first set of centers are larger than all the pairwise distances in the second set of centers.The authors obtain that a uniform contraction of the centers does not decrease the volume of the intersection ofthe balls, provided that N ≥ (1 +

√2)d. Their result extends to intrinsic volumes. They prove a similar result

concerning the volume of the union. Second, a strong contraction is a contraction in each coordinate. They showthat the conjecture holds for strong contractions. In fact, the result extends to arbitrary unconditional bodies in theplace of balls.

Igors Gorbovickis spoke about “The central set and its application to the Kneser-Poulsen conjecture”. He hasgiven new results about central sets of subsets of a Riemannian manifold and applied those results to prove newspecial cases of the Kneser-Poulsen conjecture in the two-dimensional sphere and the hyperbolic plane.

Robert Connelly talked about ”The isostatic conjecture”, which is a joint work of him with Evan Solomonidesand Maria Yampolskaya. They show that a jammed packing of disks with generic radii, in a generic container, issuch that the minimal number of contacts occurs and there is only one dimension of equilibrium stresses. Theyalso point out some connections to packings with different radii and results in the theory of circle packings whosegraph forms a triangulation of a given topological surface.

Oleg Musin has given a talk under the title “Representing graphs by sphere packings”. His talk surveyed recentmajor advances on the following three topics: Euclidean and spherical graph representations as two–distance sets;Euclidean and spherical graph representations as contact graphs of congruent sphere packings; generalizations ofSteiner’s porism and Soddy’s hexlet.

Janos Pach talked about ”Disjointness Graphs”, which is a joint work of him with Gabor Tardos and GezaToth. The disjointness graph G = G(S) of a set of segments S in Rd, d ≥ 2, is a graph whose vertex set is Sand two vertices are connected by an edge if and only if the corresponding segments are disjoint. They prove thatthe chromatic number of G satisfies κ(G) ≤ (ω(G))4 + (ω(G))3, where ω(G) denotes the clique number of G.It follows , that S has Ω(n

15 ) pairwise intersecting or pairwise disjoint elements. Stronger bounds are established

for lines in space, instead of segments. They show that computing ω(G) and κ(G) for disjointness graphs oflines in space are NP–hard tasks. However, they can design efficient algorithms to compute proper colorings ofG in which the number of colors satisfies the above upper bounds. One cannot expect similar results for sets ofcontinuous arcs, instead of segments, even in the plane. They construct families of arcs whose disjointness graphsare triangle-free (ω(G) = 2), but whose chromatic numbers are arbitrarily large.

Outcome of the MeetingThe meeting was very successful, we were lucky to bring together mathematicians from many countries and manyresearch areas, such as convex geometry, discrete geometry, probability, functional analysis. Besides the leadingscientists, we also had 6 graduate students and 12 postdocs or recent PhDs participating in the workshop. Femaleparticipation was above 21%. The friendly atmosphere created during the workshop helped many participants notonly to identify the promising ways to attack the old problems but also to get acquainted with many open newones.

ParticipantsAkopyan, Arseniy (IST Austria)Alexander, Matt (Kent State University)

Page 123: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Recent Advances in Discrete and Analytic Aspects of Convexity 119

Alfonseca-Cubero, Maria de los Angeles (North Dakota State University)Artstein, Shiri (Tel-Aviv University)Bezdek, Karoly (University of Calgary)Connelly, Robert (Cornell University)Dann, Susanna (University of Bogota)Florentin, Dan (Kent State University)Gorbovickis, Igors (Uppsala University)Henk, Martin (Technische Universitat Berlin)Khan, Muhammad (University of Calgary)Kim, Jaegil (University of Alberta)Klartag, Boaz (Weizmann institute)Koenig, Hermann (Universitaet Kiel)Koldobsky, Alexander (University of Missouri)Litvak, Alexander (University of Alberta)Livshyts, Galyna (Georgia Institute of Technology)Ludwig, Monika (Technische Universitat Wien)Musin, Oleg (University of Texas Rio Grande Valley)Myroshnychenko, Sergii (University of Alberta)Naszodi, Marton (Alfred Renyi Institute of Mathematics)Oliwa, Michael (University of Calgary)Pach, Janos (Ecole Polytechnique Federale de Lausanne)Paouris, Grigoris (Texas A&M University)Pivovarov, Peter (University of Missouri)Rotem, Liran (Technion)Ryabogin, Dmitry (Kent State University)Saroglou, Christos (Kent State University)Schechtman, Gideon (Weizmann Institute of Science)Schuster, Franz (Vienna University of Technology)Schutt, Carsten (Christian-Albrechts-Universitaet)Slomka, Boaz (University of Michigan)Stancu, Alina (Concordia University)Stephen, Matthew (University of Alberta)Tikhomirov, Konstantin (Princeton University)Tomczak-Jaegermann, Nicole (University of Alberta)Valettas, Petros (Universtiy of Missouri)Vritisou, Beatrice (University of Alberta)Weil, Wolfgang (Karlsruhe Institute of Technology)Werner, Elisabeth (Case Western Reserve University)Yaskin, Vladyslav (University of Alberta)Zhang, Ning (Huazhong University of Science and Technology)

Page 124: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 11

Arithmetic Aspects of Explicit ModuliProblems (17w5065)

May 28 - June 2, 2017Organizer(s): Nils Bruin (Simon Fraser University), Kiran Kedlaya (University of CaliforniaSan Diego), Samir Siksek (University of Warwick), John Voight (Dartmouth College)

Overview of the FieldA central theme of modern number theory is understanding the absolute Galois group of the rational numbersGQ = Gal(Q/Q), and more generally GK = Gal(K/K) where K is a number field. The principal approach tothis has been to study the action of GQ on objects arising in geometry, especially the p-torsion of elliptic curves.Let E be an elliptic curve defined over Q. This can be given by an equation of the form

E : Y 2 = X3 +AX +B, (A, B ∈ Z, 4A3 + 27B2 6= 0).

The points ofE form a group. The action ofGQ on the p-torsion subgroupE[p] gives rise to a mod p representationρE,p : GQ → GL2(Fp). Such representations are the subject of two of the most important conjectures in numbertheory. Both are due to Fields Medalist and Abel Prize winner Jean-Pierre Serre: Serre’s uniformity conjecture(1968) and Serre’s modularity conjecture (1986). Serre’s modularity conjecture was recently proved by Khare andWintenberger (2009). Another closely related conjecture is the modularity conjecture for elliptic curves over Q,proved by Wiles and Taylor (1995) for semistable elliptic curves over Q, and by Breuil, Conrad, Diamond andTaylor (2001) for all elliptic curves over Q. In proving modularity for semistable elliptic curves, Wiles provedFermat’s Last Theorem, a question that had vexed mathematicians for 350 years.

A modular curve classifies elliptic curves whose torsion points enjoy certain Galois properties. Understandingthe points of modular curves over number fields is key to many great theorems in number theory. Let us mention afew:

(I) Heegner’s resolution of Gauss’s class number 1 problem required the determination of rational points onvarious modular curves.

(II) Mazur proved that if E is an elliptic curve defined over Q and p > 163 then E does not have a rationalp-isogeny (an equivalent formulation is that ρE,p is irreducible for p > 163). The proof involved thedetermination of rational points on the family of modular curves X0(p). Mazur’s theorem is one of the threegreat pillars on which the proof of Fermat’s Last Theorem rests; the other two are Ribet’s level-loweringtheorem, and Wiles’ modularity of semistable elliptic curves.

120

Page 125: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Arithmetic Aspects of Explicit Moduli Problems 121

(III) Building on earlier work by Mazur and Kamienny, Merel proved the uniform boundedness conjecture: forany d ≥ 1, there is a Bd such that if E is an elliptic curve over a number field K of degree d and p > Bd aprime, then E does not have K-rational p-torsion. Merel’s theorem involves the study of rational points onsymmetric powers of X0(p) and X1(p).

(IV) The obstruction to modularity lifting for elliptic curves is represented by rational points on modular curves.The proof of the full modularity theorem for elliptic curves by Breuil, Conrad, Diamond and Taylor requiredthe determination of rational points on several modular curves; this computation was carried out by Elkies.

(IV) Serre’s uniformity conjecture that asserts that if p > 37 is prime and E/Q an elliptic curve without complexmultiplication then ρE,p is surjective. This conjecture reduces to the determination of rational points onthree families of modular curves X0(p), X+

s (p), X+ns(p).

Beyond modular curves there are equally intriguing but harder moduli problems for curves of higher genus,abelian varieties, abelian varieties with level structure, abelian varieties with certain endomorphism rings, etc.

Recent DevelopmentsIn recent years there have been many spectacular breakthroughs (both theoretical and algorithmic). These haveformed a strong motivation for the workshop. Among them we mention the following:

(I) The proof by Bilu, Parent and Rebolledo [2], [3] of the split Cartan case of Serre’s uniformity conjecture:they have determined the rational points on the modular curves X+

s (p) for p = 11 and p ≥ 17.

(II) The recent proof of modularity of elliptic curves over real quadratic fields by Le Hung, Freitas and Siksek[8]. This required the determination of quadratic points on several complicated modular curves.

(III) Systematic tables of modular curves of small genus due to Zywina and Sutherland.

(IV) Equations for Hilbert modular surfaces for all thirty fundamental discriminantsD of level 1 < D < 100 dueto Elkies and Kumar [7].

(V) Work of Derickx, Kamienny, Stein and Stoll [6] who determined the possible prime orders of torsion pointson elliptic curves over number fields of degrees 4, 5, 6.

(VI) A database of genus 2 curves due to Booker, Sijsling, Sutherland, Voight and Yasaki [4].

(VII) Work of Bruin and Nasserden [5] elucidating the arithmetic of the Burkhardt quartic which is the modulispace for principally polarized abelian surfaces with full level 3 structure.

Open ProblemsAn open problems session formed one of the highlights of the workshop. The participants were encouraged tosuggest good open problems as a means of stimulating further progress in the field.

David Zureick–BrownCompute XH(Q) from the following list of curves.

P2<x,y,z> := ProjectiveSpace(Rationals(),2);

// level 3ˆn curvesX33:= Curve(P2, -xˆ3*y + xˆ2*yˆ2 - x*yˆ3 + 3*x*zˆ3 + 3*y*zˆ3);X43:= Curve(P2, xˆ3*z - 6*xˆ2*zˆ2 + 3*x*yˆ3 + 3*x*zˆ3 + zˆ4);

Page 126: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

122 Five-day Workshop Reports

// level 5ˆn curvesR<x> := PolynomialRing(Rationals());S<a,b,c,d> := PolynomialRing(Rationals(),4);

h := xˆ3 + x + 1;f := 6*xˆ6 + 5*xˆ5 + 12*xˆ4 + 12*xˆ3 + 6*xˆ2 + 12*x - 4;X11 := HyperellipticCurve([f,h]);

h2 := xˆ3 + x + 1;f2 := xˆ6 - 13*xˆ4 - 38*xˆ3 + 6*xˆ2 + 22*x + 6;X15 := HyperellipticCurve([f2,h2]);

f1 := aˆ2 + 51*a*b + 648*bˆ2 - 900*a*c - 22086*b*c + 211572*cˆ2 - 25650*a*d- 629856*b*d + 11499732*c*d + 156402576*dˆ2;

f2 := a*bˆ2 + 24*bˆ3 - 438*a*b*c - 10818*bˆ2*c - 11232*a*cˆ2 - 186732*b*cˆ2- 243648*cˆ3 - 12996*a*b*d - 320382*bˆ2*d - 285444*a*c*d - 2161728*b*c*d- 104818536*cˆ2*d + 992412*a*dˆ2 + 90530136*b*dˆ2 - 5156170344*c*dˆ2- 67660478712*dˆ3;

X16 := Curve(ProjectiveSpace(Rationals(),3),[f1,f2]);

David Zureick–BrownIn Theorem 1.4 of Varilly-Alvarado–Viray

https://sites.math.washington.edu/˜bviray/papers/VAV_UniformBoundRank19K3.pdf

and degree r′′ = 2 (so over quadratic fields), apply results of Bruin–Najman

https://arxiv.org/pdf/1406.0655.pdf

so with finitely many exceptions, an elliptic curve over a quadratic extension with a cyclic n-isogeny is a Q-curve.

Eric KatzA question related to the Chabauty method: define iterated p-adic integrals in a down-to-earth way without usingFrobenius. Suppose C over Qp has good reduction. Classically, a p-adic integral comes about via

C(Cp) → J(Cp)Log−−→ LieJ(Cp);

so for iterated integrals, we need to replace J by a unipotent analogue.

Rene SchoofLet X be a nice curve over Q of genus g ≥ 1 given by equations in projective space Pn equipped with a heightfunction h. Let P0 ∈ X(Q), and use P0 to embedX(Q) → J(Q) by P 7→ [P −P0]. One has the canonical heighth on J(Q). Are there bounds for h(P ) in terms of h([P − P0])? If g = 1, there are bounds in Silverman. (Wewould use this to say that points in a box on J(Q) determine points in a box on X(Q).)

Kiran KedlayaBy an old result of Mumford, the closure of the moduli space of principally polarized abelian fourfolds with trivialgeometric endomorphism algebra but the Mumford-Tate group is nontrivial (SL2 × SL2 × SL2) is nonempty anda countable union of components of dimension 1.

Page 127: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Arithmetic Aspects of Explicit Moduli Problems 123

• Give an explicit model for one or more components.

• Give explicit points, especially on the Torelli locus.

• For points on the Torelli locus, what fields of definition are possible? (Is it possible to show or rule out theexistence of an example over Q?)

Jeroen SijslingAs in Problem 4, let X be a nice curve over Q of genus g ≥ 1 given by equations in Pn. Embed X(Q) → J(Q)by P 7→ [P − P0] for P0 ∈ X(Q).

Now let M : H0(X,ωX) → H0(X,ωX) be a matrix representing a candidate endomorphism α of J . Tocheck if α is an endomorphism, we compute

α([P − P0]) =

g∑i=1

[Qi − P0]

and make the corresponding graph Y ⊂ X ×X , the closure of the points (P,Qi) so obtained.

• The projection onto the first component is degree g. What is the degree of the projection onto the secondprojection?

• Which monomials are needed to define Y ⊆ Pn × Pn, i.e., those monomials in some set of generators forthe ideal of vanishing of Y ?

• What can one say about the sizes of the coefficients in the equations defining Y ?

Maarten DerickxDerickx–Kamienny–Mazur

http://www.math.harvard.edu/˜mazur/papers/For.Momose20.pdf

prove that every point onX1(17) defined over a quartic field comes from a rational function of degree 4 onX1(17);moreover, up to (Z/17Z)∗/±1, there are three such functions, with Galois group once S4 and twice D4. Notethere exists an elliptic curve E over a number field K with Gal(K/Q) ' C4 cyclic which has a direct explanation.

Find the rational points on those curves that classify when the Galois group of these points is smaller: for thenormal closure X → X1(17) → P1 and a subgroup H ≤ Gal(X/P1), we find modular curves X/H → P1 andthere are six left.

For more detail, see the file

http://www.birs.ca/workshops/2017/17w5065/files/X_1(17)_D4_S4.txt

Jennifer Johnson–LeungLet F be a Siegel paramodular form of level N with Fourier–Jacobi expansion

F (τ, τ ′, z) =∑k

fk(τ, z)qk.

Let χ be a quadratic character of conductor p, and consider the twist

F (τ, τ ′, z;χ) =∑k

χ(k)fk(τ, z)qk;

Page 128: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

124 Five-day Workshop Reports

the twist is no longer a Siegel paramodular form, but rather, it is stable under the stable paramodular groupKs(p

n) = K(pn) ∩ K(pn−1) where pn ‖ N and K(m) is the paramodular group of level m. The represen-tation theory of the group Ks(p

n) is very nice, worked out by Ralf Schmidt, with newspaces of dimension 1 whenthey are supposed to be—and there are Hecke operators.

Is there a geometric object associated to F (τ, τ ′, z;χ)? And is there some class of abelian surfaces for whichthe Galois representations coincide?

Bjorn PoonenLet p > 2 be a prime, let k = Fp(t) and X : yp = txp + x. Compute X(k). Is there a nice way to do it?

This curve is smooth and has the structure of an additive group. But over a base extension, the genus goesdown, and by work of Voloch the set of points is finite, so the answer is a finite abelian group. (For p = 2, thecurve is a conic birational to P1.)

Several people suggested an argument to prove that (0, 0) is the only solution. In particular, Bas Edixhovenused a parametrization of the curve over Fp(u) with up = t, and then imposed the conditions that dx/du anddy/du be zero to ensure that x and y are in Fp(t) instead of just Fp(u).

Drew SutherlandGiven a smooth plane quartic X over Q compute Jac(X)(Q)tors efficiently. This would be useful for the databaseof genus 3 curves going into the LMFDB.

For hyperelliptic of genus 3, in principle it has been worked out. Work modulo many primes to get an up-per bound and look for rational points to match. Perhaps Chaubauty’s method works (make Manin–Mumfordeffective)? Perhaps a Hensel lifting method works?

(It may also be interesting to work out the geometrically hyperelliptic but non-hyperelliptic curves.)

Elisa Lorenzo GarciaWhat modular curves X(Γ) have a smooth plane model? (In particular, all genus three non-hyperelliptic modularcurves.) Then g = (d − 1)(d − 2)/2 for a degree d, and we need a g2

d-linear system on X . Such a curve hasgonality

√g, so using an effective bound on the gonality this should reduce the problem to a finite list?

David Zureick–BrownIs there a surface S which is not the quotient of the product of two curves, with a nontrivial Albanese variety, suchthat one can apply Chabauty’s method?

Armand BrumerWe leave it to the reader to generalize this in the obvious manner. It is motivated by making sure that we mightsomeday be able to find all abelian surfaces over Q of given conductor.

Let S be a finite set of primes, A(S) be the finite set of abelian surfaces good outside S, and J (S) the set ofJacobians in A(S). Introduce an invariant d(S) and a set T (S) as follows. For each isogeny class in A(S), takethe minimum degree of any polarization and then let d(S) be the maximum over the isogeny classes in A(S). LetT (S) be a minimal set of places such that each isogeny class in J (S) contains a Jacobian Jac(C) such that C isgood outside T (S).

What can be said about d(S) and T (S). Is d(S) bounded as S grows?Even 30 years after Faltings, the only case understood is S = ∅! Even for S = 2 neither d(S) nor T (S)

are known. The work of Merriman–Smart only find the curves good outside 2, but there are many other examplesbeyond this list.

The problem is slightly easier if one restricts to semistable abelian varieties: for a few sets S, one may find allsemistable surfaces good outside S, up to isogeny, thanks to Schoof or Brumer–Kramer.

Page 129: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Arithmetic Aspects of Explicit Moduli Problems 125

Samuele AnniLet E/Q : y2 + y = x3−x (LMFDB label 37.a1). For every prime ` we have that Gal(Q(E[`])/Q) ∼= GL2(F`).This gives a realization of GL2(F`) as Galois group over Q for all primes ` using ”one object”. Is there ananalogous construction, i.e. simultaneous realization of GL2(F`) for all ` as Galois group using the “same object”,over any number field different from Q?

John VoightComputations with paramodular forms and L-functions suggest that there is an abelian surface A over Q of con-ductor 550 whose first few Euler factors (computed by David Farmer and Sally Koutsoliotas, the first few by CrisPoor and David Yuen) are as follows:

L2(T ) = (1 + T )(1 + 2T 2)

L3(T ) = 1− T 2 + 9T 4

L5(T ) = 1 + 3T + 5T 2

L7(T ) = 1 + 4T 2 + 49T 4

L11(T ) = (1 + T )(1− 3T + 11T 2)

L13(T ) = 1− 8T 2 + 169T 4

Show that such a surface exists! Because L3(T ) is irreducible, if A exists then A is simple over Q. The abeliansurface A may or may not have a principal polarization over Q. We expect that A[2] is an extension of E1[2] byE2[2], whereE1 andE2 are elliptic curves of conductors 11 and 50 respectively. The first few Dirichlet coefficientsof the L-function are:

1,−1, 0,−1,−3, 0, 0, 1, 1, 3, 2, 0, 0, 0, 0, 3,−3,−1, 1, 3, 0,−2,−3, 0, 4, 0, 0, 0, 0, 0,−5,−3, 0, 3, 0,−1,

3,−1, 0,−3,−3, 0, 12,−2,−3, 3, 6, 0,−4,−4, 0, 0,−6, 0,−6, 0, 0, 0, 3, 0,−14, 5, 0,−5, 0, 0, 0,

3, 0, 0, 3, 1,−3,−3, 0,−1, 0, 0, 10,−9,−8, 3,−3, 0, 9,−12, 0, 2, 0, 3, 0, 3, 0,−6,−3, 0, 9, 4, 2,−4, 12,

0, 6, 0, 0, 6, 21, 0, 4, 6, 0, 0, 0, 0, 9, 0, 0,−3, 0, 0,−4, 14, 0, 5, 3, 0,−18, 5, 0, 0, 0, 0, 0, 0, 0,−3,−6, 0, 1,

0, 0,−3, 0, 3, 0, 3, 0,−3,−6, 0,−8, 1,−3, 0, 15, 0,−21,−10, 0, 9, 0, 8, 9, 3, 0, 3, 6, 0, 8,−9,

1,−12,−18, 0, 0, 6, 0, 0, 12, 3, 13, 0, 0,−3,−9, 0,−6,−6, 0, 3,−3, 0,−15,−9, 0, 4, 12,−2,−32, 4,

0,−12, 0, 0, 9,−6,−3, 0, 2, 0, 1.

Drew SutherlandLet ρf : Gal(Q/Q) → GL2(F`) be an odd irreducible mod-` Galois representation associated to a classicalmodular form f , and let p be a prime not dividing the level of f . Is there a way to determine the conjugacy classof ρf (Frobp) directly from f (given by its q-expansion, say)?

When the eigenvalues of ρf (Frobp) are distinct, this is clear, but if ρf (Frobp) has trace 2 and determinant 1,

for example, is it possible to distinguish the conjugacy classes of(

1 10 1

)from the identity without computing

separately the torsion of an associated abelian variety?

John VoightIs there an efficient (or at least practical) algorithm that, given a genus 2 curve X over Q, computes the isogenygraph of abelian surfaces isogenous to Jac(X) as principally polarized abelian varieties over Q, and the minimaldegree of isogenies between them—like for elliptic curves?

If one allows isogenies that do not respect the principal polarization (so we allow polarizations of arbitrarydegree), is the corresponding set finite?

Page 130: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

126 Five-day Workshop Reports

Presentation Highlights

Balakrishnan and Muller: Rational Points on X+ns(13)

At the workshop Jennifer Balakrishnan and Jan-Steffen Muller created much excitement by announcing the deter-mination of the rational points on the modular curveX+

ns(13), as part of joint work with Netan Dogra, Jan Tuitmanand Jan Vonk. This has been a famous open problem for many years. More significantly, it demonstrates the ideaspioneered initially by Minhyong Kim, which were extensively studied at BIRS workshop 07w5063, February 4-9,2007, have substantial applications to modular curves.

Let X/Q be a curve of genus g ≥ 2 with Jacobian J and let ` be a prime of good reduction. Using Selmervarieties, Kim defines a decreasing sequence

X(Q`) ⊇ X(Q`)1 ⊇ X(Q`)2 ⊇ · · ·

all containing X(Q). Thanks to the work of Coleman, the ‘Chabauty set’ X(Q`)1 is known to be finite providedthe ‘Chabauty condition’ rankJ(Q) < g holds. In this case one has a practical strategy that often succeeds incomputing the set of rational points X(Q). Alas, for the family of modular curves X+

ns(p) with p ≥ 13 it is known(assuming BSD) that rankJ(Q) is at least the genus, making the methods of Mazur, Kamienny and Merel (as wellas Coleman–Chabauty) inapplicable.

Balakrishnan and Dogra [4] have recently shown that the ‘quadratic Chabauty set’ X(Q`)2 is finite provided

rankJ(Q) < g + rankNS(J)− 1,

where NS(J) is the Neron-Severi group of J/Q. It is known for modular curves X of genus g ≥ 3 thatrankNS(J) ≥ 2, and thus quadratic Chabauty is strictly more powerful than classical Chabauty in the modu-lar context. The joint work alluded to above turns quadratic Chabauty into a practical computational tool that canbe used to attack explicit examples, and the application to X+

ns(13) is expected to the first of many breakthroughswith this method.

Zureick-Brown: Mazur’s Problem B

Mazur’s Problem B (also known as Mazur’s vertical uniformity problem) asks for the determination of possibleimages of the representations ρE,p∞ : GQ → GL2(Zp) for elliptic curves E over the rationals and all primesp. For p > 37 it is easy to give an answer conditional on Serre’s uniformity conjecture. Recently this questionhas been resolved completely by Rouse and Zureick-Brown [9] for p = 2. Zureick-Brown’s talk gave a detailedoverview of the proof which involves the computation of models of modular curves XH and rational points onthese modular curves for around 700 arithmetically minimal subgroups H of GL2(Z2).

Andrew Sutherland: Modular curves of prime-power level with infinitely many rationalpoints

For each open subgroup G of GL2(Z) containing −I and having full determinant there is a a modular curve XG

defined over Q whose non-cuspidal points parametrize elliptic curves E/Q such that the image of ρE : GQ →GL2(Z) is contained inG. When the index ofG is sufficiently large, the curveXG has genus≥ 2 and so by Faltingshas finitely many rational points. This raise the interesting question of for which G does the modular curve XG

have infinitely many rational points. This talk gave an overview of recent work by Sutherland and Zywina inwhich they give a full answer to this question where G has prime-power level. They find (up to conjugacy) 248such groups where XG(Q) is infinite, with 220 being curves of genus 0, and 28 being elliptic curves with positiverank. This is indeed a step towards Mazur’s vertical uniformity conjecture, as for each prime p it gives an explicitclassification of possible p-adic images with the possible exception of finitely many j-invariants.

Page 131: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Arithmetic Aspects of Explicit Moduli Problems 127

Pierre Parent: Rational points of Modular Curves–An Arakelovian Point of ViewLet p be a prime and let Je denote Merel’s winding quotient of J0(p); this is the maximal quotient that has analyticrank 0. One knows thanks to deep work of Kolyvagin, Logachev and Kato that Je(Q) is finite. Write J ∼ Je×J⊥e .Let P be a degree d point on X0(p) (that is a point defined over a number field of degree d). Let P denote thecorresponding rational point on the d-th symmetric power X0(p)(d). The image of this on J0(p) belongs to theintersection of the two cycles (J⊥e + torsion) and X0(p)(d). Parent’s talk explained that knowing the heights anddegrees of the two cycles, allows via an arithmetic Bezout theorem, to give an upper bound for the height of theintersection. Of course the smaller the dimension of J⊥e (and hence equivalently the larger the dimesion of Je), thebetter control we have on the intersection. A theorem of Iwaniec and Sarnak gives

1

4+ o(p) ≤ dim(Je)

dim(J0)≤ 1

2+ o(p).

A conjecture of Brumer assertsdim(Je)

dim(J0)=

1

2+ o(p).

Parent sketched a proof of the following theorem: under Brumer’s conjecture, the j-height of quadratic points onX0(p2) is O(p5 log p). Remarkably the bound is independent of the quadratic field!

Scientific Progress MadeThe workshop schedule was designed to give participants plenty of time for collaboration and discussions. We haveasked the participants to report on the progress to existing projects made and also on any new projects initiatedduring the workshop.

• Samuele Anni and Elisa Lorenzo Garcia: we are designing an algorithm to compute endomorphism rings ofthreefolds in positive characteristic. Using this algorithm, we are also studying endomorphisms in charac-teristic zero through liftings, giving a completely algebraic alternative to the known algorithms.

• Samuele Anni and Ekin Ozman. We want to study local points on fibred products of X0(p) for differentprimes p. This is connected to local-global questions studied in Ekin and my thesis from different points ofview.

• Samuele Anni and Samir Siksek. A new paper on modularity of elliptic curves over totally real subfields ofcyclotomic fields is in preparation.

• Andrew Sutherland, Jeroen Sijsling and John Voight. We have worked on our Genus 2 automorphy paper.This is still a work in progress, but we moved the ball forward.

• David Zureick-Brown received lots of advice and help from other participants towards completing Mazur’sprogramme B. The progress made includes:

– Determination of rational points on several XH .

– Andrew Sutherland was able to compute traces of Frobenius at the first 1000 primes for some of thelarge genus subgroups H . This allowed the recognition that a few pairs of XH , XK , with H , K notconjugate, were accidentally isomorphic (and then it was proved). Sutherland will optimize his codeto allow computation of zeta functions (and hence, whether JH is simple, etc) for several of the H forwhich there are currently no nice equations.

• Maarten Derickx and David Zureick-Brown. We were able to find the rational points on 4 of Maarten’s 6curves, and in one of the remaining 2 cases we were able to rule out most of the standard techniques fromworking.

Page 132: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

128 Five-day Workshop Reports

• Eric Katz and David Zureick-Brown. We made fair progress on our Buium project (that was the subject ofKatz’s talk), and some progress on another project (about “Total Jet Spaces”).

• Sara Arias-de-Reyna, Elisa Lorenzo Garcia and Christophe Ritzenthaler discussed some aspects of Jacobiansof genus 3 curves. This is expected to lead to improvements on the Arias-de-Reyna’s work, presented at theworkshop, on the realisation of GSp6(F`) as a Galois group of a tamely ramified extension.

• Rachel Pries and Ekin Ozman were able to complete their project on p-ranks of trielliptic curves.

• Mark van Hoeij and David Zureick-Brown started an new collaboration. In Zureick-Brown’s talk he dis-played several curves having high degree plane models. Van Hoeij computed plane models of much lowerdegrees, which will help Zureick-Brown study the arithmetic of these curves.

• Francesc Fite, Elisa Lorenzo Garcia and Andrew Sutherland. We have continued our work on our paperSato-Tate groups of twists of Fermat and Klein quartics. This is a project we have been working on for along time, but we finally cracked the last stumbling block during the week and should be able to wrap up thepaper shortly.

• Francesc Fite: In my talk I explained the theorem that ensures that if the square of an elliptic curve withCM admits a rational model up to Q-isogeny, then the quadratic imaginary field of the CM has either classnumber 1, class number 2, or class group C2 × C2. While one easily shows that all quadratic imaginaryfields with class numbers 1 and 2 arise, the question on whether quadratic imaginary fields with class groupC2 × C2 actually occur was open before the workshop. The afternoon after my talk John Voight showedto me an example having class group C2 × C2. We expect to start a collaboration in which we determineexactly which quadratic imaginary fields with class group C2 × C2 can arise.

• Nils Bruin, Armand Brumer, Chritophe Ritzenthaler and Jaap Top: we had a discussion which may leadto a new way to compute Serre’s obstruction for abelian threefolds. We have not proved anything yet butthe strategy seems coherent and effective. We consider a non-hyperelliptic genus 4 curve C in P3 as theintersection of a (unique) quadric Q and cubic over a field K. We ask that the discriminant of Q is a squarein K and Jac(C) to have a rational non-zero two-torsion point. We can then construct an unramified doublecover D → C and its Prym is a principally polarized abelian threefold (A, a). We conjecture that if (A, a)is geometrically undecomposable, then (A, a) is the Jacobian of a genus 3 curve over K if and only if theGalois closure of D → C → P1 (the last map coming from any of the rational rulings on Q) is defined overK.

• Christophe Ritzenthaler: I received interesting feedback after my talk from Brumer, Viray, Elkies and Voight.During the problem session, I think we also proved that except forX(7), none of the otherX(n) (with n > 6)can have a plane model because their automorphism groups are not automorphism groups of plane curves(by results from Harui).

• Ekin Ozman and Samir Siksek: we are now many steps closer to completing our project of determining thequadratic points on X0(p) of genera 3, 4 and 5.

Outcome of the MeetingAs the above feedback amply demonstrates, this was a great meeting at which much progress has been madetowards fundamental questions in the arithmetic of moduli problems. Many of our participants wrote to tell ushow useful the workshop has been to them. We conclude with a few quotes from our participants highlighting thesuccess of the meeting.

Jeroen Sijsling had this to say:

“The venue and facilities of the workshop were top-notch. The planning encouraged the researchersinvolved to discuss as much as possible, an invitation that was certainly taken up. Especially useful

Page 133: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Arithmetic Aspects of Explicit Moduli Problems 129

was the open problems session on Tuesday, where some participants, myself included, asked someopen questions of theirs to the audience. My question got resolved quite rapidly by other participants.Also in this way the workshop contributed to advancing its field of research.”

Jennifer Johnson-Leung said:

“The BIRS workshop 17w5065 had a strong positive impact on my research program, I had severalvaluable interactions with my colleagues. In particular, I learned of certain surfaces that Brumerhas recently constructed that he believes to be paramodular. However, he expects the representation tohave a non-trivial central character. This is not possible in the paramodular theory for essential reasons.He hopes that I and my collaborators will be able to reconcile this. I also had useful conversationswith Balakrishnan, Sutherland and Voight about classes of known examples of paramodular surfaces.I found that I had a basic error in my understanding which I was able to correct. I also had theopportunity to meet several colleagues in person for the first time. This makes it much easier for meto write to them with specific questions or ideas. I was able to attend this conference only becauseof the generous family accommodations. I travel very little due to my husband’s disability, and thisworkshop provided me with the opportunity to interact with collaborators and colleagues very closeto my research area. I also learned of useful results and techniques from the lectures.”

Andrew Sutherland said:

“This workshop was an extremely productive one from my perspective. The participants includedmany leading experts in the field, and I was able to make forward progress on two existing projectswith collaborators who were also in attendance, as well as obtaining an entirely new result. I can pin-point the exact moment when the new insight occurred: it was on the trail up Tunnel Mountain whiletaking a quick hike I took during the lunch break before the afternoon session. The combination of thetheoretical beauty of the mathematical content of the talks and the natural beauty of the environmentaround BIRS was wonderfully exhilirating.”

Finally we quote Armand Brumer, one of our most distinguished participants:

“It was a great pleasure having a chance to participate in the workshop. Neither the beauty of thesurroundings nor the great amenities could make a dent on the stimulating talks and mathematicalconversations!”

ParticipantsAchter, Jeff (Colorado State University)Anni, Samuele (Heidelberg University)Arias-de-Reyna, Sara (University of Sevilla)Balakrishnan, Jennifer (Boston University)Bruin, Nils (Simon Fraser University)Bruin, Peter (Universiteit Leiden)Brumer, Armand (Fordham University)Cesnavicius, Kestutis (University of California, Berkeley)Derickx, Maarten (Universiteit Leiden)Edixhoven, Bas (Mathematical Institute, Universiteit Leiden)Elkies, Noam D. (Harvard University)Fite, Francesc (Essen)Goren, Eyal (McGill University)Harvey, David (University of New South Wales)Ho, Wei (University of Michigan)

Page 134: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

130 Five-day Workshop Reports

Johnson-Leung, Jennifer (University of Idaho)Kani, Ernst (Queen’s University)Katz, Eric (Ohio State University)Kedlaya, Kiran (University of California, San Diego)Khuri-Makdisi, Kamal (American University of Beirut)Lorenzo Garcia, Elisa (Universite de Rennes 1)Mueller, Jan Steffen (Universitat Oldenburg)Najman, Filip (University of Zagreb)Newton, Rachel (University of Reading)Ozman, Ekin (Bogazici University)Parent, Pierre (Universite de Bordeaux)Park, Jennifer (Ohio State University)Poonen, Bjorn (Massachusetts Institute of Technology)Pries, Rachel (Colorado State University)Rebolledo, Marusia (Universite Clermont Auvergne)Ritzenthaler, Christophe (Rennes)Schoof, Rene (University of Rome II)Sijsling, Jeroen (Universitat Ulm)Siksek, Samir (University of Warwick)Stoll, Michael (Universitat Bayreuth)Sutherland, Andrew (Massachusetts Institute of Technology)Top, Jaap (University of Groningen)van Hoeij, Mark (Florida State University)Viray, Bianca (University of Washington)Voight, John (Dartmouth College)Zureick-Brown, David (Emory University)Zywina, David (Cornell University)

Page 135: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] J. S. Balakrishnan and N. Dogra, Quadratic Chabauty and rational points I: p-adic heights,arXiv:1601.00388.

[2] Yu. Bilu and P. Parent, Serre’s uniformity problem in the split Cartan case, Annals of Math. 173 (2011),569–584.

[3] Yu. Bilu, P. Parent and M. Rebolledo, Rational points on X+0 (pr)(Q), Annales de l’Institut Fourier 63 (2013),

957–984.

[4] A. R. Booker, J. Sijsling, A. V. Sutherland, J. Voight and D. Yasaki, A database of genus 2 curves over therational numbers, LMS Jour. Comp. Math. 19 (2016), 235–254.

[5] N. Bruin and B. Nasserden, Arithmetic aspects of the Burkhardt quartic threefold, arXiv:1705.09006.

[6] M. Derickx, S. Kamienny, W. Stein and M. Stoll, Torsion points on elliptic curves over number fields of smalldegree, arXiv:1707.00364.

[7] N. Elkies and A. Kumar, K3 surfaces and equations for Hilbert modular surfaces, Algebra & Number Theory8 (2014), 2297–2411.

[8] N. Freitas, B. Le Hung and S. Siksek, Elliptic curves over real quadratic fields are modular, InventionesMathematicae 201 (2015), 159–206.

[9] J. Rouse and D. Zureick-Brown, Elliptic curves and 2-adic images of Galois, Research in Number Theory(2015), 1:1.

131

Page 136: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 12

Mathematical Analysis of BiologicalInteraction Networks (17w5099)

June 4 - 9, 2017Organizer(s): David Anderson (University of Wisconsin-Madison), Matthew Johnston (SanJose State University), Casian Pantea (West Virginia University), Lea Popovic (Concordia Uni-versity), Anne Shiu (Texas A&M University)

Overview of the FieldRecent breakthroughs in experimental and high-throughput data analysis techniques have provided an unprece-dented window into the complexity of the biochemical interactions in living cells. Fundamental metabolic andsignaling pathways corresponding to basic cellular functions such as osmolarity, chemotaxis, the cell cycle, andapoptosis are now known to consist of dozens of metabolites interacting via hundreds of interdependent chemicalreactions. New experimental techniques such as fluorescent proteins, meanwhile, have allowed real-time observa-tions of such complex dynamical behavior as periodicity, hysteresis, and stochastic fluctuations.

Mathematics has emerged as a pivotal player in grappling with the complexity of these biochemical systems,and is a cornerstone of current systems biology research. Two common mathematical modeling frameworks fornetworks of biochemical interactions include systems of ordinary differential equations (ODEs) and stochasticcontinuous-time Markov chains (CTMCs). The classical mass-action form for the interaction rates reduces theODE formulation to a polynomial dynamical system, which has led to wide-spread interest from researchers inalgebraic geometry. When properly posed, these models have the ability to capture important properties of thephysical system without the expense and time-consumption of laboratory experimentation and data collection.These models can also allow the generation of hypotheses which can then be tested in the laboratory.

Significant mathematical research has been conducted on relating structural properties of the underlying net-work of chemical interactions to admissible dynamical behaviors and properties. These approaches have attemptedto identify motifs which contribute to biologically important behaviors such as biochemical switching, oscillatorybehavior, concentration robustness, and persistence (i.e. non-exhaustion) of metabolites. Classical network-basedapproaches have included flux balance analysis, stoichiometric network analysis, chemical reaction network the-ory, and monotone systems theory. Remarkably, many results have been derived which guaranteed qualitativebehavior which is robust to the system’s parameters and initial conditions.

Research on network-based approaches to biochemical reaction models is now conducted by prominent re-searchers in a wide breadth of complementary mathematical disciplines, including differential equations and con-trol theory, stochastic processes, algebraic geometry, optimization and computation, and analysis. While work inthese areas is often distinct in flavor, it has been the community’s experience that the most successful approaches

132

Page 137: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Mathematical Analysis of Biological Interaction Networks 133

come from sharing ideas across lines of expertise. This workshop focused on establishing communication amongthe different disciplines and building new collaborations.

Workshop StructureThe workshop contained a mix of prepared talks and free time for collaboration and discussion. Since the work-shop participants had areas of expertise spanning many distinct mathematical disciplines, the invited talks werechosen to reflect a diversity of mathematical approaches. Eight speakers were selected to give 40 minute talks onMonday and Tuesday which served as overviews of the notation and common approaches taken in their respectivedisciplines. On Monday morning, David F. Anderson and Gheorghe Craciun (both UW-Madison) introduced thenotation relevant to two common dynamical modeling frameworks for biochemical interaction networks: deter-ministic ODE models and stochastic CTMC models. On Monday afternoon, Anne Shiu (Texas A&M) and AliciaDickenstein (University of Buenos Aires) outlined algebraic approaches for the study of chemical interaction net-works. Early Tuesday morning, Anne Condon (UBC) and Erik Winfree (Caltech) explained how reaction networksare used as computational tools, both in theory and in the laboratory. Later on Tuesday morning, Yiannis Kaznessis(University of Minnesota) and Eduardo Sontag (Rutgers University) discussed some engineering applications ofreaction network theory. The remaining talks were 25 minutes in length and focused predominantly on current re-search. Speakers were also encouraged to present open problems for discussion. In total, 26 of the 42 participantsgave talks, with many of the other participants leading breakout sessions.

The schedule was left deliberately more open in the later part of the week, so that as the week progressed itwould allow for the open problems that were raised to have more time to be discussed in self-organized sessions.In total, two hours of breakout session / free time was offered during Monday (late afternoon), two and a halfhours during Tuesday (early afternoon), all of Wednesday afternoon, two and a half hours on Thursday (earlyafternoon), and complete free time on Friday. Prior to the daily breakout sessions, participants were invited topropose open problems / discussion areas on a white board in the TCPL foyer. At the beginning of the breakoutsession time, the participants self-organized into groups and session rooms were assigned according to the sizeof the interested groups. Several of these groups continued discussing over multiple days, this included a groupdiscussing extinction events in discrete reaction networks and one discussing the possibility of lifting oscillationsfrom small networks to large ones.

Presentation HighlightsDavid Anderson (University of Wisconsin-Madison): Stochastic models of CRNs. An overview.

The workshop opened with a survey talk on the field of reaction networks, highlighting one of the core ques-tions in the field: what are the connections between dynamical behavior and network structure in CRNs? Thispresentation set the stage for many subsequent talks by reviewing CRN terminology (reaction networks, stoichio-metric space, linkage classes, weak reversibility, complex-balancing, deficiency, etc.) and CRN mass-action mod-els, both deterministic and stochastic. In particular, a number of ways to represent stochastic CRNs were surveyed,including Gillespie algorithms, Kurtz’s random time change representation, and the chemical master equation. Anoverview of current research directions in stochastic CRNs followed. One of the important open questions is de-termining when well-behaved deterministic dynamics of a CRN (for example, uniqueness of positive equilibria,stability of equilibria, etc) guarantees that the stochastic dynamics is also well-behaved (for example, positive re-current). This is known to be true for deficiency zero, weakly reversible networks, which are known to be positiverecurrent [21]. The search for other classes of networks with this property is another area of current interest; it isconjectured that weak reversibility is a sufficient condition for positive recurrence. The list of open questions alsoincluded finding conditions for stochastic CRNs to undergo an extinction event, and studying when combiningwell-behaved stochastic CRNs results in a well-behaved CRN. These questions have reoccured in later talks, andhave also been the topic of some of the breakout sessions.

Gheorghe Craciun (University of Wisconsin-Madison): On possible future directions for the mathematicalanalysis of reaction networks.

Page 138: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

134 Five-day Workshop Reports

Prof. Craciun’s talk comprised a historical overview of the developments leading to the current state-of-the-artin deterministic CRNs (including the proof of the global attractor Conjecture). Many of the ideas in the field (forexample detail and complex balancing) go back to Boltzmann, and attempts at proving the stability of detailed(complex) balanced equilibria go back to work of Shear and Higgins in the 1960’s and before the celebratedwork of Horn and Jackson. A first possible research direction identified in this talk is applying the machinery ofdeterministic CRNs to the Boltzmann equation. There is an interesting connection between the convergence toequilibrium in complex-balanced CRNs and Boltzmann’s H-theorem (the distribution of molecule velocities in anideal gas converges to the Maxwell distribution). This connection has been studied with the simplifying assumptionof spacial homogeneity and when finite number of velocities are possible [22], and an open question was proposedon extending this approach to an infinite number of velocities. A theory of detail balanced and complex-balancedinfinite reaction networks would be a key ingredient in this direction. Another area of exploration is the studyof piecewise constant differential inclusions, a far-reaching generalization of complex-balanced systems. Theseobjects have a very rich algebraic and combinatorial structure, are connected to toric dynamical systems, and area key ingredient in the recent proof of the global attractor conjecture [5]. An interesting open question proposedin the talk is whether piecewise linear differential inclusions or some related general object can give rise to aLyapunov function the way toric dynamical systems do.

Daniele Cappeletti (University of Wisconsin-Madison): Stochastically Modeled Reaction Networks withAbsolute Concentration Robustness.

It is known [20] that certain structural conditions of a deterministic mass-action CRN imply absolute concen-tration robustness (i.e. some of the species have the same same concentration at all positive steady states). Recentwork [21] has however also shown that under the same conditions on the network the stochastic CRN undergoesan extinction event with probability 1. This talk discussed a new result resolving this discrepancy between the de-terministic and stochastic models. A key observation is that while in general when rescaled stochastic trajectoriesconverge to a deterministic trajectory, in complex systems with differences of orders of magnitude the hypothesisneeded for such general rescaling and convergence is violated, so the result does not apply. The talk presented theidea to divide the dynamics into a slow and a fast subsystem, and hypothesize that the fast subsystem has a productform stationary distribution. Under certain conditions, one can then show that up to any fixed time the averagesof the ACR species counts tend to their ACR equilibria. It is an open question how fast this convergence is, andwhether the convergence rate depends on the initial state. Finally, a conjecture proposing that all ACR systemshave extinction events was shown false by way of counterexample.

Badal Joshi (California State University San Marco): Graphically balanced equilibria and stationary measuresof reaction networks.

Graph-related symmetries of a reaction network give rise to certain special equilibria (such as complex balancedequilibria) in deterministic models. Correspondingly, in stochastic models these symmetries give rise to certainspecial stationary measures. Some new balance measures were discussed in the stochastic setting: the reactionbalanced measure, the complex balanced measure, the reaction vector balanced measure and the cycle balancedmeasure. The reaction vector balancing is the same notion as detailed balancing in Markov chain theory. Relationsbetween these balance measures in the stochastic setting were discussed. For example, reaction balance impliescomplex balance, reaction vector balance, and cycle balance. The talk discussed the idea of decomposing bothdeterministic and stochastic systems into so-called “factor systems” and establishing the correspondence betweenfactors of a complex balanced deterministic system and those of the corresponding stochastic system. It is knownby work of Dickenstein and Perez Millan that complex balance and cycle balance implies reaction balance. In thestochastic setting complex balance and reaction vector balance implies cycle balance and reaction balance, but thisis not true in the deterministic setting.

Alicia Dickenstein (University of Buenos Aires): Algebraic tools in the study of reaction systems.This overview talk focused on questions of multistationarity in mass-action CRN models. Large classes of

systems (especially enzymatic mechanisms) give rise to steady state varieties having particularly nice parametriza-tions (in work of Gunawardena, Thompson, Feliu, Wiuf, Dickenstein, Perez Millan, etc.). These parametrizationsrewrite the steady state algebraic equations using a minimal number of variables. For example, while the n sitephosphorylation network involves 3n + 3 variables, its steady state variety can be parametrized using only threevariables. This reduction proves very useful in answering questions of existence and number of positive steady

Page 139: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Mathematical Analysis of Biological Interaction Networks 135

states, and finding regimes of parameters (reaction rate constants, total masses) for which multistationarity is pos-sible. A recent paper of Conradi et al. uses degree theory and the parametrization of the steady state variety todevise explicit conditions on parameters that guarantee the existence of multiple equilibria. On the other hand,purely algebraic techniques (not relying on the Jacobian) are applicable for classes of enzymatic systems. Theexample of enzymatic cascades was presented: algebraic methods show that if the enzymes are different, only onepositive steady state is possible; however, multistationarity is possible if the enzymes coincide.

Anne Shiu (Texas A& M): Algebraic methods for analyzing bistability and oscillations in reaction systems.This overview talk reviewed the current state of the art in the dynamics of phosphorylation networks (where

the phosphorylation mechanism is either processive, distributive, or mixed), as well as other important biologi-cal pathways, including the ERK network. The talk focused on the questions of multistationarity and oscillation,although results on stability of equilibria were also briefly mentioned. These networks exhibit an variety of com-binations of multistationarity/oscillation behaviors. For example, processive networks are globally stable (Conradiand Shiu 2015); distributive networks are multistationary; mixed networks admit oscillations; the ERK networkadmits multistationarity. The proofs of these facts rely on parametrizations of the steady state varieties, and useof the Routh-Hurwitz conditions, or of Yang’s Hopf-bifurcation criterion. It is an open question if the doublephosphorylation mechanism can admit oscillations. One possible approach to attacking this question is lifting theperiodic orbits of the corresponding mixed phosphorylation network (which are known to exist), using techniquesfrom the recent work of M. Banaji. This became the topic of one of the breakout sessions.

Anne Condon (University of British Columbia): An introduction to molecular programming with stochasticCRNs.

CRNs can be viewed as a programming language. Prof. Condon introduced this idea by way of examples: thesum of two integers n1 + n2 can be computed by the network X1 → Y,X2 → Y as the count of Y when allreactions stopped firing and the initial molecule counts for X1 and X2 are n1 and n2, respectively. Similarly, thenetworkX1+X2 → Y computes min(n1, n2), and networkX1 → T,X2+Y → 0 computes n1−n2. Thinking ofthem as programs, it is desirable that networks are output stable (i.e. output does not change in any reachable statefrom the output state) and that they stably compute the output (i.e. output is always reachable from all possibleinputs). A small discussion on conditions implying these properties followed, along with considerations on thespeed of the computations. These have to do the propensity of the reactions in the stochastic model; it can beshowed that computing the sum is O(log n), whereas computing the minimum is o(n). CRNs can also be used tocompute logical predicates, like “n1 > n2”: a corresponding network is L → Y,X2 + Y → N,X1 + N → Y ,where the output is Y if n1 > n2 and N otherwise. It can be shown that union and intersections of predicates canbe computed using CRNs. However, many of these predicates are semi-linear and are slow computing (they requirelinear time). In these cases once can construct faster programs (networks) that converge to the correct output withhigh probability.

Erik Winfree (Caltech): Computing with CRNs.Prof Winfree started his talk by discussing the similarities between programming molecules to perform a cer-

tain task in synthetic biology and programming electrical systems, but also pointing out that they are also vastlydifferent. Our gap in the component-wise understanding of biological systems (smallest living thing is composedof thousands of interacting components) requires a distinct approach to bioengineering. The dynamics of CRNsare extremely varied, encompassing stable systems, oscillations, and chaos. He then proceeded to describe theimplementation of the kinetic control of DNA strand displacement, and how it can be used to create desired CRN.He presented toehold-mediated strand displacement, where an incumbent strand is initially hybridized to a com-plementary OdisplacementO domain of a target strand by base pairing, and an invader strand is complementaryboth to the displacement domain of the target and to an adjacent OtoeholdO domain, allowing it to displace theincumbent to form a more stable duplex. The displacement is initiated by binding of the invader to the toehold, thisis then followed by a branch migration process in which base pairs between target and incumbent break and arereplaced by base pairs with the invader, and the overall reaction rate for this displacement is strongly dependent onthe toehold binding strength. Distinct species get distinct domains, and reactions in the system are then based ondisplacement. Experimental displacillator results were presented: the dynamics is slow but works reasonably well(no sustained oscillations, though). Probabilistic inference was discussed, as were chemical Boltzmann machines,and the complexity of robust CRN distributions.

Page 140: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

136 Five-day Workshop Reports

Yiannis Kaznessis (University of Minnesota): Closure scheme for chemical master equations.Probability distributions (in stochastic CRNs) can be described by moments, which encode the complete infor-

mation on the distribution. Moments obey differential equations that govern the dynamic and steady state behaviorof stochastic reaction networks. However, moment equations are generally not closed (derivatives of momentsdepend on higher order moments). The zero-information closure scheme for the moment equation starts withthe premise that while higher order probability moments may not be numerically negligible, they contain littleinformation to reconstruct the master probability. Higher order moments are then related to lower order ones bymaximizing the information entropy −∑P (x) ln(P (x) of the network; this is done by solving a system of non-linear algebraic equations coming from the corresponding Lagrange multipliers. Once the moments are computed,they are used to reconstruct the probability distribution, and the stability around steady states can be assessed bycomputing the eigenvalues of the moment Jacobian. The Schlogl model was used to illustrate the power of theapproach, but also its limitations: for example, bimodality is lost when the system size is increased.

Eduardo Sontag (Rutgers): Dynamic response phenotypes in systems biology: Scale-invariance and monotoneI/O systems.

This overview talk focused on output responses to external inputs. Some adaptive systems (ranging frombacterial chemotaxis pathways to signal transduction mechanisms) have an additional feature: scale invariance,i.e. the property that output depends not on the absolute change of the input, but rather on the scaling factorby which the input changes. A theoretical framework characterizing scale invariance was discussed, and typicalstructures for scale invariant mechanisms were presented (for example, incoherent feed-forward loops). The theorycorrectly predicts that the E. coli output behavior is unchanged under scaling of its ligand input signal, which hasbeen verified experimentally. Chemosensing mechanisms, as well as cancer immunotherapy mechanisms have alsobeen observed experimentally to exhibit scale invariance, with implications to model selection and validation.

Robert Brijder (Hasselt University): Sufficient Conditions for the Eventual Dying of Reactions in DiscreteChemical Reaction Networks.

In this talk the speaker presented conditions on CRNs for some of the reactions to stop firing in the long term,followed by a corollary that gives a sufficient condition for an extinction event to occur. The condition is based onthe structure of the network alone (phrased in terms of Petri nets), and it produces a statement about the reachabilityof certain states, being therefore independent of the stochastic rates one may assign. A subconservative CRN ofstoichiometric matrix Γ is defined by the existence of a a positive vector c such that cΓ ≤ 0. A CRN is enlarged byadding edges from componentwise “larger” to “smaller” complexes, and in such a way that the reactions added donot end at complexes of a certain set Y (the enlarged network is called dom-CRN Y-admissible). It turns out thatif the set Y of complexes has a certain structural property, if the CRN is subconservative, then any Y-admissibledom-CRN is guaranteed an extinction event. Finding sufficient conditions for extinction in discrete CRNs thatenlarge the class presented here is an open problem, as is the study of computational decidability of extinctionquestions. These issues have been the focus of one of the breakout sessions.

Robert Johnson (Caltech): Formal Verification of Chemical Reaction Network Equivalence: A BisimulationApproach

This talk defined two CRNs to be equivalent if the have the same simulated behavior. This notion of equiv-alence is relevant in molecular programming, where the simplest mechanism for performing a task are often de-sirable. The setup is that of classical CRNs endowed with mass-action kinetics of rate constants set to one. Theequivalence definition relies on syntactic Markovian bisimulation, which is also central to devising algorithms thatcheck CRN equivalence. This area of research presents a series of open questions, for example can bisimulationprove some feature that a correct implementation must have, or can it give bounds bounds on the size of a correctimplementation?

Nicolette Meshkat (Santa Clara University): Using algebraic matroids and avoiding differential algebra inidentifiability, observability, and indistinguishability.

The questions of structural identifiability (i.e. uniquely recovering parameters), structural observability (uniquelyrecovering trajectories), structural indistinguishability (uniquely recovering models) in CRN input/output systemscan be attacked using algebraic tools. The example of a linear 2-compartment model was used to exemplify the

Page 141: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Mathematical Analysis of Biological Interaction Networks 137

techniques, but these are applicable to general systems. If one of the species is used as the output (and thus ob-served without error for all time, and for any input), can one then uniquely identify all the parameters? It turnsout that the answer is “no” even in this simple case. An algorithm for identifying rational combinations of param-eters is based on the input/output equations and using differential algebra. This information can be then used toextract individual parameters that are identifiable using Groebner basis and elimination ideals; however, choosinga monomial order that leads to the desired result is in general hard to do. In new work, the speaker used elementsof matroid theory to devise methods of choosing the correct order that allows certain parameters to be uniquelyidentified. Moreover, similar techniques are applicable to questions of indistinguishability.

Atsushi Mochizuki (RIKEN, Japan): Observation and Control of Complex Nonlinear Systems Based onNetwork Structures.

Understanding the dynamics of CRNs is a difficult task even when measurements on all species are possible.For example, measuring responses to changes in total amounts of enzymes is very subtle, as some changes arecounter-intuitive, while some metabolites do not change at all. This talk presents a theory based in part on sen-sitivity analysis that links qualitative responses of a CRNs to its structure alone. In particular, it is shown that aparticular set of nodes in the CRNs the feedback vertex set determines the CRN dynamics. In other words, thisimplies that steady states, periodic oscillations, quasi-periodic oscillations and other long-term behaviors, can beidentified by measurements of a subset of molecules in the network, and this subset is determined by the networkstructure alone. Moreover, change in the dynamics (for example switching from one stable steady state to another)can be achieved by controlling only the feedback vertex set. A concrete regulatory network of 90 genes was ana-lyzed using these techniques, and the theory predicted five genes that can be used to control the overall dynamics,in agreement with experimental observations.

Ankit Gupta (ETH Zurich): Numerical estimation of the stationary solution of the chemical master equation.This talk focused on numerical estimation of the stationary solution for the chemical master equation. Solving

the (generally infinite) system of ODE corresponding to the chemical master equation is a daunting task, as is, ingeneral, finding the stationary solution of the chemical master equation. If the state space is infinite, the latter isequivalent to solving an infinite dimensional linear system, which can’t be done directly. Instead, a truncation ofthe state space is considered (e.g. by truncating at certain a order of magnitude, by cutting off absorbing states, andby sending reactions to absorbing space to a designated state in the truncated space). It is shown that the stationarydistribution corresponding to the truncated system approximates the stationary distribution of the original system,if the infinite state-space is irreducible, and if the reaction dynamics is exponential ergodic (which can be checkedby constructing a Foster-Lyapunov function). The algorithm is shown to be effective even for very large statespaces (∼ 109 states), and on par with SSA performance for concrete protein pathways examples.

Matthew Johnston (San Jose State University): Network Translation and Absolute Concentration Robustness.A species is said to have absolute concentration robustness (ACR) if its attains the same value at all positive

steady states. This suggests evolved structures in biochemical reaction networks which are capable of maintainingnarrow ranges for certain reactants. This talk presented new results expanding the class of networks known to haveACR (e.g., from work of Feinberg and Shinar), and conditions that ensure that the ACR values can be determined.The technique uses tools from generalized CRNs and translated CRNs, and show that (1) if translation is deficiencyzero and weakly reversible, all complexes on common linkage class are robust; (2) if translation is deficiency one,then all pairs of nonterminal complexes are robust; and (3) the ACR value can be determined if deficiency is zeroand under a few additional conditions. These results are illustrated by way of examples, including the futile cycleand the EnvZ-OmpR signaling pathway.

Heinz Koeppl (Technische Universitat Darmstadt): Biochemical networks in random environments: modelingand inference.

Many cellular processes on the single-cell level show significant cell-to-cell variability even when the cellsare genetically identical and share the same growth conditions. An additional source of variation in intracellularsystems may be needed to account for this, as the noise from standard stochastic CRN models is insufficient toexplain the magnitude of these differences. This talk focused on adding a source of variability from the randomenvironment in which the cellular process is embedded. A generative joint model was presented in which the re-actions had additional static random variables factoring multiplicatively into the kinetic rate. The main question of

Page 142: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

138 Five-day Workshop Reports

interest was whether one could represent this enhanced model with a simpler model which behaves as if embeddedin a random environment. Various approaches, based on the chemical master equation, the Fokker-Planck equa-tion, and the Liouville equation were discussed. In an example of a first order decay reaction filtering theory wasused to make a successful simplification, in which conditional information on the additional static random variablefactors was obtained from observations of the species counts over time.

Mercedes Perez Millan (University of Buenos Aires): Checking multistationarity in MESSI systems.Many enzyme mechanisms fall under the class of MESSI (Modifications of type Enzyme-Substrate or Swap

with Intermediates) systems. Here swaps includes transfer reactions, e.g. Xp + Y XpY → X + Yp. This talkpresented new results on the study of steady states of MESSI systems; in particular, the focus was on toric MESSIsystems, i.e. systems whose steady state variety can be described by binomial equations. A theorem characterizingmultistationarity (and based on elimination of intermediates) for these systems is presented. It is shown that thereexists a unique positive steady state if and only if there is a certain sign compatibility between minors of twomatrices (one of which is the stoichiometric matrix, and the other comes from exponents of binomials in thesteady state variety). A number of common examples can be studies using this results: for example, it followsthat enzymatic cascades with common phosphotase for both layers is multistationary. The sign conditions canbe rephrased in terms of certain subspaces intersecting certain orthants. The talk concluded with a discussion ofalgorithms checking these sign conditions and their connection with finding circuits in oriented matroids.

Lea Popovic (Concordia University): Rare event calculations for chemical reaction systems.Theory of large deviations for random processes and rate calculations for frequencies of rare events was applied

to stochastic models of CRNs. In order to define “rare” events one first needs to establish a concentration ofmeasure property for the sequence of variables / sequence of processes based on some scaling parameter N (typicalexamples include the sample average of i.i.d variables and the empirical measure of i.i.d variables or of sequentialoutcomes of a Markov chain). In the chemical reaction network context the standard scaling would be to considerlarge amounts of all species and fast rates of all reactions. Rare events can then play a key role in switchingbetween stable states in CRNs whose mean dynamics exhibits bistability. This talk showed how to determine thetime scale of switching based on calculations from large deviation theory, and also compared the calculations forthe rate of switching based on a scaled Markov chain model for the CRN to the calculations for this rate based on asmall diffusion model for the same CRN. A framework for large deviation theory and results for multi-scale CRNswas also presented.

Greg Rempala (The Ohio State University): Law of large numbers for the SIR process on a random graph.Motivated by the recent Ebola outbreak, and informed by epidemiological data coming from this outbreak, the

speaker developed an SIR model evolving on a multilayer random graph with given degree distribution, whereedges in different layers correspond to potentially infectious contacts of different types. The evolution of thegraph follows the assumption that infectious individuals drop each of their contacts according to an exponentialdistribution, and that the dropped contacts, which could account for behavioral changes due to disease such asisolation or decreased mobility, cannot be recovered. This setup allows the derivation of a law of large numbers fora large graph that results in a system of ODEs which drive the evolution of various quantities of interest, such asthe proportions of infected and susceptible vertices, as the number of nodes tends to infinity. The result is shownto be in concordance with data collected during the recent Ebola outbreak in Africa.

Alan Rendall (Johannes Gutenberg-Universitı¿œt Mainz): Multiple steady states in models for the Calvincycle of photosynthesis.

A number of models have been proposed in the literature for the Calvin cycle, leading to a number of state-ments (many of which are based on numerical simulations) on the number and stability of steady states (see Prof.Rendall’s recent paper “A Calvin Bestiary”). However, many of these results are not very careful, and sometimeoverlook physical requirements on the models (e.g. that the steady states must be positive). The focus of the talk isthe rigorous analysis of a simple model with five unknowns: it is proved that there are parameters for which thereare two positive steady states, one of them stable, and a special case where there is a continuum of steady states.The analysis is made possible by identifying a weak fold bifurcation with 1D center manifold. More detailedmechanisms were also discussed. Using elementary flux modes, the Pettersson model (with fifteen unknowns

Page 143: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Mathematical Analysis of Biological Interaction Networks 139

and expressed as a system of differential-algebraic equations) is shown to have two positive equilibria for certainchoices of parameters. The Poolman model is also discussed, and it can give rise to three positive steady states.

Carsten Conradi (HTW Berlin): Multistationarity in biochemical reaction networks.There is an extensive literature on necessary conditions for multistationarity for mass-action CRNs. Most of

those results depend on the structure of the network alone, are based on injectivity of the vector field, and go backto the work of Craciun and Feinberg. Roughly speaking, if the Jacobian of the system doesn’t change sign on thepositive orthant for all values of rate constants, multistationarity is ruled out. On the other hand, finding sufficientconditions for multistationarity, and parameter regimes that allow it, is still, by and large, an open question. Thistalk presented new results in this direction (Conradi, Mincheva, Feliu, Wiuf). Using degree theory, one shows that ifthe vector field is dissipative, does not admit boundary steady states, its Brouwer degree restricted to compatibilityclasses has a predetermined sign, and if the Jacobian does not change sign, then multistationarity is not possible.On the other hand (and perhaps most importantly), if the determinant changes sign and a certain parametrizationof the steady state variety is possible, then the system admits at least two positive steady states.

Stefan Mueller (University of Vienna): Sign conditions in chemical reaction network theory.Many results on existence and uniqueness of equilibria in mass-action, or generalized mass-action systems are

expressed in terms of sign conditions on certain subspaces. This talk is an overview of these results. In the classicaltheory of complex-balanced mass-action networks, there is a very useful parametrization of the steady states. Thisextends to generalized mass action (where the monomial rates of reactions are not necessarily linked to reactantcomplexes). Two notions of deficiency can be defined in this setting, and the kinetic deficiency is intimatelylinked to complex-balancing: zero kinetic deficiency implies existence of complex balanced equilibria for all rateconstants, while kinetic deficiency equal to δ > 0 implies the existence of δ conditions on the rate constants forcomplex balancing. The existence and uniqueness of steady states for generalized mass-action systems correspondsto surjectivity and injectivity of generalized polynomial maps, which can be translated in terms of sign conditionson vectors of the stoichiometric subspace and its orthogonal complement. The talk concluded with a completecharacterization of long-term dynamics of generalized Lotka systems: depending on the kinetic monomial, thesystem exhibits various behaviors, from center oscillations to global stability.

Maya Mincheva (Northern Illinois University): Interplay between diffusion and delay.Diffusion and delay play a common role in biological pattern formation, and have mathematical similarities as

well: they both destabilize a stable equilibrium of a CRN system of differential equations. The question consideredhere is whether, when both delay and diffusion are present, the destabilizing can be achieved by diffusion or by thedelay alone. It is known since Turing that diffusion alone can destabilize an equilibrium if the diffusion constantsare not equal; but does that happen if the delay cannot destabilize the system equilibrium? The analysis presentedin this talk shows that the answer is no: If the nonsingular matrix A is strongly stable with respect to delay (i.e.linearly stable for any choice of delays), then A−D is strongly stable with respect to delay for any diagonaldiffusion matrix D. (Mincheva, Hinow, 2016). The analysis makes use of a matrix-theoretic characterizationof strong delay stability due to Hopfbauer, and involves calculations on the (quasi-linear) Jacobian of the delaysystem.

Casian Pantea (West Virginia University): Inheritance of multistationarity in CRNs: lifting nondegenerate(stable) steady states). This talk presented a series of modifications on a CRN that preserves its capacity for multi-stability/multistationarity. Some of these modifications have already appeared in the literature (in work of Craciun,Feinberg, and Joshi, Shiu); however, the talk discusses a new common framework for all these results using theimplicit function theorem on local coordinates on stoichiometric classes. The notion of “reduced Jacobian”, i.e.the Jacobian of the vector field restricted to stoichiometric classes) is a central object of the analysis. Examplesof modifications that preserve multistability are (1) adding dependent reactions (e.g. reverse reactions); (2) addinginflow and outflow; (3) adding new species and inflows and outflows of that species; (4) adding new reversiblereactions (with a nondegenerancy condition: full column rank of new species/reaction submatrix is required); (5)adding intermediate complexes involving new species (with a certain rank nondegeneracy condition); (6) addingenzymatic mechanism, i.e. replacing y → y′ by E + y I → E + y′.

Jinsu Kim (University of Wisconsin-Madison): Lyapunov functions and Tiers for ergodicity and mixingtimes of stochastic reaction networks.

Page 144: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

140 Five-day Workshop Reports

In this talk it was shown how the standard Lyapunov function for ODE models of mass-action CRN can be usedto verify ergodicity of the stochastic CRN model. More precisely, it was shown that under certain conditions theLyapunov-Foster criteria are satisfied. The approach is based on splitting monomials in tiers (following Anderson’sproof of the global attractor conjecture for the one linkage class case). A sufficient condition based on the networkalone is presented: if (1) the network is double full (i.e. it contains complexes 2A for all species A) and (2) everydouble complex has a path to a single or empty complex, then the Markov chain is positive recurrent. The talk alsoincluded a discussion on mixing time (time until distance between probability profile and stationary distributionis epsilon small). It was shown that the conditions above give rise to a “super Lyapunov function” (in whichcase the expected time to extinction has a uniform upper bound regardless of initial condition, and convergence tothe stationary distribution is in fact exponential). It is conjectured that detailed balance also implies exponentialconvergence to the stationary distribution.

James Brunner (University of Wisconsin-Madison): Robust permanence of deterministic reaction networkmodels.

In this talk the notion of tropically endotactic differential inclusions is presented. This is a far-reaching gen-eralization of toric differential inclusions, and it emphasizes the rich combinatorial structure of vector fields cor-responding to, for example, weakly-reversible mass-action networks. Tropically endotactic networks are definedstarting with a certain polyhedral fan, whose exponentiation cuts out regions in the positive orthant on each ofwhich the differential inclusion is defined by a constant cone. These regions correspond to areas of dominanceof specific source monomials, which drive the dynamics in the direction of small cones around their respectivereactions. A theorem is presented for 2D tropically endotactic differential inclusions, which are shown to be per-manent – a continuum of trapping regions (much like level sets of a Lyapunov function) is defined, which drivethe trajectory towards a compact set in the interior of the positive orthant. As an example, it is shown that if theLotka-Volterra network is perturbed by pushing reaction vectors inwards, then the resulting system gives rise to atropically endotactic differential inclusion, and the system is therefore persistent.

German Enciso (University of California-Irvine): Gave a tutorial on the stochastic models of reaction net-works on Wednesday evening.

Scientific Progress Made

Breakout sessions summaries

Tuesday June 6

Cycle balance I (reported by G. Craciun). This group focused on determining whether cycle balance couldreplace complex balance in the construction of a Lyapunov function, along the lines of the Horn and Jacksontheorem. The group established that, as stated, the question can be answered in the negative by way of counterex-ample: they constructed a cycle balanced network with three positive equilibria, which therefore can not have aglobal Lyapunov function.

Constructing Lyapunov functions for stochastic CRNs (reported by J. Kim). The standard Lyapunov functionfrom the ODE CRN model does not readily work in the stochastic case. This group started considering alternativeconstructions, including piecewise linear Lyapunov functions. For bimolecular networks, the group also considereddefining Lyapunov functions as solutions of a certain PDE, using the classical Zubov’s method.

Stochastic CRNs with extinction events I (reported by R. Brijder). The group considered the question ofwhether a given CRN has an extinction event, and what conditions on the network characterize this property.Issues of decidability and complexity of this question were also considered.

Oscillations in ODE phosphorylation models I (reported by A. Shiu). This group considered the existence ofoscillations in the double phosphorylation system. One approach discussed here was “lifting” the oscillation from

Page 145: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Mathematical Analysis of Biological Interaction Networks 141

the mixed mechanism phosphorylation (shown to exist in a recent paper by Suwanmajo and Krishnan). Using arecent result of Banaji, a new model of mixed mechanism network was shown to exhibit oscillation.

General mass-action I (reported by M. Johnston). The group discussed results from the paper of Mueller and Re-gensburger, focussing on how existence of steady states and complex-balanced steady states relate to zero networkdeficiency and zero kinetic deficiency. Connections with toric dynamical systems were also discussed.

Thursday June 8

Combining networks in stochastic CRN models (reported by D. Anderson). The group considered the questionof combining two “well-behaved” (for example, positive recurrent) stochastic CRNs: under what conditions is theresulting network also well-behaved? The group proved a series of preliminary results for small cases.

Stochastic CRNs with extinction events II (reported by R. Johnson). The use of Petri nets was considered inrelation to the question: what are the states starting at which all reactions are guaranteed to fire for all times withnonzero probability? Complexity of this and related questions have been discussed, as well as decidability issues:the group identified four quantifiers relevant to these questions.

General mass-action and cyclic balance II (reported by D. Siegel). Relevant results due to Volpert were dis-cussed, in particular the invariance of the positive orthant without monotonicity hypotheses, the connection be-tween balance equalities and convergence to equilibrium, and the role of the Petri net cycles.

Oscillations in ODE phosphorylation models II (reported by A. Shiu). The group performed numerical searchesof parameters for Hopf bifurcations in the double phosphorylation system, and also discussed the gaps in a recentpaper claiming to have found such parameters. Moreover, candidates for minimal models of oscillations wereconsidered.

Identifiability in stochastic CRNs models (reported by G. Rempala). The group fixed the gap in a long-standingproof about using MLE methods to consistently estimate parameters in stochastic models of CRNs, when incom-plete observations of trajectories are observed.

ParticipantsAnderson, David (University of Wisconsin-Madison)Boros, Balazs (Johann Radon Institute for Computational and Applied Mathematics)Brijder, Robert (Hasselt University)Brunner, Jim (University of Wisconsin-Madison)Cappelletti, Daniele (ETH Zurich)Condon, Anne (University of British Columbia)Conradi, Carsten (Hochschule fur Technik und Wirtschaft Berlin)Craciun, Gheorghe (University of Wisconsin Madison)Dickenstein, Alicia (Universidad de Buenos Aires)Enciso, German (University of California Irvine)Giaroli, Magali (University of Buenos Aires)Gupta, Ankit (ETH Zurich)Johnson, Robert (California Institute of Technology)Johnston, Matthew (San Jose State University)Joshi, Badal (California State University San Marcos)Kang, Hye-Won (University of Maryland, Baltimore County)

Page 146: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

142 Five-day Workshop Reports

Kaznessis, Yiannis (University of Minnesota)Kim, Jinsu Kim (University of Wisconsin-Madison)Koeppl, Heinz (Technische Universitaet Darmstadt)Kurtz, Thomas (University of Wisconsin)Meshkat, Nicolette (Santa Clara University)Mincheva, Maya (Northern Illinois University)Mochizuki, Atsushi (Kyoto University)Mueller, Stefan (RICAM, Austrian Academy of Sciences)Ortiz Munoz, Andres (California Institute of Technology)Pantea, Casian (West Virginia University)Perez Millan, Mercedes (Universidad de Buenos Aires)Popovic, Lea (Concordia University)Regensburger, Georg (Johannes Kepler University Linz (JKU))Rempala, Grzegorz (The Ohio State University)Rendall, Alan (Johannes Gutenberg-Universitat Mainz)Schnoerr, David (University of Edinburgh)Shiu, Anne (Texas A&M University)Siegel, David (University of Waterloo)Snarski, Michael (Brown University)Sontag, Eduardo (Rutgers University)Szederkenyi, Gabor (Computer and Automation Research Institute)Tang, Xiaoxian (Texas A&M University)Tonello, Elisa (University of Nottingham)Winfree, Erik (California Institute of Technology)Yu, Polly (University of Wisconsin-Madison)

Page 147: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] M.A. Al-Radhawi and D. Angeli. New approach to the stability of chemical reaction networks: Piecewiselinear in rates Lyapunov functions. 2015. DOI 10.1109/TAC.2015.2427691.

[2] David F. Anderson, German Enciso, and Matthew D. Johnston. Stochastic analysis of chemical reaction net-works with absolute concentration robustness. J. R. Soc. Interface, 11(93):20130943, 2014.

[3] David Angeli, Patrick de Leenheer, and Eduardo Sontag. A petri net approach to the study of persistence inchemical reaction networks. Math. Biosci., 210(2):598–618, 2007.

[4] Robert Brijder. Dominant and T-Invariants for Petri Nets and Chemical Reaction Networks. Lecture Notes inComput. Sci., 9211:1–15, 2015.

[5] Gheorghe Craciun. Toric differential inclusions and a proof of the global attractor conjecture. 2015.

[6] Gheorghe Craciun, Alicia Dickenstein, Anne Shiu, and Bernd Sturmfels. Toric dynamical systems. J. SymbolicComput., 44(11):1551–1565, 2009.

[7] Patrick de Leenheer, David Angeli, and Eduardo Sontag. Monotone chemical reaction networks. J. Math.Chem., 41:295–314, 2007.

[8] Pete Donnell and Murad Banaji. Local and global stability of equilibria for a class of chemical reaction net-works. SIAM J. Appl. Dyn. Syst., 12(2):899–920, 2013.

[9] Pete Donnell, Murad Banaji, Anca Marginean, and Casian Pantea. CoNtRol: an open source framework forthe analysis of chemical reaction networks. Bioinformatics, 30(11):1633–1634,2014.

[10] Elisenda Feliu. Injectivity, multiple zeros, and multistationarity in reaction networks. Proc. R. Soc. A.,471:2173, 2015.

[11] Elisenda Feliu and Carsten Wiuf. Enzyme sharing as a cause of multistationarity in signaling systems. J. Roy.Soc. Int., 9(71):1224–1232, 2012.

[12] Jeremy Gunawardena. Distributivity and processivity in multisite phosphorylation can be distinguishedthrough steady-state invariants. Biophys. J., 93:3828–3834, 2007.

[13] Matthew D. Johnston, David Siegel, and Gabor Szederkenyi. A linear programming approach to weak re-versibility and linear conjugacy of chemical reaction networks. J. Math. Chem., 50(1):274–288, 2012.

[14] R.L. Karp, M. Perez Millan, T. Dasgupta, A. Dickenstein, and J. Gunawardena. Complex linear invariants ofbiochemical networks. J. Theor. Biol., 311:130–138, 2012.

[15] Arjun Manrai and Jeremy Gunawardena. The geometry of multisite phosphorylation. Biophys. J., 95:5533–5543, 2009.

[16] Mercedes Perez Millan, Alicia Dickenstein, Anne Shiu, and Carsten Conradi. Chemical reaction systemswith toric steady states. Bull. Math. Biol., 74(5):1027–1065, 2012.

143

Page 148: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

144 Five-day Workshop Reports

[17] Stefan Muller, Elisenda Feliu, George Regensburger, Carsten Conradi, Anne Shiu, and Alicia Dickenstein.Sign conditions for injectivity of generalized polynomial maps with applications to chemical reaction networksand real algebraic geometry. Found. Comput. Math., 2015.

[18] Stefan Muller and Georg Regensburger. Generalized mass action systems: Complex balancing equilibria andsign vectors of the stoichiometric and kinetic-order subspaces. SIAM J. Appl. Math., 72(6):1926–1947, 2012.

[19] Murad Banaji and Casian Pantea. Some results on injectivity and multistationarity in chemical reaction net-works, SIAM J. Appl. Dyn. Syst., 15(2), 807?869

[20] Loic Pauleve, Gheorghe Craciun, and Keinz Koeppl. Dynamical properties of discrete reaction networks. J.Math. Biol., 69(1):55–72, 2014.

[21] Guy Shinar and Martin Feinberg. Structural sources of robustness in biochemical reaction networks. Science,327(5971):1389–1391, 2010.

[22] David Anderson, Gheorghe Craciun, and Thomas Kurtz. Product-form stationary distributions for deficiencyzero chemical reaction networks, Bulletin of mathematical biology 72, 2010

[23] Gheorghe Craciun, Minh-Binh Tran. A toric dynamical system approach to the convergence to equilibriumof quantum Boltzmann equations for Bose gases, 2016.

Page 149: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 13

Nilpotent Fundamental Groups (17w5112)

June 18 - 23, 2017Organizer(s): Jan Minac (Western University), Florian Pop (University of Pennsylvania),Adam Topaz (Oxford), Kirsten Wickelgren (Georgia Institute of Technology)

Overview of the Field

One of the main guiding principles in modern Galois theory is the deep interaction between arithmetic and ge-ometry. This principle is evident in Grothendieck’s etale fundamental group of a geometrically connected variety,which is an extension of the absolute Galois group of the base field by the geometric fundamental group. Suchgroups have been studied via their nilpotent/unipotent completions for many decades, simply because such com-pletions are more amenable to cohomological calculations. This point of view has seen a recent surge of activity,primarily revolving around new results in anabelian geometry, as well as new structural results about absoluteGalois groups arising from the norm-residue isomorphism.

Recent Developments

Anabelian Geometry

Nilpotent and solvable quotients of fundamental/Galois groups play a central role in recent results in anabelian ge-ometry. In relationship with the section conjecture, nilpotent quotients of geometric fundamental groups providea plethora of obstructions for the existence of sections, and therefore also to the existence of rational points. Asmentioned above, the main benefit of working with such obstructions is that they can be studied using cohomo-logical methods. More precisely, these obstructions manifest themselves as Massey products in cohomology; suchresults have been pervasive in the work of Wickelgren. However, it is still unknown whether these obstructionsare complete. For instance, there are known counter-examples due to Hoshi to the geometrically pro-p variant ofthe section conjecture, as well as counter-examples due to Wickelgren to the geometrically 2-step nilpotent sectionconjecture. But it is still unknown which minimal variants of the section conjecture are plausible.

Nilpotent quotients of Galois groups also play a key role in the birational setting. For instance, Pop proveda refinement of the birational p-adic section conjecture using mod-p meta-abelian quotients of absolute Galoisgroups. This strengthens Koenigsmann’s original proof of the birational section conjecture which used absoluteGalois groups. It’s important to mention, however, that many of Pop’s methods actually work with the smaller“abelian-by-central” (i.e. 2-step nilpotent) quotients, but it is still open whether or not the mod-p abelian-by-central variant of the birational p-adic section conjecture holds true.

145

Page 150: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

146 Five-day Workshop Reports

Abelian-by-central quotients of Galois groups play a central role in Bogomolov’s program in birational an-abelian geometry, whose goal is to reconstruct higher-dimensional geometric function fields using the pro-` abelian-by-central quotient of their absolute Galois group. This program was formulated into a precise functorial conjectureby Pop, and this formulation is now commonly known as the Bogomolov-Pop conjecture. While the general state-ment of the conjecture is still wide open, it has been proven in several important cases by Bogomolov-Tschinkel,Pop and Silberstein. If successful, Bogomolov’s program would go far beyond Grothendieck’s original anabelianphilosophy for two primary reasons. First, it considers purely geometric objects whereas Grothendieck’s anabelianphilosophy revolved around a strong interaction of arithmetic and geometry. And second, and it considers “almost-abelian” Galois groups, whereas Grothendieck’s anabelian philosophy revolved around Galois groups which werehighly non-abelian.

Some of the key steps in Bogomolov’s Program are by now well-developed, but there are still many openquestions. For instance, the “local theory” detects decomposition/inertia groups of so-called “quasi-divisorialvaluations” using pro-` abelian-by-central Galois groups; this theory was developed over the last several yearsby Bogomolov-Tschinkel, Pop, and Topaz. It is still a major open problem to give similar recipes for divisorialvaluations. The “global theory” is the second major component of Bogomolov’s Program, and Pop has somegeneral results in this direction. In the global theory, it is still a major open problem to construct so-called “rationalquotients” using group-theoretical methods. Finally, there is some recent work due to Topaz which suggests theplausibility of a mod-` abelian-by-central variant of Bogomolov’s Program, although there currently no knowncomplete case of this mod-` variant.

Lastly, we discuss the question of Ihara from the 1980’s about finding a (non-tautological) combinatorial de-scription of absolute Galois groups. Being motivated by Ihara’s work on braid groups, this combinatorial de-scription should arise from the Galois action of absolute Galois groups on geometric fundamental groups. Thisquestion was formulated into a precise conjecture by Oda-Matsumoto, based on motivic evidence, and this ques-tion/conjecture is now commonly referred to as the “I/OM.” The original I/OM using the full geometric funda-mental groups of algebraic varieties was proven by Pop (unpublished) in the 90’s. Nevertheless, using anabeliantechniques mentioned above, Pop recently proved a strengthening of this result which instead considers the Galoisaction on pro-` abelian-by-central quotients of geometric fundamental groups. It is still a major open questionwhether similar results hold for the geometric fundamental group of certain small categories of varieties, such asthe Teichmuller modular tower.

Galois Groups and the Norm-Residue IsomorphismIn another direction, one has the subject of determining explicit properties about the structure of large Galoisgroups of fields, specifically absolute Galois groups and maximal pro-` Galois groups. Most such results rely onthe recent proof of the norm-residue isomorphism theorem due to Voevodsky-Rost et al. This highly-celebratedtheorem imposes strict restrictions on the Galois cohomology ring, and determining the precise implications ofthese restrictions on the structure of absolute Galois groups is an active research area. In the same vein, there isalso the open question of making some aspects of the norm-residue isomorphism explicit by tying it to explicitconstructions of certain pro-` Galois extensions.

In the last few years, there has been a surge of interest in this area, specifically concentrated around the van-ishing of higher Massey products in Galois cohomology. This is particularly highlighted in the works of Hopkins-Wickelgren, Minac-Tan and Efrat-Matzri, which show that certain higher Massey products vanish over essentiallyall fields. Still, at this point it remains open whether all higher Massey products vanish when they are defined.In any case, such results have important group-theoretical consequences which are summarized in the so-calledKernel-Unipotent Conjecture. Some partial results towards this conjecture were recently obtained by Efrat-Minacand Minac-Tan, but the general conjecture remains open.

A related topic is about formality and 1-formality in Galois cohomology. Massey products are a basic ob-struction to the formality and 1-formailty of the differential graded algebra (DGA) underlying Galois cohomology.Positselski has recently provided examples where absolute Galois groups are not formal. This implies that there ismore information in the DGA than is available from the Norm Residue theorem. The work discussed above givesinformation about this DGA, which could potentially lead to a refinement of the Norm Residue theorem itself.There are also interesting questions worth exploring in connection with the cyclotomic character and formality

Page 151: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nilpotent Fundamental Groups 147

of Galois cohomology in certain cases, and some striking connections, due to Positselski, between Koszulity andBogomolov’s conjecture about the freeness of the commutator subgroup of p-Sylow subgroups of absolute Galoisgroups.

A second facet in this subject comes from the Elementary Type Conjecture, which aims to determine thestructure of finitely-generated maximal pro-`Galois groups of fields. These results are closely related to the “local-theory” in anabelian geometry which was mentioned above. In the last several years, it has become apparent thatabelian-by-central quotients of absolute Galois groups completely control much of the local theory, in “tame”situations. On the other hand, the very recent work of Koenigsmann-Strommen suggests that similar results mightalso hold in “wild” situations. A complete picture of the local theory in both the tame and wild situations couldpotentially lead to the resolution of the Elementary-Type conjecture.

Unipotent Fundamental Groups and Hyperplane ArrangementsThe pro-nilpotent completion of the geometric fundamental group has a natural algebraic analogue in the pro-unipotent fundamental group. The pro-unipotent fundamental group has been an object of great interest ever sincethe work of Deligne, Sullivan, Morgan and Griffiths, who studied the de Rham fundamental groups of compactKahler manifolds using Hodge theory. More generally, Deligne’s theory of mixed hodge structures and the weightfiltration has been extensively used to study unipotent fundamental groups of possibly non-proper varieties. Bynow, unipotent fundamental groups are fairly ubiquitous, and they are particularly pervasive throughout the workof Hain-Matsumoto, Deligne-Goncharov, as well as many others.

Such ideas have also taken center stage in the theory of hyperplane arrangements, as appears in the workof Cohen, Dimca, Denham, Papadima, Suciu, Wang and others. In this context, the connection between the(unipotent) fundamental group of the complement of a hyperplane arrangement and the combinatorial structure(matroid) of the hyperplane arrangement is of particular interest, and determining the precise connection is a majorresearch area. A very similar question comes up in higher-dimensional (birational) anabelian geometry, where onemust understand the precise connection between the geometry of a divisor and the group-theoretical structure ofinertia elements in the fundamental group of its complement. In connection with section (2), formality, 1-formality,graded-formality, etc. for fundamental groups of complements of a hyperplane arrangements are again questionswhich are of great interest in the theory, where such questions can be studied using the weight filtration in de-Rhamand/or `-adic cohomology.

The theory of unipotent fundamental groups and hyperplane arrangements can therefore be seen as a bridgebetween subjects (1) and (2) above. Moreover, there is a clear overlap between the three subjects, and somesimilarity between the techniques they use. But there are also some significant differences between their methods,as they strive for different goals. Therefore, we believe that increased interaction between these three subjectscould lead to new collaborations and new results, by rethinking current research areas from new points of view. Inthis respect, the primary goal of this workshop was to facilitate such new interaction between the three subjectsmentioned above.

Open ProblemsDuring two open problem sessions on Thursday and Friday, workshop participants presented and discussed openproblems in the field. Other open problems were discussed during talks and breaks. Here is a list of these openproblems.

Thursday1. (K. Wickelgren, presenting an open problem posed by Micheal J. Hopkins and K. Wickelgren). Let k be a

field. The nth Milnor K-theory group KMn (k) is defined by tensoring the underlying multiplicative group

k∗ of k with itself n-times, and quotienting out by the subgroup generated by elements a1 ⊗ a2 ⊗ . . .⊗ anwhere ai + aj = 1 for some i 6= j, i.e.,

KMn (k) = (k∗)⊗n/〈a1 ⊗ a2 ⊗ . . .⊗ an : ai + aj = 1〉. (13.0.1)

Page 152: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

148 Five-day Workshop Reports

Milnor K-theory thus combines the multiplication on k with the addition in some interesting manner. Viewthe presentation (13.0.1) as a field arithmetic description.

Let Galk denote the absolute Galois group of k. The Milnor/Bloch-Kato Conjecture proved by Voevodskyand Rost provides the following description of the cohomology ring H∗(Galk) of absolute Galois groups.Let N be prime to the chracteristic of k. Applying H∗(Galk,−) to the Kummer exact sequence

1→ µN → Gm z 7→zN→ Gm → 1

produces the sequence

k∗z 7→zN→ k∗ → H1(Galk, µN )→ 0,

and therefore the Kummer isomorphism k∗/(k∗)N ∼= H1(Galk, µN ) = H1(Galk,Z/N(1)). The cupproduct then produces a map (k∗)⊗n → Hn(Galk,Z/N(n)). The Steinberg relation implies that this mapfactors through a map from mod N Milnor K-theory

KMn (k)/N ∼= Hn(Galk,Z/N(n)),

and the Milnor/Bloch-Kato Conjecture is that this map is an isomorphism. For example, we have a ringisomorphism

H∗(Galk,Z/2) ∼= ⊕n(k∗)⊗n/〈x⊗ (1− x), 2〉,(the Milnor conjecture). The Milnor/Bloch-Kato Conjecture is also called the Norm-Residue isomorphismtheorem.

There is a differential graded algebra DGA of cochains C∗(Galk,Z/2) computing the cohomology withZ/2-coefficients, and the coefficient group can be generalized as well, but with Z/2-coefficients, the com-plication from the twists is absent. One could alternatively assume that k contains roots of unity, or perhapsuse all twists. Let C∗(Galk) denote a DGA arising in this way, but not specifying the coefficients.

Question: Is there a field arithmetic description of the DGA of cochainsC∗(Galk) refining the Milnor/Bloch-Kato description of Galois cohomology?

In [13], it was asked if C∗(Galk) is formal, i.e., quasi-isomorphic to H∗(Galk). If it were, then the fieldarithmetic description given in Milnor/Bloch-Kato would be the desired refinement. Interestingly, it is not,as shown by Positselski [30]. So there is information contained in C∗(Galk) which is lost on passage tocohomology. Is it possible to give a generator-relation description of C∗(Galk) in terms of the arithmetic ofk?

In his paper [30, page 226] Positselski mentions that he is not aware of any counterexample to formality ofC∗(Galk) if all roots of unity of powers ln are in the field where Z/l are considered coefficients in Galoiscohomology. Also it is interesting to notice that Example 6.3 can be considered as analogue of an examplein [MS]. This was discussed with J. Minac and A. Topaz during the conference, and there is still a possibilitythat that Galois cohomology and cyclotomic character can contain information obtained from C∗(Galk). Infact there maybe possibly “enhanced notion of formality” taking into account also some Bockstein mapsin Galois cohomology which could possibly explain Positselski’s examples and could still lead to a deepformality-like property of C∗(Galk).

2. (F. Bogomolov) Let K be a function field over an algebraically closed field k of characteristic 6= `. Let Gbe a Sylow-` subgroup of the absolute Galois group of K. It turns out that the isomorphism class of G (as apro-` group) only depends on trdeg(K|k) and k. Let G(2) denote the commutator subgroup of the profinitegroup G.

Conjecture: G(2) is a free pro-` group.

This conjecture provides a strategy to give an alternative proof of the Bloch-Kato conjecture. Indeed, con-sider the spectral sequence associated to G Gab:

Hi(Gab,Hj(G(2),Z`))⇒ Hi+j(G,Z`)

Page 153: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nilpotent Fundamental Groups 149

The conjecture implies that the Ei,j2 terms of this spectral sequence all vanish for j ≥ 2. The main point isthat elements of Ei,02 = Hi(Gab,Z`) can all be represented as sums of symbols.

The cohomology of Gab behaves similarly to the cohomology of an abelian variety. It might be the case thatGab can be seen as a fundamental group of some sort of “universal Albanese variety” associated to k andtrdeg(K|k). For a more detailed discussion, refer to [1].

3. (D. Litt) Let X be a smooth geometrically-connected curve over a finite field k of characteristic 6= `. Afterextending the base field, we may assume that the Frobenius acts trivially on H1(X,F`). Consider the actionof Frobenius (Frob) on the F` group-algebra

F`[[π1(X)]].

Question: What are the sizes of the Jordan blocks of this action? More precisely, define:

r(n) = minm : (Frob− I)m | F`[[π1]]/In = 0

(here I denotes the augmentation ideal.) Here are some interesting questions one might ask:

(a) Can r(n) be bounded non-trivially?

(b) How does r(n) change on passing to finite etale covers of X?

Satisfactory answers to these questions would lead to an alternate proof of a conjecture of de Jong (whichhas been proven by Gaitsgory). Idea: The above construction should give a notion of weights which shouldbehave well when dealing with F` coefficients.

4. (M. Florence) Let U be an open subscheme of SpecZ and let X → U be a smooth projective (arithmetic)curve. For each closed point x ∈ U , consider the fibre Xx above x, as well as the number of rational pointsXx(k(x)).

Question: Is there an algorithmic way to describe #(Xx(k(x))), as x varies, using the polynomials definingX?Comment: Since we expect equidistribution of Frobenius elements, the number of points should behaverandomly.

5. (D. Harbater) LetX be a smooth affine hyperbolic curve over an algebraically closed field k of characteristicp > 0.

Question: Does πet1 (X) determine X up-to isomorphism?

If X is the affine line over the algebraic closure of a finite field, then the answer is yes (this is a theo-rem of Tamagawa). It was pointed out by Hoshi during the discussions, that the answer is also yes for aonce-punctured elliptic curve.

In general, it is known that πet1 (X) (in the above context) determines the genus of X , the number of punc-

tures, the cardinality and the characteristic of the base field. (This is also due to Tamagawa.)

Remark (Bogomolov): In the cases that we know, it suffices to use certain small quotients of π1.

6. (M. Florence) Let G be a profinite group endowed with a character χ : G → Z×p . Let Z/pn(1) denote theG-module whose underlying abelian group is Z/pn, and such that G acts on Z/pn(1) via χ. Assume that(G,χ) satisfies a formal version of Hilbert 90. In other words, for all open subgroups U of G, the canonicalmap

H1(U,Z/pn(1))→ H1(U,Z/p(1))

is surjective. Some questions that one could ask about such (G,χ):

(a) Which groups satisfy this property? (Examples include free profinite groups, Demuskin groups.)

Page 154: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

150 Five-day Workshop Reports

(b) Do such groups always arise as Galois groups? I.e., does there exist a field F of characteristic 6= p, andan extension E of F which is p-closed, such that G = Gal(E|F ) and χ is the cyclotomic character?

Related topics were discussed in C. Quadrelli’s talk. See [7, 16, 21] for related papers.

Friday

1. (A. Topaz) Let g be a positive integer, and consider the (discrete) group:

Σg := 〈a1, b1, . . . , ag, bg :∏i

[ai, bi] = 1〉.

Let Σg denote the pro-` completion of Σg . Does there exist a field F of characteristic 6= ` which containsµ`∞ , such that Σg is isomorphic to Gal(F (`)|F )? Does such a field have any “distinguished” valuations?Note: This is a special case of question (6)(b) above.

2. (J. Minac, continuing with (1)) More generally, do all pro-p Demuskin groups arise as maximal pro-p Galoisgroups of fields (of characteristic 6= p)? See [15, 16, 25, 26].

3. (J. Minac) For a field F of characteristic 6= 2, let W (F ) denote the Witt ring of quadratic forms of F . Doesthere exists a field F such that W (F ) is isomorphic to W (Q2), and such that 2 is a square in F ?

4. (J. Minac) Let F be a field of characteristic 6= p which contains µp, and let Gal(n)F denote the p-Zassenhaus

filtration on GalF . Can one describe the cohomology rings H∗(GalF /Gal(n)F ,Z/p) in terms of generators

and relations? (Note, the answer is “yes” for n = 2 and n =∞).

5. (A. Topaz) Let X be a smooth variety over C, and let U be a (Zariski) open subvariety of X . Does thereexist a smaller (Zariski) open subvariety V of U such that V is (rationally) 1-formal?

6. (F. Pop, J. Minac) Does there exist a proof of the Bloch-Kato conjecture which is purely formal in nature?In other words, what group-theoretical properties of the absolute Galois group of a field (endowed with acharacter mimicking the cyclotomic character) ensure that the Bloch-Kato conjecture holds true?

7. (P. Guillot, J. Minac, A. Topaz, N. D. Tan) Let F be a field of characteristic 6= ` such that µ` ⊂ F , andlet x1, . . . , xn ∈ H1(F,Z/`) be given. Assume that the n-fold Massey product 〈x1, . . . , xn〉 is non-empty.Does 〈x1, . . . , xn〉 contain 0? This is the so-called n-Massey vanishing conjecture in Galois cohomology,which was formulated by Minac-Tan [24, 23], and which was strongly influenced by the previous workof Hopkins-Wickelgren [13]. For recent results towards this conjecture and beyond, refer to [11, 12, 19].One can also consider a weaker version of the above question, where one additionally requires that all thecup-products xi ∪ xj vanish; this weaker version seems easier, but it is still far from trivial.

Presentation Highlights

The presentations in the workshop consisted of five survey talks (60 minutes) and 14 research talks (45-50 minutes).The survey talks were certainly among the highlights, while the shorter research talks presented the state of the arton the various topics covered in the workshop. All of the talks in the workshop were extremely interesting andadded to the success of the workshop. Several speakers provided supplemental materials, all of which has beenmade available on the workshop web-page. Finally, almost all of the talks were recorded and can be viewed on theworkshop web-page.

Page 155: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nilpotent Fundamental Groups 151

Survey Talks

The survey talks were some of the major highlights of the workshop. They helped considerably to introduce someof the main topics and results of the workshop, and they were successful in stimulating many further discussionsthroughout the week. Here is a brief summary of each of these survey talks.

A. Topaz gave a survey of almost-abelian anabelian geometry. As described in this talk, the goal of almostabelian anabelian geometry is to recover arithmetic and geometric information from two-step nilpotent Galois-theoretical data. The talk focused primarily on two topics: Bogomolov’s Programme in birational anabelian geom-etry, and the Ihara/Oda-Matsumoto conjecture. Aspects from both the pro-` and mod-` variants of the theory werediscussed. Some relevant references include [3, 4, 28, 29, 27, 33].

T. Szamuely gave a survey of fundamental groups and their variants. This was an introduction to several vari-ants of fundamental groups that arise in algebraic and arithmetic geometry. In addition to the usual topologicalfundamental group and Grothendieck’s etale fundamental group, this survey included a description of Nori’s fun-damental group scheme, and Deligne’s pro-algebraic fundamental group. Wherever possible, this talk highlightedboth the Tannakian point of view, and the point of view using torsors. The speaker graciously provided typesetnotes from his talk, which are available on the workshop webpage.

J. Stix gave a survey of the section conjecture. This is a major open conjecture, which he explained and thendiscussed examples and analogues, including a proof for abelian varieties over finite fields, and birational andminimalistic analogues. The speaker has written an excellent book on the subject [31].

K. Wickelgren gave a survey of Massey products in Galois cohomology, discussing vanishing results, notablythe major advance to 4-fold Massey products in [12], as well as applications to the structure of Galois groups andautomatic realization results. The first problem listed in the Open Problems section was also discussed.

A. Suciu gave a survey of formality notions for spaces and groups, and the slides from his talk are availableon the workshop webpage. His talk provides many open problems: namely, he gave results on cdga’s over fieldsof characteristic 0 and explained to the audience that many of these results may hold (and may not hold) in finitecharacteristic, and suggested the possibility of understanding the analogous structure in the case of finite charac-teristic. He discussed resonance varieties, characteristic varieties, and the Tangent cone Theorem as an alternativeto Massey products for disproving the formality of a space.

Research Talks

The first day of the workshop had two research talks which focused on birational anabelian geometry and the Galoisaction on fundamental groups. M. Ludtke spoke about his recent results [18] concerning anabelian geometryfor one-dimensional function fields over algebraically closed fields. D. Litt spoke about his recent results [17]concerning the Galois action on `-adic unipotent completions of geometric fundamental groups.

One primary focus of the second day was the section conjecture and its variants. I. Dan-Cohen spoke abouthis recent preprint [9], joint with T. Schlank, about rational motivic homotopy theory and its connections withKim’s unipotent variant of the section conjecture. A. Betts spoke about his recent preprint [5] which studies localhights on abelian varieties from a motivic/anabelian point of view; a handout from this lecture is available onthe workshop web-page. F. Bogomolov spoke about a rational, almost-abelian variant of the section conjecturefor higher-dimensional function fields over algebraically closed fields, which appears in his joint work with M.Rovinsky and Y. Tschinkel [2].

There were two additional talks on the second day whose topic was unrelated to the section conjecture. E.Bayer-Fluckiger spoke about cohomological invariants of G-Galois algebras, and recent results establishing theHasse principle for their self-dual normal bases; the slides from this talk are available on the workshop web-page.D. Neftin spoke about local approximation and specialization, emphasizing recent new results in the context of theGrunwald problem.

The third day of the workshop focused on recent developments in the context of Massey products in Galoiscohomology. E. Matzri spoke about his recent work [19] concerning the vanishing of triple Massey products withhigher weight in Galois cohomology. P. Guillot spoke about his recent joint work with J. Minac, A. Topaz, N. D.Tan and O. Wittenberg [11, 12], concerning vanishing of quadruple Massey products in the Galois cohomology ofnumber fields.

Page 156: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

152 Five-day Workshop Reports

The morning session on the fourth day of the workshop saw two talks related to Galois cohomology. A. Schultzspoke about the parametrizing space of bi-cyclic elementary p-extensions and its Galois-module structure, in anal-ogy with Kummer theory. S. Chebolu spoke about his joint work with J. Carlson and J. Minac [6], concerning thefinite-generation problem in Tate cohomology of finite groups.

The afternoon session on the fourth day saw two additional talks with different focuses. Y. Hoshi spokeabout his recent joint work with S. Mochizuki and A. Minamide [14], which gives a new simple group-theoreticalcharacterization of the Grothendieck-Teichmuller group. D. Harbater spoke about the Galois theory of arithmeticfunction fields, especially his work on local-to-global principles using patching techniques, which is part of hislong-term ongoing project joint with J. Hartmann, D. Krashen, J.-L. Colliot-Thelene, Parimala and Suresh.

The last day of the workshop saw a single research talk by C. Quadrelli, who spoke about his ongoing jointwork with I. Efrat, J. Minac and T. Weigel, which studies the group-theoretical structure of absolute Galois groups,endowed with their cyclotomic character, from a cohomological point of view.

Informal Discussions

In addition to the survey and research talks, many participants of the workshop also engaged in several informaldiscussions and explanations of recent research. Here are three which we are aware of. T. Feng explained hisrecent paper [8] using cohomology operations to study a question of Tate. M. Florence explained some ideasbehind his recent joint work with C. De Clercq [10], which aims to give a new simple and purely algebraic proofof the norm-residue isomorphism theorem. A. Topaz explained his recent work [32] concerning reconstruction ofhigher dimensional function fields from their generic cohomology endowed with their infinite-dimensional mixedHodge structure.

Scientific Progress Made

J. Minac and A. Topaz discussed ideas related to the differential graded algebra of etale cochains on Spec k. R.Davis and K. Wickelgren made progress on a project on the 2-nilpotent mod p quotient of the etale fundamentalgroup, which is joint with R. Pries. F. Bogomolov, F. Pop, A. Suciu and A. Topaz had some discussions around theopen problem concerning formality (c.f. §3.2 (5)) and its connection with structure of cohomology and anabeliangeometry. P. Guillot, J. Minac, A. Topaz and O. Wittenberg had some discussions concerning Massey productsin Galois cohomology, which could lead to further progress in their work. Further discussions between the teamabove, E. Matzri, and D. Neftin also took place, which could possibly lead to new advances towards the n-Masseyvanishing conjecture.

These are just a few of the research discussions that we are aware of, but certainly many more took place.Especially important were several informal discussions between young researchers, including graduate studentsand postdocs, and more senior researchers, which will lead to new research exchanges. We expect that many ofthese discussions will eventually lead to new collaborations and new results, which combine ideas and tool fromdifferent areas.

Outcome of the Meeting

The primary goal of this workshop was to increase interaction between the three subjects mentioned in theoverview: anabelian geometry; the group theory of pro-` and absolute Galois groups via the norm-residue isomor-phism; and (unipotent) pro-algebraic fundamental groups. From this point of view, the workshop was extremelysuccessful, and this is backed up by comments we received from several participants. We hope to organize a similarworkshop again in the near future.

Page 157: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nilpotent Fundamental Groups 153

ParticipantsBayer-Fluckiger, Eva (Ecole Polytechnique Federale de Lausanne)Betts, Alexander (University of Oxford)Bogomolov, Fedor (New York University)Chebolu, Sunil (Illinois State University)Chetard, Beatrice (The University of Western Ontario)Dan-Cohen, Ishai (Ben Gurion University of the Negev)Davis, Rachel (University of Wisconsin-Madison)Feng, Tony (MIT)Florence, Mathieu (Universite Paris 6)Frankel, Brett (Northwestern University)Gille, Stefan (University of Alberta)Guillot, Pierre (Universite de Strasbourg)Haase, Bastian (Emory University)Hall, Chris (Western University)Harbater, David (University of Pennsylvania)Hoshi, Yuichiro (Kyoto University)Litt, Daniel (Columbia University)Ludtke, Martin (Universitat Frankfurt)Matzri, Eliyahu (Bar-Ilan University)Merling, Mona (Johns Hopkins University)Minac, Jan (Western University)Neftin, Danny (Technion)Obus, Andrew (University of Virginia)Palaisti, Marina (The University of Western Ontario)Pop, Florian (University of Pennsylvania)Quadrelli, Claudio (University of Milan-Bicocca)Schultz, Andrew (Wellesley College)Silberstein, Aaron (University of Chicago)Srinivasan, Padmavathi (Georgia Institute of Technology)Stix, Jakob (Goethe-Universitat Frankfurt)Suciu, Alexandru (Northeastern University)Szamuely, Tamas (Renyi Institute, Hungarian Academy of Sciences)Topaz, Adam (University of Oxford)Tripathy, Arnav (Harvard University)Vogt, Isabel (Massachusetts Institute of Technology)Wickelgren, Kirsten (Georgia Institute of Technology)Wittenberg, Olivier (Ecole normale superieure)

Page 158: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] F. A. Bogomolov, On the structure of Galois groups of the fields of rational functions, K-theory and algebraicgeometry: connections with quadratic forms and division algebras (Santa Barbara, CA, 1992), 83-88,

[2] F. A. Bogomolov, M. Rovinsky and Y. Tschinkel, Multiplicative homomorphisms of fields preserving algebraicdependence, preprint, arXiv:1709.00639.

[3] F. A. Bogomolov and Y. Tschinkel, Reconstruction of function fields, Geom. Funct. Anal. 18 (2008), no. 2,400-462.

[4] F. A. Bogomolov and Y. Tschinkel, Reconstruction of higher-dimensional function fields, Mosc. Math. J. 11(2011), no. 2, 185-206.

[5] A. Betts, The motivic anabelian geometry of local heights on abelian varieties, preprint (2017),arXiv:1706.04850.

[6] J. Carlson, S. Chebolu and J. Minac, Ghosts and Strong Ghosts in the Stable Module Category, Can. Math.Bull. 59 (2016), 682-692.

[7] I. Efrat and C. Quadrelli, The Kummerian property and maximal pro-p Galois groups, preprint (2017)arXiv:1707.07018.

[8] T. Feng, Steenrod Operations in Etale Cohomology and Applications to the Brauer Group of a Surface over afinite Field arXiv: 1706.00151, 2017.

[9] I. Dan-Cohen and T. Schlank, Rational motivic path spaces and Kim’s relative unipotent section conjecture,preprint (2017), arXiv:1703.10776.

[10] C. De Clercq and M. Florence, Smooth profinite groups and applications, preprint (2016).

[11] P. Guillot, J. Minac and N.D. Tan, Four-fold Massey Products in Galois cohomology for all primes p, preprint.

[12] P. Guillot, J. Minac and A. Topaz, with an appendix by O. Wittenberg. Four-fold Massey products in Galoiscohomology, preprint (2016) arXiv:1610.05748.

[13] M. J. Hopkins and K. G. Wickelgren, Splitting varieties for triple Massey products, J. Pure Appl. Algebra219 (2015), no. 5, 1304–1319.

[14] Y. Hoshi, A. Minamide and S. Mochizuki, Group-theoreticity of numerical invariants and distinguishedsubgroups of configuration space groups, RIMS preprint: RIMS-1870 (2017).

[15] B. Jacob and R. Ware, A recursive description of the maximal pro-2 Galois group via Witt rings, Math. Z.200, (1989), no. 3, 379–396.

[16] J. Labute, Classification of Demushkin groups, Canad. J. Math. 19 (1967),106–132.

[17] D. Litt, Arithmetic Restrictions on Geometric Monodromy, preprint (2016), arXiv:1607.05740.

154

Page 159: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nilpotent Fundamental Groups 155

[18] M. Ludtke, A Birational Anabelian Reconstruction Theorem for Curves over Algebraically Closed Fields inArbitrary Characteristic, preprint (2017), arXiv:1706.01713.

[19] E. Matzri, Triple Massey Products with weight in Galois cohomology, preprint (2017), arXiv:1609.07927.

[20] J. Minac, F.W. Pasini, C. Quadrelli and N.D. Tan, Quadratic duals and Koszul Algebras in Galoiscohomology, preprint.

[21] J. Minac, M. Rogelstad and N.D.Tan, Relations in the maximal pro-p-quotient of an absolute Galois group,preprint.

[22] J. Minac and M. Spira, Witt rings and Galois groups, Ann. Math. (144) (1996), 35-60.

[23] J. Minac and N. D. Tan, Triple Massey products and Galois theory, J. Eur. Math. Soc. 19 (2017), no. 1,255-284.

[24] J. Minac and N. D. Tan, The kernel unipotent conjecture and the vanishing of Massey products for odd rigidfields, Adv. Math. 273 (2015), 242-270.

[25] J. Minac and R. Ware, Pro-2-Demushkin groups of rank ℵ0 as Galois groups of maximal 2-extensions offields, Math. Ann. 292 (1992), no. 2, 337–353.

[26] J. Minac and R. Ware, Demushkin groups of rank ℵ0 as absolute Galois groups, Manuscripta Math. 73 (1991),no. 4, 411–421.

[27] F. Pop, Finite tripod variants of I/OM, preprint (2016).

[28] F. Pop, On the birational anabelian program initiated by Bogomolov I, Invent. Math. 187 (2012), no. 3,511-533.

[29] F. Pop, Reconstruction of Function Fields from their pro-l Abelian divisorial inertia, preprint (2016).

[30] L. Positselski, Koszulity of cohomology = K(π, 1)-ness + quasi-formality J. Algebra 483 (2017), 188–229.

[31] J. Stix, Rational Points and Arithmetic of Fundamental Groups: Evidence for the Section Conjecture LectureNoes in Mathematics 2054, Springer-Verlag Berlin Heidelberg, 2013.

[32] A. Topaz, A Torelli Theorem for Higher-Dimensional Function Fields, preprint (2017), arXiv:1705.01084.

[33] A. Topaz, The Galois action on geometric lattices and the mod-` I/OM, preprint (2016), arXiv:1510.08836.

[34] V. Voevodsky, On motivic cohomology with Z/l-coefficients, Ann. of Math. (2) 174 (2011), no. 1, 401–438.

Page 160: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 14

Diophantine Approximation and AlgebraicCurves (17w5045)

July 2 - 7, 2017Organizer(s): Michael Bennett (University of British Columbia), Aaron Levin (Michigan StateUniversity), Jeff Thunder (Northern Illinois University)

Introduction

The first topic of the workshop, Diophantine approximation, has at its core the study of rational numbers whichclosely approximate a given real number. This topic has an ancient history, going back at least to the first ra-tional approximations for π. The adjective Diophantine comes from the third century Hellenistic mathematicianDiophantus, who wrote an influential text solving various equations in integers and rational numbers (and whosename is now also attached to such “Diophantine equations”). The subject rose to prominence in the last centurybeginning with the seminal work of Thue, and it has been the source of a large number of deep and influentialresults, including far-reaching work of Siegel, Roth, Baker, Schmidt, and Faltings, among others. It remains ahighly active area at the forefront of number theory and mathematics.

The second topic of the workshop, algebraic curves, has long been implicit and in the background of Diophan-tine problems. In this direction, the fundamental role of algebraic curves was cemented in the influential 1922conjecture of Mordell that an algebraic curve of genus at least two possesses only finitely many rational points.The interplay between geometry and arithmetic has increased rapidly since that time, and the use of increasinglyadvanced tools from algebraic and arithmetic geometry has led to the solution of many outstanding and previouslyinaccessible problems, including the resolution of Mordell’s conjecture (by Faltings).

The workshop centered on the interplay between Diophantine approximation and algebraic curves, with inter-connections to a diverse array of topics in algebra, geometry, analysis, and logic, among others.

Overview of the Field

A foundational topic in Diophantine approximation is the study of rational numbers which closely approximatea given real number. Naıvely, since the rational numbers are dense in the real numbers, one may approximatea real number arbitrarily well by a rational number. Diophantine approximation studies the closeness of thisapproximation in terms of the “size” of the rational number (e.g., the denominator). Specifically, if α is a realnumber, one can study the rational solutions p

q to an inequality of the form

156

Page 161: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Diophantine Approximation and Algebraic Curves 157

∣∣∣∣α− p

q

∣∣∣∣ < c(α)

qδ, (14.0.1)

where c(α) and δ are positive real numbers.If δ = 2, then an elementary result of Dirichlet shows that for any irrational real number α, the set of rational

solutions pq to (14.0.1) is infinite when c(α) = 1 (or even c(α) = 1√

5, due to Hurwitz). The theory of continued

fractions provides a method for explicitly calculating such good approximations.In a complementary direction, Liouville (1844) showed that if α is an algebraic number of degree d ≥ 2,

then there exists an explicit positive constant c(α) such that (14.0.1) has no solutions for δ ≥ d. Liouville’sresult is already sufficient to show that numbers such as

∑∞i=1 10−i! are transcendental, giving a first link between

Diophantine approximation and transcendence problems. On the other hand, Liouville’s inequality is typically tooweak to obtain interesting consequences for Diophantine equations.

A breakthrough occurred in 1909 when Thue [53] improved the exponent in Liouville’s result to δ > 12d+1. As

an application, Thue proved the finiteness of integer solutions x, y ∈ Z to equations (now called Thue equations)of the form F (x, y) = a, where a ∈ Z and F ∈ Z[x, y] is an irreducible binary form of degree d ≥ 3.

Siegel [50], in 1921, expanded on Thue’s method to prove finiteness for exponents δ > 2√d. The improvement

to an exponent δ of order o(d) proved essential when, in 1929, Siegel [52] proved his famous theorem on integralpoints on curves: if C ⊂ An is an affine curve defined over a number field k with positive geometric genus orpossessing at least 3 (geometric) points at infinity, then the set of points ofC with coordinates in the ring of integersof k, C(Ok), is finite. More generally, in this case finiteness holds for any set of S-integral points C(Ok,S), whereOk,S (S finite) is a ring of S-integers of k. Siegel’s theorem on integral points provides a basic link betweenthe two topics of the workshop, and more generally, represents an early instance of the deep and fundamentalinterplay between geometry and arithmetic. In the case of rational affine curves, Siegel’s theorem is equivalent tothe finiteness of solutions to the so-called S-unit equation

au+ bv = c, u, v ∈ O∗k,S , (14.0.2)

where a, b, c ∈ k∗ are nonzero constants. This equation had been studied earlier by Siegel (when O∗k,S = O∗k) andin the above generality by Mahler. In fact, one can study the S-unit equation (and Siegel’s theorem) in far moregeneral contexts (e.g., when u, v lie in a finitely generated subgroup of C∗).

Siegel’s result was improved to δ >√

2d by Gelfand and Dyson (1947) (independently), and finally, theoptimal result δ > 2 was obtained in 1955 by Roth [43]: If α is an algebraic number, ε > 0, and C > 0, then thereare only finitely many rational numbers p

q ∈ Q satisfying∣∣∣∣α− p

q

∣∣∣∣ < C

q2+ε. (14.0.3)

Beginning with work of Mahler, Roth’s theorem was subsequently generalized by Ridout and Lang [33, 42]to an arbitrary fixed number field k (in place of Q) and to allow for finite sets of absolute values (including non-archimedean ones).

In contrast to Liouville’s result, Roth’s theorem is ineffective. That is, given α an algebraic number, the methodof proof does not yield a way, in general, to compute the finitely many solutions to (14.0.3). In fact, this defect isalready present in all of the results on the inequality (14.0.1) beginning with Thue (consequently, Siegel’s theoremon integral points is also ineffective). This problem of effectivity remains a major open problem. On the otherhand, it is possible to derive an upper bound on the number of solutions to Roth’s inequality (14.0.3) and relatedproblems. For instance, beginning with work of Evertse [25], many authors have studied the problem of producingbounds for the number of solutions to the S-unit equation (14.0.2) that, remarkably, depend only on the cardinality|S|. For example, Beukers and Schlickewei [7] have proven the bound 216|S|.

In the 1960’s, in a series of papers, Alan Baker [5] revolutionized the subject by producing effective lowerbounds for linear combinations of logarithms of algebraic numbers. Baker’s results found numerous applications,including effective bounds for solutions to the unit equation (14.0.2) (from which many other Diophantine equa-tions may be effectively solved). Analogous lower bounds for linear forms in p-adic logarithms were obtained byvan der Poorten [40], Yu [59], and others.

Page 162: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

158 Five-day Workshop Reports

In a different direction, Schmidt [48] proved a deep higher-dimensional generalization of Roth’s theorem tothe setting of hyperplanes in projective space. Schmidt’s Subspace Theorem, as it is known, has found numerousapplications to Diophantine problems (more precisely, the version typically applied includes subsequent improve-ments due to Schlickewei [45], analogous to Ridout-Lang’s generalization of Roth’s theorem). Among the manyapplications is the fundamental result of Evertse [24] and van der Poorten and Schlickewei [41] on the n-term unitequation: all but finitely many solutions of the unit equation

u1 + u2 + . . .+ un = 1, u1, . . . , un ∈ O∗k,S , (14.0.4)

satisfy an equation of the form∑i∈I ui = 0, where I is a nonempty subset of 1, . . . , n.

In 1983, another leap forward occurred when Faltings [27] proved Mordell’s conjecture dating from 1922: forany number field k and any smooth projective curve C of genus at least two, the set of rational points C(k) isfinite. Thus, the qualitative behavior of rational points on algebraic curves is completely determined by the crudestgeometric invariant of a curve, the genus. Although Faltings’ original proof avoided Diophantine approximation,Vojta [55] subsequently found a new and influential proof of the Mordell conjecture based in Diophantine approxi-mation. Building on this work, Faltings [28] proved a conjecture of Lang classifying the subvarieties of an abelianvariety which contain a Zariski dense set of rational points, and another conjecture of Lang generalizing Siegel’stheorem to affine open subsets of an abelian variety. Even more generally, Vojta [56, 57] extended Faltings’ resultsto subvarieties of semi-abelian varieties.

Generalizing Mordell’s conjecture, Bombieri and Lang conjectured that if X is a variety of general type, overa number field k, then the set of rational points X(k) is not Zariski dense in X . In fact, the conjecture can beextended to include integral points and varieties of log general type. In this form, the Bombieri-Lang conjectureunifies all of the aforementioned qualitative results on integral and rational points. In particular, it contains asspecial cases the results of Siegel and Faltings for curves, and the results of Faltings and Vojta for integral andrational points on subvarieties of semi-abelian varieties.

Returning to Diophantine approximation, and ending the overview, we mention Vojta’s conjectures [54], whichposit a precise inequality for algebraic points on a variety, at once quantifying the Bombieri-Lang conjecture andgeneralizing the Diophantine approximation inequalities of Roth and Schmidt. Vojta’s conjectures came as theresult of the surprising discovery of analogies between Diophantine approximation and Nevanlinna theory, thequantitative theory that grew out of Picard’s classical theorem in complex analysis on entire functions omitting twovalues. This has led to the development of strong ties and analogies between arithmetic and complex geometry, inaddition to the more classical analogies between number fields and function fields.

Recent Developments and Presentation HighlightsIn this section we discuss select recent developments, with a view towards the presentations given at the workshop,and give a description of some of the presentation highlights.

Schmidt’s Subspace Theorem and Its GeneralizationsSchmidt’s Subspace Theorem remains one of the most powerful Diophantine approximation results available, andconstitutes the central tool in a diverse and increasingly large number of applications (e.g., see Bilu’s survey article[8]). Generalizing the Subspace Theorem to other contexts began with work of Faltings and Wustholz [29], andmore recently, Corvaja and Zannier [22] and Evertse and Ferretti [26] have proven generalizations to higher-degreehypersurfaces in projective space and to more general projective varieties. Closely related techniques have beenused to study integral points on varieties, beginning with a new proof of Siegel’s theorem by Corvaja and Zannier[21], and continuing in their work [23] and work of Levin [34], Corvaja, Levin, and Zannier [19], and Autissier[2].

Continuing this line of research, Paul Vojta (University of California, Berkeley) spoke on joint work withMin Ru on birational Nevanlinna constants. In earlier work of Ru, a new invariant, the Nevanlinna constant,was introduced to clarify and unify the Diophantine approximation method (and analogues in Nevanlinna theory)developed in the previous work. Vojta spoke on variants of this constant, and corresponding generalizations of the

Page 163: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Diophantine Approximation and Algebraic Curves 159

Subspace Theorem, which go further by incorporating the filtrations introduced in Autissier’s work, in addition toother novel ideas.

Unit equations and other related equationsThe unit equation (14.0.2) (and more generally (14.0.4)) is ubiquitous in number theory, and many other famousDiophantine equations (e.g., Thue-Mahler equations, Mordell equations, hyperelliptic equations) can be reducedto it. Such an observation goes back to at least the 1926 work of Siegel [51] (written under the pseudonym X), whoused the unit equation to study integral points on affine hyperelliptic curves. The groundbreaking work of Bakerled to effective height bounds for solutions to the unit equation (and a host of other Diophantine equations) and thepossibility of practical general algorithms. We can (loosely) organize a number of the presentations around thesecore themes.

Height bounds and algorithmic aspects

The classical approach to producing effective height bounds for solutions to the unit equation (and related Diophan-tine equations) comes from the theory of linear forms in logarithms (including p-adic and elliptic versions). Sincethe bounds produced by this method are typically too large to admit a naıve exhaustive search, finding practicalalgorithms for the resolution of such Diophantine equations has been intensively studied, with key developmentsgoing back to Baker and Davenport [4] and the introduction of the LLL-algorithm by de Weger [58].

Building on observations of Frey [30] and the proof of the Taniyama-Shimura conjecture [14], recently “modu-lar” approaches to the unit equation, and related equations, have been developed (over the rational numbers). Workin this direction includes results of Murty and Pasten [37], Bennett and Billerey [5], and Benjamin Matschke(University of Bordeaux), in joint work with Rafael von Kanel, who spoke on solving S-unit, Mordell, Thue,Thue-Mahler, and generalized Ramanujan-Nagell equations via the Taniyama-Shimura conjecture. There areclose relations between all of these classical Diophantine equations, and Matschke focused on the Mordell equa-tion y2 = x3 + a, where a ∈ Z is fixed and x and y lie in a ring of S-integers ZS = Z[1/NS ], where S is agiven set of rational primes and N =

∏p∈S p. Several improvements to the theory and practical resolution of

the Mordell equation were given, including improved height bounds for the solutions (using an approach basedon the Taniyama-Shimura conjecture, as opposed to linear forms in logarithms) and improvements to the knownalgorithms for solving the Mordell equation, including novel sieving techniques. Applications to elliptic curvedatabases were discussed, including the explicit computation of all elliptic curves (up to isomorphism) with goodreduction outside a set of of sufficiently small primes.

Beth Malmskog (Villanova University) discussed implementing a solver for S-unit equations in Sage andapplications to algebraic curves (joint work with Alejandra Alvarado, Angelos Koutsianas, Christopher Rasmussen,Christelle Vincent, and Mckenzie West). In contrast to Matschke’s talk, a key focus of the algorithm was handlingS-unit equations over arbitrary number fields. As an application, all 63 Q-isomorphism classes of Picard curveswith good reduction away from 3 were computed (by a result of Borner-Bouw-Wewers, all Picard curves over Qhave bad reduction at 3).

Adela Gherga (University of British Columbia) spoke on implementing algorithms to compute elliptic curvesover Q. Specifically, she discussed difficulties with implementing the algorithms of de Weger and Tzanakis,particularly with respect to performing p-adic computations. This is an area where many current computer algebrapackages are rather lacking.

Closely related inequalities and equations

Jan-Hendrik Evertse (Universiteit Leiden) discussed joint work with Yann Bugeaud and Kalman Gyory on S-parts of values of binary forms and decomposable forms. If S = p1, . . . , pn is a finite set of primes, a ∈ Z,a 6= 0, and we have a factorization a = pi11 · · · pinn a′ with pi - a′, 1 ≤ i ≤ n, then the S-part of a is defined to be[a]S = pi11 · · · pinn . For a decomposable form F ∈ Z[X1, . . . , Xm], let

α(F ) = infα | ∀S,∃c > 0 such that [F (x)]S ≤ c|F (x)|α for all x ∈ Zm, gcd(x1, . . . , xm) = 1, F (x) 6= 0.

Page 164: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

160 Five-day Workshop Reports

If F is a binary form (m = 2) of degree n ≥ 3 and nonzero discriminant, then it is proven that α(F ) = 2n .

More generally, they prove results for decomposable forms under certain conditions, and give quantitative resultscounting the number of solutions to related inequalities.

Yann Bugeaud (University of Strasbourg) discussed the binary representation of smooth numbers. He proveda series of results all with the general theme that a large integer cannot simultaneously have only very small primedivisors and very few nonzero binary digits. The given new results are effective, and rely on estimates for linearforms in complex and p-adic logarithms of algebraic numbers.

Shabnam Akhtari (University of Oregon) discussed joint work with Jeffrey Vaaler on finding height inequal-ities for units in the ring of S-integers of a number field. More specifically, if h(·) denotes the (absolute) Weilheight, then Akhtari discussed the problem of finding estimates for the quantity

minh(βγ) | γ ∈ Γ,

where Γ is a given subset of a group of S-units O∗S of k, and β 6= 0 is a nonzero S-integer. Aside from intrinsicinterest, such height inequalities have applications to Diophantine equations, and in particular, the relevance tosolving norm form equations was discussed.

Unit equations and arithmetic dynamics

The application of unit equations to arithmetic dynamics, and in particular as a tool to study periodic and preperi-odic points of a rational function, goes back to at least Narkiewicz [38], who used unit equations to study rationalperiodic points of monic polynomials. In the more general setting of rational functions, recent works of Canci[15], Canci and Paladino [16], and Canci and Vishkautsan [3] have used unit equations as an essential tool tostudy rational preperiodic points and give partial results towards the Morton-Silverman conjecture in arithmeticdynamics.

Extending this line of results, Sebastian Troncoso (Michigan State University) spoke on rational preperiodicpoints and hypersurfaces in projective space. In the one-dimensional case, if φ : P1 → P1 is an endomorphismof degree d ≥ 2 over a number field k, and φ has good reduction outside of a set of places S of k (includingarchimedean places), then Troncoso shows that the number |PrePer(φ, k)| of k-rational preperiodic points satisfies

|PrePer(φ, k)| ≤ 5 · 216|S|d3 + 3.

A sharper bound is obtained for strictly preperiodic points. In higher dimensions, quantitative results are obtainedfor the number of rational preperiodic hypersurfaces satisfying certain conditions. Key ingredients in the proofsare bounds for the number of solutions to S-unit equations and Thue-Mahler equations.

Runge’s methodAn old method of Runge [44], dating from 1887, proves the effective finiteness of the set of integral points oncertain affine curves. In its most basic form, Runge proved that if f ∈ Q[x, y] is an absolutely irreducible polyno-mial and the leading term of f factors nontrivially over Q into nonconstant coprime polynomials, then the set ofsolutions to

f(x, y) = 0, x, y ∈ Z,is finite and can be effectively computed. More generally, Runge’s method can be used to prove effective finitenessof S-integral points on an affine curve C when |S| is small compared to the number of rational components of Cat infinity. Bombieri [12] proved a uniform version of Runge’s theorem, allowing the number field K and set ofplaces S to vary.

More recent developments include a generalization of Runge’s method to higher dimensions by Levin [35], aseries of papers by Bilu and Parent [9, 10, 11] applying Runge’s method to modular curves and solving certaincases of a well-known conjecture of Serre, an application in arithmetic dynamics to integral points in orbits byCorvaja, Sookdeo, Tucker, and Zannier [20], and an application to the Nagell-Ljunggren equation by Bennett andLevin [6].

Page 165: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Diophantine Approximation and Algebraic Curves 161

Further developing these themes, Samuel Le Fourn (ENS Lyon) spoke on Runge’s method and its applica-tion to integral points on Siegel modular varieties. Expanding on Levin’s higher-dimensional version of Runge’smethod, Le Fourn proves a “tubular” version of Runge’s method which permits wider applicability to higher-dimensional problems on integral points. As an application, Le Fourn proves an explicit finiteness result forintegral points on the Siegel modular variety A2(2).

Unlikely intersectionsIf X and Y are subvarieties in a space of dimension n > dimX + dimY , then one usually expects that theintersectionX∩Y will be empty (the intersection is “unlikely”). In fact, ifX is fixed and Y runs through an infinitefamily of suitable subvarieties with dimX + dimY < n, then one still expects that X has small intersection withthe entire family unless X is of a certain special type related to the family. One may view conjectures of Lang,Manin-Mumford, Zilber, Andre-Oort, Bombieri-Masser-Zannier, and others in this general setting. The topic hasbeen developing rapidly, with notable contributions including work of Bombieri, Masser, and Zannier [13], Maurin[36], Habegger [31, 32], and Pila [39].

Within this framework, Laura Capuano (Oxford University) spoke on joint work with Barroero on unlikelyintersections in families of abelian varieties. In particular, the case where X is a curve in a family of abelianvarieties was studied, and applications were given to the study of families of polynomial Pell equations.

Perfectoid spaces and p-adic hyperbolicityVojta’s dictionary [54] gives a precise way to “translate” between statements in Diophantine approximation andstatements in (complex) Nevanlinna theory. This analogy between the two subjects has proved very fruitful andinfluential, and has led to substantial progress in both subjects. Since at least the 1980’s, a theory of p-adic (or non-Archimedean) Nevanlinna theory has been developed, and in particular, various notions of p-adic hyperbolicitywere introduced by Cherry [18]. Similar to the classical case, a dictionary between p-adic Nevanlinna theory andDiophantine approximation over the rational numbers was studied by An, Levin, and Wang in [1].

The recent theory of perfectoid spaces, due to Scholze [49], has been quickly recognized as a major advance,and has already found many deep applications in diverse areas. In this context, Ariyan Javanpeykar (Universityof Mainz) discussed the notion of p-adic hyperbolicity and its relation to other common notions of hyperbolicity.Using Scholze’s theory of perfectoid spaces, he presented a proof of a strong p-adic hyperbolicity statement formoduli spaces of polarized abelian varieties.

Transcendence TheoryMichael Coons (University of Newcastle) spoke on joint work with Yohei Tachiya on problems in the transcen-dence theory of functions of one complex variable. The talk centered on proving generalizations and extensionsof a result of Duffin and Schaeffer. A primary result yielded the transcendence, over the field of meromorphicfunctions, of complex functions with a finite (positive) bounded radius of convergence satisfying certain radialunboundedness conditions. As an application, one obtains that the field C(z)(ez,

∑n≥0 z

2n) has transcendencedegree two over C(z), answering a function-theoretic analogue of a question of Nishioka on the algebraic inde-pendence of e and the number

∑n≥0 2−2n .

Noriko Hirata-Kohno (Nihon University) discussed the independence of values of the polylogarithm functionand a generalized polylogarithm function with periodic coefficients. The method of proof relies on the constructionof new Pade approximations built on earlier (unpublished) work of Nishimoto.

Further Presentation HighlightsRichard Guy (University of Calgary) spoke on some relations between number theory and the Euclidean geometryof triangles. He discussed some classical geometric configurations of lines and circles associated to triangles,including configurations arising from the Simson line theorem, and raised some open problems about the structureof a related graph.

Page 166: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

162 Five-day Workshop Reports

Lajos Hajdu (University of Debrecen) discussed the problem of finding well approximating lattices for afinite set of points (joint work with Andras Hajdu and Robert Tijdeman). After giving some motivation for theproblem coming from medical image processing, Hajdu defined an appropriate measure for the approximation ofa lattice to a finite set of points. Under this notion, results on finding well-approximating lattices were given inall dimensions, with more complete results in the one-dimensional case. The techniques used include the LLLalgorithm and techniques from simultaneous Diophantine approximation.

Fabien Pazuki (University of Copenhagen) spoke on inequalities for the j-invariants of isogenous ellipticcurves. Specifically, Pazuki showed that if h denotes the absolute logarithmic Weil height, E1 and E2 are ellipticcurves over Q with j-invariants j1 and j2, respectively, and φ : E1 → E2 is an isogeny, then the following twoinequalities hold:

|h(j1)− h(j2)| ≤ 9.204 + 12 log deg φ,

h(j1)− h(j2) ≤ 10.68 + 6 log deg φ+ 6 log(1 + h(j1)).

Some applications of the inequalities were discussed, including the problem of bounding the coefficients of mod-ular polynomials.

Cameron Stewart (University of Waterloo) spoke on joint work with Stanley Xiao on the representation ofintegers by binary forms. Let F be a binary form with integer coefficients, non-zero discriminant, and degreed ≥ 2. Let RF (Z) denote the number of integers of absolute value at most Z which are represented by F .Computing RF (Z) in the case of binary quadratic forms (d = 2) is classical, and goes back to work of Fermat,Lagrange, Legendre, and Gauss. The cubic case (d = 3) was studied by Hooley, and many authors have studiedvarious higher degree classes of binary forms. For d ≥ 3, Stewart and Xiao are able to give a precise asymptoticresult, showing that there is a positive number CF such thatRF (Z) is asymptotic to CFZ2/d. The techniques usedinclude Heath-Brown’s p-adic determinant method, as refined by Salberger.

Amos Turchet (University of Washington) discussed joint work with Kenneth Ascher on the uniformity ofintegral points on curves and surfaces. A well-known result of Caporaso-Mazur-Harris shows that, assuming theBombieri-Lang-Vojta conjecture, there should exist a uniform bound B(k, g) for the number of k-rational pointson a smooth projective curve C, where B(k, g) depends only on the number field k and the genus g of C. Higher-dimensional uniformity results on rational points in the same vein were proven by Hassett, Abramovich, andAbramovich-Voloch. Turchet discussed partial results towards a program for proving an integral points analogueof Caporaso-Mazur-Harris’s result, extending results of Abramovich for integral points on elliptic curves. Geo-metrically, this involves extending the previous uniformity results from varieties of general type to varieties of loggeneral type.

ParticipantsAkhtari, Shabnam (University of Oregon)Bauer, Mark (University of Calgary)Bennett, Michael (University of British Columbia)Bremner, Andrew (Arizona State University)Bugeaud, Yann (Universite de Strasbourg)Capuano, Laura (Oxford University)Coons, Michael (University of Newcastle)Evertse, Jan-Hendrik (Universiteit Leiden)Gherga, Adela (University of British Columbia)Guy, Richard (The University of Calgary)Hajdu, Lajos (university of Debrecen)HIRATA-Kohno, Noriko (Nihon University)Ingram, Patrick (York University)Javanpeykar, Ariyan (University of Mainz)Kundu, Debanjana (University of Toronto)Le Fourn, Samuel (University of Lyon (ENS de Lyon))

Page 167: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Diophantine Approximation and Algebraic Curves 163

Levesque, Claude (Universite Laval)Levin, Aaron (Michigan State University)Malmskog, Beth (Villanova University)Matschke, Benjamin (Boston University)McKinnon, David (University of Waterloo)Morrow, Jackson (Emory University)Mosunov, Anton (University of Waterloo)Nguyen, Khoa (University of Calgary)Park, Jennifer (Ohio State University)Pazuki, Fabien (University of Copenhagen)Pinter, Akos (University of Debrecen)Plante, Thomas (Michigan State University)Sharma, Divyum (University of Waterloo)Stewart, Cameron (University of Waterloo)Thunder, Jeff (Northern Illinois University)Troncoso, Sebastian (Michigan State University)Tucker, Tom (University of Rochester)Turchet, Amos (University of Washington)Tzanakis, Nikos (University of Crete)Vojta, Paul (University of California at Berkeley)Walsh, Gary (University of Ottawa)Wang, Julie Tzu-Yueh (Academia Sinica)Widmer, Martin (Royal Holloway, University of London)Yasufuku, Yu (Nihon University)Yazdani, Soroosh (University of Waterloo)

Page 168: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] T. T. H. An, A. Levin, and J. Wang, A p-adic Nevanlinna-Diophantine correspondence, Acta Arith. 146 (2011),no. 4, 379–397.

[2] P. Autissier, Sur la non-densite des points entiers, Duke Math. J. 158 (2011), no. 1, 13–27.

[3] A. Baker, Linear forms in the logarithms of algebraic numbers. I, II, III, Mathematika 13 (1966), 204-216;ibid. 14 (1967), 102-107; ibid. 14 (1967), 220–228.

[4] A. Baker and H. Davenport, The equations 3x2 − 2 = y2 and 8x2 − 7 = z2, Quart. J. Math. Oxford Ser. (2)20 (1969), 129–137.

[5] M. A. Bennett and N. Billerey, Sums of two S-units via Frey-Hellegouarch curves, Math. Comp. 86 (2017),no. 305, 1375–1401.

[6] M. Bennett and A. Levin, The Nagell-Ljunggren equation via Runge’s method, Monatsh. Math. 177 (2015),no. 1, 15–31.

[7] F. Beukers and H. P. Schlickewei, The equation x+ y = 1 in finitely generated groups, Acta Arith. 78 (1996),no. 2, 189–199.

[8] Y. Bilu, The many faces of the subspace theorem [after Adamczewski, Bugeaud, Corvaja, Zannier. . .],Asterisque (2008), no. 317, Exp. No. 967, vii, 1–38, Seminaire Bourbaki. Vol. 2006/2007.

[9] Y. Bilu and P. Parent, Integral j-invariants and Cartan structures for elliptic curves, C. R. Math. Acad. Sci.Paris 346 (2008), no. 11-12, 599–602.

[10] Y. Bilu and P. Parent, Runge’s method and modular curves, Int. Math. Res. Not. IMRN (2011), no. 9, 1997–2027.

[11] Y. Bilu and P. Parent, Serre’s uniformity problem in the split Cartan case, Ann. of Math. (2) 173 (2011), no. 1,569–584.

[12] E. Bombieri, On Weil’s “theoreme de decomposition”, Amer. J. Math. 105 (1983), no. 2, 295–308.

[13] E. Bombieri, D. Masser, and U. Zannier, Intersecting a curve with algebraic subgroups of multiplicativegroups, Internat. Math. Res. Notices (1999), no. 20, 1119–1140.

[14] C. Breuil, B. Conrad, F. Diamond, and R. Taylor, On the modularity of elliptic curves over Q: wild 3-adicexercises, J. Amer. Math. Soc. 14 (2001), no. 4, 843–939.

[15] J. Canci, Finite orbits for rational functions, Indag. Math. (N.S.) 18 (2007), no. 2, 203–214.

[16] J. Canci and L. Paladino, Preperiodic points for rational functions defined over a global field in terms of goodreduction, Proc. Amer. Math. Soc. 144 (2016), no. 12, 5141–5158.

[17] J. Canci and S. Vishkautsan, Scarcity of cycles for rational functions over a number field, Trans. Amer. Math.Soc. Ser. B, to appear.

164

Page 169: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Diophantine Approximation and Algebraic Curves 165

[18] W. Cherry, Hyperbolic p-adic analytic spaces, Ann Arbor, MI, 1993, Thesis (Ph.D.)–Yale University.

[19] P. Corvaja, A. Levin, and U. Zannier, Integral points on threefolds and other varieties, Tohoku Math. J. (2) 61(2009), no. 4, 589–601.

[20] P. Corvaja, V. Sookdeo, T. J. Tucker, and U. Zannier, Integral points in two-parameter orbits, J. Reine Angew.Math. 706 (2015), 19–33.

[21] P. Corvaja and U. Zannier, A subspace theorem approach to integral points on curves, C. R. Math. Acad. Sci.Paris 334 (2002), no. 4, 267–271.

[22] P. Corvaja and U. Zannier, On a general Thue’s equation, Amer. J. Math. 126 (2004), no. 5, 1033–1055.

[23] P. Corvaja and U. Zannier, On integral points on surfaces, Ann. of Math. 160 (2004), no. 2, 705–726.

[24] J.-H. Evertse, On sums of S-units and linear recurrences, Compositio Math. 53 (1984), no. 2, 225–244.

[25] J.-H. Evertse, On equations in S-units and the Thue-Mahler equation, Invent. Math. 75 (1984), no. 3, 561–584.

[26] J.-H. Evertse and R. G. Ferretti, A generalization of the Subspace Theorem with polynomials of higherdegree, Diophantine approximation, Dev. Math., vol. 16, SpringerWienNewYork, Vienna, 2008, pp. 175–198.

[27] G. Faltings, Endlichkeitssatze fur abelsche Varietaten uber Zahlkorpern, Invent. Math. 73 (1983), no. 3, 349–366.

[28] G. Faltings, The general case of S. Lang’s conjecture, Barsotti Symposium in Algebraic Geometry (AbanoTerme, 1991), Perspect. Math., vol. 15, Academic Press, San Diego, CA, 1994, pp. 175–182.

[29] G. Faltings and G. Wustholz, Diophantine approximations on projective spaces, Invent. Math. 116 (1994),no. 1-3, 109–138.

[30] G. Frey, Links between solutions of A−B = C and elliptic curves, Number theory (Ulm, 1987), LectureNotes in Math., vol. 1380, Springer, New York, 1989, pp. 31–62.

[31] P. Habegger, Intersecting subvarieties of abelian varieties with algebraic subgroups of complementarydimension, Invent. Math. 176 (2009), no. 2, 405–447.

[32] P. Habegger, On the bounded height conjecture, Int. Math. Res. Not. IMRN (2009), no. 5, 860–886.

[33] S. Lang, Fundamentals of Diophantine geometry, Springer-Verlag, New York, 1983.

[34] A. Levin, Generalizations of Siegel’s and Picard’s theorems, Ann. of Math. (2) 170 (2009), no. 2, 609–655.

[35] A. Levin, Variations on a theme of Runge: effective determination of integral points on certain varieties, J.Theor. Nombres Bordeaux 20 (2008), no. 2, 385–417.

[36] G. Maurin, Courbes algebriques et equations multiplicatives, Math. Ann. 341 (2008), no. 4, 789–824.

[37] M. R. Murty and H. Pasten, Modular forms and effective Diophantine approximation, J. Number Theory 133(2013), no. 11, 3739–3754.

[38] W. Narkiewicz, Polynomial cycles in algebraic number fields, Colloq. Math. 58 (1989), no. 1, 151–155.

[39] J. Pila, O-minimality and the Andre-Oort conjecture for Cn, Ann. of Math. (2) 173 (2011), no. 3, 1779–1840.

[40] A. J. van der Poorten, Linear forms in logarithms in the p-adic case, (1977), 29–57.

[41] A. J. van der Poorten and H. P. Schlickewei, Additive relations in fields, J. Austral. Math. Soc. Ser. A 51(1991), no. 1, 154–170.

Page 170: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

166 Five-day Workshop Reports

[42] D. Ridout, The p-adic generalization of the Thue-Siegel-Roth theorem, Mathematika 5 (1958), 40–48.

[43] K. F. Roth, Rational approximations to algebraic numbers, Mathematika 2 (1955), 1–20; corrigendum, 168.

[44] C. Runge, Uber ganzzahlige Losungen von Gleichungen zwischen zwei Veranderlichen, J. Reine Angew.Math. 100 (1887), 425–435.

[45] H. P. Schlickewei, The p-adic Thue-Siegel-Roth-Schmidt theorem, Arch. Math. (Basel) 29 (1977), no. 3,267–270.

[46] W. M. Schmidt,, Approximation of algebraic numbers, Diophantine approximation (Cetraro, 2000), LectureNotes in Math., vol. 1819, Springer, Berlin, 2003, pp. 107–170.

[47] W. M. Schmidt, Simultaneous approximation to algebraic numbers by rationals, Acta Math. 125 (1970),189–201.

[48] W. M. Schmidt, Norm form equations, Ann. of Math. (2) 96 (1972), 526–551.

[49] P. Scholze, Perfectoid spaces, Publ. Math. Inst. Hautes Etudes Sci. 116 (2012), 245–313.

[50] C. L. Siegel, Approximation algebraischer Zahlen, Math. Z. 10 (1921), no. 3-4, 173–213.

[51] C. L. Siegel, The Integer Solutions of the Equation y2 = axn + bxn−1 + · · ·+ k, J. London Math. Soc. 1(1926), 66–68.

[52] C. L. Siegel, Uber einige Anwendungen diophantischer Approximationen, Abhandlungen der PreußischenAkademie der Wissenschaften. Physikalisch-mathematische Klasse 1929, Nr. 1.

[53] A. Thue, uber Annaherungswerte algebraischer Zahlen, J. Reine Angew. Math. 135 (1909), 284–305.

[54] P. Vojta, Diophantine approximations and value distribution theory, Lecture Notes in Mathematics, vol. 1239,Springer-Verlag, Berlin, 1987.

[55] P. Vojta, Siegel’s theorem in the compact case, Ann. of Math. (2) 133 (1991), no. 3, 509–548.

[56] P. Vojta, Integral points on subvarieties of semiabelian varieties. I, Invent. Math. 126 (1996), no. 1, 133–181.

[57] P. Vojta, Integral points on subvarieties of semiabelian varieties. II, Amer. J. Math. 121 (1999), no. 2, 283–313.

[58] B. M. M. de Weger, Algorithms for Diophantine equations, CWI Tract, vol. 65, Stichting MathematischCentrum, Centrum voor Wiskunde en Informatica, Amsterdam, 1989.

[59] K. Yu, Linear forms in p-adic logarithms. III, Compositio Math. 91 (1994), no. 3, 241–276.

Page 171: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 15

Women in Control: New Trends in InfiniteDimensions (17w5123)

July 16 - 21, 2017Organizer(s): Luz de Teresa (Universidad Nacional Autonoma de Mexico), Irena Lasiecka(University of Memphis), Kirsten Morris (University of Waterloo)

Overview

Control theory as a discipline has received considerable attention over the years. There are important applicationsto such diverse fields as engineering and medical sciences and challenging mathematical questions have made itrelevant from both the abstract and applications point of view. The first developments in mathematical theory werefor ordinary differential equation models. The need for more accurate descriptions and the constitutive relationsin physical models such as structures and fluids give rise to partial differential equations (PDE’s). The analysis ofthese models is strongly tied to several areas of mathematics, both pure and applied. Besides PDE’s, geometricanalysis, functional analysis, harmonic analysis, differential geometry, combinatorics and graph theory, optimiza-tion, numerical analysis and computational mathematics are all relevant. The last 20 years or so have witnessedgreat interaction and synergism where long-standing questions have found unexpected answers but these resultshave opened up new avenues of research. Because the state of systems with PDE models is a time-varying func-tion that evolves on an infinite-dimensional state space, such systems are often referred to as infinite-dimensionalsystems.

With the impetus of technological problems demonstrating a need for models based on a continuum ratherthan a discrete description of phenomena, control theory has evolved into infinite-dimensions. This means theuse of models described as the objects in infinite-dimensional Hilbert/Banach spaces with concrete representationsstemming from PDE’s. It was the goal of this workshop to review the major trends in this area and to introducea wealth of problems and techniques which could provide a springboard to further research. This conferencewas a perfect opportunity to put together people working in the field of infinite-dimensional systems from diversebackgrounds and with different expertise.

Control theory in the last 20 years or so has undergone tremendous evolution. From vibrant developmentsin linear finite-dimensional control, with almost complete understanding of fundamental concepts such as stabi-lization, controllability, optimal control in this context. A wealth of applications pushed the borders into PDE’smodeling by asking similar questions for infinite-dimensional systems. The unifying language then became that ofsemigroups which provide a common description of dynamics in both finite and infinite-dimensions. Not surpris-ingly, the answers are not the same, and in depth study of analysis, geometry, topology has paved the way to new

167

Page 172: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

168 Five-day Workshop Reports

control theoretical results within the realm of infinite-dimensions. These results were often formulated first in anabstract form in the language of functional analysis. However new conditions which transpired had to be verifiedby concrete representations described by PDE’s or other infinite-dimensional functional equations. Thus the theoryprogresses in parallel-at an abstract level and also on a more specific PDE level, where verification of the sought-after properties became an art in itself. In infinite-dimensional systems research the focus is on development ofan abstract and axiomatized framework. The talks presented at the workshop represented both trends: PDE’s andinfinite-dimensional system theory.The interaction between these two approaches and the resulting benefits as wellas for collaborations were heavily emphasized. Applications to biological, fluid dynamics and mechanical mod-els give a very tangible payoff in the applied world with solutions obtained via numerical methods; extension ofalgorithms to this class is non-trivial. Plenary talks were meant to represent diverse areas of infinite-dimensionalcontrol theory where new developments have emerged and are projected to flourish into the future. It was hopedthat these talks would be accessible to non-specialists and providing a basis for the material in a number of shortercontributed talks.

Control theory, like other STEM disciplines, has a marked gender gap. Gender studies confirm that havingscientists and engineers with diverse backgrounds, interests, and cultures assures better scientific and technologicalresults. Improving female representation in the control field is not just an equity issue, but also an opportunity toimprove the quality of work in the branch by having an impact on highly dedicated female researchers with variedexpertise ranging from very applied work being done in engineering, to very theoretical mathematical work. Allbranches of control are inter-disciplinary. Inter-disciplinary work can add to the difficulties faced by minorityresearchers since it is easy to criticize contributions and grant proposals as not being “mainstream”. Anotherdifficulty, faced by female researchers in all STEM fields, is in finding collaborations since men tend to collaboratewith other men.

The meeting had several inter-connected objectives: to provide support for young women interested in this field,introduce young female graduate students to potential future advisors and collaborators. We hope the workshophelped retention and will increase the participation of women in research activities in control of PDE’s in thelong-term. Another objective was to deepen worldwide networks of collaboration by inviting women from LatinAmerica.

State of the FieldDevelopments in PDE optimal control theory started with considerations of parabolic partial differential equations- see for instance, the celebrated work of J.L. Lions [2]) . Later developments brought up front hyperbolic systems-first in one space dimension, starting with the pioneering papers of D. Russell [4] and W. Littman [3]. Already inone space dimension, it became clear that the nature of control mechanisms and capabilities is vastly different fromthat in finitedimensional spaces. This has provided an drive in abstract infinite-dimensional theory to seek counter-examples and also positive results within more restrictive frameworks. One can get a basic understanding of thearea by realizing that properties such as stabilization are relatively simple for parabolic systems (this is essentiallyfinite-dimensional theory due to spectral analysis), while (exact) controllability becomes almost impossible. Inhyperbolic dynamics, controllability and more generally inverse problem are practical and solvable. On the otherhand, stabilization is much more difficult for hyperbolic systems and requires special feedback that is intrinsicallyinfinite-dimensional. While controllability is an open loop control problem, stabilization is realized by a closedloop or feedback control acting on an infinite time horizon T = ∞. Optimal control could be either open loopor closed loop and either finite or infinite horizon. As we shall see later, there is strong synergism between theseconcepts - solution of one problem feeds into another problem.

Consider an abstract model that provides a unified treatment for most of infinite-dimensional control problems.Let T > 0 be finite or infinite and let H and U be two separable Hilbert spaces with scalar product and associatednorm respectively denoted by (·, ·)H , (·, ·)U , ‖ · ‖H and ‖·‖U . Let us consider the abstract control system:

y(t) = Ay(t) +Bu(t) in (0, T ) ,

y(0) = y0 ∈ H.(15.0.1)

In this system y0 ∈ H is the initial datum and u ∈ L2(0, T ;U) is the control, is exerted on the system through the

Page 173: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Control: New Trends in Infinite Dimensions 169

operator B. Assume that A and B are unbounded operators respectively defined on D(A) ⊂ H and D(B) ⊂ U .Let us also assume that system (15.0.1) is well-posed ; that is, for any (y0, u) ∈ H × L2(0, T ;U) there exists aunique weak solution y ∈ C0([0, T ];H) to problem (15.0.1) which depends continuously on the data. Denote byy(t; y0, u) ∈ H the solution to system (15.0.1) at time t ∈ [0, T ] corresponding to (y0, u) ∈ H × L2(0, T ;U).Controllability is an open loop control problem where one seeks a control u ∈ L2(0, T ;U) in order to “hit” a giventarget. Hitting may be exact or approximate. These concepts are quantitized by the following definitions.

1. System (15.0.1) is exactly controllable at time T if, for all (y0, y1) ∈ H ×H , there exists u ∈ L2(0, T ;U)such that the solution y of (15.0.1) satisfies

y(T ; y0, u) = y1.

2. System (15.0.1) is controllable to trajectories at time T if, for every (y0, y0) ∈ H ×H and u ∈ L2(0, T ;U),there exists u ∈ L2(0, T ;U) such that the corresponding weak solution to (15.0.1) satisfies

y(T ; y0, u) = y(T ; y0, u).

3. System (15.0.1) is null controllable or exactly controllable to zero at time T if, for every y0 ∈ H there existsu ∈ L2(0, T ;U) such that the corresponding weak solution to (15.0.1) satisfies

y(T ; y0, u) = 0.

4. System (15.0.1) is approximately controllable at time T if, for every (y0, y1) ∈ H × H , and every ε > 0,there exists u ∈ L2(0, T ;U) such that the corresponding weak solution to (15.0.1) satisfies

‖y(T ; y0, u)− y1‖H < ε.

Stabilization can be formulated as seeking a feedback control u = F (y) where F is an operator from U → H[bounded or unbounded, linear or nonlinear] with the property that when inserted into the dynamics, it leads tostable dynamics. Stability may be asymptotic or uniform. In a simplest case when F is bounded, asymptoticstability amounts to the property that the semigroup e(A+BF ) is asymptotically stable, that is,

e(A+BF )ty0 → 0, as t→∞∀y0 ∈ H (15.0.2)

Uniform stability requires that stability property be uniform with respect to the underlined topology. This is to say

||e(A+BF )t||t→∞ → 0 (15.0.3)

Several developments and new trends in various areas were discussed during the workshop.

Controllability and Stabilization of Parabolic SystemsThe theory of infinite-dimensional systems often bifurcates into parabolic or hyperbolic frameworks. This is dueto axiomatized assumptions imposed where parabolicity exhibits very special properties such as smoothing andinfinite speed of propagation while hyperbolicity is marked by finite speed of propagation and intrinsic lack ofsmoothing by the dynamics. These properties have a fundamental effect on abstract control theory. Treatments ofboth type of dynamics often require different methods. These differences were also emphasized at the workshopwhere parabolicity [or more generally analyticity of the underlying semigroups] was exhibited by systems of heattransfer, Navier Stokes equations, viscoelastic systems, equations of thermoelasticity. In the case of parabolicdynamics expected properties are stabilization and null-controllability [or controllability to trajectories]. Talks byAssia Benabdallah, Catherine Lebiedzik, Suzanne Lenhart, Jing Zhang provide good representation of the area.However, the results presented by Assia Benabdallah showed that parabolic coupled equations can present hyper-bolic behavior related to controllability; more specifically, minimal time of controllability and regional dependenceof the control region.

Page 174: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

170 Five-day Workshop Reports

Controllability and Stabilization of Hyperbolic SystemsControllability and stabilization of hyperbolic systems is different than for parabolic systems. Here the situationis opposite: exact controllability is a natural property to expect due to time reversibility and propagation property.which does not change the regularity. However, it takes time for the signal to travel. Thus finite speed of propa-gation occurs and a typical results on localized/boundary controllability takes place after some time. Stabilization,instead, is more demanding due to the fact that instability in hyperbolic dynamics is typically infinite-dimensional.Typically the unstable part of the spectrum lies on the imaginary axis. In parabolic dynamics the unstable sub-space is finite-dimensional. The workshop had many talks devoted to this issue. Valeria Cavalcanti presentedsome results on stabilization of wave equation via the transmission of viscoelasticity. Daniela Sforza consideredsystems of equation with memory terms. Paola Loreti presented an approach where controllability of a large classof hyperbolic systems can be resolved by methods of Fourier’s Analysis and exponential functions.

Optimal controlIn optimal control the goal is to find a control to minimize a given performance index, J(u, y0), subject to dynamics(15.0.1). Thus one seeks an optimal control u0 ∈ L2(0, T ;U) such that

J(u0, y(u0)) = infu∈L2(0,T ;U)

J(u, y(u)). (15.0.4)

Optimal control has strong synergy with controllability and stabilization. For instance, consider T =∞. Seekingan optimal control requires that the functional cost be finite for some control. This is guaranteed by the so calledfinite cost condition, and this may be deuced from controllability. Once the optimal control is found, for linearsystems it can be synthesized in a feedback form. There exists an operator P so that the optimal control

u0(t) = −B∗Py(t).

In addition, subject to coercivity of the functional cost, the corresponding optimal system is exponentially stable.Classical optimal control involves minimization of a chosen functional (representing the cost and the target)

subject to dynamics depending on control function and possibly constraints on both state and control. This consti-tutes the core of calculus of variations and optimal control theory. This class of control problems has been studiedfirst within the context of ODE’s, then evolved in the direction of PDE’s, mostly abstract systems with boundedcontrol operators. Unbounded controls such as point or boundary control have been considered more recentlystarting with parabolic systems, due to their nice regularity properties. Again, motivated by important applicationsin mechanics and medicine optimal control problems, both boundary and point controls have been considered, andsome theory has been obtained for hyperbolic dynamics. We have new phenomena coming into picture, such asfinite speed of propagation or unexpected hidden regularity. These properties have a heavy influence on the overalltheory. New trends that are emerging in the field include optimal and constrained control of nonlinear systems andoptimal control of hyperbolic- like models with boundary controls.

There were several talks on this topic, For instance, Suzanne Lenhart and Wendi Ding on optimal control ofparabolic PDE’s with application to biology and Francesca Bucci on optimal control including optimal feedbackcontrol of the third order system with boundary control. This latter reduces to a second-order system with a finitespeed of propagation. All these talks, in addition to presenting mathematical contribution to the field, were stronglyrouted in life science applications. Lenhart’s and Ding in biology while Bucci’s presentation dealt with a problemof controlling radiation and high frequency ultrasounds. Constanza Sanchez presented optimal control problemapplicable to a Schrodinger equation, an equatio thatn in some sense combines parabolic and hyperbolic effects -it displays infinite speed of propagation but lack of smoothing on bounded domains.

Abstract Infinite-Dimensional SystemsInfinite-dimensional system theory is an area where the goal is to axiomatize certain properties in control theorywithin the context of infinite-dimensional theory with standard matrices replaced by operators. The aim is toverify which properties from finite-dimensional control theory persists and which are no longer true and thus need

Page 175: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Control: New Trends in Infinite Dimensions 171

to be replaced by an appropriate framework. The importance of the field lies in its generality. Of course, suchabstract theory must be supported by examples that provide a verification and justification for the content. Thus,not surprisingly, there is a strong synergism between infinite-dimensional system theory and PDE control. Thisworkshop was a good example of existing collaborations and the need for future research exchanges. Talks byBirgit Jacob and Jacquelien Scherpen provided overviews of some recent developments in these areas.

Control of Hybrid Systems and Systems with TransmissionAn important area of current theoretical and practical interest is combining several different types of dynamics (asdictated by modeling -e.g. fluid–structure interaction) where control acquires a unique role in transferring desirableproperties from one part of dynamics onto another. Propagation of stability, controllability and regularity [the latterin optimal control problems] through an interface becomes a centerpiece of the underlying analysis.

Canonical examples of such systems are fluid/ flow structure interactions, structural acoustic interactions andcomposite structures. Applications are abundant in various other areas of sciences such as medicine, biology,engineering, aeroelasticity etc. The main characteristic feature of such systems is the interaction of two differentPDE dynamics via some interface (a manifold of co-dimension one) or via transmission conditions separating twodifferent media. From the mathematical point of view control of such systems present formidable challenges.Any progress in this area must be preceded by good understanding of control theoretic aspects of a single PDEequation -both parabolic and hyperbolic.. Only then one can venture into trying to control the interaction by takingadvantage of the desirable properties each PDE brings to the table. For instance, dissipativity and natural stabilizingmechanism in parabolic flows (Navier Stokes equation ) enables controlling vibrations/oscillation in mechanicalstructures (bridges, vessels, arterias). In order to exhibit such phenomena one must assure appropriate geometricconditions. Several talks at the workshop [Valeria Cavalcanti, Lorena Bociu, Irina Ryzhkova, Weiwei Hu, KatieSzulc, Jing Zhang] were devoted to this topic which then was discussed in smaller groups of interest with plans forfurther collaboration. The topic, itself, is very rich and interdisciplinary. It involves modeling, PDE analysis, fluidmechanics and experiment/numerics. Speakers at the workshop were able to provide different perspectives andaspects of the problem, so that the cumulative experience is greater than the sum of ingredients. Formulation ofrelevant mathematical problem requires strong back up of modeling and of applied relevance. Thus it was naturalto engage in collaborative work between USA, Brazil and France.

Control of models Arising in Biology and MedicineThis is a field where theoretical methods of optimal control find their way into truly applied problems. Dueto the array of PDE models describing biological phenomena, a maximum principle in optimal control as wellas feedback stabilization and synthesis provide for critical tools in improving therapies and treatments. Typicaltheoretical results are local (as relevant in applications), due to a nonlinear nature of the models. Suzanne Lenhart’stalk on optimal control provided an overview of math biology applications, Francesca’s Bucci lecture dealt withmedical applications in inverse problems-ultrasound technology. Bianca Calsavara’s presentation described localcontrollability results in genetic networks.

Nonlinear controlAll these problems can be also be studied within the nonlinear framework, integral equations, delayed equations,Banach spaces etc. The applications give a vast scenario of models in the infinite-dimensional framework. Non-linear models with nonlinear controls can be written

y = A(y) +B(u)

where A(y) and B(u) are nonlinear operators may also be considered. With feedback control this becomes y =A(y) + BF (y) -where the ultimate equation may represent a well studied dynamical system. Clearly, manypossible scenarios are under consideration, leading to an array of results with strong applied ramifications. Manytalks at the workshop addressed these nonlinear aspects of control theory.

Page 176: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

172 Five-day Workshop Reports

Every nonlinear problem needs its own precise statement and development. Some of the techniques presentedin the workshop include fixed point arguments (from Browder to Kakutani fixed point theorems), Coron’s “returnmethod” [1] or the use of the Inverse Function Theorem.

Panel Discussions

We had several panel discussions that participants found useful. Our area is very inter-disciplinary which can beexciting but also an issue for women, particularly in hiring and promotion, granting agencies etc. The fact thatresearch may not fit into a mainstream area can be used as an excuse for turning down a proposal or promotion.The efficacy of some kind of mentoring was clear. Of course this can be of great help for any young researcher butin particular for women since gender imbalance and society pressures and attitudes discourage women in a numberof aspects of the academic life.

Open research problems

Identification of research problems that are interesting (either for applications or theory) but not already beenstudied by another group, is critical to success as an academic. This, and effective time management, are twoskills that need to be developed by young academics in order to obtain tenure and promotion. New faculty need todistance themselves from their advisor to develop their own program and must write papers independently in orderto obtain tenure. The following points were also made during the discussion.

• Mathematical biology has a lot of job and grant opportunities and there are lots of interesting unsolvedproblems.

• For inter-disciplinary work, such as biology, working with an expert in the non-math area is very helpful.

• Slow/fast dynamics arise in many dynamics and methods are needed for them.

• Coupled PDE’s, particularly hyperbolic/parabolic equations, have many interesting mathematical issues withconsequences for applications (a number of talks at the workshop were concerned with such. systems)

• Systems with memory are receiving interest.

• Fractional Laplacians and other fractional derivatives have interesting theoretical issues and connectionswith applications, such as thermo-elastic materials, but some people won’t touch them.

• Interdisciplinary work can be a good source of interesting problems, but there is a danger of falling out ofthe mainstream work in your department or the granting agencies

• Illustrations of differences with finite-dimensional systems useful.

• Keep in touch with numerical and experimental people. They can be a source of insight and new problemseven if you are only interested in theory.

• Try to find problems that will develop collaborators in your department.

• Go to workshops in your field and related fields.

• Think of your strengths/uniqueness and play to those.

• Choose problems that excite you!

Page 177: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Control: New Trends in Infinite Dimensions 173

Grant WritingGrants are important for academics in several respects. On a practical level, they provide funds for travel, collab-oration and graduate students. They are also used by institutions as a criterioni for tenure and promotion, partlybecause they are regarded as an indication of the respect that a researcher’s work holds and partly because grantoverhead is important to university budgets.

It was clear in the discussion that several younger faculty had been discouraged by unsuccessful grant appli-cations. However several respected and well-funded senior participants shared that often applications are turneddown. The success rate for most programs is quite low, and applications for interesting research are turned down.Sometimes it is just bad luck. The important thing is use any reviews as useful information for revising and rewrit-ing and to persist; either with a revised proposal to the same program or else to a different program. A number ofother comments and advice were made:

• Look at all sources of funding, be creative in thinking of programs to which to submit applications.

• Think of who will be reading the application. Is it government personnel ? Experts in the field? This shouldaffect how the research is presented.

• Useful to have someone read and advise on the proposal. Many universities have personnel to help.

• Start with applications for smaller grants.

• Proposal writing is time-consuming.

• Find out and address the priorities of the grant program.

• If you’re trying to involve an industrial partner, “talk their language” and think about their priorities.

• Use common language for inter-disciplinary work.

• Collaborators can be helpful, particularly for inter-disciplinary work.

• Read reviews carefully and use to make changes for new proposals. Often you can make changes andresubmit to the same program. Or, the research may be more suitable for another program.

• Even unsuccessful proposals can help in focusing ideas and identification of open problems.

• Networking is important; go to conferences and meet people.

HighlightsAs outlined above, there is a synergistic diagram

exact controllability ↔ solvability of optimal control↔ exponential stabilizability

and also important applications of stabilizability and optimal control. There was a balance in the topics: noteverything was theoretical nor everything was applied. A wide variety of different aspects of control theory ofPDE’s was covered. The workshop gave an opportunity to explore links and future collaborations. Although thiswas not planned, a number of talks featured work on different aspects of coupled systems. This was marked bya number of participants as a possibly fruitful area of research and also as a good topic for a conference invitedsession.

This was a very good meeting from the scientific point of view. All participants expressed enthusiasm forthe high technical content and atmosphere of the workshop.The research presented was of the usual calibre ofmajor international meetings. This provided encouragement to the younger women about the quality of women’scontributions to our field.

The participants obtained interesting new research ideas and bibliography. Some of the participants have al-ready invited to give seminars at other participants’ universities and we all have new names of excellent researchers

Page 178: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

174 Five-day Workshop Reports

in the area of control of PDE’s that can be invited to meetings, workshops and seminars. The experience was greatin the sense that we put in touch different groups working on this interesting subject all around the world, withdifferent techniques, backgrounds and area of application. Talks included very abstract setting to very applied one,from biology to engineering, from physics to abstract analysis.

Typical comments were:

“I want to thank you for inviting me to the workshop women in control, it’s really the best conferenceI have attended till now. Thank you.”

“First of all, thank you again for organizing the workshop! It was a great learning experience, andI found it much more beneficial than the usual conferences I attend. The schedule was great, andallowed participants to interact, collaborate, discuss potential common projects, and give/receiveadvice regarding professional development. I found all the small group research talks and panelsvery helpful. Also, I really enjoyed the atmosphere of the workshop. I wish we had more workshopslike this.”

“In general, I think that the Workshop “Women in Control: New Trends in Infinite Dimensions” wasa really useful and reach experience in many senses. One of the interesting things I think I obtainedfrom the Workshop is that I could make contact with people with a lot of knowledge in control theoryand learn and see other applications and techniques of different areas of control problems. I keep intouch with them and I am opened to establish common work with some of them.”

Meeting OutcomeThe high quality of the talks, despite the restriction to women working in a small field, was very encouraging toall participants. There was discussion at the workshop about organization of sessions at international conferencesin 2018, such as MTNS and IFIP, that will involve participants as speakers.

The format of the meeting facilitated discussion in small groups around open problems which helped par-ticipants develop their networks and think about research problems. Networks in five sub-areas were create toestablish new collaboration, to mentor young women with their research proposals, to share specific material andpapers, propose new research, and so on. This list will also help in organization of sessions at conferences. Thelist of sub-areas, along with acronyms, is

• CIDS: Control of infinite Dimensional Systems

• OCO: Optimal control and Optimization

• CMBM:Control of mathematical-biological Models

• SLTB: Stabilization and long time behaviour

• NM: Numerical Methods

Table 15.1 lists the members of each group.We have also created a closed group in Facebook and a “basecamp” webtool for our group to share files and

emails. Only invited members can receive the updates and to see the files.Some representatives from Asia were missing. Although possible participants in China were identified, lack of

support for their travel expenses did not allow them to attend. Also the control of PDE’s community in France islarge but not well represented. We believe that after the present positive experience a future meeting will be evenmore successful, particularly with financial support. We would like to encourage more participation from Asia andFrance, as well as women working on numerical issues. The financial and administrative support of BIRS thatenabled this meeting to happen was very much appreciated. A future meeting, maybe in China, is already beingdiscussed.

As described above the topics and lectures presented are characterized by great synergy and gradual buildupof knowledge in the area which is progressive with a strong slant to modern applications and technologies. For

Page 179: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Control: New Trends in Infinite Dimensions 175

Person Country NetworkingsApraiz, Jone Spain CIDS/OCOBelkhatir, Zehor Saudi Arabia CMBM/NMBenabdallah, Assia France CIDS/CMBMBociu, Lorena U.S.A OCO/CMBMBucci, Francesca Italy OCOCalsavara, Bianca Brazil CIDS/CMBM/OCOCavalcanti, Valeria Neves Domingos Brazil SLTBde Teresa, Luz Mexico CIDS/OCODing, Wandi U.S.A CMBMDoubova, Anna Spain CIDS/CMBM/NMFu, Xiaoyu China SLTBGomes, Susana N. U.K. SLTB/NM/OCOHu, Weiwei USA SLTB/NM/OCOJacob, Birgit Germany CIDS/SLTBJamieson, Jessie USA NMLaasri, Hafida Gernamy CIDS/SLTBLasiecka, Irena USA SLTB/CIDSLebiedzik, Catherine USA SLTBLenhart, Suzanne USA CMBMLoreti, Paola Italy CMBMMorris, Kirsten Canada SLTB/OCO/NMRivas, Ivonne Colombia CIDS/SLTBRyzhkova-Gerasymova, Iryna Ukraine SLTBSanchez de la Vega, Constanza Argentina CIDS/OCOScherpen, Jacquelien Netherlands NM/SLTBSforza, Daniela Italy CMBMSzulc, Katarzyna Poland NMTang, Shuxia Canada SLTB/OCOTegling, Emma Sweden OCOZhang, Jing USA SLTB/OCO

Table 15.1: Networkings

Page 180: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

176 Five-day Workshop Reports

example: the results presented in topics 1-4 display great synergy. Developments achieved at the abstract level,when combined with parabolic and hyperbolic discoveries culminate in the study of systems where various typesof dynamics interact. This led to strong interaction among participants who find proper mentoring in the areascomplementary [but important] to their expertise. Similarly topics 5-7 pave the way to a treatment of appliedproblems that can be resolved with a help of computational mathematics.

Both senior and junior researchers found the workshop useful for advancing themselves and the field. Thesenior ones by being exposed to disciplines a bit outside their expertise and the junior ones by being introducedgradually into the arcanes and developments in each area. On site contacts, formalized by creation of the WorkingGroups [see above] are expected to provide an effective mentoring forum with the goal of advancing the field andcareers of women researchers.

ParticipantsApraiz, Jone (University of the Basque Country)Belkhatir, Zehor (King Abdullah University of Science and Technology (KAUST))Benabdallah, Assia (Aix Marseille University)Bociu, Lorena (NC State University)Bucci, Francesca (Universita’ degli Studi di Firenze)Calsavara, Bianca (University of Campinas)Cavalcanti, Valeria Neves Domingos (University of Maringa Brazil)de Teresa, Luz (Universidad Nacional Autonoma de Mexico)Ding, Wandi (Middle Tennessee State University)Doubova, Anna (Universidad de Sevilla Spain)Fu, Xiaoyu (Sichuan University China)Gomes, Susana N. (Imperial College London)Hu, Weiwei (Oklahoma State University)Jacob, Birgit (University of Wuppertal)Jamieson, Jessie (University of Nebraska-Lincoln)Laasri, Hafida (Fern Universitat Hagen)Lasiecka, Irena (University of Memphis)Lebiedzik, Catherine (Wayne State University USA)Lenhart, Suzanne (University of Tennessee)Loreti, Paola (University of Rome)Morris, Kirsten (University of Waterloo)Rivas, Ivonne (Universidad del Valle Colombia)Ryzhkova-Gerasymova, Iryna (Kharkiv National University)Sanchez de la Vega, Constanza (Universidad de Buenos Aires Argentina)Scherpen, Jacqueline (University of Groningen Netherlands)Sforza, Daniela (Sapienza Universita di Roma)Szulc, Katarzyna (Polish Academy of Sciences)Tang, Shuxia (University of Waterloo)Tegling, Emma (KTH Royal Institute of Technology)Zhang, Jing (Virginia State University)

Page 181: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] J. M. Coron, Control and non linearity AMS, 2007.

[2] J. L. Lions. Controle optimal de systemes gouvernes par des equations aux derivees partielles Dunod, Paris,1968.

[3] W. Littman. Boundary control theory for hyperbolic and parabolic partial differential equations with constantcoefficients Annali della Scuola Normale Superiore di Pisa 5, Issue: 3, 567–580 (1978).

[4] D.L. Russell, Nonharmonic Fourier series in the control theory of distributed parameter systems. J. Math. Anal.Appl. 18, 542–560 (1967).

177

Page 182: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 16

Latest Advances in the Theory andApplications of Design and Analysis ofExperiments (17w5007)

August 6 - 11, 2017Organizer(s): Holger Dette (Ruhr-Universitat Bochum), Douglas Wiens (University of Al-berta), Weng Kee Wong (University of California, Los Angeles)

Design and analysis of experiments is an area in statistical methodology, which has applications in manyresearch fields where experiments are performed to develop new theory and/or improve the quality of processes andservices. For example, experiments are continually carried out to investigate and improve manufacturing processes,understand climate changes, provide better utilization of large-scale primary care services, evaluate environmentaldamages, and develop more effective drugs for patients, to name a few. Optimal design techniques are also usedin education testing, cryptology or fMRI analysis of brain data. Even small improvements in methodology canhave huge effects in applications. Because of rapidly rising costs in experimentation, researchers are increasinglyinterested in using efficient designs that save costs without sacrificing the quality of ensuing statistical inferences.In toxicology, for example, in addition to saving labour, time and material cost, researchers are increasingly underpressure to use the fewest possible animals in laboratory research.

There have been many exciting new developments in this area in the last two decades. In particular, researchershave made advances in both theory and applications of optimal designs for mixed effects models, longitudinalmodels and nonlinear models, including more efficient designs in health care studies, computer experiments, andexperiments for processes with time or space dynamics. The main aims of the workshop were to develop newtheory for optimal design for new and more challenging problems, apply novel applications of optimal designsto new problems and keep us up to date of the state-of-the-art optimization techniques used in engineering andcomputer science. Some focus areas of the workshop are described below.

New Theory and Construction Methods for Nonlinear ModelsThe theory of optimal designs for non-linear models is very important because they have many applications inthe medical and pharmaceutical sciences, biometry and in various other industrial experiments. However, mostdesign work has focused on models with independent and normally distributed errors, fixed effects, and assumingthe outcome is either continuous or binary. The workshop provided a forum to stimulate development of advancedmethods for constructing optimal designs in nonlinear models with mixed effects, longitudinal models, and mod-els with non-normally correlated errors or models with outcomes that may comprise of continuous and discrete

178

Page 183: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Latest Advances in the Theory and Applications of Design and Analysis of Experiments 179

responses. Progress was also made in the construction of optimal designs with correlated observations, which hasparticular mathematical challenges, due to the non-convexity of the optimization problems. Other attendees hadalready made, and continue to make, important contributions to design in health care and clinical trials. These in-clude strategies for designing studies where the main outcomes are categorized (treatment success, partial successor failure), optimal randomization allocation plans for patients in a multi-arm clinical trial, and optimal designstrategies for testing in enzyme-kinetic studies.

Two different approaches of experimental design for the so called Fractional Polynomials (FP) family of modelswere described, independently, by Professors Steven Gilmour and Jesus Lopez-Fidalgo. This family has beenshown to be much more flexible than polynomials for fitting continuous outcomes in the biological and healthsciences. In particular, being linear models they can be used instead of a variety of non-linear models that addmuch complexity to the usual inferences. Despite their increasing popularity, design issues for FP models had neverbeen addressed. D- and I-optimal experimental designs were computed for prediction using FP models. Detailedproofs of the closed-formed design were given when possible. Otherwise numerical computations were made.Their properties were shown and a catalogue of design points useful for FP models was provided in illustrativegraphics. As applications, linear mixed effects models for longitudinal studies were considered. An example usinggene expression data was considered by Professor Lopez-Fidalgo, comparing the designs used in practice. Takinginto account the final choice of one of the potential models, an interesting problem is finding designs for effectivemodel discrimination for FP models. They are exploring KL-optimality and he showed some of their advances.

On the other hand Professor Gilmour noted that with the availability of routines for fitting nonlinear models instatistical packages, FP models are increasingly being used. However as in all experiments the design should bechosen such that the model parameters are estimated as efficiently as possible. The design choice for such modelsinvolves the usual difficulties of nonlinear models design. He presented Bayesian optimal exact designs for severalfractional factorial models. The optimum designs were compared to various standard designs in response surfaceproblems. Some unusual problems in the choices of prior and optimization method were noted. As an introductionto his presentation Professor Lopez-Fidalgo made some comments about the present and future of experimentaldesign. In particular he mentioned the paper “Manifesto for reproducible science” published recently in Nature,where the authors, coming from different sciences, ask for a rigorous research methodology and give key measuresto optimize the scientific process. He pointed out the word ‘design’ appears 25 times in 7 pages in all sections ofthe paper.

Experimental design applications for discriminating between models have been hampered by the assumptionto know beforehand which model is the true or more adequate one, which is counter to the very aim of the exper-iment. Previous approaches to alleviate this requirement were either symmetrizations of asymmetric techniquessuch as compound T -optimality, or Bayesian, minimax and sequential approaches. In their joint talk Professors Ra-doslav Harman and Werner Muller presented a novel, genuinely symmetric criterion based on a linearised distancebetween mean-value surfaces and the newly introduced notion of nominal confidence sets . The computationalefficiency of the proposed approach was shown and a Monte-Carlo evaluation of its discrimination performanceon the basis of the likelihood-ratio was provided. Additionally these authors demonstrated the applicability of thenew method for a pair of competing models in enzyme kinetics.

Professor Anatoly Zhigljavsky (Dette et al. 2016, 2017) reviewed a series of works related to optimal de-signs with correlated errors and noted the potential importance of implementing designs with correlated errors inpractical applications. His talk highlighted new and important theoretical investigations toward that direction. Hepresented a complete solution of this challenging optimal design problem for a broad class of regression modelsand covariance kernels. As a by-product he derived explicit expressions for the BLUE in the continuous timemodel and analytic expressions for the optimal designs in a wide class of regression models. Professor AnatolyZhigljavsky also reviewed the celebrated Sacks-Ylvisaker approach and discussed several approximate methods ofconstruction of optimal estimators.

Yu Shi from UCLA Biostatistics Department described her group’s recent research into utilizing advancedoptimization and integration techniques to find Bayesian optimal designs for random effects models, which poten-tially has profound applications in HIV studies, PK/PD studies and many other studies in biomedicine. She alsotalked about using these techniques in developing biomarkers in oncology studies. This work is interesting bothmethodologically and in practice, and the group puts great efforts into developing user-friendly software that ispromising to benefit scientist and practitioners at large. Her message reinforced Dr. Kim’s use of novel algorithms

Page 184: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

180 Five-day Workshop Reports

used by engineers and computer scientists to solve hard design problems. This algorithm is a member of the classof nature-inspired metaheuristic algorithms which has been shown to be very flexible, assumptions free and easy toimplement and use. They are powerful in that they have been reported in computer science and engineering arenasto be able to solve optimization problems with hundreds of variables in a relatively very short period of time. Inshort, they offer exciting opportunities and great promise when current numerical methods in statistics do not workwell. When designing a study for a complicated nonlinear problem or a high dimensional design problem, thesealgorithms can also help determine the structure of the optimal design quickly and hence help the researcher makesan informed guess on the correct number of points required, their locations and the number of replicates at each ofthe design points. Subsequent applications of an equivalence theorem can then produce a formula for the optimaldesign, which can be elusive. Chen el at. (2017) provide examples.

Professor Mong-Na Lo Huang (Huang, et al., 2017) proposed optimal design methodology for group testingexperiments, where the goal is to estimate the prevalence of a trait using a test with uncertain sensitivity andspecificity. She and her co-authors determined approximate optimal designs for simultaneously estimating theprevalence, sensitivity, and specificity The optimal designs require three different group sizes with equal frequen-cies. On the other and, if estimating prevalence as accurately as possible is the only focus, the optimal strategy is tohave three group sizes with unequal frequencies. Based on a Chlamydia study in the United States, she comparedthe competing designs and provided Additionally she discussed extensions on budget-constrained optimal grouptesting designs, where both subjects and tests incur costs, and assays have uncertain sensitivity and specificity thatmay be linked to the group sizes.

Dr. Stephanie Biedermann (Lee, et al., 2017) described her newly developed methodology for designing ex-periments when outcome values are not missing at random. This is the first explorative investigation into thistopic, with some promising results so far, and thus opens up a new avenue of research on the interface of designof experiments and incomplete data problems, in particular the notoriously challenging situation when missingdata are not missing at random. In the ensuing discussion, an interesting modification to the optimality criterionwas proposed, which may lead to an increased benefit of the approach in the planning of scientific experiments.Professor Rosemary Bailey described recent work obtaining good block designs for 36 treatments in blocks of size64, thus filling in a gap in the famous square lattice designs introduced by Frank Yates 80 years ago. Professor JeffWu also referred to the ongoing importance of Yates’s work in comments on another talk.

Prof Henry Wynn introduced a new optimal design criterion (Kuriki and Wynn, 2017), which can be usedto determine an experimental design that minimise the width of simultaneous confidence bands for nonlinearregression models, which are constructed by evaluating the volume of a tube about a curve that is defined as atrajectory of a regression basis vector. Professor France Mentre investigated optimal designs for nonlinear mixedeffect models (NLMEM), which are used in longitudinal studies. She reviewed methods evaluating the expectedFisher information based on Monte-Carlo Hamiltonian Monte-Carlo (MC/HMC), which require a priori knowledgeon models and parameters, and extended these MC/HMC-based methods to account for uncertainty in parametersand in models.

Professor Min Yang described his recent research into data reduction of big data. Extraordinary amountsof data are being produced in many branches of science. Proven statistical methods are no longer applicable withextraordinary large data sets due to computational limitations. A critical step in Big Data analysis is data reduction.Professor Yang introduced a novel approach called Information-Based Optimal Subdata Selection (IBOSS) fordata reduction. The proposed research will open an entirely new avenue for research. It will make a significantcontribution by developing methods that extract information from big data more efficiently. The impact will befelt in many areas of science. Existing sampling-based methods face the limitation that the amount of informationthat they extract is constrained by the subsample size, and does often not increase with the full data size. Theproposed IBOSS strategy alleviates this problem. Preliminary results show that the IBOSS strategy retains muchmore information than the sampling-based methods. During the discussion that followed, the comparison betweendivide-and-conquer method, and new and unexpected applications of this work to high-dimensional data, such asgenomics data were brought forth.

Page 185: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Latest Advances in the Theory and Applications of Design and Analysis of Experiments 181

Robustness of Design

Suppose that an investigator anticipates planning a study that will result in a number of observations on a randomvariable Y , whose probability distribution – often merely through its expected value – depends on a vector x ofcovariates that can be set by the investigator – hence the design. After the data are gathered the relationshipbetween Y and x is to be assessed. This will generally involve both estimation and prediction, and is often donein the context of a particular model of which the experimenter might have only partial knowledge and in which hemight have little faith – hence the robustness requirement. ‘Robustness’ has numerous meanings in statistics. Thenotion appears to have been introduced by Box (Box, 1953), and was given a firm mathematical basis by Huber(Huber, 1964, 1981), for whom it generally – but certainly not exclusively – meant the relative insensitivity of astatistical procedure to departures from the assumed Gaussian error distribution. In design the usual performancemeasures depend on the error distribution only through the first two moments, and beyond this the distributionalshape is not so relevant. One does however have in mind a particular model to be fitted once the data are gathered.In ‘classical’ optimal design theory, one believes explicitly that the model one fits is the correct one, and measuresthe quality of a design through a ‘loss function’ such as the determinant, or trace, or maximum eigenvalue of thecovariance matrix, corresponding to the well known D-, A- and E-optimality criteria. In ‘model robust’ designtheory one instead anticipates that the model that will be fitted by the experimenter is not necessarily the true one– a simple example to bear in mind is that of fitting a straight line regression when the true response function ispossibly not exactly linear in the covariate – and so the loss function highlights some more general feature such asthe mean squared error (mse). This will of course depend on the true, rather than fitted, model and so one seeksa design minimizing some scalar quantity summarizing the increased loss – perhaps the maximum, or average, ofthe mse over the predicted values – as the true model varies. A common framework is that an experimenter plans tofit a particular model to his data, while realizing that any model is at best an approximation. He seeks protection,at the design stage, from increased loss incurred by model mis-specification. Canada has become a noted regionof concentration for research in Robustness of Design, through the work of (contact organizer) D. Wiens (Wiens,2015), with his colleagues and former students, several of whom attended and gave presentations.

Professor Julie Zhou (U. Victoria) presented her recent research on optimal regression designs under thesecond-order least squares estimator (SLSE). Since the SLSE is more efficient than the least squares estimatorwhen the error distribution in regression model is asymmetric, new optimality criteria under the SLSE have gooddesign properties. An attendee from outside the design community commented on the usefulness of Julie’s workto his work, involving the modelling of survival data.

The workshop provided an excellent opportunity for sharing and discussing research ideas. After ProfessorMong-Na Lo Huang gave her presentation on optimal group testing designs, Professors Zhou and Huang startedto explore possible joint research projects on algorithms for computing optimal group testing designs and robustdesigns.

In a thought-provoking and elegantly presented address Professor Tim Waite (Manchester U.) proposed usageof minimax efficient random designs based on a frequentist decision-theoretic experimental design approach, withapplication to model-robust design for prediction. These seem to address in an efficient manner a long openproblem – the inadmissibility of a class of design presented by Wiens (Wiens, 1992).

Professor Dave Woods (University of Southampton, UK) presented recent results and ongoing research on se-quential design for Bayesian optimisation of physical systems with application to automatic experimentation in thechemical and pharmaceutical sciences. This work, in collaboration with chemists and chemical engineers, aimsto develop novel design of experimental methods and apply them in conjunction with automated equipment toincrease lab efficiency and reduce costs. A particularly important element of the work is methodology to automat-ically detect and down-weight potentially outlying observations. Useful feedback from the talk included links toother optimisation methods for maximising expected improvement in computer experiments.

Xiaojian Xu (Brock U.) discussed the effects of maximum likelihood estimation for a generalized linear mixedmodel (GLMM) when possible departures may appear from its assumed form. The commonly occurring depar-tures involved in a GLMM may include imprecision in the assumed linear predictor, misspecified random effectdistribution, or possibly both. She outlined the construction of D-optimal sequential designs which are robustunder consideration of these types of departures. Since the computational work involved in GLMMs can be veryintensive, an approximate approach was also proposed. Some comparisons were given through simulations.

Page 186: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

182 Five-day Workshop Reports

Computer Experiments

The traditional idea of an experiment involves observation of a system of interest under controlled conditions,with the intent of learning something about that system. The system of interest varies by discipline: engineersand physicists may be interested in systems involving physical material, biologists may focus on living organisms(or collections or components of them), while social scientists may be interested in experiments involving thebehaviour of human beings. In contrast, the system of interest in a computer experiment is often a computermodel, usually a mathematical description of a real system of interest, represented as a computer program. Thecomputer representation is usually necessary due to the complexity of the model. Experimental goals are oftensimilar to those in traditional experiments. While the computer model must, in principle, be fully known, it isgenerally so complex that a useful understanding of its behavior requires the empirical approach of an experiment(Morris and Moore, 2015).

Computer experiments are increasingly used in many important scientific studies, in part because they can ac-count for the sophisticated correlation structures often found in complex studies that generate very large data setsquickly and continuously, such as in climate changes. For such complicated and expensive physical experiments, awell-designed computer experiment using, say, Latin square designs and sequential designs can reduce the experi-mental costs significantly. Canada has become somewhat of a hotbed for such work; evidence of this was providedby the attendance of D. Bingham and B. Tang from Simon Fraser University, William Welch from University ofBritish Columbia, Devon Lin from Queen’s University and other world leaders in this area, such as Jeff Wu fromthe Georgia Institute of Technology and Peter Chien from the U. Wisconsin-Madison, together with some of theircurrent and former students.

Professor Will Welch described his recent research into emulation of computer experiments. His talk motivatedsome to think of new ways to link Gaussian process models with additive correlation functions to general functionaladditive models. This could lead to a new research direction in emulation. Will pointed out that Gaussian processes(GPs) are widely used for analysis of the input-output relationship(s) of a deterministic computer experiment.While there are many tweaks of the basic model, they turn out to be fairly unimportant for prediction accuracy. Inparticular, complex input-output functions remain difficult to model with useful accuracy whatever method is usedwithin the usual GP universe of models.

Professor Bingham introduced new types of space-filling designs motivated by several real applications. Theproposed designs are unconventional and novel. This is a product of the cross-fertilization between experimentaldesign, combinatorics, and applied mathematics. It shows a new aspect of space-filling designs. The approachwas demonstrated on several examples, including the cosmology study of non-linear power spectrum simulationmodels that motivated the work.

Devon brought ‘big data’ into focus, via ‘fat data’ with high dimensional inputs and/or outputs. She proposeda computationally efficient modelling approach to build emulators for large-scale dynamic computer experiments.This approach sequentially finds a set of local design points based on a new criterion specifically designed foremulating dynamic computer simulators. Singular value decomposition based Gaussian process models are builtwith the sequentially chosen local data. To update the models efficiently, an empirical Bayesian approach wasintroduced. When a target observation is available, estimating the inputs of the computer simulator that producethe matching response as close as possible is known as inverse problem. She proposed a new criterion-basedestimation method to address the inverse problem of dynamic computer experiments.

Professor Jeff Wu described his recent and potentially groundbreaking research into rocket injection designthrough spatial-temporal modelling; issues on uncertainty quantification were also discussed. He detailed a recentstudy to illustrate how physics and data are used jointly to learn about the “truth” of the physical world. Inthe quest for advanced propulsion systems, a new design methodology is needed which combines engineeringphysics, computer simulations and statistical modelling. There are two key challenges: the simulation of high-fidelity spatial-temporal flows (using the Navier-Stokes equations) is computationally expensive, and the analysisand modelling of this data requires physical insights and statistical tools. In this method, a surrogate model isfirst presented for efficient flow prediction in swirl injectors with varying geometries, devices commonly used inmany engineering applications. The novelty lies in incorporating properties of the fluid flow as simplifying modelassumptions, which allows for quick emulation in practical turnaround times, and also reveals interesting flowphysics which can guide further investigations. Next, a flame transfer function framework is used for modelling

Page 187: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Latest Advances in the Theory and Applications of Design and Analysis of Experiments 183

unsteady heat release in a rocket injector. Such a model is useful not only for analyzing the stability of an injectordesign, but also identifies key physics which contribute to combustion instability. During the discussion thatfollowed, Professor Wu specifically mentioned the importance of computer experiments as well as sequentialdesigns for future developments, especially in industrial applications. This has provided a very good direction forfuture study on design of experiments.

Professor Peter Chien’s talk consisted of three topics on the design and analysis of computer experiments withcomplex data. The first topic dealt with computer codes with gradients. The gradient-enhanced Gaussian processemulator is widely used to analyze all outputs from a computer model with gradient information. The gradient-enhanced Gaussian process emulator has more numeric problems than in many multivariate cases because of thedependence of the model output and each gradient output. He had derived, and presented, a statistical theory tounderstand why this problem happens and proposed a solution using a data selection approach. The second topicconcerned computer models with invariance properties, which appear in materials science, physics and biology. Heproposed a new statistical framework for building emulators to preserve invariance. The framework uses a weightedcomplete graph to represent the geometry and introduces a new class of function, called the relabeling symmetricfunctions, associated with the graph. The effectiveness of the method was illustrated by several examples frommaterials science. The third topic presented a new class of statistical design inspired by the Samurai Sudokupuzzle. These designs have overlapping components and are useful for cross-validating data or models frommultiple sources.

Advanced Numerical Methods for Searching Optimal Designs

Analytical optimal designs are typically available only for relatively simple models or models with a very specialstructure, where mathematics can be cleverly applied to find formulas for the optimal designs and to study theirproperties. With large data becoming increasingly more available and improved modelling techniques, more real-istic but more complicated models are employed to draw accurate inference from scientific studies; for example,in fMRI studies on how the brain functions and reacts to different stimuli or how water resources can be betterconserved and efficiently utilized by studying the whole water system involving hundreds of variables in hydrol-ogy. A focus in the workshop was to develop more efficient numerical methods for finding optimal designs forhigh dimensional problems. To this end, the workshop promoted cross-disciplinary research by having engineersand computer scientists at the workshop as well. Eminent engineering experts in optimization, as well as currentgraduate students and recent graduates from statistics programmes, demonstrated how to apply state-of-the-artalgorithms, such as nature-inspired metaheuristic algorithms to solve complex design problems.

Professor Luc Pronzato discussed the problem of obtaining good experimental designs for Gaussian processmodelling in computer experiments. To determine designs, which have satisfactory space-filling properties heproposed minimax-distance (sub-)optimal designs, which minimise the maximum distance between a point ofthe region of interest and its closest design point, and thus have attractive properties in this context. As theirconstruction is difficult, even in moderate dimension Professor Pronzato considered several discretisation methodsof the experimental region, such as the determination of Chebyshev-centroidal Voronoi tessellations obtained fromfixed-point iterations of Lloyds’ method, and the construction of any-time (nested) suboptimal solutions by greedyalgorithms applied to submodular surrogates of the minimax-distance criterion.

Dr. Seongho Kim, an Associate Professor of Biostatistics at Wayne State University/Karmanos Cancer Insti-tute, described his pioneering developments in pharmacokinetics and phase II clinical trial designs using a stochas-tic global optimization algorithm, particle swarm optimization (PSO). During his presentation, he brought up threecrucial but challenging issues in parameter estimation, global optima, speed of convergence, and statistical identifi-ability, and demonstrated that PSO is able to deal with those challenges efficiently, leading to the possible openingof a new avenue of research.

Some members of the optimization community attended and gave interesting talks on this important aspect ofdesign construction. In particular, Professor Lieven Vandenberghe from Department of Engineering Departmenatat UCLA described connections between experimental designs and semidefinite programming, and provided up-to-date computing algorithms, useful for finding optimal designs efficiently. Semidefinite programming (SDP) hasimportant applications in optimisation problems that involve moment cones or, by duality, cones of nonnegative

Page 188: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

184 Five-day Workshop Reports

polynomials. Examples can be found in statistics, signal processing, control, and non-convex polynomial optimi-sation. Professor Guillaume Sagnol used semidefinite programming (SDP) and second-order cone programming(SOCP) to compute distributionally-robust (DRO) optimal designs. The distributionally robust approach can beseen as a robust counterpart of the Bayesian approach, where one optimises against the worst-case of all priorsbelonging to a family of probability distributions. Both these speakers talked about techniques to solving opti-mization problems not commonly used in the subfield of optimal designs and so attendees are now likely moreequipped with additional tools to tackle tough optimization design problems, including high dimensional opti-mizationa design problems with many covariates.

New ResearchersA morning was devoted to presentations from graduate students and recent graduates.

Dr. Rui Hu – a freshly minted Ph.D. – presented her method of constructing robust sampling designs for theestimation of threshold probabilities in spatial studies. Her method, at the design stage, addressed the problemof possible model misspecification when estimating threshold probabilities which are critical in many disciplinessuch as ecology, epidemiology, etc. This work is a successful application of the theory of robust optimal designand sparked a lot of interest and discussion among participants.

The work presented by Dr. Kirsten Schorning (Feller, et al., 2017) discussed a new field of optimal designtheory, namely the construction of optimal designs for several models, sharing some common parameter. Herresults have many applications in pharmaceutical investigations where good designs for dose response curves withcommon parameters are required (for example the placebo effect in daily or twice daily treatment). In particular shedetermined optimal designs for a recently preformed Phase II clinical trial, where the goal goal was a comparisonof two dose response curves corresponding to different treatment groups

A third talk was delivered by Dr. Maryna Prus (Prus and Schwabe, 2016), who investigated optimal designproblems for random coefficient regression (RCR) models, which are frequently used in many statistical appli-cations; especially in biosciences and medical research. In the case of multiple group RCR models individualsin different groups get different kinds of treatment and Dr. Maryna Prus determined the optimal group sizes forthe prediction of the individual parameters in multi group RCR models with a common population mean for allindividuals across all groups.

One of the young researchers scheduled to present at the meeting is Ahipasaoglu Selin from Singapore. Sheis an Assistant Professor in the Engineering Systems & Design Pillar at Singapore University of Technology andDesign since April 2012. She received her doctorate at Cornell University from Prof. Michael Todd in 2009.Afterwards, she worked as a researcher at the Operations Research and Financial Engineering Department inPrinceton University and the Management Science Group at London School of Economics. She is interested inConvex Optimization, Sparse Learning, Combinatorial Auctions, and Global Optimization. She is new to optimaldesign of experiments and was looking very forward to attend and meet top researchers in the field. However,being a Turkish citizen, her visa application was surprisingly denied. The organizers arranged to her present hertalk via video and it was very well received. Her talk followed up on Drs. Zhou and Vandenburghe’s talks onusing semi-definite programming to solve difficult optimization design problems in statistics. She presented a newperspective that many in the audience had not seen before.

Special EventThe meeting featured an evening group presentation, with much audience participation, on the Future of Design ofExperiements, with presenters Joachim Kunert, Dennis Lin, John Stufken and Boxin Tang, with Rainer Schwabe asmoderator and discussant. Joachim emphasized the continuing role of the concepts of blinding, randomization andpermutation tests. Dennis shared some of his personal views on the evolution of DOE since R. A. Fisher, and wenton to talk about some of his recent work on Design for Order-of-Addition. John asked general questions aboutour relevance as a group, with suggestions for meeting current challenges. His comments regarding the potentialof design of experiments in Big Data inspired comments on the use of such methods in methods – design andanalysis – for the IT industry. Boxin took a look at some possible future developments in computer experiments,

Page 189: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Latest Advances in the Theory and Applications of Design and Analysis of Experiments 185

factorial designs and beyond, through the lenses of optimality, robustness and fusion. Probably more discussionwas generated in this session than in any others. Some key topics raised were the need to interact with otherdisciplines, including computer science, and the need to be involved in big data and data science applications tohelp ensure scientific rigour through the application of design principles. Comments of Professor Lin regardingour collective and sometimes discouraging methods of statistical nomenclature led to the possible opening of anew avenue of research – ‘heartfelt learning’ – to exploit methodological links with AI.

Another point that was emphasised in this discussion was the ongoing need for the foundation ideas of ex-perimental design, such as randomisation, replication, blinding and blocking. These are sometimes overlooked innew areas of research. All found the panel on the future of design of experiments extremely insightful. New ideasdiscussed in the panel, such as subsampling of Big Data, are highly relevant to current research projects and canopen new angles to solve those problems.

Some take home messages from the workshop are

• All agreed that design is an important component of many scientific studies and these are well reported andreaffirmed in regulatory and scientific committee reports,

• Researchers in optimal design should work more collaboratively to realize a common goal - greater impactof the subfield of optimal experimental designs in scientific discovery,

• Discussions on how to work strategically to realize the common goal were discussed but there was no clearconsensus,

• Some thought that it is increasingly important to do more interdisciplinary work and learn from relateddisciplines, such as engineering and computer science, in how they solve optimization problems.

• All agreed that the workshop we had at Banff was intellectually stimulating, very helpful in exchangingresearch ideas and in promoting the subfield of optimal design.

ParticipantsBailey, Rosemary (University of St Andrews)Biedermann, Stefanie (University of Southampton)Chien, Peter (University of Wisconsin-Madison)Dette, Holger (Ruhr-Universitat Bochum)Gao, Lucy (University of Washington)Gilmour, Steven (Kings College London)Harman, Radoslav (Comenius University)Hu, Rui (MacEwan University)Huang, Mong-Na Lo (National Sun Yat-sen University)Kim, Seongho (Wayne State University)Kim, Skim (University of Alberta)Kunert, Joachim (TU Dortmund)Lin, Devon (Queen’s University)Lin, Dennis (PennState)Lopez-Fidalgo, Jesus (University of Navarre)Lu, Xuewen (University of Calgary)Mentre, France (University Paris Diderot and INSERM)Muller, Werner (Johannes Kepler Universitat Linz)Plagemann, Angela (Canadian Statistical Sciences Institute)Pronzato, Luc (CNRS/Universite de Nice–Sophia Antipolis)Prus, Maryna (Otto von Guericke Universitat Magdeburg)Sagnol, Guillaume (Technical University of Berlin)Schorning, Kirsten (Ruhr-Universitat Bochum)

Page 190: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

186 Five-day Workshop Reports

Schwabe, Rainer (Otto von Guericke Universitat Magdeburg)Shi, Yu (UCLA)Stufken, John (Arizona State University)Tang, Boxin (Simon Fraser University)Vandenberghe, Lieven (UCLA)Waite, Timothy (University of Manchester)Welch, Will (University of British Columbia)Wiens, Douglas (University of Alberta)Wong, Weng Kee (University of California, Los Angeles)Woods, Dave (University of Southampton)Wu, Jeff (Georgia Institute of Technology)Wynn, Henry (London School of Economics & Political Science)Xu, Xiaojian (Brock University)Yang, Min (University of Illinois at Chicago)Zhang, Zizhao (UCLA)Zhigljavsky, Anatoly (Cardiff University)Zhou, Julie (University of Victoria)

References

G. E. P. Box. (1953). “Non-normality and tests on variances,” Biometrika, 40, 318-335.

P. Y. Chen, R. B. Chen, and W. K. Wong. (2017). “Standardized maximin D-optimal designs for phar-macological models via particle swarm optimization techniques, ” Chemometrics and Intelligent LaboratorySystems. In press.

H. Dette, A. Pepelyshev, A. Zhigljavsky (2016). “Optimal designs in regression with correlated errors,”Annals of Statistics, 44(1), 113-152.

H. Dette, M. Konstantinou, A. Zhigljavsky (2017). “A new approach to optimal designs for correlatedobservations,” Annals of Statistics, 45(4), 1579–1608.

C. Feller, K. Schorning. H. Dette, G. Bermann, B. Bornkamp (2017). “Optimal designs for dose responsecurves with common parameters,” Annals of Statistics, In press.

S. H. Huang, M.-N. L. Huang, K. Shedden, W. K. Wong (2017). “Optimal group testing designs for esti-mating prevalence with uncertain testing errors,” Journal of the Royal Statistical Society, Ser.B, In press.

P. J. Huber, (1964). “Robust estimation of a location parameter, ” The Annals of Mathematical Statistics,35, 73-101.

P. J. Huber, (1981). Robust Statistics, Wiley.

Page 191: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Latest Advances in the Theory and Applications of Design and Analysis of Experiments 187

S. Kuriki, H. P. Wynn (2017). “Optimal experimental design that minimizes the width of simultaneousconfidence bands,” arXiv 1704.03995.

K. M. Lee, R. Mitra, and S. Biedermann. (2017) “Optimal design when outcome values are not missingat random, ” Statistica Sinica. In press.

M. D. Morris and L. M. Moore. (2015). Design of Computer Experiments: Introduction and Back-ground, In Handbook of Design and Analysis of Experiments, 577–591, CRM.

M. Prus and R. Schwabe (2016). “Optimal designs for the prediction of individual parameters in hierar-chical models,” Journal of the Royal Statistical Society, Ser.B, 78, 175–191.

D. P. Wiens, (1992). “Minimax designs for approximately linear regression, ” Journal of Statistical Plan-ning and Inference, 31, 353–371.

D. P. Wiens, (2015). “Robustness of Design. In Handbook of Design and Analysis of Experiments, 719–754, CRM 2015.

Page 192: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 17

Women in Numbers 4 (17w5083)

August 13 - 18, 2017Organizer(s): Chantal David (Concordia University), Michelle Manes (University of Hawaiiat Manoa), Jennifer Balakrishnan (Boston University), Bianca Viray (University of Wash-ington)

Conference at BIRS

Rationale and Goals

The first Women in Numbers workshop was held at BIRS in 2008, with the explicit goals of increasing theparticipation of women in number theory research and highlighting the contributions of women who werealready doing high-quality research in the field. In their original proposal, the organizers cited the lack ofrepresentation of women in major institutions and at major international conferences.

Since that first workshop, 84 different women have participated in three WIN conferences at BIRS, withmore than 60 other participants at the Women in Numbers – Europe conferences held at CIRM in Luminyand at the Lorentz Center. The number of women doing research in number theory is steadily growing.Though it is difficult to know exactly how much of this increase is due (directly or indirectly) to WIN, thevisibility of the WIN conferences — due both to the community that has developed and the high quality ofresearch output from these workshops — has helped to raise awareness in the broader community of theimportant contributions made by these researchers.

Another major indicator of the success of the WIN conferences is the publication output: three proceed-ings volumes containing a total of 36 papers (including some survey papers), as well as more than 10 journalarticles published elsewhere. This far surpasses the output of a typical research workshop or graduatefocused workshop such as the Arizona Winter School.

Less tangible but equally important outcomes include the lasting bonds formed between beginning re-searchers and their mentors, and the invigorating effect of the conferences on this mathematical community.WIN created a new model of working research conference designed to build supportive networks for women.Inspired by the success of WIN conferences, women in other fields have organized similar conferences atmath institutes over the past few years, each focused on building collaboration groups consisting of seniorand junior women in a given area. These include: Algebraic Combinatorixx and Women in Topology (WIT)at BIRS; Women in Shape (WiSh) at IPAM; and two Women in Applied Math conferences at IMA, Dy-namical Systems with Applications to Biology and Medicine (WhAM!) and Numerical PDEs and ScientificComputing (WhAM2!). Each of these conferences has resulted in new, high-quality mathematics researchas well as lasting collaborations among attendees.

188

Page 193: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 189

These are positive changes, but our work is not done. Visibility of women at international number theoryworkshops and conferences, though increasing, continues to be low, with percentages of speakers who arewomen rarely exceeding 20%. Women are still underrepresented in major research universities, especiallyat the most prestigious institutions. Women receive disproportionately less grant money from the NSFthan their peers. It is imperative that we continue to build on the success of the WIN and other researchcollaboration conferences for women, creating supportive networks for women at the early stage of theircareers. The WIN4 workshop was the next step in this continuing story.

The specific goals of the workshop were:

• to generate research in significant topics in number theory;

• to broaden the research programs of women and gender minorities working in number theory, espe-cially pre-tenure;

• to train graduate students and postdocs in number theory, by providing experience with collaborativeresearch and the publication process;

• to strengthen and extend a research network of potential collaborators in number theory and relatedfields;

• to enable faculty at small colleges to participate actively in research activities including mentoringgraduate students and postdocs; and

• to highlight research activities of women in number theory.

We would like to thank the following organizations for their support of this workshop: the Associationfor Women in Mathematics, BIRS, the Clay Mathematics Institute, Microsoft Research, the National ScienceFoundation, the Number Theory Foundation, and the Pacific Institute for the Mathematical Sciences.

Participants and Format

The focus of the workshop was on supporting new research collaborations within small groups. Before theworkshop, each participant was assigned to a working group according to her research interests. Eachgroup had two leaders chosen for their skill in both research and communication. These leaders designedprojects and provided background reading and references for their groups. At the conference, there were afew talks, but most of the time was dedicated to working groups. Each group also submitted a short writtenprogress report on their project. These reports, along with the project title and the names of the groupmembers, are included in Section 17.

There were a total of forty-two participants (with sixteen of those project leaders1), all women or genderminorities. Of those, we had:

• 16 in tenured or tenure-track faculty positions,

• 12 early-career faculty or postdocs,

• 13 graduate students, and

• 1 industry mathematician.

Significant effort was made to enlarge the WIN community as much as possible: we had 4 project leaderswho had never attended a WIN before and 24 out of 26 group members were first-time WIN-ers.

13 additional project leaders participated remotely.

Page 194: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

190 Five-day Workshop Reports

ScheduleSunday16:00 – 17:30 Check-in begins18:00 – 19:30 Dinner20:00 – 22:00 Informal Gathering

Monday07:00 – 08:45 Breakfast08:45 – 09:00 Introduction and Welcome by

BIRS Station Manager09:00 – 10:00 Introduction and Welcome

by Organizers10:00 – 10:30 Coffee Break10:30 – 12:00 Group Work12:00 – 13:30 Lunch13:30 – 14:30 Group Work14:30 – 15:30 Coffee Break15:30 – 16:30 Group Work16:30 – 17:50 Project Introductions18:00 – 19:30 Dinner

Tuesday Thursday07:00 – 09:00 Breakfast 07:00 – 09:00 Breakfast09:00 – 10:00 Group Work 09:00 – 10:00 Group Work10:00 – 11:00 Coffee Break 10:00 – 11:00 Coffee Break11:00 – 12:00 Group Work 11:00 – 12:00 Group Work12:00 – 13:30 Lunch 12:00 – 13:30 Lunch13:30 – 14:30 Group Work 13:30 – 14:30 Group Work14:30 – 15:30 Coffee Break 14:30 – 15:30 Coffee Break15:30 – 16:30 Group Work 15:30 – 16:30 Group Work16:30 – 17:50 Project Introductions 16:30 – 16:35 Group Photo18:00 – 19:30 Dinner 16:35 – 17:55 Project Introductions20:30 – 21:30 Panel Discussion 18:00 – 19:30 Dinner

Wednesday Friday07:00 – 09:00 Breakfast 07:00 – 09:00 Breakfast09:00 – 10:00 Group Work 09:00 – 10:10 Project Presentations10:00 – 11:00 Coffee Break 10:15 – 10:45 Coffee Break11:00 – 12:00 Group Work 10:50 – 11:20 Project Presentations12:00 – 13:30 Lunch 11:30 – 12:00 Checkout by Noon13:30 – 17:30 Free Afternoon 12:00 – 13:30 Lunch18:00 – 19:30 Dinner20:30 – 21:30 Panel Discussion

Outcomes

The obvious outcomes of WIN4 are the research advances made by each of the nine working groups. As withthe previous WIN workshops, these results will appear in peer-reviewed journals over the next few years. Inaddition, two of the organizers (Jennifer Balakrishnan and Michelle Manes) together with Amanda Folsomand Matilde Lalın will edit a Proceedings volume. We have recently signed a contract with Springer, and theProceedings will appear in their AWM Series. We anticipate that each group will submit at least one paper— either a survey paper or a paper with new results — for peer review. We have announced a deadline of

Page 195: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 191

March 31, 2018 for submissions, with the Proceedings appearing in 2019.In our minds, more important than the specific mathematical results that emerge from the conference

are the collaborations and mentoring that happens there. In particular, we held two after-dinner mentoringpanels in the lounge at BIRS. The first was primarily for early-career mathematicians, focused on job appli-cations, interviews, publishing, and grant proposals. The second was focused on long-range career planning,relevant to both early- and mid-career participants.

Only time will prove the success of the WIN4 workshop, but the excitement and energy there werepalpable. Besides plans for the Proceedings volume, plans are already in place for a third WINE (Womenin Number-Europe) conference and potential organizers for a WIN5 workshop are being contacted. Wehad a number of WIN4 participants volunteer to help keep the WIN network active, whether it be throughorganizing special sessions at AMS or AWM meetings, leading projects at future WIN/WINE workshops,editing future WIN proceedings volumes, or even organizing a future WIN or WINE workshop. We takethe enthusiasm of these volunteers as additional evidence of the benefits of the mentoring and collaborationof that occurs at WIN workshops.

Project Reports

Horizontal distribution questions for elliptic curves over QParticipants: Chantal David, Ayla Gafni, Amita Malik, Lillian Pierce, Neha Prabhu, Caroline Turnage-Butterbaugh

Let E denote an elliptic curve over Q. There are many “horizontal distribution” questions about ellipticcurves in the literature, that is, questions about the properties of the reduced curves modulo primes p onaverage over the primes. Most of them remain open questions, such as the Lang-Trotter conjecture, or theKoblitz conjecture, while some of were recently solved, as the Sato-Tate conjecture. The following conjec-ture for “extremal primes” was considered recently by [25] and [26] (where James and Pollack prove theconjecture for CM elliptic curves in the second paper). For a non-CM elliptic curve, it is conjectured that

#p ≤ x : ap(E) = [2√p] ∼ 16

x1/4

log x. (17.0.1)

This conjecture is completely open, and there are not even non-trivial upper bounds for the number of suchprimes.

Similar to Birch’s take on the Sato-Tate conjecture [5], one can look at the “vertical distribution” to getevidence for conjecture (17.0.1). This means to consider the number of curves over Fp such that ap(E) =[2√p] on average over the primes. In a recent paper, David, Koukoulopoulos and Smith [13] presented

a general framework to address such vertical distribution, which makes it possible to attack the verticaldistribution for extremal primes.

Our first goal is then to use the axiomatic framework of [13] to prove the “vertical distribution”

∑p≤x

#E/Fp : ap(E) = [2√p]′

p∼ 16

x1/4

log x,

where the dash in the numerator indicates that we are counting the number of curves E up to isogeny.A key ingredient that is needed to apply the results of [13] is to know that [2

√p] and p are independently

well distributed modulo `, i.e., a result of the type

#p ≤ x : [2√p] ≡ a mod `, p ≡ b mod ` ∼ π(x)

`(`− 1), (17.0.2)

with a good error term with sufficient uniformity in `.

Page 196: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

192 Five-day Workshop Reports

This leads to the distribution of the fractional part √p. Indeed, it is easy to see that

[2√p] = k`+ a ⇐⇒

2√p

`

∈(a

`,a+ 1

`

).

The distribution of √p has been studied by Balog [4].The first goal of our project was then to generalize Balog’s result to include more general intervals for

√p, to include a congruence condition on the primes p, and with sufficient uniformity in `. We started towork on that specific problem during the week at BIRS, and have made very good progress: the approach ofBalog generalizes to this case, with more work to control the various parameters involved. This should leadto a vertical distribution for champion primes, and we are currently writing the details of the argument.

After looking at the vertical distribution, we also want to address the problem of finding a non-trivialupper bound for (17.0.1), and this will lead to mixed distribution between the fractional parts √p andprimes splitting in the Galois extensions obtained by adding the `-torsion point of E toQ, i.e. a mix betweenthe results of Balog and the Chebotarev Density Theorem. This is analogous to (17.0.2), which can be seenas the independence between Balog’s result on the equidistribution of fractional parts and the equidistri-bution of primes in arithmetic progressions (which is the Chebotarev Density Theorem for the cyclotomicextensions).

Apollonian circle packingsParticipants: Holley Friedlander, Elena Fuchs, Piper Harron, Catherine Hsu, Damaris Schindler, Kather-ine Stange

Apollonian circle packings are fractal sets in the plane which are obtained by repeatedly adding circlesinto an initial constellation of three mutually tangent circles. Starting from such a collection of three mu-tually tangent circles C1, C2, C3 one adds all circles that are tangent to all three of those. By a theorem ofApollonius there are exactly two such circles (viewing a line as a circle with infinite radius). Given those fivecircles one can continue the process of adding circles that are tangent to three of the circles that are alreadyin the picture. We repeat this process ad infinitum and let P be the union of all the circles obtained in thisway.

Apollonian circle packings are very interesting from a number theoretic point of view. By a theo-rem of Descartes, four mutually tangent circles C1, C2, C3, C4 with curvatures a, b, c, d fulfill the followingquadratic equation

(a+ b+ c+ d)2 − 2(a2 + b2 + c2 + d2) = 0.

From this equation one can deduce that if four mutually tangent circles C1, C2, C3, C4 in the starting con-figuration have integer curvatures, then all curvatures in the packing P , as constructed above, are integers.This gives rise to a number of questions on arithmetic properties of the set of curvatures that appear in sucha packing.

It is known that for a fixed primitive integral packingP not all integersm ∈ N can occur as the curvatureof a circle in the packing. As Fuchs showed in [21], for every fixed packing P there is a non-trivial subsetof residue classes modulo 24, say Σ ⊂ Z/24Z, to which the curvatures are restricted, and this is the onlycongruence obstruction. The local-global conjecture for Apollonian packings states that every sufficientlylarge integer m with m mod 24 ∈ Σ, also occurs as a curvature in P . Building on work of Sarnak [35] andBourgain and Fuchs [6], Bourgain and Kontorovich showed in [7] that the local-global conjecture holds foralmost all positive integers m ≤ X , with an exceptional set of size at most O(X1−ε) for some positive ε.

The goal of our project is to understand the local-global conjecture for certain subcomponents of aprimitive integral packing P . We start by constructing a “tree” TP,r (it isn’t quite a tree) of tangent circlesinside a primitive integral Apollonian packing P . Fix a positive integer r and two tangent circles Ca, Cc inthe packing of curvatures a and c, such that a and c are both r-almost primes (their existence follows from[35]). We begin drawing TP,r by drawing the (tangent) circles Ca, Cc of curvature a and c. Now, add intoTP,r all those circles in P that are tangent to either Ca or Cc. Of those, let Wr,1 be the set of the circles in

Page 197: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 193

TP,r so far that have r-almost prime curvature, and add into TP,r all circles tangent to a circle in Wr,1. Ofthese new circles, let Wr,2 be the set of the circles that have r-almost prime curvature, and add into TP,rall circles tangent to a circle in Wr,2. Continue this process indefinitely (this process does indeed continueindefinitely as shown in [35]) to construct the tree TP,r. Note that if r = 1, when we delete from TP,r(v) allof the circles whose curvatures are not prime we will get precisely a full prime component as described atthe end of [35].

Our question is then, does there exist an r so that the resulting curvatures of circles in TP,r must satisfya local-global principle?

During the workshop we worked on numerical experiments to test such a conjecture and we discussedways to define Cayley-like graphs (which play an important role in [7]) for r-almost prime trees. Moreover,we worked on lower bounds on the number of curvatures less than X appearing in an r-almost primecomponent. Following the methods in Bourgain and Fuchs [6], we expect that we can show that the numberN(X) of curvatures of size at most X that appear in the r-almost prime tree TP,r is at least

N(X) X

(log logX)1/2.

We hope to improve this to a positive density result in the future.

Arithmetic dynamics and Galois representationsParticipants: Jamie Juul, Holly Krieger, Nicole Looper, Michelle Manes, Bianca Thompson, Laura Walton

Let K be a number field and f ∈ K(z) a rational function of degree d ≥ 2. Let Tf be the infinitepreimage tree rooted at 0, with absolute Galois group Gf acting as a subgroup of the automorphisms of Tf .There is a growing body of literature on these arboreal Galois representations. In particular, Joes [28] has aspecific conjecture for quadratic functions:

Conjecture1 1 Let K be a number field, and f(z) ∈ K(z) have degree 2. Then the index [Aut(Tf ) : Gf ] isfinite unless one of the following holds:

1. f is post-critically finite

2. f commutes with a non-trivial Mobius transformation that fixes 0

3. the critical points c1, c2 of f satisfy fr(c1) = fr(c2) for some r ≥ 2

4. 0 is a periodic point of f

Cases of this conjecture have been dealt with by Jones [28], Jones-Manes [29], and others. It is knownthat if any of these conditions holds, then the arboreal Galois representation cannot be finite index. Ourplan was to investigate this conjecture with an eye towards a very difficult question:

Question1 1 F = ft be a one-parameter family of quadratic rational maps, defined over Q, and at a markedpoint defined over Q. Let Tt be the infinite preimage tree rooted at at and Gt the absolute Galois group, as asubgroup of Aut(Tt). When is

SF := t : [Aut(Tt) : Gt] =∞a set of bounded height?

It is known by Pink [33] that the set SF will not have bounded height if any of the following conditionsare met:

• if the family is a curve of quadratic rational maps defined by a specific critical orbit relation — specif-ically, there is some iterate r > 0 such that the critical points c1, c2 satisfy fr(c1) = fr(c2) and at ≡ 0,

Page 198: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

194 Five-day Workshop Reports

• if the family is a curve for which at is periodic, or

• if f commutes with a non-trivial Mobius transformation.

However, there is in all those cases a reasonable alternative to the full automorphism group (for examplein the latter case, the centralizer of Aut(f) in Aut(Tf )) in which the Galois group has finite index, and thequestion above can be modified accordingly. The set of post-crticially finite maps in the space of quadraticrational maps is known to have bounded height by work of Benedetto-Ingram-Jones-Levy. Hence the ques-tion can be viewed as a weaker version of the conjecture; however, the question could be refined to searchfor a subgroup for which the index is finite (as in the case of the centralizer for f with nontrivial automor-phisms).

During our week at BIRS, we investigated a particular family of quadratic rational functions. We provedsome results related to height bounds on the parameter outside of which we had an attracting fixed point thatwould control much of the dynamics, and for several particular parameter choices we were able to provethat the image of Galois is the full automorphism group of the tree. We also proved that assuming Vojta’sConjecture, a similar set of conditions to those in Conjecture 1 characterizes the set of cubic polynomialsf ∈ K[x] such that [Aut(T ) : GK(f)] < ∞, where K is a number field. To do so, we make use of a resultfrom [24]. Specifically, we have the following.

Theorem 17.0.1 Let K be a number field, and let f ∈ K[x] have degree d ≥ 2, where d is prime. Supposefn(x) has at most r irreducible factors over K as n→∞, so that

fN+n(x) = fN,1(fn(x))fN,2(fn(x)) · · · fN,r(fn(x))

is the prime factorization of fn+N (x) in K[x] for any sufficiently large N . Suppose that there is an M suchthat the following holds: for all n ≥ M , and for all 1 ≤ j ≤ r, there is a multiplicity one critical point γ1 of fand a prime pn of K such that:

• vpn(fN,j(fn(γ1))) = 1

• vpn(fN,j(fn(γt))) = 0 for all t 6= 1

• vpn(fm(γt)) = 0 for all critical points γt, and all m < N + n

• vpn(d) = 0.

Then [Aut(T ) : GK(f)] <∞.

Chabauty-Coleman experiments on genus three hyperelliptic curvesParticipants: Jennifer Balakrishnan, Francesca Bianchi, Victoria Cantoral-Farfan, Mirela Ciperiani, Anas-tassia Etropolski

Let C be a hyperelliptic curve of genus g = 3 defined overQ. An affine model for C can be written in theform y2 = f(x), where f(x) is a polynomial with rational coefficients and 7 ≤ deg f(x) ≤ 8. As a specialcase of Faltings’ theorem, we know that C(Q), the set of rational points on C, is finite; moreover, by theMordell-Weil theorem, the set of rational points of the Jacobian J of C forms a finitely generated abeliangroup, i.e., J(Q) ' Zr ⊕ T .

When r < g, the Chabauty-Coleman method [12] can be used to help compute C(Q). The methodproduces a finite set of p-adic points on C containing C(Q), via the construction of p-adic integrals ofannihilating regular 1-forms. In the case when r = 1 (and g = 3), the method produces two indepen-dent regular 1-forms α, β whose (p-adic) Coleman integrals can be used to compute C(Q). It is of interestto see what other points on C the integrals of α and β cut out.

Indeed, under these hypotheses (hyperelliptic curves with g = 3 and r = 1), our goal is to producealgorithms that would allow us to use the Chabauty-Coleman method to compute the rational points of

Page 199: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 195

any such curve. One fully worked out example of this technique appears in the Ph.D. thesis of Wetherell[39] where he does various computations on the Jacobian of a genus 3 hyperelliptic curve to carry outChabauty-Coleman; our goal is to automate this method, while replacing the computations on the Jacobianwith computations directly on the curve.

We now also fix a prime p of good reduction and a naive height bound B ∈ N. We first determine the setSB ⊆ C(Q) of points of naive height bounded by B. We proceed if SB is non-empty. Suppose, for ease ofexposition, that we have deg f(x) = 7. Fix the embedding C → J that sends P to the class [(P ) − (∞)] ofthe divisor (P )− (∞). If P ∈ C(Q), then Q := P −∞ gives us a rational point in the Jacobian of C, and ifQ is non-torsion, by computing the three Coleman integrals

∫ P∞ ω0,

∫ P∞ ω1,

∫ P∞ ω2, where ωi = xidx

2y formsa basis of H0(C,Ω1), we may determine α and β.

We then consider the indefinite Coleman integrals of α and β as p-adic power series over all residue disksof C. The zero locus of these integrals gives a set C(Qp)1 ⊆ C(Qp) such that C(Q) ⊆ C(Qp)1. We wantto determine, in practice, how much larger C(Qp)1 is than C(Q), and further, if we can describe C(Qp)1 interms of C(Q) and other arithmetically relevant p-adic points on C.

During our week at BIRS, we implemented the algorithm described above to determine C(Qp)1 andstudied the data produced by a short list of curves pulled from the genus 3 database of hyperelliptic curvessoon to be in the L-functions and Modular Forms DataBase (LMFDB) [37]. We plan on running thesealgorithms on further curves in the LMFDB and analyzing the resulting data.

Computational aspects of supersingular elliptic curves

Participants: Efrat Bank, Catalina Camacho, Kirsten Eisentrager, Jennifer Park

Project Goal: Our goal during the week at BIRS was to solve the following problem:

Problem: Given a supersingular elliptic curve, compute its endomorphism ring.

Setup and context: One of the methods for computing the endomorphism ring is to study the `-isogenygraph of supersingular elliptic curves defined over a field of characteristic p, where p and ` are distinctprimes. Kohel’s thesis [30] outlines an algorithm to try to compute this endomorphism ring, but it is notcompletely explicit in the sense that some parts of the arguments are only sketched. Thus, we hope to makethis algorithm more concrete, and prove the existence of some extra features in the `-isogeny graph thatcould eventually aid in the efficiency of the algorithm to computing the endomorphism rings.

The setup that we will consider is as follows: For distinct primes p and `, we can define the `-isogenygraph G(p, `) as follows:

1. The vertex set V is the set of isomorphism classes of supersingular elliptic curves over the finite fieldFp2 . (This gives us, up to isomorphism, all supersingular elliptic curves in characteristic p becauseevery supersingular elliptic curve has a model defined over Fp2 .) We can think of the vertices labeledby the j-invariants of the supersingular elliptic curves. The number of vertices is roughly p/12 anddepends on the congruence class of p modulo 12 (see [36]).

2. The edge set can be described as follows: given a supersingular j-invariant j, write down an ellipticcurve E in Weierstrass form with j(E) = j. This can be done explicitly.

Choose a subgroup H of E of order `, and let E′ be the elliptic curve E′ = E/H . Now connect thevertices j = j(E) and j′ = j(E′) by an edge, which represents the `-isogeny with torsion H . Since the`-torsion on the elliptic curve E is isomorphic to Z/`Z × Z/`Z, there are ` + 1 subgroups of order `.Hence, with this construction, we get an ` + 1-regular directed graph, and the edges leaving a vertexj(E) are labelled by subgroups of E of order `. The graph is a multigraph, which means that twovertices can be connected by more than one edge, and allows self-loops.

In this setup, a loop beginning and ending at a vertex j can be interpreted as an element of the endomor-phism ring of the supersingular elliptic curve E with j(E) = j.

Page 200: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

196 Five-day Workshop Reports

By Deuring’s theorem, each endomorphism ring corresponds to a maximal order in the unique quater-nion algebra ramified at p and∞. When one can identify the maximal order corresponding to the endomor-phism ring of a particular elliptic curve, we say that the endomorphism ring has been computed.

By identifying the elements of the quaternion algebra corresponding to these loops, one has a chance offinding the generators of the maximal order, according to Kohel’s thesis [30, Chapter 7].

Parts of the project that were finished during the week at BIRS: During the week, we were able to:

1. Find a short list of at most eight elements of the quaternion algebra that could correspond to a givenloop;

2. For certain primes (p = 31, 101, 103), identify the maximal orders attached to all of the supersingularelliptic curves over Fp by using the above short list;

3. Identify some criteria for when two given loops generate an order of rank < 4;

4. Identify some criteria for when two given loops generate a non-maximal order;

5. Conjecturally identify some criteria for when two given loops generate an order of rank 4.

Quantum modular forms and singular combinatorial seriesParticipants: Amanda Folsom, Min-Joo Jang, Susie Kimport, Holly Swisher

Mock modular forms are complex-valued functions which share similar transformation properties toordinary modular forms with respect to the action of SL2(Z) on the upper-half complex plane H, but gaintheir modular transformation properties on H at the expense of losing their holomorphic properties there.The theory of mock modular forms, and the more overarching theory of harmonic Maass forms, has largelydeveloped during this century [9]; however, their origins trace back to both the original Maass forms andRamanujan’s mock theta functions from the early-to-mid 1900s. The development of the theory of harmonicMaass and mock modular forms has been of great importance within the theory of modular forms, itself amajor area of research in Number Theory related to the Langlands Program, the BSD Conjecture, andFermat’s Last Theorem, for example. Applications of harmonic Maass and mock modular forms have alsoemerged in the diverse areas of Mathematical Physics, Representation Theory, Topology, and more.

Quantum modular forms were defined by Zagier in 2010 [40]; they are similar to mock modular forms inthat they feign modularity in some way, with the notable exception that their domain is not H, but rather Q.More precisely, a weight k quantum modular form (k ∈ 1

2Z) is a complex-valued function f on P1(Q) \ S,for some appropriate S, such that for all γ = ( a bc d ) ∈ Γ ⊆ SL2(Z), the functions hγ(x) = hf,γ(x) :=f(x)− ε−1(γ)(cx+ d)−kf

(ax+bcx+d

)satisfy a ‘suitable’ property of continuity or analyticity in R (such as real

analyticity). Here, the ε(γ) ∈ C satisfy |ε(γ)| = 1.Questions of interest to many (see for example [9, 18, 19, 22, 23, 27, 40]) have been to understand spaces

of quantum modular forms, determine explicit examples of and sources of quantum modular forms, and tounderstand the relationship, if any, between quantum modular and mock modular forms. Rational cuspsare a natural boundary to the fundamental domain of a mock modular form, so on one hand, the latterproblem is a natural one to study. On the other hand, a relationship is not immediate as the domains andanalytic properties of mock and quantum modular forms are a priori different.

In this project, we address the above questions in our study of a (k+1)-variable combinatorial generatingfunction Rk(w1, w2; . . . , wk; q), k ≥ 1, for k-marked Durfee symbols introduced by Andrews [3]. Histori-cally, combinatorial functions are known to occasionally be a source of automorphic functions; likewise,such automorphic properties can sometimes be used to prove combinatorial theorems. Andrews’ celebrated(k + 1)-variable function Rk is a vast generalization of the one-variable combinatorial generating functionfor integer partitions, an ordinary modular form whose automorphic properties were famously studied anddeveloped along with the Circle Method by Hardy and Ramanujan. The automorphic properties of Rkwhen viewed as a function of τ ∈ H with q = e2πiτ and fixed wj have slowly been uncovered in a series of

Page 201: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 197

papers [8, 10, 17], the last of which was authored by participants Folsom and Kimport. Rk indeed exhibits“mock” behavior, though the exact type of transformations it exhibits on H depend on k, and also the wj .Our project seeks to understand the quantum properties of Rk, if any, on Q.

A number of complications arose in our study. First, it is unclear that Rk is even defined when viewed asa function on Q. Second, the automorphic properties of Rk on H depend on k and the wj . Third, showingerrors to modularity extend to analytic functions on R is non-trivial. Barring these obstacles, our strategyis to ultimately make use of the mock-automorphic transformation properties possessed by Rk on H, and touse analytic continuation to obtain quantum transformation properties on Q.

To this end, our first result establishes a quantum set of rationals on which Rk is defined; this involvesworking with a (k + 1)-fold q-hypergeometric series related to Rk by work of Andrews [3]. We then workwith general automorphic transformation properties on H established by Folsom and Kimport in [17] toproduce explicit errors to modularity exhibited by Rk. The errors which emerge are in terms of the non-holomorphic function

S(z; τ) :=∑

n∈ 12 +Z

(sgn(n)− E

((n+ Im(z)

Im(τ)

)√2 Im(τ)

)(−1)

n− 12 e−2πinzq−

n2

2

),

where E(z) := 2∫ z

0e−πt

2

dt. In certain settings, our errors to modularity in fact involve limiting versions ofS, further complicating their understanding. We next transform our errors to modularity from functionsinvolving the non-holomorphic S to period integrals

∫ i∞a/b

g(ρ)(−i(τ + ρ))−12 dρ, where the g are ordinary

modular forms, and ab ∈ Q. We are also able to turn the limiting versions of S which appear into derivatives

of period integrals after working with properties of S as proved in [41], among other things. We then employdelicate analytic arguments to deduce that the errors to modularity indeed extend to analytic functions inR, and use analytic continuation to establish that the Rk is a quantum modular form. As corollaries toour results, we also obtain – non-trivially – exact formulas for period integrals of modular forms and theirderivatives as evaluations of simple finite q-hypergeometric multi-sums when parameters are specialized toroots of unity.

Newton polygons of cyclic covers of the projective lineParticipants: Wanlin Li, Elena Mantovan, Rachel Pries, Yunqing Tang

A fundamental problem in arithmetic geometry is understanding which abelian varieties arise as Jaco-bians of (smooth) curves. This question is equivalent to studying (the interior of) the Torelli locus in Siegelvarieties.

In positive characteristic p, there are an abundance of discrete invariants associated with abelian vari-eties, e.g., the p-rank, the Newton polygon, and the Ekedahl–Oort type. These invariants give informationabout the Frobenius morphism and the number of points of the abelian variety defined over finite fields. Itis a natural question to ask which of these invariants are realized by Jacobians. As each type of discreteinvariant yields a stratification of the reduction modulo p of the Siegel variety, this problem is equivalent tounderstanding which strata intersect the Torelli locus, and its interior.

Ultimately, one would like to understand the geometry of the induced stratifications of the Torelli locus(e.g., the connected components of each stratum and their closure), in the same way that the geometry ofthe corresponding stratifications of Siegel varieties is understood. For example, in [16], Faber and van derGeer prove that for any genus g and prime p, the p-rank strata are non-empty and have the appropriatecodimension. See also [1] and [2].

In the case of the Newton polygon, more precisely the Newton polygon of the characteristic polynomialof Frobenius, much less is known beyond genus 3. Most interestingly, in 2005, Oort observed that for genusg ≥ 9, a dimension count suggests that it is unlikely for all Newton polygons to occur for Jacobians.

This project focuses on understanding the Newton polygons of cyclic covers of the project line. In [15],for any family T of cyclic covers of the projective line, Deligne and Mostow construct a PEL-type Shimuravariety containing T . In [31], Moonen shows that there are precisely twenty families of cyclic covers of the

Page 202: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

198 Five-day Workshop Reports

projective line which give rise to special subvarieties of Siegel varieties, and each of these twenty special fam-ilies agrees with the associated Shimura variety constructed in [15]. The Newton polygon and Ekedahl–Oortstratifications of PEL-type Shimura varieties are well understood by the work of Viehmann and Wedhorn[38]. Combining these results thus enables us to compile the list of all Newton polygons and Ekedahl–Oorttypes of Jacobians in each of the twenty special families in [31]. (Some of the necessary computations werecarried out using MAGMA.)

Given this list, we proceed to investigate which invariants arise for smooth curves in the families. Forthe Newton polygon corresponding to the open stratum (i.e., the µ-ordinary polygon) the answer is alwaysaffirmative. Beyond that it is less clear. In the cases when the closed stratum (which corresponds to theNewton polygon called basic) has codimension 1, a count of the number of connected components of theboundary implies that the answer is affirmative for sufficiently large primes. These two results yield severalnew examples of Newton polygons of Jacobians of smooth curves for low genera for infinitely many p.

Finally, to attack the same questions for arbitrarily large genera, we develop a new induction argument,similar in flavor to the one initially used in [16] for p-ranks and its most recent refinement for Newtonpolygons in [34]. The value of the induction argument is that it yields information about the geometry of theNewton polygon strata for higher genus, starting from geometric information for the strata in low genus.This yields new occurrences for Newton polygons of smooth cyclic covers of the projective line varying innon-special families. Unfortunately, one hypothesis is in general not stable under the inductive argument,and so far our result only yields finitely many new Newton polygons. As we have not yet exhausted allpossible settings, there is still hope to construct new examples for arbitrary large genera. Either way, weexpect to establish the non-emptiness of many/most/all Newton polygon strata for genera g ≤ 8 undercertain congruence conditions on the prime p.

Ramanujan graphs in CryptographyParticipants: Anamaria Costache, Brooke Feigon, Kristin Lauter, Maike Massierer, Anna Puskas

Our WIN4 group studied the security of a new proposal for Post-Quantum Cryptography (PQC) fromboth a number theoretic and cryptographic perspective. National Institute of Standards and Technology(NIST) will be running an international competition over the next few years to select a new system for PQC.One of the possible candidates is based on the hardness of finding isogenies between supersingular ellipticcurves. This hard problem was first proposed by Charles-Goren-Lauter in 2006 ([11]) as the basis for anew cryptographic hash function construction. A Pizer graph is the isogeny graph of supersingular ellipticcurves, SS(`, p), which can be interpreted in terms of Brandt matrices representing Hecke operators actingon spaces of modular forms. The idea behind the cryptographic applications is to use the hardness of findingpaths in these Ramanujan graphs (or inverting random walks) as a way to construct a one-way function.The hard problem is then to find paths in the graph, given the starting and ending point. In the samepaper, Charles-Goren-Lauter also proposed a PQC system based on another family of Ramanujan graphs,those of Lubotzky-Phillips-Sarnak (LPS). A 2008 paper by Petit-Lauter-Quisquater ([32]) breaks the hashfunction based on LPS graphs, whereas recent work has built on the hardness of finding isogenies betweensupersingular elliptic curves. In particular, in [14] De Feo-Jao-Plut proposed a cryptographic system basedon supersingular isogeny Diffie-Hellman as well as a set of five hard problems.

Our group had two goals related to the crypto-systems on these Ramanujan graphs. On the one side wewanted to study the weaknesses and reductions in the problems proposed by De Feo-Jao-Plut. On the otherside we wanted to study the relation between the Pizer and LPS graphs by viewing both from a numbertheoretic perspective.

For the first goal, we started to understand the relationships between the 5 problems posed by De Feo-Jao-Plut and to relate them to the hardness of the Supersingular Isogeny Graph problem which is the foun-dation for the CGL hash function.

For the second goal, we started explicating the relation between LPS graphs and Pizer graphs. Thevertex set of both of these graphs can be viewed as a set of double cosets obtained from the adelic pointsof the multiplicative group of a particular quaternion algebra, B, acted on by an order of the algebra.

Page 203: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 199

The particular algebra and the order one must choose depend on the graphs. For LPS graphs we must fixB = B2,∞, the quaternion algebra over Q that is ramified at 2 and ∞, and choose a non-maximal orderthat will depend on a prime p. Whereas for the Pizer graphs we choose B = Bp,∞, the quaternion algebraover Q that is ramified at p and infinity, and take the maximal order in B. Thus we were able to see that thePizer and LPS considered in [11] are not isomorphic. However, there are generalizations of both of thesegraphs that we believe do come from the same choices of quaternion algebras and orders. We are continuingto explore how explicit we can make this isomorphism and whether or not this can provide any insight intowhy the hash-function for LPS graphs has been broken, but the one for Pizer graphs has not been.

Torsion structures on elliptic curvesParticipants: Abbey Bourdon, Ozlem Ejder, Yuan Liu, Frances Odumodu, Bianca Viray

A result of Frey, which relies on Faltings’s theorem about rational points on subvarieties of abelianvarieties, states that a curve C/Q has infinitely many points of degree at most d only if 2d is at least theQ-gonality of C [20]. Moreover, the proof shows that if C has infinitely many points of degree at most dand d < gonQ(C), then the image of Symd(C) in Jac(C) contains a translate of a positive rank subabelianvariety. One can think of Frey’s result as saying that there are infinitely many points of degree at most donly when there is an infinite family parametrizing them (e.g., a P1 or a positive rank abelian variety).

Degree d points that are not parametrized by an infinite family are less understood. Such points arecalled sporadic; precisely, a closed point x on a curve C is sporadic if C has only finitely many closed pointsof degree at most deg(x). In our project, we study sporadic points on the modular curves X1(N)N≥1,whose non-cuspidal points correspond to isomorphism classes of pairs of elliptic curves with a marked pointof order N . In particular, we seek to better understand the properties of elliptic curves that can give rise tosporadic points on X1(N).

Let E/Q be an elliptic curve and let ` be a prime. If there is a sporadic point x ∈ X1(`) with j(x) = j(E),thenE must achieve a rational point of order ` in unusually low degree. This in turn implies that the image ofthe mod ` Galois representation must be unusually small. Thus one may anticipate that elliptic curves withcomplex multiplication (CM) are good candidates for producing sporadic points, and indeed any CM ellipticcurve will give rise to infinitely many sporadic points in

⋃`X1(`). In contrast, if E/Q is a non-CM elliptic

curve, Serre’s Uniformity Conjecture states that the image of the mod ` Galois representation associated Eis surjective for all primes ` > C, where C is a constant that does not depend on E (a standard guess isthat C = 37). If true, this would imply there are only finitely many non-CM non-cuspidal sporadic pointson ∪`X1(`) that correspond to elliptic curves E with j(E) ∈ Q. This observation inspires the followingquestion:

Question 17.0.2 Does there exist an absolute constant C such that if N > C, there are no non-CM non-cuspidal sporadic points on X1(N) corresponding to elliptic curves with rational j-invariant?

More generally, one may ask:

Question 17.0.3 Fix a positive integer d. Does there exist a constant C = C(d) such that if N > C, thereare no non-CM non-cuspidal sporadic points x on X1(N) with [k(j(x)) : Q] ≤ d, where j : X1(N) → X(1)denotes the natural map to X(1) = P1

j?

During our week at BIRS, our group succeeding in proving the following partial affirmative answer tothis question. We write Spor(N) for the closed subset of X1(N) consisting of all sporadic points on X1(N).

Theorem 17.0.4 Fix a number field k and assume Serre’s Uniformity Conjecture for k. Then there exists apositive integer A = A(k) such that

j

( ⋃N∈N

Spor(N)

)∩ P1

j (k) ⊂ j

⋃N |A

Spor(N)

.

Page 204: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

200 Five-day Workshop Reports

In particular, the set of k-rational j-invariants of sporadic points is finite.Moreover if the constant in Serre’s Uniformity conjecture can be taken to depend only on the degree d of k

for all number fields k of degree d, then the same is true for A. In particular, then there are only finitely manyj-invariants corresponding to sporadic points x with [k(j(x)) : Q] bounded.

ParticipantsBalakrishnan, Jennifer (Boston University)Bank, Efrat (University of Michigan)Bianchi, Francesca (University of Oxford)Bourdon, Abbey (University of Georgia)Camacho Navarro, Ana Catalina (Colorado State University)Cantoral Farfan, Victoria (ICTP)Ciperiani, Mirela (The University of Texas at Austin)Costache, Anamaria (University of Bristol)David, Chantal (Concordia University)Eisentraeger, Kirsten (The Pennsylvania State University)Ejder, Ozlem (University of Southern California)Etropolski, Anastassia (Rice University)Feigon, Brooke (The City College of New York (CUNY))Folsom, Amanda (Amherst College)Friedlander, Holley (Dickinson College)Fuchs, Elena (University of California, Davis)Gafni, Ayla (University of Rochester)Harron, Piper (University of Hawaii at Manoa)Hsu, Catherine (University of Oregon)Jang, Min-Joo (Cologne University)Juul, Jamie (Colorado State University)Kimport, Susie (Stanford University)Krieger, Holly (University of Cambridge)Lauter, Kristin (Microsoft Research)Li, Wanlin (University of Wisconsin-Madison)Liu, Yuan (University of Wisconsin - Madison)Looper, Nicole (Northwestern University)Malik, Amita (University of Illinois at Urbana-Champaign)Manes, Michelle (University of Hawaii at Manoa)Mantovan, Elena (California Institute of Technology)Massierer, Maike (University of New South Wales)Odumodu, Frances (Universite Bordeaux)Park, Jennifer (Ohio State University)Prabhu, Neha (Indian Institute of Science Education and Research-Pune)Pries, Rachel (Colorado State University)Puskas, Anna (University of Alberta)Schindler, Damaris (Utrecht University)Tang, Yunqing (IAS/Princeton University)Thompson, Bianca (Harvey Mudd College)Turnage-Butterbaugh, Caroline (Duke University)Viray, Bianca (University of Washington)Walton, Laura (Brown University)

Page 205: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] J. Achter and R. Pries. Monodromy of the p-rank strata of the moduli space of curves. Int. Math. Res.Not. IMRN, (15):Art. ID rnn053, 25, 2008.

[2] J. Achter and R. Pries. The p-rank strata of the moduli space of hyperelliptic curves. Adv. Math.,227(5):1846–1872, 2011.

[3] G.E. Andrews, Partitions, Durfee symbols, and the Atkin-Garvan moments of ranks, Invent. Math. 169(2007), no. 1, 37–73.

[4] A. Balog, On the fractional parts of pθ, Arch. Math. (Basel) 40 (1983), no. 5, 434–440.

[5] B. J. Birch, How the number of points of an elliptic curve over a fixed prime field varies. J. London Math.Soc. 43 (1968) 57–60.

[6] J. Bourgain, E. Fuchs, A proof of the positive density conjecture for integer Apollonian circle packings,Journal of the AMS, 24, (4), pp. 945–967, 2011.

[7] J. Bourgain, A. Kontorovich, On the local-global conjecture for integral Apollonian gaskets, Invent. Math.,196 (3), pp. 589-650, 2014

[8] K. Bringmann, On the explicit construction of higher deformations of partition statistics, Duke Math.J., 144 (2008), 195–233.

[9] K. Bringmann, A. Folsom, K. Ono, and L. Rolen, Harmonic Maass forms and mock modular forms:theory and applications, American Mathematical Society Colloquium Publications, Providence, R.I., toappear.

[10] K. Bringmann, F. Garvan, and K. Mahlburg, Partition statistics and quasiweak Maass forms, Int.Math. Res. Notices, 1 (2009), 63–97.

[11] D. Charles, K. Lauter and E. Goren, Cryptographic hash functions from expander graphs, Journal ofCryptology. The Journal of the International Association for Cryptologic Research 22 (2009), 93–113.

[12] R. F. Coleman, Effective Chabauty, Duke Mathematical Journal 52 (1985), No. 3, 765–770.

[13] C. David, D. Koukoulopoulos and E. Smith, Sums of Euler products and statistics of elliptic curves, toappear in Mathematische Annalen.

[14] L. De Feo, D. Jao and J. Plut, Towards quantum-resistant cryptosystems from supersingular ellipticcurve isogenies, Journal of Mathematical Cryptology 8 (2014), 209–247.

[15] P. Deligne and G. D. Mostow. Monodromy of hypergeometric functions and nonlattice integral mon-odromy. Inst. Hautes Etudes Sci. Publ. Math., (63):5–89, 1986.

[16] C. Faber and G. van der Geer. Complete subvarieties of moduli spaces and the Prym map. J. ReineAngew. Math., 573:117–137, 2004.

201

Page 206: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

202 Five-day Workshop Reports

[17] A. Folsom and S. Kimport, Mock modular forms and singular combinatorial series, Acta Arith. 159(2013), no. 3, 257–297.

[18] A. Folsom, S. Garthwaite, S-Y Kang, H. Swisher, and S. Treneer, Quantum mock modular formsarising from eta-theta functions, Research in Number Theory 2:14 (2016), 41pp.

[19] A. Folsom, K. Ono, and R.C. Rhoades, Mock theta functions and quantum modular forms, ForumMath. Pi 1 (2013), e2, 27 pp.

[20] G. Frey, Curves with infinitely many points of fixed degree, Israel J. Math. 85 (1994), no. 1–3, 79–83.

[21] E. Fuchs, Strong approximation in the Apollonian group, J. Number Theory, 131 (12), pp. 2282–2302,2011.

[22] S. Garoufalidis and T.T.Q. Le, Nahm sums, stability and the colored Jones polynomial, Res. Math. Sci.2 (2015), Art. 1, 55 pp.

[23] K. Hikami, Quantum invariant for torus link and modular forms, Comm. Math. Phys. 246 (2004), no.2, 403–426.

[24] K. Huang. Generalized Greatest Common Divisors for the Orbits under Rational Functions. Preprintavailable at arxiv: 1702.03881.

[25] K. James, B. Tran, M.-T. Trinh, P. Wertheimer, and D. Zantout, Extremal primes for elliptic curves, J.Number Theory 164 (2016), 282–298.

[26] K. James and P. Pollack, Extremal primes for elliptic curves with complex multiplication, J. NumberTheory 172 (2017), 383–391

[27] M-J Jang and S. Lobrich, Radial limits of the universal mock theta function g3, Proc. Amer. Math.Soc. 145 (2017), no. 3, 925–935.

[28] R. Jones. Galois representations from pre-image trees: an arboreal survey. Pub. Math. Besancon.(2013) 107-136.

[29] R. Jones and M. Manes. Galois theory of quadratic rational functions. Comment. Math. Helv. 89(1)(2014), 173-213.

[30] D. Kohel. Endomorphism rings of elliptic curves over finite fields. Ph.D. thesis, University of Californiaat Berkeley, 1996.

[31] B. Moonen. Special subvarieties arising from families of cyclic covers of the projective line. Doc. Math.,15:793–819, 2010.

[32] C. Petit, K. Lauter, J. Quisquater, Full cryptanalysis of LPS and Morgenstern hash functions, Securityand Cryptography for Networks (2008), 263-277.

[33] R. Pink. Profinite iterated monodromy groups arising from quadratic morphisms with infinite post-critical orbits. arXiv:1309.5804.

[34] R. Pries. Current results on Newton polygons of curves. Submitted.

[35] P. Sarnak, Letter to Lagarias on Apollonian circle packings, http://web.math.princeton.edu/sarnak/AppolonianPackings.pdf

[36] J. H. Silverman. The arithmetic of elliptic curves. Springer, New York, 2 edition, 2009.

[37] A. Sutherland, personal communication, August 2017.

Page 207: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Women in Numbers 4 203

[38] E. Viehmann and T. Wedhorn. Ekedahl-Oort and Newton strata for Shimura varieties of PEL type.Math. Ann., 356(4):1493–1550, 2013.

[39] J. L. Wetherell, Bounding the Number of Rational Points on Certain Curves of High Rank, Ph.D. thesis,University of California at Berkeley, 1997.

[40] D. Zagier, Quantum modular forms, Quanta of maths, 659–675, Clay Math. Proc., 11, Amer. Math.Soc., Providence, RI, 2010.

[41] S. Zwegers, Mock theta functions, Ph.D. Thesis, Universiteit Utrecht, 2002.

Page 208: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 18

Future targets in the classificationprogram for amenable C∗-algebras(17w5127)

September 3 - 8, 2017Organizer(s): George Elliott (University of Toronto), Zhuang Niu (University of Wyoming),Aaron Tikuisis (University of Ottawa)

Overview of the Field

C∗-algebra theory is a branch of mathematics with connections to other areas such as topology, symbolic andmeasurable dynamics, harmonic analysis, number theory, differential geometry, set theory, and geometricgroup theory. A C∗-algebra is an algebra of bounded operators on Hilbert space, that is closed under takingadjoints and in the operator norm topology. Operator algebras – C∗-algebras and von Neumann algebras –arose in the mathematical rigorization of quantum mechanics in the 1920s and ’30s.

Constructions of C∗-algebras from other mathematical objects, such as groups or dynamical systems,appeared shortly after the original physics-inspired inception of C∗-algebras. These constructions encodeinteresting information about the input objects: e.g., for groups the C∗-algebra captures unitary representa-tion theory; for dynamical systems it concerns the orbit-equivalence relation. Many other constructions havesince arisen, producing C∗-algebras associated to such objects as directed graphs, foliations, coarse metricspaces, integral domains, higher-rank graphs, and C∗-dynamical systems. Understanding the structure ofsuch C∗-algebras, and how this structure reflects the input, has been a driving force behind C∗-algebraresearch.

The classification program is a major research direction in C∗-algebra theory, with the aim of showingthat C∗-algebras which agree on K-theoretic (and thereby computable) invariants are isomorphic. K-theoryfor C∗-algebras is a generalization of topological K-theory; it is also necessary to include the simplex oftracial states and its pairing with the K0-group in the K-theoretic invariants used for classification. Theclassification program systematically tackles the question of the structure of C∗-algebras, since a given ab-stract algebra will then be isomorphic to a model with known structure – constructed to have the sameinvariant.

204

Page 209: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Future targets in the classification program for amenable C∗-algebras 205

Classification of C∗-algebras has its origins in the work of Dixmier and Glimm in the ’60s; in the early’90s, Elliott brought it to prominence as a problem in its own right, and it has subsequently seen tremendousdevelopment. Much of the recent focus has been on simple, separable, amenable, unital, infinite dimensionalC∗-algebras; the nonunital case has also seen very recent developments. The following are the major turningpoints in the theory, which have set the foundations for exciting recent developments:

• The classification, by Bratteli and Elliott, of approximately finite dimensional (AF) C∗-algebras [2, 6].

• The development by Brown–Douglas–Fillmore and Kasparov of KK-theory, a bivariant version ofK-theory [3, 18].

• Elliott’s conjecture of the classifiability of simple separable amenable unital C∗-algebras [7].

• The Kirchberg–Phillips classification of purely infinite simple separable amenable unital C∗-algebraswhich satisfy the Universal Coefficient Theorem (UCT) [29].

• The unexpected discovery of simple separable amenable unital C∗-algebras which cannot be distin-guished by traditional K-theoretic invariants, with obstructions related to high topological dimension[45, 46, 47, 36, 42, 38].

• The Toms–Winter conjecture, postulating, in response to the last point, a robust characterizationof low topological dimension for simple separable amenable unital C∗-algebras (see more in RecentDevelopments and Open Problems.) [44, 10]

• Tracial approximation, pioneered by Lin based on results of Elliott, Gong, and L. Li (see [23, 8]), andon Popa’s local quantization ([31]).

Recent Developments

The recent developments in C∗-algebra theory may be viewed in the context of three streams:

Classification. Classification, underpinned by the concept of tracial approximation, culminated recently inthe following:

Theorem 18.0.1 (Elliott, Gong, Lin, Niu) [9] LetA,B be simple unital infinite dimensional separable amenableC∗-algebras with finite nuclear dimension and for which all traces are quasidiagonal, which satisfy the Uni-versal Coefficient Theorem for KK-theory, and which have a non-zero trace. Then A ∼= B if and only ifEll(A) ∼= Ell(B).

In the above theorem, the Elliott invariant, Ell(A) of a C∗-algebra A is

Ell(A) := (K0(A),K0(A)+, [1A]K0(A),K1(A), T (A), ρA : K0(A)× T (A)→ R).

The hypotheses of finite nuclear dimension and the Universal Coefficient Theorem are discussed below. Thecase with no non-zero trace was dealt with by Kirchberg and Phillips (see [20, 29]). (In both cases, the resultfollowed a significant amount of earlier work, such as [35, 24, 25, 50, 51].)

In fact, this result came about in two stages. In the first stage, Gong, Lin, and Niu proved classificationof Z-stable C∗-algebras which satisfy certain tracial approximation hypothesis. Then, using a key resultof Winter, it was proven that the hypotheses of the above theorem are sufficient to establish the tracialapproximation hypothesis.

These ideas have been pushed further, yielding certain (but not yet complete) classification results inthe nonunital setting. For instance, Elliott, Gong, Lin, and Niu have classified simple separable C∗-algebras

Page 210: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

206 Five-day Workshop Reports

with finite nuclear dimension which are KK-contractible (UCT with zero K-groups) – the Elliott invariant inthis (non-unital) case reduces to the cone of lower semicontinuous traces, together with the subset of tracialstates

Structure. The Toms–Winter conjecture is an idea central to the structure theory of C∗-algebras, whichcomplements the classification results mentioned above. It states the following:

Conjecture 18.0.2 (The Toms–Winter conjecture) [44, 10] Let A be a simple separable unital infinite dimen-sional amenable C∗-algebra. The following conditions are equivalent.(i) A has finite nuclear dimension: dimnuc(A) <∞.(ii) A is Z-stable: A ∼= A⊗Z .(iii) A has strict comparison of positive elements.

The three eclectic properties in this conjecture have varied pertinence in different settings, or construc-tions. Powerful structural consequences ensue for C∗-algebras known to have all three of these properties.

A C∗-algebra is said to have nuclear dimension at most n if it has the completely positive approxima-tion property (the identity map approximately factorizes via uniformly bounded completely positive mapsthrough finite dimensional algebras), in a particularly controlled manner which is inspired by Lebesgue’snotion of covering dimension. This property was introduced by Winter and Zacharias, and is based on anearlier property called decomposition rank due to Kirchberg and Winter [22].

The property of Z-stability says that the C∗-algebra A is isomorphic to its tensor product with a certainspecial C∗-algebra, namely the Jiang–Su algebra Z . This algebra Z has the important property of beinga strongly self-absorbing C∗-algebra, meaning that Z ∼= Z ⊗ Z , and the isomorphism can be chosen tobe approximately unitarily equivalent to the first-factor embedding Z → Z ⊗ 1 ⊂ Z ⊗ Z , and it is notisomorphic to C [43]. Every other strongly self-absorbing C∗-algebra is Z-stable [48], and the Jiang–Sualgebra is in fact characterized by this property. The Jiang–Su algebra may be built indirectly using UHF-algebras; this enables certain properties of UHF algebras to be generalized to the algebra Z , a prominentexample being [50].

Strict comparison of positive elements is a property for a C∗-algebra that factors through a particularinvariant, the Cuntz semigroup. The Cuntz semigroup is an ordered abelian semigroup which capturescertain cohomological information about the C∗-algebra. Strict comparison is equivalent to almost unper-foration of the Cuntz semigroup, a property which captures a sense of good (or expected) behaviour in thisinvariant.

The conjecture is closely linked to the classification of C∗-algebras. It arose in response to the counterex-amples to Elliott’s classification conjecture. The hypothesis of finite nuclear dimension appears in Theorem18.0.1, and is known to be a necessary hypothesis. In fact, Z-stability is also used crucially in parts of theproof of Theorem 18.0.1, whence the known implication (i)⇒ (ii) of the Toms–Winter conjecture is essentialto this theorem.

In fact, this conjecture is very close to being established, owing to results that were announced at thisworkshop. (i)⇒ (ii) is due to Winter [49] and (ii)⇒ (iii) to Rørdam [37]. After being established in variousspecial cases (including [26, 39, 1]), it was announced at this workshop that the implication (ii) ⇒ (i) hasbeen proven in full generality, by Castillejos, Evington, Tikuisis, White, and Winter. While the validity of(iii) ⇒ (ii) has yet to be determined (there are partial results, such as [21]), a significant step forward waspresented at the workshop by Thiel. He showed that C∗-algebras with stable rank one and strict comparisonof positive elements automatically have almost divisible Cuntz semigroups; when one additionally assumeslocally finite nuclear dimension, Z-stability follows by [49].

Quasidiagonality, of C∗-algebras and of traces, is another key hypothesis in Theorem 18.0.1. In 2015,it was proven by Tikuisis, White, and Winter that for amenable C∗-algebras which satisfy the UniversalCoefficient Theorem, all faithful traces are automatically quasidiagonal [41]. It remained mysterious what

Page 211: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Future targets in the classification program for amenable C∗-algebras 207

role the Universal Coefficient Theorem really played in this result: was it just a technical hypothesis, or wasit truly necessary? Indeed, many questions around the Universal Coefficient Theorem remain mysterious(more on this below).

Putting together the exciting developments mentioned above yields the following result of a fairly defini-tive nature.

Theorem 18.0.3 Let A,B be a simple unital infinite dimensional separable amenable C∗-algebras with thefollowing properties:

(i) either Z-stability, or stable rank one plus locally finite nuclear dimension, and

(ii) satisfying the Universal Coefficient Theorem.

Then A ∼= B if and only if Ell(A) ∼= Ell(B).

A separable C∗-algebra A satisfies the Universal Coefficient Theorem if a canonical sequence,

0→ ExtZ1 (K∗(A),K∗(B))→ KK(A,B)→ Hom(K∗(A),K∗(B))→ 0,

is exact, for every separable C∗-algebra A. Many C∗-algebras are known to satisfy the Universal CoefficientTheorem; one of the key results in this direction is due to Tu, which says that C∗-algebras constructed fromamenable groupoids automatically satisfy the Universal Coefficient Theorem. However, the following is amajor open question.

Question 18.0.4 Does every separable amenable C∗-algebra satisfy the Universal Coefficient Theorem?

Dadarlat gave an excellent talk at the workshop discussing some of the subtleties of this problem.

Group actions and crossed products. As the classification of simple unital separable amenable C∗-algebrasreaches maturity, there is increasing interest in looking at group actions on C∗-algebras and at crossedproducts, where the classification results may be put to use.

Given a group G and an action α of the group (by ∗-automorphisms) on a C∗-algebra A, the crossedproduct, denoted A oα G, is a new C∗-algebra which contains A and encodes the action. An importantspecial case is when A is a commutative C∗-algebra, necessarily of the form C0(X,C); in this case, theaction corresponds to an action of G on X by homeomorphisms.

There have been a variety of approaches to showing classifiability of crossed products; however, while thestandard classification hypotheses (separability, unitality, simplicity, amenability) are either automatic orwell-understood for crossed products, we still have a lot to learn about how the hypotheses ofZ-stability/finitenuclear dimension and of the Universal Coefficient Theorem behave.

Some highlights among what is already known:

• If G is a finitely generated group with polynomial growth (equivalently, a virtually nilpotent group[14, 27]) andX is a finite dimensional locally compact Hausdorff space, then for any action ofG onX ,the crossed productC0(X)oG has finite nuclear dimension. (A result of Hirshberg and Wu, presentedin the workshop.)

• For any countable amenable group G, the generic action α of G on the Cantor set has a Z-stablecrossed product [4]. This crossed product is also known to satisfy the other hypotheses of Theorem18.0.1, so it is classifiable.

Page 212: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

208 Five-day Workshop Reports

• If G is an inductive limit of residually finite groups and A is a classifiable C∗-algebra (i.e., satisfyingthe hypotheses of Theorem 18.0.1) for which the trace space T (A) is a Bauer simplex, and α is anaction of G on A which is pointwise strongly outer, and which acts trivially (or at least periodically)on T (A), then the crossed product AoG has finite nuclear dimension (a result of Gardella–Hirshbergand Gardella–Phillips–Wang, presented in the workshop). If the crossed product also satisfies theUniversal Coefficient Theorem, then it is classifiable.

• Kerr recently defined a notion of “almost finite” actions, and proved that the crossed products of suchactions are Z-stable.

Another natural problem to turn to is the classification of actions of groups on C∗-algebras. A groupaction consists of a homomorphism from the given group to the group of ∗-automorphisms of the C∗-algebra;in case the group is equipped with a topology, one asks that this homomorphism be continuous, using thepoint-norm topology on the set of automorphisms. There are several notions of equivalence which one mayconsider; although this is not immediately apparent, cocycle conjugacy turns out to be one of the mostnatural. However, the classification problem is believed to be intractable for general actions, and generallyit is natural to restrict to actions with the Rokhlin property.

This problem is at an early stage, and there is much to do. A few highlights:

• Actions with the Rokhlin property of finite groups on a classifiable Kirchberg algebra or on a classifi-able C∗-algebra with tracial rank zero have been classified by Izumi [15, 16].

• Actions of the circle T on classifiable Kirchberg algebras with the continuous Rokhlin property havebeen classified by Gardella.

• Actions of the real numbers, R, also called flows, on classifiable Kirchberg algebras with the Rokhlinproperty have been classified by Szabo [40], an exciting new result that was presented at the workshop.

Open Problems

All participants contributed to the following list of open problems. Our thanks to Hannes Thiel for recordingthe list.

(i) The UCT-problem: Does every nuclear C∗-algebra satisfy the universal coefficient theorem (UCT)?

(ii) How bad can crossed products be? When is C(X)oG classifiable?

(iii) Let D be a strongly self-absorbing (s.s.a.) C∗-algebra with D O2. Is there a unital embeddingD → O∞ ⊗ Q, where Q denotes the universal UHF-algebra? (This can be considered as an infiniteversion of the quasidiagonality problem [52].)

(iv) Develop applications of classification. Establish Giordano–Putnam–Skau type (orbit equivalence) the-orems ([13]) for spaces that are not Cantor spaces.

(v) More examples of actions of Zd on Cantor sets ([11, 12]).

(vi) Sell classification. Simplify the proof and make it accessible to other areas of mathematics.

(vii) Is conjugacy of shifts of finite type (SFT) decidable? Can this be rephrased in terms of C∗-algebras?

(viii) Do simple C∗-algebras with real rank zero necessarily have strict comparison of positive elements?Are they Z-stable?

(ix) Range results for Cuntz semigroups.

Page 213: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Future targets in the classification program for amenable C∗-algebras 209

(x) Which rigid C∗-tensor categories embed into the representation theory of Z?

(xi) LetD be a s.s.a. C∗-algebra, letG be a torsion-free amenable group. Does there exist a unique, stronglyouter action of G on D?

(xii) Develop a theory of subfactors for inclusions of classifiable C∗-algebras ([17, 30]).

(xiii) Let G be an etale, amenable groupoid with G(0) a Cantor set such that C∗(G) is simple. Is C∗(G)Z-stable?

(xiv) LetA be a simple, classifiable C∗-algebra, and letG be a (finite) group of automorphisms of the Elliottinvariant Ell(A). Does G lift to an action on A? Does the map Aut(A)→ Aut(Ell(A)) split?

(xv) Let A be a simple, classifiable C∗-algebra, let a ∈ A, and let π : A → B(H) be an irreducible repre-sentation. Does H have an π(a)-invariant subspace?

(xvi) Is every simple, classifiable C∗-algebra a groupoid C∗-algebra ([5, 34])? Is every simple (exact) C∗-algebra a groupoid C∗-algebra?

(xvii) Develop analogies of Popa’s rigidity theory (for actions of property (T) groups on s.s.a. C∗-algebras)([32, 28, 33]). Is there a theory of intertwining by bimodules for C∗-algebras?

(xviii) Classify non-simple TAF algebras.

(xix) Is every real rank zero ASH algebra with slow dimension growth TAF?

(xx) Develop a better understanding of completely positive approximations. In particular, when is an in-ductive limit, in the category of operator systems, of finite-dimensional C∗-algebras, order-isomorphicto a C∗-algebra?

(xxi) What are the possible Cowling–Haagerup invariants for separable, simple, exact C∗-algebras?

Scientific Progress Made

The workshop brought together experts in various aspects of C∗-algebra theory, and created an environmentwhich fostered collaboration and the generation of new ideas. Here are a few examples of specific progressmade during the meeting.

• Francesc Perera, Leonel Robert, and Hannes Thiel made advances in understanding the Cuntz semi-group, and specifically addressing the question of when infima exist in this ordered C∗-algebra invari-ant.

• Søren Eilers, Jamie Gabe, and Takeshi Katsura made a decisive breakthrough on the problem ofdetermining when an extension of two simple graph C∗-algebras is again a graph C∗-algebra.

• Gabor Szabo benefitted from discussions with experts in C∗-algebra classification, leading to an insightthat will likely lead to new results.

• Jose Carrion, Chris Schafhauser, Aaron Tikuisis, and Stuart White initiated a new collaborativeproject on aspects of C∗-algebra classification.

Page 214: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

210 Five-day Workshop Reports

Outcome of the Meeting

There was a lot of sharing of expertise and learning at this meeting. This led to new collaborations, andinspired serious discussions of future directions for researchers in the classification of C∗-algebras, andmore generally, directions for the field of C∗-algebras itself. It is expected that there will be a number offuture meetings which follow up on these directions, including Noncommutative Geometry and OperatorAlgebras 2018: C∗-algebras and Dynamics (Munster).

Participants

an Huef, Astrid (Victoria University of Wellington)Blackadar, Bruce (University of Nevada at Reno)Brenken, Berndt (University of Calgary)Carrion, Jose (Texas Christian University)Dadarlat, Marius (Purdue University)Deeley, Robin (University of Hawaii)Eckhardt, Caleb (Miami University)Eilers, Soren (University of Copenhagen)Elliott, George (University of Toronto)Evington, Samuel (University of Glasgow)Farah, Ilijas (York University)Gabe, Jamie (University of Southampton)Gardella, Eusebio (University of Muenster)Geffen, Shirly (Ben Gurion University)Giordano, Thierry (University of Ottawa)Gong, Guihua (University of Peurto Rico)Hirshberg, Ilan (Ben Gurion University of the Negev)Ivanescu, Cristian (MacEwan University)Jacelon, Bhishan (University of Toronto)Katsura, Takeshi (Keio University)Kennedy, Matthew (University of Waterloo)Lee, Hyun Ho (University of Ulsan)Li, Xin (Queen Mary University of London)Li, Kang (Westfalische Wilhelms-Universitat Munster)Lin, Huaxin (University of Oregon)Lutley, James (University of Toronto)Musat, Magdalena (University of Copenhagen)Ng, Ping Wong (University of Lousiana at Lafayette)Niu, Zhuang (University of Wyoming)Perera, Francesc (Universitat Autonoma de Barcelona)Phillips, Christopher (University of Oregon)Putnam, Ian (University of Victoria)Robert, Leonel (University of Louisiana at Lafayette)Rordam, Mikael (University of Copenhagen)Sato, Yasuhiko (Kyoto University)Schafhauser, Christopher (University of Waterloo)Skau, Christian (Norwegian University of Science and Technology)Strung, Karen (Radboud University)Szabo, Gabor (University of Aberdeen)Thiel, Hannes (University of Munster)

Page 215: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Future targets in the classification program for amenable C∗-algebras 211

Tikuisis, Aaron (University of Ottawa)Toms, Andrew (Purdue University)Viola, Maria Grazia (Lakehead University)Wang, Qingyun (University of Oregon)White, Stuart (University of Glasgow)Winter, Wilhelm (University of Muenster)Wu, Jianchao (Texas A&M University)Zacharias, Joachim (University of Glasgow)

Page 216: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] Joan Bosa, Nathanial P. Brown, Yasuhiko Sato, Aaron Tikuisis, Stuart White, and Wilhelm Winter.Covering dimension of C∗-algebras and 2-coloured classification. Mem. Amer. Math. Soc. To appear.arXiv:1506.03974.

[2] Ola Bratteli. Inductive limits of finite dimensional C∗-algebras. Trans. Amer. Math. Soc., 171:195–234,1972.

[3] Lawrence G. Brown, Ronald G. Douglas, and Peter A. Fillmore. Extensions of C∗-algebras and k-homology. Ann. of Math. (2), 105(2):265–324, 1977.

[4] Clinton T. Conley, Steve Jackson, David Kerr, Andrew Marks, Brandon Seward, and Robin Tucker-Drob. Følner tilings for actions of amenable groups. arXiv:1704.00699.

[5] Robin Deeley, Ian F. Putnam, and Karen Strung. Constructing minimal homeomorphisms on point-like spaces and a dynamical presentation of the Jiang–Su algebra. Trans. Amer. Math. Soc. To appear.arXiv:1503.03800.

[6] George A. Elliott. On the classification of inductive limits of sequences of semisimple finite-dimensionalalgebras. J. Algebra, 38(1):29–44, 1976.

[7] George A. Elliott. The classification problem for amenable C∗-algebras. In Proceedings of theInternational Congress of Mathematicians, Vol. 1, 2 (Zurich, 1994), pages 922–932. Birkhauser, Basel,1995.

[8] George A. Elliott and Guihua Gong. On the classification of C∗-algebras of real rank zero. II. Ann. ofMath. (2), 144(3):497–610, 1996.

[9] George A. Elliott, Guihua Gong, Huaxin Lin, and Zhuang Niu. On the classification of simple amenableC∗-algebras with finite decomposition rank, II. arXiv:1507.03437, 2015.

[10] George A. Elliott and Andrew S. Toms. Regularity properties in the classification program for separa-ble amenable C∗-algebras. Bull. Amer. Math. Soc. (N.S.), 45(2):229–245, 2008.

[11] Thierry Giordano, Hiroki Matui, Ian F. Putnam, and Christian F. Skau. Orbit equivalence for Cantorminimal Z2-systems. J. Amer. Math. Soc., 21(3):863–892, 2008.

[12] Thierry Giordano, Hiroki Matui, Ian F. Putnam, and Christian F. Skau. Orbit equivalence for Cantorminimal Zd-systems. Invent. Math., 179(1):119–158, 2010.

[13] Thierry Giordano, Ian F. Putnam, and Christian F. Skau. Topological orbit equivalence and C∗-crossedproducts. J. Reine Angew. Math., 469:51–111, 1995.

[14] Mikhael Gromov. Groups of polynomial growth and expanding maps. Inst. Hautes Etudes Sci. Publ.Math., (53):53–73, 1981.

212

Page 217: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Future targets in the classification program for amenable C∗-algebras 213

[15] Masaki Izumi. Finite group actions on C∗-algebras with the Rohlin property. I. Duke Math. J.,122(2):233–280, 2004.

[16] Masaki Izumi. Finite group actions on C∗-algebras with the Rohlin property. II. Adv. Math.,184(1):119–160, 2004.

[17] Vaughan F. R. Jones. Index for subfactors. Invent. Math., 72(1):1–25, 1983.

[18] Gennadi G. Kasparov. Equivariant KK-theory and the Novikov conjecture. Invent. Math., 91(1):147–201, 1988.

[19] Eberhard Kirchberg. Central sequences in C∗-algebras and strongly purely infinite algebras. InOperator Algebras: The Abel Symposium 2004, volume 1 of Abel Symp., pages 175–231. Springer,Berlin, 2006.

[20] Eberhard Kirchberg and N. Christopher Phillips. Embedding of exact C∗-algebras in the Cuntz alge-bra O2. J. Reine Angew. Math., 525:17–53, 2000.

[21] Eberhard Kirchberg and Mikael Rørdam. Central sequence C∗-algebras and tensorial absorption ofthe Jiang-Su algebra. J. Reine Angew. Math., 695:175–214, 2014.

[22] Eberhard Kirchberg and Wilhelm Winter. Covering dimension and quasidiagonality. Internat. J.Math., 15(1):63–85, 2004.

[23] Huaxin Lin. Tracially AF C∗-algebras. Trans. Amer. Math. Soc., 353(2):693–722, 2001.

[24] Huaxin Lin. Classification of simple C∗-algebras of tracial topological rank zero. Duke Math. J.,125(1):91–119, 2004.

[25] Huaxin Lin. Asymptotic unitary equivalence and classification of simple amenable C∗-algebras. Invent.Math., 183(2):385–450, 2011.

[26] Hiroki Matui and Yasuhiko Sato. Decomposition rank of UHF-absorbing C∗-algebras. Duke Math. J.,163(14):2687–2708, 2014.

[27] John Milnor. Growth of finitely generated solvable groups. J. Differential Geometry, 2:447–449, 1968.

[28] Narutaka Ozawa and Sorin Popa. On a class of II1 factors with at most one Cartan subalgebra. Ann.of Math. (2), 172(1):713–749, 2010.

[29] N. Christopher Phillips. A classification theorem for nuclear purely infinite simple C∗-algebras. Doc.Math., 5:49–114 (electronic), 2000.

[30] Sorin Popa. Classification of amenable subfactors of type II. Acta Math., 172(2):163–255, 1994.

[31] Sorin Popa. On local finite-dimensional approximation of C∗-algebras. Pacific J. Math., 181(1):141–158, 1997.

[32] Sorin Popa. Strong rigidity of II1 factors arising from malleable actions of w-rigid groups. I. Invent.Math., 165(2):369–408, 2006.

[33] Sorin Popa and Stefaan Vaes. Unique Cartan decomposition for II1 factors arising from arbitraryactions of free groups. Acta Math., 212(1):141–198, 2014.

[34] Ian F. Putnam. Some classifiable groupoid C∗-algebras with prescribed K-theory. Math. Ann. Toappear. arXiv:1611.04649.

[35] Mikael Rørdam. Classification of inductive limits of Cuntz algebras. J. Reine Angew. Math., 440:175–200, 1993.

Page 218: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

214 Five-day Workshop Reports

[36] Mikael Rørdam. A simple C∗-algebra with a finite and an infinite projection. Acta Math., 191(1):109–142, 2003.

[37] Mikael Rørdam. The stable and the real rank of Z-absorbing C∗-algebras. Internat. J. Math.,15(10):1065–1084, 2004.

[38] Mikael Rørdam and Erling Størmer. Classification of nuclear C∗-algebras. Entropy in operatoralgebras, volume 126 of Encyclopaedia of Mathematical Sciences. Springer-Verlag, Berlin, 2002. Op-erator Algebras and Non-commutative Geometry, 7.

[39] Yasuhiko Sato, Stuart White, and Wilhelm Winter. Nuclear dimension and Z-stability. Invent. Math.To appear. arXiv:1403.0747.

[40] Gabor Szabo. The classification of Rokhlin flows on C∗-algebras. arXiv:1706.09276.

[41] Aaron Tikuisis, Stuart White, and Wilhelm Winter. Quasidiagonality of nuclear C∗-algebras. Ann. ofMath. (2), 185(1):229–284, 2016.

[42] Andrew S. Toms. On the classification problem for nuclear C∗-algebras. Ann. of Math. (2),167(3):1029–1044, 2008.

[43] Andrew S. Toms and Wilhelm Winter. Strongly self-absorbing C∗-algebras. Trans. Amer. Math. Soc.,359(8):3999–4029, 2007.

[44] Andrew S. Toms and Wilhelm Winter. The Elliott conjecture for Villadsen algebras of the first type. J.Funct. Anal., 256(5):1311–1340, 2009.

[45] Jesper Villadsen. Simple C∗-algebras with perforation. J. Funct. Anal., 154(1):110–116, 1998.

[46] Jesper Villadsen. On the stable rank of simple C∗-algebras. J. Amer. Math. Soc., 12(4):1091–1102,1999.

[47] Jesper Villadsen. Comparison of projections and unitary elements in simple C∗-algebras. J. ReineAngew. Math., 549:23–45, 2002.

[48] Wilhelm Winter. Strongly self-absorbing C∗-algebras are Z-stable. J. Noncommut. Geom., 5(2):253–264, 2011.

[49] Wilhelm Winter. Nuclear dimension and Z-stability of pure C∗-algebras. Invent. Math., 187(2):259–342, 2012.

[50] Wilhelm Winter. Localizing the Elliott conjecture at strongly self-absorbing C∗-algebras. J. ReineAngew. Math., 692:193–231, 2014.

[51] Wilhelm Winter. Classifying crossed product C∗-algebras. Amer. J. Math., 138(3):793–820, 2016.

[52] Wilhelm Winter. QDQ vs. UCT. In Operator Algebras and Applications, volume 12 of Abel Symp.,pages 321–342. Springer, Cham, 2016.

Page 219: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 19

Lattice walks at the Interface of Algebra,Analysis and Combinatorics (17w5090)

September 17 - 22, 2017Organizer(s): Marni Mishna (Simon Fraser University), Mireille Bousquet-Melou (Uni-versite de Bordeaux/ Centre National de la Recherche Scientifique), Michael Singer (NorthCarolina State University), Stephen Melczer (University of Waterloo & ENS Lyon)

Overview of the field

Lattice Path Models

Lattice paths are a classic object of mathematics, with applications in a wide range of areas including com-binatorics, theoretical computer science and queuing theory. In the past ten years, several new approacheshave emerged to determine formulas for exact enumeration. In particular, several systematic methods havearisen from very disparate areas of mathematics. The study of lattice walks restricted to the first quadrantis surprisingly illuminating; it reveals connections between abstract combinatorial structures and classes ofanalytic functions.

Here we define a lattice path model to be a combinatorial class of walks on the integer lattice, typi-cally starting at the origin, and remaining in some cone. It is prescribed by a finite set of vectors S, theallowable steps one can take in the lattice. Much attention has been paid to walks with small steps, whereS ⊆ −1, 0, 12, constrained to the quarter plane, Z2

≥0 = (x, y) ∈ Z2 : x ≥ 0, y ≥ 0. We frequentlyabbreviate the small steps using compass directions or vector diagrams. A walk of length n which startsat (x0, y0) is a sequence of steps (s1, . . . sn) such that si ∈ S and all points (x0, y0) +

∑`i=1 si : 0 ≤

` ≤ n remain in the designated cone. Figure 19.1 is a walk from the quarter plane model with step setS = (0, 1), (0,−1), (1, 0), (−1, 0).

The basic enumerative problem is to determine formulas (exact or asymptotic) for the number of walksof length n in a given model. Early studies [24, 21] determined formulas for particular models, generallyusing adhoc methods. Fayolle, Iagnogorski and Malyshev completed a comprehensive study of stationarydistribution of small step models in the quarter plane. They built a functional equation, and providedan analytic toolkit to resolve cases. The work inspired Bousquet-Melou, who adapted their methods for

215

Page 220: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

216 Five-day Workshop Reports

Figure 19.1: A walk of length 1000 from the simple quarter plane model with step set S =(−1, 0), (1, 0), (0,−1), (0, 1)

exact enumeration. She was able to directly derive the results of Kreweras and Gessel in a straightforwardmanner. This approach was adapted to many models by Bousquet-Melou and Mishna [12], who provided astructured analysis of the generating functions for the small step models restricted to the first quadrant.

Specifically, they determined that of the 28 − 1 possible small step models, there were precisely 79 thatwere combinatorially distinct and nontrivial. They provided generating function expressions for 22 of them,and made conjectures on the nature of the generating functions of the rest. The conjectures use a relatedgroup defined for each step set (inspired by the work of Fayolle et al.) This presented a tidy, and clearlyintriguing, classification challenge.

A wide variety of techniques have been developed in the interim, and this meeting sought to bring re-searchers together to discuss commonalities, feasible extensions, and future directions. The approachesdiscussed at this meeting included bijective methods, probability, computer algebra, complex analysis, al-gebraic geometry, and differential Galois theory. There are many references for these methods [12, 4, 19,7, 27, 16]. In this document we outline some of these topics as they apply to lattice path enumeration, anddiscuss the implications in generating function classification.

The generating function and its kernel

We revisit the idea of more general cones towards the end of the document. For now, consider the quarterplane and a fixed given step set S. Let e(k,`)→(k′,`′)(n) denote the number of walks in the cone startingat (k, `), ending at (k′, `′), taking n steps, each from S. The complete generating function for these numbersis the following series, viewed to be an element of Q[x, y][[t]]:∑

k,`,n≥0

e(0,0)→(k,`)(n)xk y`tn.

Formally, weights on the steps add no significant difficulty to the set up. Each step is assigned a (real,positive) weight, and the weight of a walk is the product of weights of its component steps. Remark, werecover the unweighted model by setting the weight of each step to be 1 (and hence the weight of each walkis 1). Alternatively, if the sum of the weights is 1, we describe a probability model. More precisely, let(di,j)(i,j)∈0,±12 be a family of elements of Q ∩ [0, 1] such that

∑i,j di,j = 1. The weight di,j can be viewed

as the probability that a walk take a step in the direction (i, j), with d0,0 the probability that it does notmove.

For any (i, j) ∈ Z2≥0 and any n ∈ Z≥0, we let qi,j(n) be the sum of the weights of all walks of length n

Page 221: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Lattice walks at the Interface of Algebra, Analysis and Combinatorics 217

starting at the origin, and ending at the point (i, j). Our main focus here is the trivariate generating series

Q(x, y; t) :=∑i,j

qi,j(n)xiyjtn. (19.0.1)

It is sufficient to consider the case∑i,j di,j = 1, by suitably scaling the t variable (although many tech-

niques are custom made for the unweighted version). Then, for any n ∈ Z≥0, |qi,j(n)| ≤ ∑i,j |qi,j(n)| ≤(∑i,j |di,j |)n = 1. We deduce that Q(x, y; t) converges for all (x, y; t) ∈ C3 such that |x| < 1, |y| < 1 and

|t| ≤ 1.

The inventory of the step set is a Laurent polynomial which we might write as a function of eithervariable x or y:

S(x, y) =∑

(i,j)∈0,±12 di,jxiyj

= A−1(x) 1y +A0(x) +A1(x)y

= B−1(y) 1x +B0(y) +B1(y)x.

(19.0.2)

Here Ai(x) ∈ x−1Q[x], and Bi(y) ∈ y−1Q[y]. The kernel of the walk is defined by

K(x, y, t) := xy(1− tS(x, y)). (19.0.3)

A walk is either a walk of length 0, or a walk followed by a valid step. This straightforward decomposi-tion has a natural translation into generating functions, into what we call the kernel equation. A propertyspecific to models of walks in the quarter plane is that the generating function for walks that end on aboundary can be obtained from an evaluation of Q(x, y; t). For example, the family of walks that begin andend at the origin have generating function

∑n≥0 q0,0(n)tn = Q(0, 0; t). The walks that end on the y-axis

have generating function∑n,k≥0 qk,0(n)xktn = Q(x, 0; t). This is a particularly convenient observation.

Following [18, Section 1], and translating the aforementioned combinatorial recurrence, we may provethat the generating series Q(x, y, t) satisfies the following functional equation:

K(x, y, t)Q(x, y; t) = xy −R(x, t)− S(y, t) + td−1,−1Q(0, 0, t) (19.0.4)

whereR(x, t) := K(x, 0, t)Q(x, 0; t), and S(y, t) := K(0, y, t)Q(0, y; t). (19.0.5)

Many enumeration strategies start with Equation (19.0.4), and then differ in their subsequent manipu-lations. This report summarizes only some of them. Central to many of these methods is the property ofsmall step walks is that K(x, y; t) is quadratic. Considering larger steps is considerably more complicated.Nonetheless, stepsets with negative steps of length 2 have recently been thoroughly considered [10].

Classification of generating functions

Lattice walks provide a unified context to examine combinatorial origins of different function classes. Thecombinatorial complexity of a given model is correlated to the analytic complexity of its generating function.It remains open to completely formalize many of the known connections.

Rational A generating function is rational if it is the quotient of two polynomials. Generating functionsof regular languages are rational. The coefficients of rational series, and their asymptotics are extremelywell understood, see [20] for a pedagogical treatment. For example, the the quarter plane model with stepset N,NE,E is unconstrained, and consequently it is easy to show the generating function is rational:QN,NE,E(x, y; t) = 1

1−t(x+xy+y) .

Page 222: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

218 Five-day Workshop Reports

Figure 19.2: Different classes of generating functions

Algebraic An univariate generating function f(t) is algebraic if there is a polynomial P (u, v) ∈ C[u, v]such that P (f(t), t) = 0. The polynomial P provides a finite encoding for f(t), and can be used in compu-tations and closure properties. The asymptotics are also relatively well understood. A multivariate functionof m varables is algebraic if it solves a polynomial equation in m + 1 variables. Rational functions arealgebraic.

From the combinatorial perspective, combinatorial classes such as trees and maps possess the archetypalalgebraic structure, and have algebraic generating functions.

In the realm of lattice models, unidimensional walks have algebraic generating functions, and many two-dimensional models can be reduced to this case. Expressions for the generating functions, and the coefficientasymptotics are straightforward to compute.

Transcendental D-finite A univariate function is differentiably finite, commonly called D-finite or holonomic,if it satisfies a linear differential equation with polynomial coefficients. In 1980 Stanley highlighted the in-terest to combinatorialists by pointing out the very combinatorial collection of closure properties: includingdifferentiation, algebraic substitution and Hadamard product. It remains open to find a combinatorialmechanism that adequately captures this class of functions. It strictly contains algebraic functions.

The following result (compiled from work of Kath, Andre, and Garoufalidis) is useful to show that afunction is not D-finite.

Theorem 19.0.1 If a series∑ant

n in Z[t] is D-finite, with radius of convergence in (0,∞), then its singularpoints are regular with rational exponents. Consequently, the asymptotic expansion has the form of a finite sum

an ∼∑

λ−nnα logk(n)fλ,α,k(1/n)

where α is rational, λ is algebraic, k is a nonnegative integer, and f is a polynomial over Q[x].

For example, Bostan, Raschel and Salvy proved that several models of excursions have non-D-finite gener-ating functions because the sub-exponential growth is irrational [7].

A multivariate function is D-finite with respect to a set of variables if it is D-finite with respect to each ofthe variables.

Page 223: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Lattice walks at the Interface of Algebra, Analysis and Combinatorics 219

Diagonals The diagonal of a formal power series∑i1,...,id≥0 fi1,...,idz

i11 · · · zidd is the series (∆F )(t) =∑

n≥0 fn,...,ntn.

The diagonal of a multivariate function is defined for a particular series expansion of that functionaround the origin.

Furstenburg showed in 1967 that algebraic functions can be expressed as a diagonal of a bivariate ratio-nal function by providing the rational based on the polynomial that the function satisfies. Lipshitz showedthat the diagonal of a D-finite multivariate series was D-finite. There do exist D-finite functions which couldnot be a diagonal of a rational, for example ex, but perhaps if the conditions are right, there might be anequivalence. The following conjecture has been considered for a while:

Question 19.0.2 (Christol, 1990) Can any D-finite series with integer coefficients and a positive radius of con-vergence can be expressed as a diagonal of a rational series?

Differentiably algebraic A function is differentiably algebraic, or D-algebraic, if there exists an algebraicrelationship between f(x) and its derivatives. More precisely, there exists a polynomial P ∈ C[x] so that

P (f(x), f ′(x), . . . , f (k)(x)) = 0.

This class clearly contains the D-finite functions, but is otherwise less well understood.

Beyond this Finally, we say that the remaining functions are hyper transcendental.

Classifying Lattice paths

So where do the generating functions of the 79 small step quarter plane models fall? Attempts to answer thisquestion have spawned a very active area of research. The answers provide insight on the subtle differencesof the function classes. Figure 19.3 is a compilation of results from numerous articles.

Slightly less is known about the univariate counting generating functions. Since these classes are closedunder algebraic substitutions, which includes evaluations, we can deduce some containments, but we cannotconclude that many of the step sets with non-D-finite complete generating function have a non-D-finitecounting generating function.

Question 19.0.3 Describe combinatorial criteria on 2D lattice models which determine the nature of the gener-ating functions. Which criteria hold in problems of higher dimension?

Question 19.0.4 Prove the remaining unclassified small step univariate counting generating functions non-D-finite.

Computer Algebra

Part of the appeal of lattice path models is the ease with which one can experiment using computer algebratools. It is relatively straightfoward to develop a series expansion from the basic recurrence, and then formhypotheses and conjectures about the generating functions.

Page 224: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

220 Five-day Workshop Reports

Figure 19.3: The nature of the complete generating function Q(x, y; t) for the 79 small step quarter plane latticepath models

Guess ’n’ prove

Software to guess the algebraic or differential equation satisfied by a series has been used enumerativecombinatorics for nearly 30 years. The main implementions work by generating a high order truncation ofthe (possibly multivariate) generating function, followed by a guessing stage which tries to fit this truncationinto algebraic or differential equations of various orders and degrees using Pade approximants. In Maple,this is implemented in the gfun [30] package.

The equations that are found remain conjectural, until they are proved. In this field, the most famousexample, perhaps, is known as “Gessel’s walk”. In 2000 Ira Gessel, via personal communication, suggestedthat the generating function for the quarter plane walks with step set SW,W,NE,E should be D-finite.This was based on the observation that the counting sequence for excursions, that is, the walks that start andend at the origin, appeared to fit very nice formula. This lead to a very vigourous discussion, as the othermodels with D-finite generating functions were easily guessed to be, and this model resisted the approach.

The D-finiteness of the excursion generating function was published in 2009 [23]. The proof used acutting edge holonomic systems approach, and creative telescoping to confirm Gessel’s conjecture. Still, itremained open to determine the nature of the complete generating function.

Around the same time, Bostan and Kauers [4] used the aforementioned approach based on Pade-Hermiteapproximants to systematically guess algebraic and differential equations satisfied by the 79 non-isomorphictwo dimensional models in the quarter plane. In addition to detailing several algebraic and arithmeticconditions which help one believe that a guessed differential equation truely annihilates the multivariategenerating function, the authors produced an influential table of guessed asymptotics for the 23 models theypredicted to have D-finite generating function.

It surprised many when Bostan and Kauers [5] proved not only the D-finiteness, but the algebraicityof the multivariate Gessel generating function Q(x, y; t)! Their approach rigorously certified a guessedminimal polynomial of Q(x, y; t) using algebraic elimination theory. The result was a computational tourde force – the algebraic equation that Q(x, y; t) satisfies requires several MB to store. Researchers werechallenged to demonstrate the algebraicity using “human” techniques. The community delivered, and suchresults appeared in the following few years. Indeed, resolving this problem is a recurrent theme. For moredetails, see the approachable survey of Bostan and Raschel [6].

Computer algebra methods have proved indispensible for providing bulk results. Notably, Bostan,Raschel and Salvy [7] iterated through to show that the excursions in 56 of the models did not have a D-finitegenerating function because the asymptotic expressions were incompatible. Specifically, the subexponentialgrowth had an irrational exponent.

Recently, Bostan et al. [11] proved all guessed annihilating algebraic and differential equations for the23 D-finite models using creative telescoping on diagonal expressions, and wrote lattice path generatingfunctions as explicit integrals of hypergeometric functions. These expressions, in turn, allowed the authors

Page 225: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Lattice walks at the Interface of Algebra, Analysis and Combinatorics 221

to classify all transcendental D-finite generating functions, and determine which combinatorially interestingspecializations of the transcendental generating functions are algebraic.

In this workshop, the topic was introduced by a thorough mini-course given by Manuel Kauers. Severaltalks addressed computational aspects, including how to prove the transcendance of a D-finite generatingfunction from an annihilating differential equation.

This is but a small snapshot of the computer algebra methods in this field. A far more complete anddetailed summary is the habilitation thesis of Bostan [8].

Question 19.0.5 The computational methods of Bostan et al. [9] show that the number of excursions on thestep set (−1, 0), (−1, 0), (−1, 1), (−1,−1), (1, 0), (1, 1) (note the double (−1, 0) step) satisfy

a2n =6(6n+ 1)!(2n+ 1)!

(3n)!(4n+ 3)!(n+ 1)!

(there are no excursions of odd length). Find a ‘human’ proof of this result.

Question 19.0.6 The four quadrant models with algebraic multivariate generating functions have zero orbit-sum, meaning the kernel method does not allow one to represent these generating functions as diagonals ofrational functions. As algebraic functions, they can be written as diagonals of bivariate rational functionsthrough known constructions, but these representations are unweildy and hard to analyze compared to the nice‘combinatorial’ expressions arising from the kernel method. Do there exist simple ‘combinatorial’ rationalfunctions whose diagonals give the generating functions of the 4 models with algebraic generating function?

Enhanced data manipulation

The strategies for guessing apprimations of asymptotic formulas from initial series data are becoming in-creasingly sophisticated. If one assumes that the generating function f(z) =

∑anz

n has a power lawsingularity of the form

f(x) ∼ C(

1− x

x0

)α,

the series can be analyzed to produce high precision numerical estimates for the unknown quantities. In thisworkshop, Tony Guttman talked about how to use differential approximants to predict subsequent termsfrom some initial data. Every differential approximant naturally reproduces exactly all coefficients used inits derivation. Being a D-finite differential equation, it implies the value of all subsequent coefficients. Thesesubsequent coefficients will usually be approximate.

These techniques have been used by the Melbourne school for a variety of combinatorial problems, andrecent work by Guttman and Elvey Price on the enumeration of Planar Eulerian Orientations [17] waspresented at this workshop.

The Kernel Method

Kernel equations here denote functional equations of a particular form that arise from combinatorial re-cursions. One separate the terms into two classes: on the left hand side is an expression with the completegenerating function, known in some circles as the bulk. One the right hand side, there are functions whichinvolve the boundary or more generally, evaluations of the complete generating function. The kernel deter-mines the bulk behaviour. The two main ways to proceed are to manipulate the equation to either eliminatethe boundary conditions, or to eliminate the bulk.

Page 226: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

222 Five-day Workshop Reports

Andrew Rechnitzer provided a short course on variants of the kernel method for lattice walks, with aview to the study of polymers. He identified the main steps:

• identify a catalytic or auxillary variable to track a useful parameter;

• use a combinatorial recursion to establish a functional equation for the complete generating function;

• rearrange the terms and write the equation in kernel form;

• evaluate the auxiliary variables to either eliminate the kernel, or to eliminate the boundary terms;

• extract the complete generating function.

Remark the two principle variants: the first one uses substitutes solutions to the kernel into the main equa-tion, and manipulates the boundary terms (“eliminate the bulk”); the second uses invariants of the kernelto eliminate the boundary terms (“eliminate the boundary”).

Eliminating the bulk can find explicit generating expressions. In polymer modelling problems, it isused to understand the behaviour for different evaluations of the auxiliary parameter. A typical exampleis contacts with a boundary, well explained by the talk of Aleks Owczarek on three interacting friendlywalks [31]. This is also central to several analytic approaches which we describe below.

Eliminating the boundary using invariants of the kernel has been very useful for lattice path enumera-tion. The finiteness of a subgroup of the kernel invariants is implicated in the classification of the completegenerating function. Bousquet-Melou and Mishna described how to use such invariants to express the com-plete generating function as a sub-series of a Laurent series expansion of a computable rational function.

Analytic combinatorics of several variables

Complex analysis methods to determine asymptoic enumeration formulas for combinatorial classes havebecome standard [20]. Initially the majority of the result came from the singularity analysis of of univariategenerating functions. The multivariate problem is substantially harder, but steady progress over the pasttwo decades has lead to some systematic strategies. Relevant here has been the work developed by Peman-tle, Wilson, Baryshnikov, and coauthors under the name of Analytic Combinatorics in Several Variables,abreviated (ACSV) [28].

Melczer and Mishna [26] noted the applicability of these methods to lattice enumeration. They restrictedto models with significant symmetry, but were able to generalize the strategy to higher dimensions. Melczerand Wilson performed a case by case analysis [27] to determine asymptotic formulas for most of the smallstep 2D models with a finite group. Courtiel et al. [15] used ACSV to determine asymptotic formulas for theweighted Gouyou-Beauchamps model.

Differential Galois Theory

The strategy to eliminate the “bulk” leads to important theoretical results. The key is to understand thatthe kernel is eliminated on an elliptic curve. Charlotte Hardouin gave an excellent course on how to applymethods in differential Galois theory to prove the hypertranscendance of lattice model generating functions.Similar strategies have been used to show the transcendance of the Gamma function.

We first fix the value of t, admittedly a nontraditional strategy for generating function combinatorics.Specifically, let us fix 0 < t < 1 with t /∈ Q.

Page 227: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Lattice walks at the Interface of Algebra, Analysis and Combinatorics 223

We start from Equation (19.0.4)

K(x, y, t)Q(x, y; t) = xy −R(x, t)− S(y, t) + td−1,−1Q(0, 0, t). (19.0.6)

Consider the algebraic curve Et, which is defined as the zero set in P1(C) × P1(C) of the followinghomogeneous polynomial

K(x0, x1, y0, y1, t) = x0x1y0y1 − t2∑

i,j=0

di−1,j−1xi0x

2−i1 yj0y

2−j1 = x2

1y21K(

x0

x1,y0

y1, t). (19.0.7)

This curve is an an elliptic curve. This is the situation studied in [12, 25], and for the Galoisian point of view,see [16].

Thanks to uniformization, we can identify Et with C/Zω1 + Zω2. Specifically, consider the map

C → Etω 7→ (q1(ω), q2(ω)),

where q1, q2 are rational functions of p and its derivative dp/dω, p the Weirestrass function associated withthe lattice Zω1 + Zω2 (cf. [25, Section 3.2]). Therefore one can lift the functions R(x, t) and S(y, t) tofunctions rx(ω) = R(q1(ω), t) and ry(ω) = S(q2(ω), t).

One can deduce from [25, Theorems 3 and 4], that the functions rx(ω) and ry(ω) can be continuedmeromorphically as univalent functions on the universal cover C. Furthermore, for any ω ∈ C, we have

τ(rx(ω))− rx(ω) = b1, where b1 = ι1(q2(ω))(τ(q1(ω))− q1(ω)) (19.0.8)τ(ry(ω))− ry(ω) = b2, where b2 = q1(ω)(ι1(q2(ω))− q2(ω)) (19.0.9)

rx(ω + ω1) = rx(ω) (19.0.10)ry(ω + ω1) = ry(ω). (19.0.11)

where τ is the automorphism of the field of meromorphic functions sending f(ω) onto f(ω + ω3) and ω3 isexplicitely given in [25, Section 3.2].

One can show that rx(ω) is differentially algebraic with respect to ddω over CE , the field of elliptic func-

tions with respect to Et if and only if Q(x, 0, t) is differentially algebraic with respect to ddx over C(x).

The order of the differential operator

In [16], it is shown that if rx(ω) is differentially algebraic, then there is a second order linear differentialoperator L with constant coefficients such that L(rx)−g is an ω3 periodic function, that is, it is left invariantby τ .

For the 51 unweighted walks associated to genus 1 curve and infinite group of the walk, only 9 walks giverise to a differentially algebraic generating function (see [16]). For these walks, it seems that the results of[3] show that there exists g ∈ CE such that rx(ω)− g(ω) is ω3-periodic.

This line of research has great potential, and there remain many natural questions.

Question 19.0.7 For weighted walks associated to genus 1 curve and infinite group of the walk, is it always truethat when rx(ω) is differentially algebraic then there exists g ∈ CE such that rx(ω)− g(ω) is ω3-periodic?

Question 19.0.8 What can be dervied from this approach about the differential dependence in t for the 51unweighted walks associated to genus 1 curve and infinite group of the walk?

Page 228: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

224 Five-day Workshop Reports

Finite group of the walk

Let us consider a walk associated with an elliptic curve Et = C/Zω1 +Zω2 (the curve defined by the kernelequation) and let us assume that this walk has finite group. This is equivalent to the fact that an integermultiple of ω3 above belongs to Zω1 + Zω2. One can ask the following questions:

Question 19.0.9 The prolongation of rx(ω) into a meromorphic function over C as well as the functional equa-tion (19.0.8) obtained in [25] for the 51 unweighted walks associated to genus 1 curve and infinite group of thewalk, depend heavily on the fact that the group of the walk is infinite. Can one still obtain analogous results inthe finite group case?

Question 19.0.10 For weighted walks associated to genus 1 curve and finite group of the walk, is there a generalclosed form for the generating series?

Random walk processes and the critical exponent

Random walk processes are a classic topic of probability. Processes conditioned never to leave the coneare connected to Harmonic functions and Random Matrix theory. The persistance probabilities of randomwalks are linked to statistical physics and population models.

Many results on the expected first exit time from a cone can be converted into asymptotic enumerationresults. The course of Kilian Raschel provided many very clear connections. If the number q(n) of walksof length n grows asymptotically like κρ−nn−α as n tends to infinity, then we call α the critical exponent.Probability results give straightforward access to information on the critical exponents. The classical ex-ponents for 1 dimensional models are 0, 1/2 and 3/2 which depend on the drift. The simple walk in twodimensions is a far richer. This is modelled by the simple N,E, S,W walks in an arbitrary angular sectorθ. Varopoulos, and Denisov and Wachtel give that the exponent for the total number of walks is π/2θ andfor excursions is π/θ + 1. These results imply that the generating functions are not D-finite whenever θ isnot a rational multiple of π.

The non simple models can be similarly analyzed, once the model is suitably transformed. This plan ofattack was used with great success by Bostan, Raschel and Salvy who demonstrated via a case analysis thatfor all small step models, if the group of the step set was infinite, the excursion generating function is notD-finite [7].

Question 19.0.11 Is there a direct connection between the infiniteness of the group and the irrationality ofcritical exponent? If so, does this correspondence hold in higher dimnesions?

In fact, the results of Denisov and Wachtel are for arbitrary dimension, so in theory a similar approachshould work in higher dimension. The first model under consideration is the model known as the 3D Krew-eras model, with step set (1, 1, 1), (−1, 0, 0), (0,−1, 0), (0, 0,−1) restricted to the orthant Z3

≥0. Severalgroups [1, 22] have tried to estimate the critical exponent with a goal of establishing its irrationality.

Question 19.0.12 Is the generating function for the 3D Kreweras model D-finite?

Page 229: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Lattice walks at the Interface of Algebra, Analysis and Combinatorics 225

Extensions of the original problem

Non convex regions

Non convex regions require a modified analysis. The 3/4 plane is the next candidate for a systematic study.Bousquet-Melou has launched this for the simple steps [13], and has demonstrated that, while more techni-cal, many of the same basic strategies can apply.

The most recent advances on this problem were presented by Trotignon during the meeting. Rascheland Trotignon have expressed the generating function for simple walks in the 3/4 plane as the solution to aboundary value problem [29].

Winding numbers

The majority of enumerative works on lattice walks to date use a last step recursive decomposition: Awalk is either empty, or it is a walk plus a step. Timothy Budd described a very promising alternativedecomposition [14]. He considers models with step sets that have a high amount of symmetry, and coneswhose boundaries on the angles that are integer multiples of π/4. He considers a new decomposition forwalks as a sequence of excursions. The work offers explicit formulas for 2D walks in a cone which avoid aboundary except at the start, and at the end. He considers the generating function where length and windingangle of the walk are tracked. The winding angle is the angle formed by the first and last step. This set upis sufficient to determine an explicit formula for the usual Gessel walks in the first quadrant. He gives someclear criteria for algebraicity, and the D-finiteness of the models that he considers is easy to establish.

This work raises a number of questions, and his intermediary objects may be useful to determine thecombinatorics behind the classification of the generating functions.

Question 19.0.13 Can his argument deduce the generating function for simple walks in the 3/4 plane trackingwinding number? How does it compare with the work of Raschel and Trotignon?

New Collaborations

Many participants have reported new collaborations as a result of this workshop, as per the intended goal.Several participants have launched collaborations to extend, and address many of the questions that werelisted here. Several participants noted the quality of the open questions presented, and the applicability oftechniques across domains.

Of particular note is an entirely new seminar series that was created at the historic Institut HenriPoincare (IHP): The Groupe de travail autour des marches dans le quart de plan1 is organized by LuciaDi Vizio and Alin Bostan.

Michel Drmota and Andrew Rechnitzer have begun discussions on analytic properties of solutions ofsystems of functional equations.

Thomas Dreyfus reported several new collaborations, including new work on the algebraic independenceof Mahler functions with Jason Bell, Boris Adamscewski and Michael Singer; and a new collaboration withCarlos Arreche on the elliptic hypergeometric functions.

Mark Wilson noted a boost to several projects in progress with Stephen Melczer.

1https://divizio.joomla.com/seminaires-et-gdt/gdt-marches-dans-le-quart-de-plan

Page 230: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

226 Five-day Workshop Reports

Miklos Bona started a new collaboration with Jay Pantone at Dartmouth college on the log-convexity ofcertain permutation classes. They have used lattice paths to settle one of the special cases. Some of theseclasses are counted by generating functions that are not algebraic, but differentiably finite, and Miklos Bonais trying to use what he learned from the talk of Bruno Salvy about such functions.

Jason Bell and Marni Mishna considered some of the questions on the cogrowth problem raised by IgorPak. Their work resulted in a preprint [2].

Michael Singer and Charlotte Hardouin have made progress towards Open Problem 6.

Participants

Adamczewski, Boris (CNRS, Lyon)Arreche, Carlos (The University of Texas at Dallas)Bagherzadeh, Fatemeh (University of Saskatchewan)Banderier, Cyril (Paris 13 University)Bell, Jason (University of Waterloo)Bona, Miklos (University of Florida)Bostan, Alin (INRIA)Budd, Timothy (Universite Paris-Saclay)Chen, Shaoshi (Chinese Academy of Sciences)Chyzak, Frederic (INRIA)Corteel, Sylvie (Berkeley)Courtiel, Julien (Universite de Caenn)Di Vizio, Lucia (Universite de Versailles-St Quentin)Dreyfus, Thomas (Universite Lyon 1)Drmota, Michael (Technische Universitaet Wien)Gessel, Ira (Brandeis University)Greenwood, Torin (Georgia Institute of Technology)Guo, John (Simon Fraser University)Guttmann, Tony (University of Melbourne)Hardouin, Charlotte (Institut de mathematiques de Toulouse)Huang, Hui (Johannes Kepler University Linz)Kauers, Manuel (Johannes Kepler University)Koutschan, Christoph (Austrian Academy of Sciences)Krattenthaler, Christian (University of Vienna)Labahn, George (University of Waterloo)Lipshitz, Leonard (Purdue University)Melczer, Stephen (University of Pennsylvania)Mishna, Marni (Mathematics, Simon Fraser University)Owczarek, Aleks (The University of Melbourne)Pak, Igor (University of California Los Angeles)Prellberg, Thomas (Queen Mary University of London)Raschel, Kilian (CNRS / Universite de Tours)Rechnitzer, Andrew (University of British Columbia)Salvy, Bruno (INRIA)Simon, Samuel (Simon Fraser University)Singer, Michael (North Carolina State University)Tevlin, Lenny (New York University)Trotignon, Amelie (Universite de Tours)Wallner, Michael (TU Wien)Wilson, Mark (University of Auckland)

Page 231: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] Axel Bacher, Manuel Kauers, and Rika Yatchak. Continued classification of 3d lattice walks in the positive octant.arXiv preprint arXiv:1511.05763, 2015.

[2] Jason Bell and Marni Mishna. On the complexity of the cogrowth sequence. arXiv:1805.08118 [math.CO], 2018.

[3] Olivier Bernardi, Mireille Bousquet-Melou, and Kilian Raschel. Counting quadrant walks via tutte’s invariantmethod. arXiv preprint arXiv:1708.08215, 2017.

[4] A. Bostan and M. Kauers. Automatic classification of restricted lattice walks. In Proceedings of FPSAC 2009,Discrete Math. Theor. Comput. Sci. Proc., AK, pages 201–215, 2009.

[5] A. Bostan and M. Kauers. The complete generating function for Gessel walks is algebraic. Proc. Amer. Math. Soc.,138(9):3063–3078, 2010. With an appendix by Mark van Hoeij.

[6] A. Bostan and K. Raschel. Compter les excursions sur un echiquier. Pour la Science, (449):40 – 46, 2015.

[7] A. Bostan, K. Raschel, and B. Salvy. Non-D-finite excursions in the quarter plane. J. Comb. Theory, Ser. A,121(0):45–63, 2014.

[8] Alin Bostan. Calcul formel pour la combinatoire des marches. https://specfun.inria.fr/bostan/HDR.pdf, 12 2017.Habilitation a Diriger des Recherches.

[9] Alin Bostan, Mireille Bousquet-Melou, Manuel Kauers, and Stephen Melczer. On 3-dimensional lattice walksconfined to the positive octant. Ann. Comb., 20(4):661–704, 2016.

[10] Alin Bostan, Mireille Bousquet-Melou, and Stephen Melczer. Counting walks with large steps in an orthant.Preprint.

[11] Alin Bostan, Frederic Chyzak, Mark van Hoeij, Manuel Kauers, and Lucien Pech. Hypergeometric expressionsfor generating functions of walks with small steps in the quarter plane. European J. Combin., 61:242–275, 2017.

[12] M. Bousquet-Melou and M. Mishna. Walks with small steps in the quarter plane. In Algorithmic Probability andCombinatorics, volume 520 of Contemp. Math., pages 1–40. Amer. Math. Soc., 2010.

[13] Mireille Bousquet-Melou. Square lattice walks avoiding a quadrant. J. Combin. Theory Ser. A, 144:37–79, 2016.

[14] Timothy Budd. Winding of simple walks on the square lattice. preprint.

[15] J. Courtiel, S. Melczer, M. Mishna, and K. Raschel. Weighted lattice walks and universality classes. J. Combin.Theory Ser. A, 152:255–302, 2017.

[16] Thomas Dreyfus, Charlotte Hardouin, Julien Roques, and Michael F. Singer. On the nature of the generating seriesof walks in the quarter plane. Invent. Math., 213(1):139–203, 2018.

[17] Andrew Elvey-Price and Anthony J. Guttmann. Counting planar Eulerian orientations. European J. Combin.,71:73–98, 2018.

[18] G. Fayolle, R. Iasnogorodski, and V. Malyshev. Random walks in the quarter-plane, volume 40 of Applications ofMathematics (New York). Springer-Verlag, Berlin, 1999.

[19] G. Fayolle and K. Raschel. Some exact asymptotics in the counting of walks in the quarter plane. In Proceedingsof AofA’12, Discrete Math. Theor. Comput. Sci. Proc., AQ, pages 109–124, 2012.

[20] P. Flajolet and R. Sedgewick. Analytic Combinatorics. Cambridge University Press, 2009.

[21] I. Gessel. A probabilistic method for lattice path enumeration. J. Statist. Plann. Inference, 14(1):49–58, 1986.

227

Page 232: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

228 Five-day Workshop Reports

[22] Anthony J Guttmann. Series extension: predicting approximate series coefficients from a finite number of exactcoefficients. Journal of Physics A: Mathematical and Theoretical, 49(41):415002, 2016.

[23] M. Kauers, C. Koutschan, and D. Zeilberger. Proof of Ira Gessel’s lattice path conjecture. Proc. Nat. Acad. Sci.USA, 106(28):11502–11505, 2009.

[24] G. Kreweras. Sur une classe de problemes lies au treillis des partitions d’entiers. Cahiers du B.U.R.O., 5–105,1965.

[25] I. Kurkova and K. Raschel. On the functions counting walks with small steps in the quarter plane. Publ. Math.Inst. Hautes Etudes Sci., 116:69–114, 2012.

[26] S. Melczer and M. Mishna. Asymptotic lattice path enumeration using diagonals. Algorithmica, 75:782–811, 2016.

[27] S. Melczer and M. Wilson. Asymptotics of lattice walks via analytic combinatorics in several variables. InProceedings of FPSAC’16, Discrete Math. Theor. Comput. Sci. Proc., 2016.

[28] R. Pemantle and M. Wilson. Analytic combinatorics in several variables, volume 140 of Cambridge Studies inAdvanced Mathematics. Cambridge University Press, Cambridge, 2013.

[29] Kilian Raschel and Amelie Trotignon. On walks avoiding a quadrant. arXiv preprint arXiv:1807.08610, 2018.

[30] B. Salvy and P. Zimmermann. GFUN: a maple package for the manipulation of generating and holonomic func-tions in one variable. ACM Trans. Math. Softw., 20(2):163–177, 1994.

[31] R Tabbara, AL Owczarek, and A Rechnitzer. An exact solution of three interacting friendly walks in the bulk.Journal of Physics A: Mathematical and Theoretical, 49(15):154004, 2016.

Page 233: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 20

Symmetries of Surfaces, Maps and Dessins(17w5162)

September 24 - 29, 2017Organizer(s): Marston Conder (University of Auckland, New Zealand), Nigel Boston(University of Wisconsin at Madison, USA), Gabino Gonzalez-Diez (Universidad Autonomade Madrid, Spain), Gareth Jones (University of Southampton, UK), Thomas Tucker (Col-gate University, USA)

Introduction

The main objective of this workshop was to bring together a number of leading and emerging researchers in the areaof study of actions of discrete groups on Riemann and Klein surfaces and algebraic curves, and related topics such assymmetric embeddings of graphs and dessins d’enfants on surfaces. It followed up on a meeting held at the CIEM inSpain in 2010.

The last two decades have seen a burgeoning of activity in these fields, with various strands coming together, exploit-ing the linkages established by Belyi and Grothendieck, and some increasingly useful techniques from combinatorial andcomputational group theory. In particular, computational experiments and searches have produced a wealth of exam-ples (either for small genera or infinite families of a particular type), and these serve as a useful test-bed for conjecturesand potential new approaches.

Workshop format

The workshop began with introductory lectures, one on each of the eight themes, presented by experts in the area. Theseaddressed recent developments and described important open problems, and possible approaches to answering them.Then the participants formed groups to work separately on the first four themes from late morning on the Tuesday tolunchtime on the Wednesday, and rearranged into new groups to work on the other four themes from the Wednesdayafternoon to the Friday morning.

Also during the week an opportunity was given for open problems to be presented, and ten short talks were pre-sented by workshop participants on topics related to the workshop themes, on the Monday, Tuesday and Wednesdayafternoons. A conference excursion was made to Lake Louise on the Thursday afternoon, and enjoyed by all who tookpart in that.

229

Page 234: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

230 Five-day Workshop Reports

Themes

Introductory talks on the main themes of the workshop were given on the first two days, as follows:

• Theme 1: Regular and edge-transitive maps,introduced by Marston Conder and Tom Tucker

• Theme 2: Dessins d’enfants, Belyi’s theorem and Galois action,introduced by Gabino Gonzalez-Diez and Ernesto Girondo

• Theme 3: Defining equations for Riemann surfaces,introduced by Allen Broughton and Aaron Wootton

• Theme 4: Polytopes, hypertopes, maps and maniplexes,introduced by Dimitri Leemans

• Theme 5: Cayley maps and skew morphisms,introduced by Robert Jajcay

• Theme 6: n-gonal automorphisms of Riemann surfaces,introduced by Mariela Carvacho

• Theme 7: External symmetries of regular maps,introduced by Jozef Siran

• Theme 8: Pseudo-real surfaces, and large automorphism groups of surfaces,introduced by Javier Cirre and Grzegorz Gromadzki.

Working groups

Below we report on the activities of the eight working groups that were formed during the workshop. The first fourconvened early in the week, with two of them merging together, and the last four convened in the second half of theworkshop.

Regular and Edge-Transitive Maps

[Workshop leaders: Marston Conder, Jozef Siran, and Tom Tucker]

By a ‘map’ we mean a 2-cell embedding of a connected graph or multigraph on a closed surface (which may be orientableor non-orientable). Such a map M is edge-transitive if its automorphism group Aut(M) acts transitively on edges.Graver and Watkins [5] introduced a classification of such maps into 14 types, according to the local effects of certaingenerators of the automorphism group. Then Siran, Tucker and Watkins [10] showed how to construct orientable mapsof each type from certain presentations satisfying an ‘index two subgroup and a ‘forbidden automorphism’ condition,and used this construction to give finite orientable maps of each type with Aut(M) ∼= Sn for certain n.

Recently Jones [7] extended this work to surfaces with boundary, and among other things showed that all but a smallnumber of simple groups occur as the automorphism group of various edge-transitive maps of all 14 types (with all ofthese maps being chiral or non-orientable). Gareth Jones was a co-organiser of this workshop, but sadly was unable toattend, and so we made available to participants a copy of a talk that he gave on edge-transitive maps at a BIRS meetingin Oaxaca in August 2017.

There have been only a few papers on edge-transitive maps published since 2001. Many questions have been sug-gested by some of the above people and others for further study: classifying all such maps for a given group or family ofgroups, or on a given surface, or for a given underlying graph or family of graphs; determining the relative frequencyof each type; and seeking generalisations, say to polytopes.

At this workshop, the working group spent time reviewing the 14 types of edge-transitive maps, including importantdetails about (a) the automorphism group G = Aut(M) with marked generators as a quotient of some universal groupU for the appropriate type, (b) the image G+ of the orientation-preserving subgroup U+ of U and its meaning for ori-entability of the map, (c) the nature of forbidden automorphisms permuting the marked generators and their inverses,and (d) the roles of duality and Petrie duality.

Page 235: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Symmetries of Surfaces, Maps and Dessins 231

The working group then focused on one of the open problems described in the opening presentation on the first day,namely Classifying the edge-transitive maps for which the subgroup G+ is abelian. This problem turned out to be agreat deal more complicated than the presenters had thought, but was highly illuminating. The discussion concentratedmostly on a small number of the 14 types, and eventually led to the discovery that there are no ET maps with abelianG+ for six of the 14 types, and there do exist such maps for six of the other eight types. Two of the 14 types remainedunresolved.

This problem provided an excellent introduction to the study of edge-transitive maps, and clarified a number ofissues for the main presenters — indeed so much so that a publication on the findings is likely (once the investigationsare completed).

Belyi Theory

[Workshop leaders: Allen Broughton, Gabino Gonzalez-Diez and Aaron Wootton]

This theme was combined with ‘Defining Equations for Surfaces’ — see the next subsection.

Defining Equations for Surfaces

[Workshop leaders: Allen Broughton, Gabino Gonzalez-Diez and Aaron Wootton]

(a) Defining a surface.In both of these themes, each surface S of interest typically has either a suitably ‘large’ group of automorphisms G,with quotient surface S/G having genus 0, or a ‘nicely structured’ branched covering π : S → P1 (n-gonal morphism).Such surfaces may be constructed in two different ways.

(b) Defining equations.Let f1, . . . , fN−1 be homogeneous polynomials in X ∈ PN of varying degrees. In general, the algebraic variety

V (f1, . . . , fN−1) =X ∈ PN : f1(X) = · · · = fN−1(X) = 0

(20.0.1)

will be an irreducible, complex algebraic curve, and its normalisation will be a smooth surface. The desirable case is aplane curve where N = 2.

(c) Group actions and monodromy groups.A conformal G-action on S is described by the set of points Q1, . . . , Qt ⊂ P1 over which π : S → S/G = P1 isramified, and a t-tuple (c1, . . . , ct) of elements of G, called a generating vector, satisfying

G = 〈c1, . . . , ct〉 and c1c2 · · · ct = 1. (20.0.2)

The t-tuple (c1, . . . , ct) is determined by the monodromy of π : S → S/G.

More generally, given any branched cover π : S → P1 of degree n, ramified over Q1, . . . , Qt, a transitive mon-odromy group Mπ ≤ Σn is determined by a monodromy system (p1, . . . , pt):

Mπ = 〈p1, . . . , pt〉 with p1p2 · · · pt = 1. (20.0.3)

The cycle structures of the permutations pi are determined by the cycle structures of the exceptional fibres π−1(Qi).Conversely, given points Q1, . . . , Qt and systems as in (20.0.2) or (20.0.3), a branched covering surface S may bedefined. This is a typical starting point for defining S, and we want to determine a defining equation for S as in (20.0.1).

(d) Quasi-platonic and Belyi surfaces.Currently there is intense interest in quasi-platonic surfaces, Belyi surfaces, and their dessins d’enfant. Quasi-platonicsurfaces form the G-action case with t = 3, and Belyi surfaces include all monodromy groups with t = 3. Both types of

Page 236: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

232 Five-day Workshop Reports

surfaces have equations defined over number fields, and the absolute Galois group acts on both the surfaces and theirdessins.

(e) Questions.At the workshop, the following questions were posed:

• Given a group action ofG or monodromy groupMπ with generating systems as in (20.0.2) and (20.0.3), is there analgorithm to find defining equations as in (20.0.1)? The cyclic case is easy and well known. In the important Belyisurface case, Monien [9] and Voight et al [12] have developed computationally intensive methods for determiningequations.

• In the quasi-platonic case, can the G-action be used to speed up computation in the methods of Monien andVoight? Are there estimates for the running time of these algorithms?

• Can a set of defining equations as (20.0.1) be found so that the G-action is induced by a linear action of G on theambient space PN? See [11].

• What are the defining equations of a curve S with cyclic group of automorphisms G such that S/G has genus 1?This generalises the cyclic n-gonal case.

(f) Discussion, outcomes, and further work.The easy case of cyclic n-gonal surfaces was presented and discussed. The easy examples of explicit equations of highergenus Belyi surfaces have almost all been determined and exploited. Further work on Galois action on dessins needs agood library of examples.

Hartmut Monien and John Voight gave extended presentations of their work (as mentioned above) on constructing adefining equation for any monodromy triple given in (20.0.3). The monodromy triple is used to compute a fundamentalregion for a Fuchsian group Π such that H/Π ∼= S. One then works with modular forms for Π, using either numericallinear algebra or numerical PDEs.

The working group enthusiastically supported the future construction of a widely available data base of equationsfor those surfaces of low genus or those defined by monodromy systems (generating vectors) of interesting groups.

Maniplexes and Incidence Geometries

[Workshop leader: Dimitri Leemans]

This working group aimed to gain a better understanding of the link between maniplexes and incidence geometries, andto develop a kind of ‘dictionary’ that would permit people working in these fields to understand each other’s research.

Steve Wilson introduced maniplexes as a means of constructing maps, either by drawing a structure on a surface,or by assembling polygons (just as a 4-cube can be constructed by gluing 3-cubes along faces).

An (n+ 1)-maniplex is a set Ω of flags, together with an ordered set R = [r0, . . . , rn] of sets of pairs of flags, whereeach ri may be seen as an involution permuting the flags. Then for 0 ≤ i < n, an i-face is a connected component ofRi = ∪j 6=irj , and the 0-faces are called the vertices, the 1-faces called the edges, and the n-faces called the facets. Inparticular, each facet is itself a maniplex.

The flag-graph of any polytope is a maniplex, but there are many other kinds of examples. A 1-maniplex is just aset Ω of cardinality 2 (and is the flag graph of the graph K2, while a 2-maniplex is a polygon, and a 3-maniplex may beviewed as a transitive action of a non-degenerate string group.

The working group focused on understanding maniplexes as chamber systems of incidence geometries obtainedfrom polytopes. The group also worked on ways of removing the ‘string’ condition from the definition of maniplexes,and developing a notion of hyperplexes which could generalise hypermaps. This work is still in progress, and is likely toenable people working on maniplexes to exploit research on incidence geometries, and vice-versa.

Cayley Maps and Skew Morphisms

Page 237: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Symmetries of Surfaces, Maps and Dessins 233

[Workshop leader: Robert Jajcay]

A skew-morphism of a group A is a permutation ϕ of A preserving the identity and a function π : A → Z|ϕ|, calledthe power function associated with ϕ, satisfying the property ϕ(ab) = ϕ(a)ϕπ(a)(b) for all a, b ∈ A. Skew-morphismswere originally introduced for the study of regular Cayley maps, but eventually became significant in the study ofcomplementary cyclic group extensions.

An orientable map M is called orientably-regular if for every pair of arcs of M there exists an orientation-preserving automorphism ofM that takes the first arc to the second. A Cayley map CM(A,X, p) is an embedding of aconnected Cayley graph C(A,X), that has the same local orientation p at each vertex. All left multiplications within theCayley group A induce automorphisms of the Cayley map, and many of the well-known families of orientably-regularmaps turn out to be Cayley maps. A Cayley map CM(A,X, p) is regular if and only if there exists a skew-morphismϕ of A with the property that ϕ(x) = p(x) for all x ∈ X . Thus regular Cayley maps on A correspond to orbits ofskew-morphisms of A that generate A and are closed under inverses.

In the context of group extensions, if A is a group and ϕ is a skew-morphism of A, then the skew-product of Aand 〈ϕ〉 is defined as G = (A × 〈ϕ〉 , ∗) under the operation (a, ϕi) ∗ (b, ϕj) = (aϕi(b), ϕσ(i,j)), for a suitably definedfunction σ(i, j). This product is a group, with a complementary factorisation G = AY . Conversely, if G is any finitegroup with a complementary factorisation G = AY where Y is cyclic, then the quotient G = G/CoreG(Y ) is a skewproduct group associated with the skew morphism ϕ. Hence the classification of skew-morphisms of a finite group Aallows for classification of all regular Cayley maps for A, as well as all complementary extensions of a group G in theform G = AY with Y cyclic.

Much effort has been devoted recently to classifying skew-morphisms of various classes of groups. Classificationshave been achieved for finite abelian groups of odd prime-power order, elementary abelian 2-groups, and groups whoseorder is a product of two primes. (Various product results have also been found.) It was announced at the workshopthat the classification of skew-morphisms of finite dihedral groups has been completed. The classification for all abelianfinite groups now seems within reach. On the other side of the spectrum, the classification of skew-morphisms of finitesimple groups was also announced.

Also discussed at the workshop was a generalised definition of skew-morphisms, related to the concept of a gen-eralised Cayley map (which is a map admitting a group of automorphisms that acts regularly on vertices). A suitabledefinition has been found, and basic properties have been determined. The concept appears to be related to the theoryof 2-extensions of groups.

n-gonal Surfaces

[Workshop leaders: Mariela Carvacho and Aaron Wootton]

An n-gonal morphism of a Riemann surface S is map φ : S → P1 of degree n, and an n-gonal surface is one that exhibitssuch a φ. When φ is regular, we say that S is regular with group of deck transformations deck(φ) = G. Literature onn-gonal morphisms is scant, most being limited to regular morphisms with G = 〈σ〉 cyclic, and the so-called cyclicn-gonal surfaces) Most of the discussion at and after this workshop focused on generalising known theory of cyclicn-gonal surfaces to other regular n-gonal surfaces.

(a) The cyclic n-gonal case.A complete picture is known for cyclic n-gonal surfaces and their automorphism groups when n is a prime number.This was made possible through a consequence of the Castelnuovo–Severi theorem, which states that if the genus g ofa cyclic p-gonal surface S satisfies g > (p − 1)2, then G = 〈σ〉 is normal in the full automorphism group of S. Whenn is not prime, additional assumptions on the automorphism group, or on the branching data of the quotient mapφ : S → S/〈σ〉 = P1 yield a similar inequality. In particular, a similar complete description appears to be tractable.

(b) Strong branching.Much of the above theory depends on the concept of ‘strong branching’ and the normality ofG. A covering f : S1 → S2

of Riemann surfaces of degree n is said to be strongly branched if g1 > n2g2 + (n − 1)2, where gi is the genus of Si

Page 238: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

234 Five-day Workshop Reports

(for i = 1, 2). Accola proved that when f is a strongly branched regular map with deck(f) = G and G is simple, thenGCAut(S1).

(c) Questions.At the workshop, the following questions were posed:

• What conditions can be imposed on G to ensure that GCAut(S), and in particular, what weakening of ‘stronglybranched’ can be used?

• Given a regular n-gonal surface S, how can Aut(S) be computed for non-normal G?

(d) Discussion, outcomes and further work.

• It was proved that if the core of G in Aut(S) is trivial, then GCAut(S) when φ is strongly branched.• It was agreed that a list should be built of examples of pairs G < Aut(S) where S/G has genus 0, to help provide

a conjectural picture.• Kay Magaard explained the work of Guralnick and Thompson on monodromy groups of rational maps. It was

envisioned that this could be used as a tool in answering the second question in (c) above.

External Symmetries of Regular Maps

[Workshop leader: Jozef Siran]

A map M on a surface is called regular if its automorphism group Aut(M) acts transitively on incident vertex-edge-face triples. Every such map has constant valency m and constant face-size `, and this pair of numbers definesthe type of M . Also every such M may be identified with a presentation for its automorphism group G in the formG = 〈 a, b, c | a2 = b2 = c2 = (ab)` = (bc)m = (ca)2 = . . . = 1 〉, as a quotient of the full (`,m, 2)-triangle group, anddenoted by the symbol M(G; a, b, c).

This working group explored duality, Petrie-duality and combinations of these, as well as exponents for self-invarianceunder the Coxeter ‘hole’ operators. Together, these concepts are sometimes colloquially called external symmetries ofregular maps, and a regular map of type (`,m) is said to be kaleidoscopic if it admits all exponents in the multiplicativegroup of units mod m.

A regular map M(G; a, b, c) is self-dual if its group G admits an automorphism that swaps a with c while fixing b,and self-Petrie-dual if G admits an automorphism that swaps a with ac while fixing b and c, and kaleidoscopic if forevery unit j modm the groupG has an automorphism that fixes a and c while taking b to (bc)jc. Then further,M is saidto have trinity symmetry (or to be completely self-dual, or ‘self-everything’) if it is both self-dual and self-Petrie-dual,and to be super-symmetric if it is kaleidoscopic and has trinity symmetry as well.

In his introductory lecture, Jozef Siran cited four research-driving questions:

(a) Does there exist a completely self-dual regular map of valency n for every odd n ≥ 5?(b) Does there exist a kaleidoscopic completely self-dual regular map of valency n for every odd n ≥ 5?(c) What kind of structure has the external symmetry group of a kaleidoscopic completely self-dual regular map?(d) Is it true that for every m ≥ 3 and every subgroup U of C∗m × C2, there exists a non-orientable regular map of

valency m with exponent group U?Affirmative answers to the first two questions for all even valencies are already known, and the analogue of (d) in

the orientable case was shown to be true by Conder and Siran in [3]. Also for valency 8 it was also shown by Conder,Kwon and Siran [4] that the order of the external symmetry group can be larger than any pre-assigned positive integer.

The working group focused on Question (a). Because explicit generating triples of involutions can be found in thecase whereG = PSL(2, q) orG = PGL(2, q), it was suggested that suitable candidates for completely self-dual regularmaps of arbitrary valency m ≥ 5 should be sought in these families of groups. Marston Conder checked and confirmedthe feasibility of such an approach for odd valencies 5 to 17 (with the help of the MAGMA computer system [2]), and heand Steve Wilson and Jozef Siran made further suggestions about how this problem might be approached algebraically.The group also had a brief discussion about a possible approach to Question (b), using parallel products (which weredeveloped several years ago by Steve Wilson and recently re-considered by Gareth Jones).

Some of the participants in this working group will take these investigations further.

Page 239: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Symmetries of Surfaces, Maps and Dessins 235

Pseudo-real Riemann Surfaces, and Large Group Actions on Surfaces

[Workshop leaders: Javier Cirre and Grzegorz Gromadzki]

A compact Riemann surface X is said to be pseudo-real if it admits anti-conformal automorphisms but no suchautomorphisms of order 2. Any such surface lies in the so-called real moduli of the moduli space of compact Riemannsurfaces, but cannot be defined by real polynomials. There is no pseudo-real surface of genus 0 or 1, but it has beenknown since 2010 that there exist pseudo-real surfaces of genus g for every g ≥ 2.

An important aspect of current research on pseudo-real surfaces has to do with the largest order M(g) of the fullautomorphism group of a pseudo-real surface of genus g ≥ 2. It is known that M(g) ≤ 12(g − 1), and that this upperbound is attained for infinitely many values of g, but before this workshop, relatively little was known about lowerbounds for M(g). Accordingly, several questions regarding potential lower bounds for M(g) were considered duringthe workshop. It was proved that M(g) ≥ 2g for all even g, and that M(g) ≥ 4(g − 1) for all odd g. This left thequestion of sharpness of these lower bounds.

In some work that took place after the workshop, a proof of sharpness of the bound M(g) ≥ 4(g− 1) for odd g wasobtained for a large (and likely infinite) set S of odd values of g. Sadly a similar theorem for even genera seems well outof reach at this stage.

In the case of conformal actions on all Riemann surfaces, the values of the corresponding parameter M(g) areknown to lie between 8(g + 1) and 84(g − 1), by the celebrated theorems of Accola–Maclachlan [1, 8] and Hurwitz [6]respectively. The precise values of M(g) are known for 2 ≤ g ≤ 300 and for some particular series of values for g, butdespite a lot of work on the topic, this function is still not well understood.

Accordingly, instead of looking at individual values of M(g), it makes sense to study the asymptotic properties ofthe function M , via the set Ad of accumulation points of the set A of values of the ratio M(g)/g. It was known beforethe workshop that for the second derived set A(2) = (Ad)d, we have 12 ∈ A(2) ⊆ 8, 12, and during the workshop wesucceeded in essentially reducing the problem of deciding whether or not 8 ∈ A(2) to deciding whether or not certainvery particular Belyi actions exist.

Articles on both of the above topics are planned to be written.

Short talks

The following 20-minute talks on topics related to the workshop themes were given by workshop participants, on theMonday, Tuesday and Wednesday afternoons:

• Jen Paulhus: A database of group actions• Alina Vdovina: Buildings and generalisations of dessins• Dimitri Leemans: Almost simple groups and polytopes• Alexander Zvonkin: Diophantine invariants of dessins d’enfants• Becca Winarski: Homomorphisms between mapping class groups of surfaces• Roman Nedela: Skew morphisms of cyclic groups and complete regular dessins• Charles Camacho: Counting quasiplatonic cyclic group actions of order n• Shaofei Du: Nilpotent primer hypermaps with hypervertices of prime valency• Milagros Izquierdo: Dessins d’enfants and a curve of Wiman• Dimitri Leemans: Almost simple groups and polytopes• David Torres: Teichmuller curves and Hilbert modular surfaces.

Organisers

Page 240: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

236 Five-day Workshop Reports

• Marston Conder (University of Auckland, New Zealand)• Nigel Boston (University of Wisconsin at Madison, USA)• Gabino Gonzalez-Diez (Universidad Autonoma de Madrid, Spain)• Gareth Jones (University of Southampton, UK)• Thomas Tucker (Colgate University, USA)— helpfully assisted by• Allen Broughton (Rose-Hulman Institute of Technology, USA)• Aaron Wootton (University of Portland, USA)

Participants

• James Anderson (University of Southampton, UK)• Antonio Breda d’Azevedo (University of Aveiro, Portugal)• Allen Broughton (Rose Hulman Institute of Technology, USA)• Emilio Bujalance (Universidad Nacional de Educacion a Distancia, Spain)• Charles Camacho (Oregon State University, USA)• Mariela Carvacho (UTFSM Valparaiso, Chile)• Javier Cirre (Universidad Nacional de Educacion a Distancia, Spain)• Marston Conder (University of Auckland, New Zealand)• Shaofei Du (Capital Normal University, China)• Jane Gilman (Rutgers University - Newark, USA)• Ernesto Girondo (Universidad Autonoma de Madrid, Spain)• Gabino Gonzalez-Diez (Universidad Autonoma de Madrid, Spain)• Grzegorz Gromadzki (University of Gdansk, Poland)• Asia Ivic Weiss (York University, Canada)• Milagros Izquierdo (University of Linkoping, Sweden)• Robert Jajcay (Comenius University Bratislava, Slovakia)• Young Soo Kwon (Yeungnam University, South Korea)• Dimitri Leemans (Universite Libre de Bruxelles, Belgium)• Stephen Lo (University of Auckland, New Zealand)• Kay Magaard (University Birmingham, UK)• Claude Marion (Universita degli Studi di Padova, Italy)• Hartmut Monien (Universitat Bonn, Germany)• Joy Morris (University of Lethbridge, Canada)• Roman Nedela (University of West Bohemia, Czech Republic)• Jen Paulhus (Grinnell College, USA)• Jozef Siran (Open University, UK)• Martin Skoviera (Comenius University Bratislava, Slovakia)• Klara Stokes (University of Skovde, Sweden)• David Torres (University of Saarbrucken, Germany)• Thomas Tucker (Colgate University, USA)• Alina Vdovina (Newcastle University, UK)• John Voight (Dartmouth College, USA)• Anthony Weaver (Bronx Community College, CUNY, USA)• Steve Wilson (Northern Arizona University, USA)• Becca Winarski (University of Wisconsin - Milwaukee, USA)• Aaron Wootton (University of Portland, USA)

Page 241: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Symmetries of Surfaces, Maps and Dessins 237

• Alexander Zvonkin (Universite de Bordeaux, France).

Acknowledgements

On behalf of all the participants, the organisers wish to thank the Banff International Research Station and the BanffCentre, and especially the people in those organisations who helped make this workshop possible and contributed to itssuccess.

Page 242: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] R.D.M. Accola, On the number of automorphisms of a closed Riemann surface, Trans. Amer. Math. Soc. 131(1968), 398–408.

[2] W. Bosma, J. Cannon and C. Playoust, The MAGMA algebra system. I. The user language, J. Symbolic Comput. 24(1997), 235–265.

[3] M. Conder and J. Siran, Orientably-regular maps with given exponent group, Bull. Lond. Math. Soc. 48 (2016),1013–1017.

[4] M.D.E Conder, Y.S. Kwon and J. Siran, On external symmetry groups of regular maps. In Rigidity and symmetry,Fields Inst. Commun. 70, 87–96, Springer, New York, 2014.

[5] J.E. Graver and M.E. Watkins, Locally finite, planar, edge-transitive graphs, Mem. Amer. Math. Soc. 126 (1997),no. 601, vi+75 pp.

[6] A. Hurwitz, Uber algebraische Gebilde mit eindeutigen Transformationen in sich, Math. Ann. 41 (1892), 403–442.

[7] G.A. Jones, Automorphism groups of edge-transitive maps, preprint.

[8] C. Maclachlan, A bound for the number of automorphisms of a compact Riemann surface, J. London Math. Soc.44 (1969), 265–272.

[9] H. Monien, The sporadic group J2, Hauptmodul and Belyi map, preprint (2017, arXiv:1703.05200).

[10] J. Siran, T.W. Tucker and M.E. Watkins, Realizing finite edge-transitive orientable maps, J. Graph Theory 37(2001), 1–34.

[11] David Swinarski, Equations of Riemann surfaces with automorphisms, preprint (2016, arXiv:1607.04778).

[12] M. Klug, M. Musty, S. Schiavone and J. Voight, Numerical calculation of three-point branched covers of the pro-jective line, preprint (2014, arXiv:1311.2081).

238

Page 243: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 21

p-adic cohomology and arithmeticapplications (17w5118)

October 1 - 6, 2017Organizer(s): Ambrus Pal (Imperial College London), Tomoyuki Abe (Kavli Institute forthe Physics and Mathematics of the Universe), Kiran Kedlaya (University of California,San Diego), Christopher Lazda (Universita di Padova)

Overview

The subject of p-adic cohomology, a Weil cohomology theory for algebraic varieties defined over fields of positive char-acteristic p, has a long and distinguished history. It began with Dwork’s proof in [Dwo60] of one of the most importantresults in arithmetic geometry, the rationality of zeta functions of algebraic varieties defined over finite fields, originallyconjectured by Weil. This proof in fact predated that given by Artin and Grothendiek, and, very surprisingly at the thetime, used p-adic analytic methods. Soon the cohomology theory that was hiding behind these methods was identified byMonsky and Washnitzer in [MW68]. At the same time, Grothendieck and Berthelot introduced crystalline cohomology[Ber74,Gro68], motivated by the desire to capture p-torsion phenomena in the cohomology of varieties in characteristicp. These two approaches were later unified, at least rationally, via Berthelot’s theory of rigid cohomology [Ber97b].After a period of intensive work, due to the efforts of Kedlaya, Caro, and others, the theory reached a new level of so-phistication with the proof of Grothendieck’s six functors formalism, replicating those properties of `-adic cohomologywhich occupied Grothendieck and his school for many years, and considered as a central achievement in arithmeticgeometry.

While a major driving factor in the subject is often expected analogies with other cohomology theories, such as`-adic etale cohomology or algebraic de Rham cohomology, in fact p-adic cohomology generally contains much moreinformation, and displays a wealth of extra subtlety that both complicates and enriches the subject. Indeed, as Ogushas long argued, crystalline cohomology is often much more closely analogous to Hodge theory for complex varieties,for example in how one can use p-adic invariants to control certain deformation problems in mixed characteristic orequicharacteristic p.

This added subtlety makes p-adic cohomology a more challenging and intricate subject of study than the `-adictheory, and is partially the reason why progress in the field has often been slower and more painstaking, usually laggingbehind results obtained for other Weil cohomology theories. When the p-adic world does catch up, however, the resultscan be far more significant that their `-adic counterparts. A good example of this is provided by the (full) Weil conjec-tures: although the `-adic proof preceded the purely p-adic proof by over 30 years, it is only by using p-adic cohomologythat we now have algorithms that can actually compute the zeta functions of varieties [Ked04, Lau06].

239

Page 244: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

240 Five-day Workshop Reports

Recent years have seen somewhat of a maturation of the subject; many basic foundational results are now settled,and interest has started to turn towards applications of these results to long standing arithmetic questions and conjec-tures, some of which were the original motivation for the development of p-adic cohomology. The most important ofthese are, in no particular order:

• various manifestations of p-adic Hodge theory;

• the p-adic Langlands correspondence both for number fields and function fields;

• applications of the theory of arithmetic D-modules to p-adic representation theory;

• the p-adic properties of special values of L-functions, including refined conjectures of the Birch–Swinnerton-Dyerand Beilinson–Bloch–Kato type;

• Iwasawa theory in positive characteristic.

Another important new research direction is to explore some more refined aspects of the theory such as the questionof integrality and p-torsion phenomena. New avenues are starting to be opened up in extending the scope of p-adiccohomology beyond the classical situation of perfect ground fields. There has also been a surge of interest in the moresubtle cousin of p-adic cohomology theory, namely p-adic homotopy theory, which attempts to apply p-adic methods tostudy non-abelian phenomena on varieties in characteristic p.

These two directions of research are deeply interwoven. For example a major motivation for the study of integralityand p-torsion in p-adic cohomology is to develop tools for attacking refined Birch–Swinnerton-Dyer-, Beilinson–Bloch–Kato- and Iwasawa-type conjectures. The desire to develop p-adic homotopy theory is partially driven by problemsencountered in the p-adic Langlands conjecture, and to have the usual homotopical tools available for the study of p-adicalgebraic cycles, such as oriented cobordism theory. Part of the motivation for exploring overconvergent cohomologyover non-perfect ground fields is to have an analogue of p-adic Hodge theory over Laurent series fields, especially forfamilies of coefficients, and to develop tools for p-adic cycle and period maps, similar to those appearing in the regulatorsof Gros [Gro90] and the integration theory of Besser [Bes00].

There are two more areas of mathematics which are closely connected to the main topic of the conference: p-adic or non-archimedean analytic geometry, via the study of p-adic differential equations, and function field arithmetic,especially its local and cohomological aspects (see for example [?, ?]). Non-archimedean analytic geometry went throughsomething of a revolution in the last few years, with the methodical introduction of a new concept of points, due toBerkovich and Huber. This strongly extended the scope of its foundational results, but at the same time streamlined theproof of many classical theorems. In the setting of p-adic differential equations these methods also found applications,and via the work of Baldassarri, Poineau–Pulita and Kedlaya [Bal10, PP13, Ked16] now we have beautiful sufficientconditions for the finiteness of cohomology for p-adic local systems on curves.

All these various strands of research were represented at the workshop, which drew together a wide range of re-searchers all with a broad interest in p-adic cohomology. There were a large number of excellent talks given by a diverseselection of speakers, all generating lively further discussions among participants. A lot of the work discussed at theworkshop concerned exciting progress towards major open problems in the area, as well as interesting new perspectivesthat will provide fertile ground for the continuing development of the subject.

Main Themes

As we have already explained, all these exciting new trends emerging in the field are of course deeply interwoven, andthe aim of the workshop was to encourage new progress in these areas by promoting both predictable and unpredictablesynergies between them. Here we describe in more details the specific topics that were covered during the workshop,and some of the interactions that were generated.

Foundations and theory over non-perfect fields

Traditionally, p-adic cohomology theories have been expressed for varieties over perfect ground fields of characteristicp. While much of the theory still works over non-perfect fields, arithmetic considerations (in particular the generalphenomenon of semistable reduction, as well as analogies with the `-adic theory) lead one to expect certain refinements

Page 245: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

p-adic cohomology and arithmetic applications 241

of existing p-adic cohomologies (such as rigid cohomology) when working over such non-perfect fields. There have beenseveral recent attempts to work out what such a more general theory might look like - for example in [LP16] the caseof Laurent series fields over perfect fields was considered, and in [LS17] B. Le Stum has given a completely generaldefinition of “overconvergent cohomology” for locally Noetherian schemes.

During the workshop, a major advance in these foundational matters was announced in the talk given by RichardCrew on Rings of arithmetic differential operators on tubes. He described an extension of Berthelot’s theory of arith-metic differential operators to a class of morphisms of adic formal schemes that are not necessarily of finite type, or evenadic, by introducing a new finiteness condition, that of being ‘universally Noetherian’. After an overview of the neces-sary commutative algebraic foundations, he went onto to explain how one can use this theory to give generalisations ofthe theory of convergent and overconvergent isocrystals.

There are also other approaches to generalising the theory of arithmetic D†-modules, as studied for example byCaro–Vauclair [CV15]. These are based on a slightly different finiteness condition - being ‘p-smooth’ rather than‘universally Noetherian’. These two conditions are somewhat orthogonal, and provide good frameworks in which tostudy Grothendieck’s 6 operations in different situations. One particularly interesting question would be to see whetheror not there is a natural way to unify these two approaches.

The Langlands program and links with representation theory

Berthelot’s theory of arithmetic D†-modules was created to try to give a good analogue of the theory of algebraicD-modules in mixed characteristic or characteristic p situations, and thanks to recent work of Caro, many of the foun-dations are now essentially in place.

In the classical case, over the complex numbers, or more generally fields of characteristic zero, one especially impor-tant area of application is to the study of representation theory, the most famous example of this being the Beilinson–Bernstein correspondence. It is similarly hoped that the theory of arithmetic D†-modules will prove to be a powerfultool in the study of representation theory, in particular that of p-adic Lie groups, and thus in the p-adic Langlandsprogram. A similar such application of D-module theory on rigid analytic spaces over p-adic fields has been alreadyfound by Ardakov–Wadsley [AW13], who used their theory to answer some representation theoretical problems whicharose in the local Langlands program.

There is also closely related work of Huyghe, Patel, Schmidt and Strauch on localisation theorems in the setting ofarithmetic D†-modules [HPSS15], in which they show that there is an equivalence of categories between the category oflocally analytic admissible representations of some split reductive group over a finite extension of Qp, and the categoryof coadmissible arithmeticD†-modules over the rigid analytic space attached to the flag variety of the group. The talk byMatthias Strauch entitled Arithmetic structures in sheaves of differential operators on formal schemes and D†-affinityreported on this exciting piece of work. First he defined certain integral structures, depending on a congruence level, forthe sheaves of differential operators on a formal scheme which is a blow-up of a formal scheme which itself is formallysmooth over a complete discrete valuation ring of mixed characteristic. When one takes the projective limit over allblow-ups, one obtains the sheaf of differential operators on the associated rigid space, introduced independently by K.Ardakov and S. Wadsley. In the second part he explained what it means for a formal model of a flag variety to beD†-affine (this concept is analogous to that of Beilinson–Bernstein and Brylinski–Kashiwara in the algebraic context),and he then explained an example illustrating how to use these results together with methods from rigid cohomology toanalyze locally analytic representations of p-adic groups.

Similarly, the 6 operations formalism has been used by Abe [Abe13] to prove a p-adic Langlands correspondence inthe function field setting, and thus verify Deligne’s “petits camarades cristallins” conjecture on the existence of p-adiccompanions to compatible systems of `-adic Galois representations (at least over curves). This has been the startingpoint of a lot of new research which we will talk about in more detail in the next section.

Higgs bundles, Simpson correspondence, Simpson’s conjecture and p-adic companions

Since the foundational work of Grothendieck, Deligne, and Tate, it has been clear that `-adic local systems (or moregenerally, complexes of `-adic constructible sheaves) should be the `-adic incarnation of a family of motives in charac-teristic p. The situation for p-adic coefficient objects was only more recently clarified, through foundational work ofP. Berthelot and R. Crew and, more recently, breakthroughs due to D. Caro and especially T. Abe.

Page 246: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

242 Five-day Workshop Reports

As we already mentioned above, Abe used his robust p-adic cohomology theory to resolve part of Deligne’s Com-panions Conjecture from Weil II for curves: he constructs, for every sufficiently irreducible lisse `-adic sheaf on asmooth curve X/Fq , a compatible overconvergent F -isocrystal. However the p-adic Companions Conjecture for higherdimensional bases remains wildly open. Morally, this is because p-adic coefficent objects contain dramatically moreinformation than their `-adic cousins; they look and act much more like complex variations of Hodge structures. Never-theless, in the talk by Kiran Kedlaya entitled Update on the companion problem, the current status of the problem wassurveyed. He first described the problem in detail, and the previous work of Drinfeld, Deligne, and Abe–Esnault, thenhe talked about his new result, which establishes a weak form of the conjecture.

In Simpson’s foundational work on non-abelian Hodge theory, he conjectures that rigid local systems on smooth,complex, quasi-projective varieties are of geometric origin. This problem was discussed in the talk given by HeleneEsnault entitled Rigid systems and integrality, where she reported on her remarkable joint work with M. Grochenig.She described how they proved that the monodromy of a cohomologically rigid integrable connection (E,∇) on asmooth complex projective variety X is integral, a statement sometimes known as Simpson’s integrality conjecture.In order to achieve this, they first proved that the mod p reduction of a rigid integrable connection (E,∇) has thestructure of an isocrystal with Frobenius structure. She also explained how they proved that rigid integrable connectionswith vanishing p-curvatures are unitary, and how they used the latter result to prove new cases of Grothendieck’s p-curvature conjecture. The result is both interesting for its philosophical implications and the link it creates betweenp-adic cohomology and Simpson’s original conjecture.

A fundamental tool used by Esnault–Grochenig is a certain p-adic analogue of Simpson’s correspondence, relatinglocal systems with Higgs bundles. The talk by Bernard Le Stum on A quantum Simpson correspondence was a report ona result of this type, but for ‘quantum’ local systems, i.e. non-commutative deformations through a formal parameter q.He first explained how the Simpson correspondence establishes an equivalence between a category of modules equippedwith a connexion and a category of Higgs bundles. Then he talked about his joint work with Michel Gros and AdolfoQuiros, where they describe such an equivalence in the case of modules equipped with a q-connexion when q is a p-throot of unity. This is modelled on the characteristic p case and relies on the notion of quantum divided powers that healso discussed.

In contrast with what happens over the complex number field, for normal varieties over a finite field, recent workof Deligne, Drinfeld, Abe–Esnault, and Kedlaya establishes that for fixed `, there are only finitely many `-adic coef-ficient objects with finite determinant, bounded degree, and ‘bounded ramification’. This suggests that they shouldexhibit some form of rigidity, and that some form of Simpson’s conjecture should be true for them. Indeed, in the talkby Subrahmanya Krishnamoorthy entitled Rank two F -isocrystals and abelian varieties the speaker reported on hisjoint work-in-progress with Ambrus Pal on a Simpson-like conjecture for F -isocrystals. This conjecture states that forsmooth varieties X over finite fields, any overconvergent F -isocrystal which ‘looks’ as though it should appear in thecohomology of a family of abelian varieties over X does in fact come from such a family. The methods are similar tothe work above: it starts with the curve case, which can be proved via a refined form of the Langlands correspondence,then uses tools from geometry to extend the abelian variety from a sub-curve to the whole variety, using Serre–Tate typedeformation theory and algebraization results.

Finally let us mention the talk by Paul Ziegler entitled Mirror symmetry for moduli spaces of Higgs bundles viap-adic integration, which was a report on a very surprising application of p-adic methods to the study of Higgs bundles.The speaker talked about a recent proof, joint with M. Grochenig and D. Wyss, of a conjecture of Hausel and Thaddeuswhich predicts the equality of suitably defined Hodge numbers of moduli spaces of Higgs bundles with SLn- and PGLn-structure. The proof, inspired by an argument of Batyrev, proceeds by comparing the number of points of these modulispaces over finite fields via p-adic integration. This then provides information about Hodge numbers through the Weilconjectures and p-adic Hodge theory. Particularly exciting is the prospect of using these methods to shed new light onaspects of the Langlands program.

The de Rham–Witt complex, Iwasawa theory and aspects of integral p-adic cohomology

One of the original motivations of Grothendieck and Berthelot for inventing crystalline cohomology, as a p-adic com-panion to the family of `-adic cohomologies produced by the etale theory, was to explain p-torsion phenomena. Whileintegral crystalline cohomology achieves this for smooth and proper varieties, the extension to a ‘good’ cohomologytheory for arbitrary varieties, which reached its zenith in the proof of the 6 operations formalism by Caro, has beenachieved only for rational coefficients, i.e. after tensoring with Q. This therefore still leaves open the question of what

Page 247: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

p-adic cohomology and arithmetic applications 243

an integral p-adic theory should look like for open or singular varieties, which has been the subject of much recent workin the field, in particular the study of the overconvergent de Rham–Witt complex by Davis, Langer and Zink [DLZ11].This now seems to provide a good candidate for smooth (but possibly open) varieties, although a priori it is not clear howit compares with other candidates such as integral Monsky–Washnitzer cohomology, or even if the latter is well-defined.

Such a comparison theorem between these two approaches was the subject of the talk given by Veronika Ertl,entitled Integral Monsky-Washnitzer and overconvergent de Rham-Witt cohomology. In it she reported on joint workwith Johannes Sprang in which they first of all show that Monsky–Washnitzer cohomology is well defined on the integrallevel, and secondly prove a comparison isomorphism between this and the overconvergent de Rham–Witt cohomology.This extends previous work of Davis and Zurieck-Brown by removing all restrictions on the cohomological degree.Interesting further questions in this direction would be to calculate the p-torsion of these groups in new and interestingcases, and to give a geometric interpretation of this torsion.

The study of integral properties of p-adic cohomology is very closely related to that of the p-adic properties of L-functions in characteristic p. Most of the work recently has been done on 1-dimensional families of abelian varieties,for example [KT03], [Pal10] and [TV15] which look at the refined Birch–Swinnerton-Dyer conjecture, the integrality ofp-adic L-functions and the equivariant Tamagawa number conjecture, respectively. What is common in these works isthe crucial use of integral p-adic cohomology theories predating the construction in [DLZ11], typically log crystallinecohomology. Therefore they are forced either to reduce the general case to the semi-stable one, or worse, restrict to thesituation when the abelian scheme is semi-stable and the considered Galois covers of the base are tame. This demon-strates the limitations of these methods, but with sufficient progress on the finiteness properties of the the overconvergentde Rham–Witt complex we expect that this area would start to develop very rapidly.

Another talk with connections to integral aspects of p-adic cohomology and Iwasawa theory was by Ambrus Pal onthe subject of Formal deformations of crystals and arithmetic applications. The speaker first described two problems inthe arithmetic of elliptic curves over function fields, the analogue of a classical conjecture of Stevens in the function fieldsetting, originally asked by Mazur, and his own conjecture on the integrality properties of Hecke eigenforms attachedto elliptic curves. He then spoke about his work in progress on these conjectures, using tools from p-adic cohomology.The first ingredient is a pro-representability theorem of what could be called arithmetic deformations of crystals andDieudonne crystals. The second ingredient is the application of the Taylor–Wiles method in this setting, which showsthat the first conjecture implies the second.

One exciting recent trend in integral p-adic cohomology, and in particular integral p-adic Hodge theory, has seenthe application of methods and techniques from higher category theory and higher algebra to the study of crystallinecohomology andK-theory. The talk given by Lars Hesselholt on Higher algebra and arithmetic provided an entertainingintroduction to some of these ideas, and how they can be used to circumvent problems in arithmetic geometry, inparticular in characteristic p, caused by the introduction of denominators.

As explained in the talk, these often arise because of the fact that the natural numbers record only the result ofcounting, and not the process of counting. The higher algebra of Joyal, Lurie and others replaces the initial ring ofalgebra Z by a more fundamental object, the sphere spectrum S, which also retains information about the countingprocess. The ‘brave new algebra’ long advocated by Waldhausen uses S as the basis for arithmetic, and doing so canoften lead to the elimination of ‘unwanted’ denominators.

Notable manifestations of this vision include the Bokstedt–Hsiang–Madsen topological cyclic homology, which re-ceives a denominator-free Chern character, and the related Bhatt–Morrow–Scholze integral p-adic Hodge theory, whichmakes it possible to exploit torsion cohomology classes in arithmetic geometry. In the talk, the speaker explained theconstruction of a certain ‘higher’ analogue of de Rham cohomology, and gave some calculations in p-adic and charac-teristic p settings. In the former case, one recovers quite naturally certain constructions that appear in integral p-adicHodge theory. In the latter, one can use this theory to give a cohomological interpretation of the Hasse-Weil zeta functionof smooth and proper varieties over finite fields by regularized determinants, as envisioned by Deninger.

The recent revolution in the higher category theory and higher algebra has the potential to become a powerful toolin all sorts of areas of mathematics. This talk gave some excellent concrete examples of this potential in the particularfields of algebraic K-theory and p-adic Hodge theory.

Page 248: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

244 Five-day Workshop Reports

F -isocrystals and p-adic representations, and homotopy theory

Most current research on the homotopy-theoretical aspects of p-adic cohomology concentrates on two particular strands:the theory of the p-adic, or crystalline, fundamental group via various Tannakian categories of isocrystals, and appli-cations of the latter to Diophantine problems. The talk by Atsushi Shiho entitled On de Jong conjecture concerned thefirst aspect, covering a fascinating conjecture due to de Jong predicting that any isocrystal on a geometrically simplyconnected smooth projective variety over a perfect field of characteristic p > 0 should be constant. The speaker re-ported on his joint work in progress with Helene Esnault where they proved several results related to this conjecture,including some special cases and a reduction to the case of convergent isocrystals.

On the other hand the talk by Ishai Dan-Cohen entitled Rational motivic path spaces concerned the second aspect.A central ingredient in Kim’s work on integral points of hyperbolic curves is the “unipotent Kummer map” which goesfrom integral points to certain torsors for the pro-unipotent completion of the fundamental group, and which, roughlyspeaking, sends an integral point to the torsor of homotopy classes of paths connecting it to a fixed base-point. Thespeaker reported on recent joint work of his with Tomer Schlank, which provides liftings of Kim’s construction to morerefined rational homotopy types, and thus gives rise to factorizations of Kim’s original period map. He explained theconstruction of a certain space Ω of rational motivic loops, and use this to provide a factorization of the unipotentKummer map which can be summarized schematically as

points→ rational motivic points→ Ω-torsors→ π1-torsors.

For affine curves, Ω-torsors are the same thing as π1-torsors, essentially because these spaces are K(π, 1)’s, however,their approach has the potential to give more refined information for higher dimensional varieties.

A key link between the p-adic, and the usual etale fundamental groups is given by the Katz correspondence, or thestudy of unit root F -isocrystals, and the associated p-adic Galois representations. The talk by Joe Kramer-Miller onSlope filtrations of F -isocrystals, log decay, and genus stability for towers of curves reported on work which shed newlight on this classical, but very important connection. The speaker introduced a notion of F -isocrystals with logarithmicdecay, gave a conjectural description of how this should relate to the slope filtrations, and sketched a proof of thisconjecture when the unit-root subcrystal has rank one. This then leads to a new proof of a recent theorem of Drinfeld–Kedlaya, as well as a generalized version of Wan’s conjecture on genus stability for towers of curves coming fromgeometry.

Other topics

Finally we report on those activities of the workshop which cannot be easily classified into one of the categories men-tioned above, but nevertheless demonstrate the richness of the subject and its many connections to other areas of alge-braic geometry and number theory.

The talk by Gebhard Bockle entitled Compatible systems of Galois representations of global function fields reportedon joint work of the speaker with W. Gajda and S. Petersen on the important open image conjecture. If K is a finitelygenerated infinite field, then it was shown recently by Cadoret, Hui and Tamagawa that for almost all `, the cohomologygroupHi

et(XKs ,F`) is semisimple as a representation of the geometric fundamental groupGgeoK = Gal(Ks/KFsp). This

has an important consequences for the image of GgeoK under its action on the adelic module Hi

et(XKs ,AQ). The speakerdescribed how to extend these results to the case where F` is replaced by the mod-` reduction of a compatible system ofGalois representations, using automorphic methods.

The talk given by Masha Vlasenko on Atkin and Swinnerton-Dyer congruences for toric hypersurfaces was about anew result on a classical application of p-adic cohomology to arithmetic. It concerned certain kinds of explicit crystalscalled Dwork modules, which in the 1990’s were used by V. Batyrev to describe the mixed Hodge structure on the middlecohomology of affine hypersurfaces in algebraic tori. In the talk the speaker reported on her work in progress donejointly with Frits Beukers, in which they they used these crystals to show several p-adic congruences for the coefficientsof powers of a Laurent polynomial. This then leads to congruences involving the L-function of toric exponential sums,and yields new p-adic unit-root formulas.

One of the early indications of the power and utility of p-adic methods was the Serre–Tate theorem on the defor-mation theory of abelian varieties, showing that this is in fact controlled by the deformation theory of its p-divisiblegroup, or equivalently its first crystalline cohomology. This basic idea of using p-adic invariants to control deformationproblems has proved to be extremely fruitful, and the talk given by Ananth Shankar provided an excellent example

Page 249: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

p-adic cohomology and arithmetic applications 245

of this. Entitled Serre-Tate theory for Shimura varieties of Hodge type, it was a report on work of the speaker andhis coauthor Rong Zhou in which they studied the formal neighbourhood of a point in µ-ordinary locus of an integralmodel of a Hodge type Shimura variety. They proved the existence of a structure analogous to the Serre–Tate structureon the deformation space of an ordinary abelian variety, in particular providing an analogue of ‘canonical lifts’ in thesesituations.

Finally, the talk by Edgar Costa entitled Computing zeta functions of nondegenerate toric hypersurfaces was a veryuseful update on the ongoing revolution on the point counting problem via p-adic methods. The speaker reported on anongoing joint project with Kiran Kedlaya and David Harvey on the computation of zeta functions of nondegenerate torichypersurfaces over finite fields, using p-adic cohomology. By exploiting a new way of computing sparse approximationsto overconvergent cohomology classes, they were able to drastically improve the feasibility of computing zeta functionsof certain smooth varieties, in particular for much larger values of p than were previously accessible.

Outcomes

The subject of p-adic cohomology is often characterised by a plethora of different approaches to the subject, each ofwhich has its own particular perspective and scope of application. The workshop successfully drew together peopleworking on all aspects of the theory, and the various topics generated many discussions, kick-starting new research andcollaborations across the whole breadth of the subject.

For example, the talk given by Veronika Ertl inspired much speculation on the expected properties and possiblescope of any integral theory of p-adic cohomology. Richard Crew in particular suggested that in fact a fully robusttheory cannot exist for finite coefficients, and sketched during discussions the reasons why.

Another topic that was constantly in the air during the workshop concerned problems surrounding the CompanionConjecture, the main remaining case now being how to produce p-adic companions on higher dimensional varieties.This is a topic that several of the participants at the workshop have been working on recently, and one in which we canhope to see some more progress in the near future.

Let us also mention several interesting discussions that took place concerning the p-adic homotopical properties ofvarieties in characteristic p, and to what extent these mirror or deviate from ‘classical’ topological behaviour. As well asbasic foundational matters, such as deformation invariance, or the existence of moduli spaces for p-adic local systems,there were also interesting problems posed on the behaviour of particular kinds of isocrystals, often inspired by Hodgetheory and Simpson-like conjectures. This reflects a general shift in focus in the subject, and is likely to become animportant area of future research. Some of these debates have been followed up since the conference, and have formedthe germ of potential new collaborations.

Participants

Abe, Tomoyuki (Kavli Institute for the Physics and Mathematics of the Universe)Balakrishnan, Jennifer (Boston University)Baldassarri, Francesco (Universita Degli Studi di Padova)Bellovin, Rebecca (Imperial College London)Boeckle, Gebhard (Universitat Heidelberg)Costa, Edgar (Dartmouth College)Crew, Richard (University of Florida)Dan-Cohen, Ishai (Ben Gurion University of the Negev)Di Proietto, Valentina (University of Exeter)Ertl, Veronika (University Regensburg)Esnault, Helene (Freie Universitat Berlin)Gregory, Oliver (Teschnische Universitat Munchen)Hansen, David (Columbia University)Hesselholt, Lars (Nagoya University and University of Copenhagen)Huyghe, Christine (IRMA Universite de Strasbourg)Kedlaya, Kiran (University of California, San Diego)

Page 250: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

246 Five-day Workshop Reports

Kobayashi, Shinichi (Kyushu University)Kramer-Miller, Joe (University College London)Langer, Andreas (University of Exeter)Lazda, Christopher (Universita di Padova)Le Stum, Bernard (IRMAR Universite de Rennes I)Liu, Ruochuan (Peking University)Marmora, Adriano (IRMA Universite de Strasbourg)Moon, Yong Suk (Purdue University)Niziol, Wiesława (ENS Lyon)Orlik, Sascha (Universitat Wuppertal)Pal, Ambrus (Imperial College London)Pulita, Andrea (Institut Fourier)Shankar, Ananth (MIT)Shiho, Atsushi (University of Tokyo)Strauch, Matthias (Indiana University)Taelman, Lenny (Universiteit van Amsterdam)Vlasenko, Masha (Institute of Mathematics of the Polish Academy of Sciences)Zhu, Xinwen (Caltech)Ziegler, Paul (University of Oxford)Zureick-Brown, David (Emory University)

[Abe13] T. Abe, Langlands correspondence for isocrystals and existence of crystalline companion for curves, preprint(2013), https://arXiv.org/abs/1310.0528.

[AW13] K. Ardakov and S. Wadsley, On irreducible representations of compact p-adic analytic groups, Ann. ofMath. (2) 178 (2013), no. 2, 453–557, https://doi.org/10.4007/annals.2013.178.2.3.

[Bal10] F. Baldassarri, Continuity of the radius of convergence of differential equations on padic analytic curves,Invent. Math. 182 (2010), no. 3, 513–584, https://doi.org/10. 1007/s00222-010-0266-7.

[Ber74] P. Berthelot, Cohomologie cristalline des schemas de caracteristique p > 0, Lecture Notes in Mathematics,vol. 407, Springer, 1974, https://dx.doi.org/10.1007/BFb0068636.

[Ber97] , Finitude et purete cohomologique en cohomologie rigide, Invent. Math. 128 (1997), no. 2, 329–377,https://doi.org/10.1007/s002220050143.

[Bes00] A. Besser, A generalization of Coleman’s p-adic integration theory, Invent. Math. 142 (2000), no. 2, 397–434,https://doi.org/10.1007/s002220000093.

[BP09] G. Bockle and R. Pink, Cohomological theory of crystals over function fields, EMS Tracts in Mathematics,vol. 9, European Mathematical Society (EMS), Zurich, 2009, https://doi.org/10.4171/074.

[CV15] D. Caro and D. Vauclair, Logarithmic p-bases and arithmetical differential modules, preprint (2015), https://arXiv.org/abs/1511.07797.

[DLZ11] C. Davis, A. Langer, and T. Zink, Overconvergent de Rham-Witt cohomology, Ann. Sci. Ec. Norm. Super.(4) 44 (2011), no. 2, 197–262, https://doi.org/10.24033/asens. 2143.

[Dwo60] B. Dwork, On the rationality of the zeta function of an algebraic variety, Amer. J. Math. 82 (1960), 631–648,https://doi.org/10.2307/2372974.

[Gro68] A. Grothendieck, Crystals and the de Rham cohomology of schemes, Dix Exposes sur la Cohomologie desSchemas, North-Holland, Amsterdam; Masson, Paris, 1968, pp. 306 - 358.

[Gro90] M. Gros, Regulateurs syntomiques et valeurs de fonctions L p-adiques. I, Invent. Math. 99 (1990), no. 2,293–320, With an appendix by Masato Kurihara, https://doi.org/10.1007/BF01234421.

[HPSS15] C. Huyghe, D. Patel, T. Schmidt, and M. Strauch, D+-affinity of formal models of flag varieties, preprint(2015), https://arxiv.org/abs/1501.05837.

[Ked04] K. S. Kedlaya, Computing zeta functions via p-adic cohomology, Algorithmic number theory, Lecture Notesin Comput. Sci., vol. 3076, Springer, Berlin, 2004, https://doi. org/10.1007/978-3-540-24847-7 1, pp. 1–17.

[Ked16] , Convergence polygons for connections on nonarchimedean curves, Nonarchimedean and Tropical Geome-

Page 251: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

p-adic cohomology and arithmetic applications 247

try (M. Baker and S. Payne, eds.), Simons Symposia, Springer International Publishing, 2016, https://doi.org/10.1007/978-3-319-30945-3, pp. 51–98.

[KT03] K. Kato and F. Trihan, On the conjectures of Birch and Swinnerton-Dyer in characteristic p ¿ 0, Invent.Math. 153 (2003), no. 3, 537–592, https://doi.org/10.1007/s00222-003-0299-2.

[Lau06] A. G. B. Lauder, A recursive method for computing zeta functions of varieties, LMS J. Comput. Math. 9(2006), 222–269, https://doi.org/10.1112/S1461157000001261.

[LP16] C. Lazda and A. Pal, Rigid Cohomology over Laurent Series Fields, Algebra and Applications, vol. 21,Springer, 2016, http://dx.doi.org/10.1007/978-3-319-30951-4.

[LS17] B. Le Stum, Rigid cohomology of locally noetherian schemes Part 1 : Geometry, preprint (2017), https://arxiv.org/abs/1707.02797.

[MW68] P. Monsky and G. Washnitzer, Formal Cohomology. I., Ann. of Math. 88 (1968), 181– 217, https://doi.org/10.2307/1970571.

[Pal10] A. Pal, The Manin constant of elliptic curves over function fields, Algebra Number Theory 4 (2010), no. 5,509–545, https://doi.org/10.2140/ant.2010.4.509.

[PP13] J. Poineau and A. Pulita, The convergence newton polygon of a p-adic differential equation iv : local andglobal index theorems, preprint (2013), https://arxiv.org/abs/1309.3940.

[Tae12] L. Taelman, Special L-values of Drinfeld modules, Ann. of Math. (2) 175 (2012), no. 1, 369–391, https://doi.org/10.4007/annals.2012.175.1.10.

[TV15] F. Trihan and D. Vauclair, A comparison theorem for semi-abelian schemes over a smooth curve, preprint(2015), https://arxiv.org/abs/1505.02942.

Page 252: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 22

New Perspectives in RepresentationTheory of Finite Groups (17w5003)

October 15 - 20, 2017Organizer(s): Gunter Malle (TU Kaiserslautern), Gabriel Navarro (University of Valen-cia), Britta Spath (Bergische Universitat Wuppertal), Pham Huu Tiep (Rutgers Univer-sity)

Overview of the Field

Group Theory is essentially the theory of symmetry for mathematical and physical systems, with major impact indiverse areas of mathematics. The Representation Theory of Finite Groups is a central area of Group Theory, withmany fascinating and deep open problems, and significant recent successes. In 1963 R. Brauer [B] formulated a list ofdeep conjectures about ordinary and modular representations of finite groups. These have led to many new conceptsand methods, but basically all of his main conjectures are still unsolved to the present day. A new wealth of difficultproblems, relating global and local properties of finite groups, was opened up in the seventies and eighties (of the 20thcentury) by subsequent conjectures of J. McKay [Mc], J. Alperin [Al], E. Dade [D1, D2], M. Broue [Br], and others, allremain open up to date.

The classification of finite simple groups raised the hope that one should be able to reduce some of the aforemen-tioned conjectures to statements about simple groups, and subsequently establish these statements by exploring deepknowledge about simple groups provided by the Deligne–Lusztig theory and other recent fundamental results in repre-sentation theory. In fact, this reduction to simple groups was accomplished by several of the organizers in [IMN] for theMcKay conjecture, in [NT1] for the Alperin weight conjecture and in [S1], [S2] and [S3] for the blocks versions. Thishope has recently materialized since the proof of the McKay conjecture for the prime 2 by two of our organizers: G.Malle and B. Spath [MS]. The road to prove the McKay conjecture for odd primes is now paved.

Recent Developments and Open Problems

There were two main themes in our meeting, with several remarkable highlights, which are discussed in more detailbelow. One of the directions that became even more apparent is the advent of powerful geometric methods to solveproblems in finite group representation theory. This was witnessed for example in the impressive talks by RaphaelRouquier and by Cedric Bonnafe. One of the most important problems in Representation Theory is to find Brauer’s

248

Page 253: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New Perspectives in Representation Theory of Finite Groups 249

decomposition matrices of symmetric groups and finite groups of Lie type. R. Rouquier announced an astonishingconjecture, joint work with O. Dudas, that will have a lasting impact on our field. The other theme has been remarkableprogress on the so called McKay–Galois conjecture, proposed in 1994 by one of the organizers ([N]). Other themes arerelated to another open problem: how the absolute Galois group act on the irreducible characters of the simple groups.

Decomposition Numbers

Decomposition numbers are the data relating the modular representations of a finite group G over a field of positivecharacteristic to the generally better known complex characters Irr(G). The two representation theories (modular andover the complex numbers) are very different in nature. Complex characters were introduced by Frobenius at the endof the 19th century and from the start corresponding sets Irr(G) were quickly determined for many of the finite simplegroups known at the time, as for example for the symmetric and alternating groups. Modular representation theorywas only introduced much later by Brauer in the 1940’s and very simple questions about the representations of thesymmetric groups are still unanswered — and intensely investigated, including in the present meeting.

A related question is the one of basic sets, that are subsets of Irr(G) somehow standing for the much more mysteriousirreducible modules over fields of positive characteristic.

Decomposition numbers and basic sets are thus a vital information to tackle conjectures about modular represen-tations since their versions over the complex numbers are generally much simpler. Any progress in the knowledge ofdecomposition numbers for finite simple groups, i.e., now essentially groups of Lie type, is of foremost importance.

It was a big success of this meeting to arrange the presence of Raphael Rouquier (UCLA) one of the main con-tributors to the field of representation theory as a whole, and a very fruitful presence in our meeting as testified bynumerous participants (see testimonials below). The meeting was the place where it was announced what is probably abreakthrough in the knowledge on decomposition numbers. In joint work with Olivier Dudas (Paris), Rouquier givesa conjectural description of the decomposition matrix of finite unitary groups with respect to the so-called non linear(non-defining) primes. This is the main case left open by the first generation of work on the subject (Dipper–Gruber–Hiss, 1985–1997) treating essentially linear primes [Di], [GH]. This has been confirmed for dimensions up to n bycomparison with the known results. Though still a conjecture this description has the advantage of being very preciseand concrete. The matrix should correspond to an explicit operator∇ on the algebra of symmetric functions defined interms of Macdonald polynomials.

Rouquier adds even more perspective to the subject by relating this algebraic setup to algebraic geometry. Indeedhe provides a categorification of the operator ∇ in the category of coherent sheaves on Hilbert schemes on the complexplane. This interpretation should also open the door to (conjectural) generalizations to other types of classical groupsfor non linear primes.

The McKay Conjecture and refinements

If G is a finite group, we denote by Irr(G) the set of irreducible complex characters of G and by Irrp′(G) its subsetconsisting of characters having degree not divisible by p. The McKay conjecture [Mc] from 1972 is at the center ofrepresentation theory. If p is a prime and G is a finite group, then it asserts that

|Irrp′(G)| = |Irrp′(NG(P ))| ,

where P is a Sylow p-subgroup of G and NG(P ) its normalizer. That is to say, a certain fundamental information on Gis encoded in some local subgroup of G, namely the Sylow normalizer NG(P ).

This conjecture somehow gave a new direction to representation theory of finite groups. Many results and conjec-tures, even recently, would not have been possible without it. We have already mentioned the conjectures by Alperin–McKay, Alperin, Dade and Broue. In fact major contributions to the representation theory of finite groups of Lie typesuch as [BMM] originate in the same kind of ideas.

Since our previous meeting in 2014, there has been an absolute highlight in this subject: Malle and Spath (two ofour organizers) proved the McKay conjecture for p = 2 (see [MS]), the case that was originally conjectured. This cameafter the reduction theorem by I. M. Isaacs, G. Malle and G. Navarro in [IMN], proving that the the path that started in

Page 254: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

250 Five-day Workshop Reports

2007 is a successful one. Of course, now the big question is what happens if p is odd? In our meeting we had a fruitfulsession on a discussion on what remains to do next, and on several possible ideas to tackle this long-standing problem.

In 2004, Navarro proposed a deep refinement of the McKay conjecture [N] that takes into account fields of valuesover the p-adics. This conjecture has consequences in determining local structure of the group from global structureand vice versa. In our meeting, A. Schaeffer Fry announced a proof, by her and in part also in collaboration withJ. Taylor, of the self-normalizing 2-Sylow conjecture, which is one of the most remarkable consequences of this McKay-Galois conjecture. This proof comes after 14 years since the conjecture was announced (and ten years after the proof[NTT] of the self-normalizing Sylow conjecture for odd primes). Recent work of Schaeffer Fry and Taylor also ledto a proof of another conjecture of Navarro, Tiep, and Vallejo [NTV] concerning groups whose Sylow 2-normalizerscontain a single irreducible 2-Brauer character (which was recently reduced to simple groups by Navarro and Vallejo).O. Brunat announced a proof of the McKay–Galois conjecture for symmetric groups and odd primes (the case p = 2was announced in our 2014 meeting) and Lucas Ruhstorfer announced a proof of the conjecture for groups of Lie typein their defining characteristic.

In recent years, there has been considerable interest in fields of values of characters, and a main open problem inthis area is to determine how the absolute Galois group acts on the irreducible complex characters of a group. D. Rossiannounced a proof of one direction of the 3-rational conjecture (by two of the organizers, Navarro and Tiep) that assertsthat the finite groups with 3 rational conjugacy classes are the finite groups with 3 rational characters. For the otherdirection, he was also able to get a very tight control over a minimal nonsolvable counterexample (if any).

J. Tent announced hisQp-conjecture, that would extend work of J. Thompson, W. Feit and R. Gow. Almost since thebeginning of character theory, the connections between fields and characters have been studied. An object of interest isthe so called rational groups, or groups all of whose characters have values in the field of the rationals. While the non-abelian composition factors of rational groups were classified by W. Feit and G. Seitz long time ago, it is a conjectureof J. Thompson that the abelian composition factors are cyclic groups of prime order p, for p < 7. The conjecture isstill open. Tent conjectures that if the field of values of the characters are contained in the field Qp, then the abeliancomposition factors of G are also under absolute control. Perhaps, further relations hold for arbitrary fields.

Future directions

Recent progress on all these fundamental conjectures raises the hope that complete proofs of some of them may bepossible in the not too distant future.

It has become even clearer in the past few years that any further significant progress can only be achieved oncewe can resolve a number of basic questions on representations of finite groups of Lie type. With many experts in theDeligne–Lusztig theory present at the meeting, we had a discussion session on Tuesday afternoon to draw a roadmaptowards possible solutions of some of these principal obstacles.

Presentation Highlights

We already highlighted the spectacular announcement by Raphael Rouquier of his and Dudas’ conjecture on the de-composition numbers of the finite unitary groups, as well as the impressive talk of Cedric Bonnafe presenting his results,joint with P. Shan and R. Maksimau, that established a conjecture concerning equivariant cohomology and fixed pointsof Calogero–Moser spaces in the smooth case.

Jesper Grodal presented in Banff a novel approach to endo-trivial modules. These representations of finite groupsover fields of positive characteristic p were introduced by Dade in his study of the Glauberman correspondence. Whilemodules of a given finite groupG over a field k of characteristic p have been shown to be in general beyond classification,the subset of endo-trivial has been shown to be endowed with the structure of a finitely generated abelian group Tk(G)for the tensor product. This also relates with the Picard group of the stable category for the corresponding group algebrakG. Results by Alperin and Bouc–Carlson–Thevenaz (2001–2006, see [CT1]) have given a complete structure theoremin the case where G is a p-group.

In the general case of a finite group with Sylow p-subgroup S, one classically defines a restriction morphism

Tk(G)→ Tk(S)

Page 255: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New Perspectives in Representation Theory of Finite Groups 251

whose kernel Tk(G,S) contains in fact all the relevant information to deduce Tk(G) from the case of p-groups thanksto work of Carlson–Thevenaz–Mazza–Nakano giving the image and proving the splitness of the above map. Grodalpresented spectacular theorems about Tk(G,S), identifying it with the first cohomology group of the orbit category Oon non-trivial p-subgroups with values in the units k×, viewed as a constant coefficient system. He uses homotopicaltechniques to give a number of formulas for Tk(G,S) in terms of one-dimensional representations of normalizers andcentralizers. This allows him to prove a conjecture by Carlson–Thevenaz (see [CT2]). He also provides strong restric-tions on when such representations of dimension greater than one can occur, in terms of the p-subgroup complex andp-fusion systems. He recovers and extends in a spectacularly simple fashion several computational results in the liter-ature. The computational potential of his methods is illustrated by calculating Tk(G,S) and therefore Tk(G) in othersample new cases, e.g., for the Monster at all primes.

N. Rizo introduced a generalization of Brauer p-blocks with respect to an invariant character of a normal subgroup,which might give a different perspective and help with the reduction in some of the block theory conjectures, includingthe k(B)-conjecture. These blocks, which are related to twisted group algebras, seem to unify several statements, likeBrauer’s Height Zero conjecture and the Gluck–Wolf–Navarro–Tiep theorem [NT2].

B. Sambale reported on the efforts to find a counterexample to the defect group Morita equivalent block problem:what is the exact relationship between the defect groups of two Morita equivalent blocks? It has been claimed, andproved in important cases, that these groups might be isomorphic. The experience in group algebra isomorphismproblems tell us, though, that there might exist counterexamples but that these might be too big to find, at least withcomputers.

P. Fong revisited his celebrated paper with Jon Alperin [AF] giving verifications of Alperin’s weight conjecture forsymmetric and general linear groups. This paved the way to subsequent verifications of the conjecture for other simplegroups but also to the Dade conjecture. The paper has a strong group theoretical side that has been seminal to studies ofdistinguished classes of nilpotent subgroups of simple groups. But the talk by Fong exemplified rather the program offinding deep connections between combinatorics arising from Alperin’s weight conjecture for the general linear groupwith the work of Lascoux–Leclerc–Thibon. Indeed a quite classical operation on partitions with a given r-core wasre-interpreted by them in terms of the action of quantum groups on Fock space, thus making a combinatorial bijectionmore natural. This is of great relevance given how central this combinatorics or their relatives are to representations offinite groups.

A. Paolini lectured on representations of maximal unipotent subgroupsU of finite groupsG of Lie type. He discussedsome history and methods about the problem of parametrizing the set Irr(U). He went on to mention some joint resultswith Goodwin, Le and Magaard about the parametrization of Irr(U) whenG is of small rank, including the exceptionaltypes F4 and E6.

H. Nguyen reported on several recent results, obtained by him and his collaborators, that utilize the average degreeover a subset of complex irreducible characters of a finite group and generalize various classical theorems in CharacterTheory of Finite Groups, including theorems of Thompson and Ito–Michler.

The concept of regular embeddings, first formulated by G. Lusztig, is nowadays a standard reduction technique todeal with finite connected reductive groups with disconnected centre. J. Taylor lectured on smooth regular embeddings,a strengthening of this concept which includes the usual embedding of the special linear group into the general lineargroup. He presented a construction of smooth regular embeddings for any algebraic group, which is done using rootdata, and then used smooth regular embeddings to provide new proofs of unpublished reduction techniques due to Asai.

Another recent highlight was explained in the talk of Z. Halasi. He reported on the final solution of a 20 yearold conjecture by L. Pyber on base sizes for permutation groups. Base sizes for permutation groups have outsideapplications to computer science. This has been a remarkable tour de force in which many mathematicians have beeninvolved, that required an incredible amount of delicate techniques.

Scientific Progress Made and Outcome of the Meeting

General Comments

The meeting featured 20 talks, given by well-known experts in the area as well as many younger participants (includingpostdocs and two Ph. D. students).

Page 256: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

252 Five-day Workshop Reports

Aside from the officially scheduled talks, ample time was allocated to informal discussions. As mentioned above,a discussion session took place on Tuesday afternoon to draw a roadmap towards possible solutions of some of theseprincipal obstacles on the way to complete proofs of some of the fundamental local-global conjectures in representationtheory.

Collaboration Started or Continued During the Meeting

We received an enthusiastic response to our request for feedback from the participants of our meeting which we thinkillustrates very clearly the many cooperations started or continued during the workshop and how the talks given fosterednew ideas and developments.

R. Boltje: I was impressed by the high quality of the talks and enjoyed them very much. Benjamin Sambale’s talkfor instance was very closely related to my own research and gave me some new perspectives. Besides that I would liketo mention three more specific developments.

— With Burkhard Kulshammer we proved that p-permutation equivalences between two block algebras A of agroup G and B of a group H preserve a certain family of invariants that is associated to the two blocks. It was alreadyknown that a p-permutation equivalence betweenA andB determines an isomorphismϕ between defect groups ofA andB and that this isomorphism is also an isomorphism between the fusion systems of A and B. To each centric A-Brauerpair (P, e), Kulshammer and Puig associated an extension of the group NG(P, e)/CG(P ) by Z(P ). This construction isrelated to the construction of a centric linking system for the fusion system of A. During the conference, Kulshammerand myself proved that if (P, e) is a centric A-Brauer pair and (Q, f) is the corresponding centric B-Brauer pair (viaϕ), then ϕ also induces an isomorphism between the extension groups associated to (P, e) and to (Q, f).

— With Susanne Danz we started the project to determine the group of perfect self-isometries of a block algebra Awith cyclic defect groups. This is still in the beginning stages.

— Jesper Grodal made some very interesting remarks related to my own talk, saying that the results with mycoauthors Kessar and Linckelmann that describe the Picard groups of a block algebra B use terms that can be derivedfrom homotopy invariants of the centric linking system of the fusion system of B. We had a very long discussion aboutthese connections that gave me enough incentive to start learning the necessary material from algebraic topology tounderstand these connections. Although nothing concrete came out of it so far, I’m very excited and hopeful that theseconnections will give me a much better insight into the topic of my own research.

C. Bonnafe: Raphael Rouquier and me worked on two different subjects. First, we tried to correct an error dis-covered in our common paper with Jean-Francois Dat on Jordan decomposition of representations of finite reductivegroups. This work is going on and will hopefully lead to a positive conclusion. Also, after Rouquier’s talk on Hilbertscheme and interesting properties of the canonical line bundle, this suggested to us to look at other blowing-up of vari-eties of the form (V ×V ∗)/W : we realized that, in some small examples, such blowing-up satisfies analogous properties(one can recover Lusztig’s fundamental a- and A-invariant geometrically). We intend to work in this direction in thenear future.

S. Danz: I would like to thank the organisers for organizing this inspiring workshop and for giving me the op-portunity to take part in it. As for concrete mathematical discussions and collaborations, I would like to mention thefollowing:

— Thanks to the free afternoon on Wednesday I was able to continue a collaboration with Robert Boltje that wehad started this summer. Together with Jurgen Muller we are studying perfect isometry groups of blocks with cyclicdefect groups, building on work done by Pornrat Ruengrot (student of Charles Eaton) in his PhD thesis. During theweek at BIRS we, in particular, discussed some ideas how to use Boltje’s recent work with Markus Linckelmann andRadha Kessar, which he presented in his talk, to make progress on describing these isometry groups.

— After my talk I had some interesting discussions with Benjamin Sambale concerning certain simple QG-modulesthat are related to 2-transitive permutation modules, in order to gain new information on integral forms of these mod-ules.

— In his talk on the AWC Paul Fong presented his bijection between the isomorphism classes of simple modulesof the symmetric group Sn in a p-block B and the conjugacy classes of B-weights of Sn. In doing so, every simpleB-module is, in particular, assigned a certain conjugacy class of p-subgroups of Sn. The natural question (also asked by

Page 257: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New Perspectives in Representation Theory of Finite Groups 253

Gabriel Navarro) is whether these groups are related to the vertices of the respective simple modules. It turned out thatthe group occurring in a B-weight is not the vertex of the corresponding simple B-module, in general. But there mightstill be a connection, and we are in touch to discuss this.

C. Eaton: I think the balance between lectures and free time was about right.

With regards to outcomes: As a direct result of the workshop I’ll be visiting Gerhard Hiss to work on the problemof Donovan’s conjecture for defect groups C3 × C3. Whilst in Canada I also discussed this problem with Gunter Malleand Shigeo Koshitani. I discussed the problem of the structure of the Picard group for a block with Robert Boltje, whichamongst other things highlighted the importance of parts of his current work with Kessar and Linckelmann. Thanksagain for the invitation to the workshop - it was very useful.

P. Fong: The conference at BIRS two weeks ago was a very stimulating and enjoyable one — the particular mix ofinterests and people worked out well. I found the ”New Perspectives” in the title of the conference covered not only newmathematical insights, but also the promising young researchers in the representation theory of finite groups who werethere.

As for my particular interests: The natural bijections for the Alperin Weight Conjecture in symmetric groups andgeneral linear groups in my talk gave rise to some interesting questions. First, what is the relation of the vertex of anirreducible modular representation of Sn and the radical subgroup of the weight associated with it? Susanne Danz andBurkhard Kulshammer have computed such vertices. We looked at the case p = 2, but no obvious relation seems tohold. The matter may be different for odd p and this is being examined now. Another question is whether the naturalbijection for symmetric groups passes nicely to alternating groups. This may be a tricky matter - one only has to recallthe talk of Olivier Brunat and the inherent difficulties in passing from Sn to An. But it deserves study. Also of relevanceis whether the natural bijection for GL(n, q) behaves well with respect to automorphisms. Britta Spath and I had achance to talk about this — the bijection should behave well and I need to write this up. The question is also relevantto a result of Jiping Zhang and he and I had the opportunity to talk about this. A natural bijection should also exist forunitary groups and some of the ground work has been done by Jianbei An.

Banff also gave me the opportunity to talk with Michel Broue about resuming our collaboration (together withBhama Srinivasan) on the Dade conjecture for unipotent blocks in classical reductive groups. The discussion workshopthat Britta Spath ran on the inductive local-global conjectures provided an incentive to write-up this project next yearduring the period our stays at MSRI overlap.

One perspective on the representation theory of finite groups I find particularly intriguing is the application of therepresentation theory of algebras, be they affine Lie algebras, affine Hecke algebras, Ariki-Koike or any other algebras.That such theories can help settle special problems in finite groups is a fascinating phenomenon. The talk given byRaphael Rouquier illustrates this well, linking as it did a special case of the decomposition matrix of a unitary groupwith an algebra over a coefficient domain Q(q, t) in two transcendental variables. One may hope for more since therepresentation theory of algebras is still developing.

E. Giannelli: The week in Banff has been extremely productive.

In my talk I presented a conjecture concerning the number of linear constituents appearing in the restriction of anirreducible character of a finite group to a Sylow p-subgroup (this is part of a joint work with Gabriel Navarro). Duringmy stay in Banff I discussed this problem with many researchers that gave me valuable ideas to prove that statement.In particular Michel Broue and Joan Tent suggested different but very promising strategies of proof. This might lead tofuture collaborations on the topic.

Moreover, I received considerable feedback and suggestions that led me to sensibly improve my analysis of thedecomposition into irreducible constituents of the permutation character obtained by inducing the trivial character ofa Sylow p-subgroup of Sn to the full symmetric group. The paper containing this study is a joint work with Stacey Lawand I have been working on it during my stay in Banff. This will be submitted for publication in the next few weeks.The comments of Gerhard Hiss definitely contributed to improving the above mentioned work.

J. Grodal: A belated thank you for a great meeting. I really enjoyed it.

It was hugely inspiring to hear the many different approaches to the ”standard” conjectures in modular represen-tation theory. Even for the talks which were further away from me, it was interesting to hear what sort of problemspeople were working with.

In terms of interaction, I spent a good deal of time talking to Raphael Rouquier about the conjectural role of loop

Page 258: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

254 Five-day Workshop Reports

groups in modular representation theory, and with David Craven (and Rouquier) about conjectural ways to approachBroue’s conjecture using subgroup complexes and homotopy theory. We’ve kept up the dreaming after the conference,but it is still very speculative.

Concretely, I also benefited from Robert Boltje’s talk, which seemed to suggest relations to things I’ve been thinkingabout earlier. I hope to continue the discussions with him, as well as his collaborators Kessar and Linckelmann atMSRI...

It was also nice to be able to quiz the assembled experts about points in the literature, large and small.

S. Koshitani: It is all the time a great pleasure to stay and research Mathematics in BIRS. I have had wonderfulexperience this time as well.

Actually I was able to discuss the joint work with Caroline Lassueur quite a lot, and therefore our research hashad a big improvement. We are researching Puig’s Finiteness Conjecture (PFC), that is a more precise conjecturethan Donovan’s conjecture and that can be stated if we replace “a Morita equivalence” by “a splendid Morita (= Puig)equivalence”. The co-worker and me both were suggested by Michel Broue to change (to uniform) the name to a splendidMorita equivalence instead of Puig’s equivalence. This was also very helpful.

I could discuss Morita equivalent blocks when our blocks have elementary abelian defect groups of order 9, withGerhard Hiss and Charles Eaton. This is related to Donovan’s Conjecture and is really one of the global-local conjec-tures in the representation theory of finite groups.

I have learned quite a lot from the many interesting and nice talks held during the week. Finally I really wouldlike to thank the BIRS for giving me such a wonderful opportunity for our research, and also for the four organizers.Hopefully I would like to come back again here in near future.

C. Lassueur: The BIRS workshop was very helpful to my current research. In particular a current project withS. Koshitani could be finished and discussions with R. Boltje and C. Eaton brought us some new ideas for future de-velopments of this project. Furthermore, the talk by R. Boltje was particularly relevant to my own research. It let meunderstand that a result I have with J. Thevenaz on the lifting of endo-p-permutation modules can be extended to mod-ules with fusion-stable endo-permutation sources with a possible application to the lifting of endo-permutation Moritaequivalences considered by Boltje–Kessar–Linckelmann through alternative arguments.

A. Maroti: Gabriel Navarro and I began to work on a conjecture of his which concerns the possibility of inducingan irreducible character of a nilpotent subgroup of a group to an irreducible character of the group. More precisely,he conjectures that if α is a complex irreducible character of a nilpotent subgroup of a finite group G such that αG isirreducible, then the generalized Fitting subgroup of G is nilpotent. We think that we managed to prove a partial resultin this direction. Namely, that if G is a group having the property that it has an irreducible character which can beinduced from an irreducible character of a nilpotent subgroup, then the Fitting subgroup of G is non-trivial or G isalmost simple. There is good evidence that perhaps the second part of the conclusion of this latter claim may be omitted.

At the conference I was happy to see Hung Nguyen for the first time after almost ten years. We had lots of conver-sations about mathematics. One of these was with Hung and Zoltan Halasi. We had partial results (joint work also withJames Cossey) on Gluck’s conjecture. This conjecture states that ifG is a finite solvable group, b(G) is the largest degreeof a complex irreducible character of G, and F (G) is the Fitting subgroup of G, then |G : F (G)| ≤ b(G)2. We had longconversation about how to prove this conjecture. One important special case is when a group has the formHV where Vis an irreducible and faithfulH-module and V can only be viewed over a field of size 2, 3, or 4. The difficulty in this caseis that there may not exist two vectors v and w in V such that CH(v) ∩ CH(w) = 1. This would have been convenientfor this latter identity implies that there is a vector u in V with |CH(u)| at most |H|1/2, and so there is a large orbit ofH on V . We tried many ways to get around this difficulty but so far it seems that we will aim to find a (single) vectorin Irr(V ) with a large H-orbit, at least in as many cases as it is possible, giving us an irreducible character of G = HVhaving large degree.

G. Navarro: I have now new joint projects with B. Spath and C. Vallejo on the reduction of the McKay-Galoisconjecture to finite groups, and several new projects with Pham Huu Tiep, Gunter Malle, Benjamin Sambale andAttila Maroti. Overall, I was impressed by the Rouquier talk. It is quite impressive to witness the announcement of aconjecture of this depth and of this impact. BIRS has become one of the best places to do mathematics and interact withour colleagues in the perfect environment. I want to thank BIRS for making our visit the most enjoyable and fruitfulone.

Page 259: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New Perspectives in Representation Theory of Finite Groups 255

H. Nguyen: This was my first time participating in a workshop in BIRS and I had enjoyed every moment of it. I hada chance to discuss some topics and problems that I am particularly interested in with my collaborators. Maroti, ZoltanHalasi and I have a hours-long conversation on a possible completed proof of an old conjecture of Gluck on bounding theindex of the Fitting subgroup in terms of the largest character degree. I also discussed with Vallejo about the relationbetween the average of p′-degrees of characters in the principal block and the p-nilpotency of finite groups. Duringthe workshop, Tiep and I were able to finish an ongoing project on an improvement of the Ito–Michler theorem usingthe notion of average degree. I also had a chance to learn various new perspectives and trends in group representationtheory from other participants. I am especially interested in a very recent conjecture proposed by Giannelli and Navarroon the number of distinct linear characters in the restriction of an irreducible character of degree divisible by p to Sylowp-subgroups.

A. Schaeffer Fry: The BIRS workshop was extremely helpful to my current research. The talks were closely relatedto my own research and provided me with important updates and new insights to the area.

During the workshop, I made significant progress on a project with J. Taylor, which resulted in finishing and sub-mitting a paper. We also began discussing our next projects.

I also had very fruitful conversations with the organizers and other participants of the workshop. For example, G.Malle suggested a new direction for my research, which led to one of the new projects with Taylor. I spoke with A. Turullabout an ongoing project and agreed to keep in touch regarding the problem. I also discussed a potential project withJ. Brough, which we have since began work on.

B. Spath: The conference was very useful and inspiring. I could meet and discuss the progress of various inductiveconditions of the so-called global-local conjectures. It was very interesting to meet Paul Fong and Jiping Zhang. WithPaul Fong I discussed his recent refinements of his deep but older results on the Alperin weight conjecture for symmetricand finite general linear groups. Also Jiping Zhang presented several results of his in this direction. The discussionsclarified the current difficulties with the inductive conditions. Also it was useful to get informed on recent results ofJianbei An and Gerhard Hiss. Meeting these experts and discussing their on-going research project gave me the feelingthat a new approach to the inductive conditions will be needed. I started thinking about that I further discussed in thiscontext with Cedric Bonnafe the action on unipotent characters that might simplify the proof of a recent result jointwith Marc Cabanes.

Several talks showed the interest in the Galois refinement of the McKay conjecture. This encouraged me to start aproject with Gabriel Navarro and Carolina Vallejo in this direction.

The workshop was made very comfortable thanks to the BIRS center providing a very convenient family suite andbabysitter time for my family, Marc and our baby son Maurice. The breakthrough I may remember most is the factthat Maurice started to walk for good that very week in the corridors of Corbett Hall!

J. Taylor: This present meeting at Banff was one of the most useful meetings I have attended in my career thus far.After my talk on root data I had a lengthy and interesting discussion with Frank Lubeck about how one could implementthese theoretical ideas computationally. I hope this will result in something useful. My collaborator, Mandi SchaefferFry, and I managed to complete our current project on principal 2-blocks, which appeared on the arXiv shortly afterthe meeting. In discussions with Gunter Malle and Pham Tiep we came up with some new ideas on our current jointproject, which will hopefully be successful. Alex Turull shared with me his work on the Glauberman correspondenceand pointed me towards an open problem on Green Functions for finite reductive groups, which I am still thinkingabout. Finally, I was really encouraged by the number of younger participants at the meeting. I think this is a greattestament to the depth and fortitude of finite groups as an area of study.

P. Tiep: I have had fruitful discussions with G. Malle and J. Taylor on our ongoing joint project. We have alsostarted new joint projects with G. Navarro and with H. N. Nguyen.

A. Turull: I was very impressed by the organization, and how well the facilities were administrated. The scientificaspects of the conferences were likewise excellent. The topics of the conference had a focus on new perspectives in therepresentation theory of finite groups. It was interesting for me to see and hear what other experts on this are thinkingcurrently. There is currently progress on the many important conjectures in representation theory, and it was veryuseful for me to hear the details of their perspectives. I was particularly interested to hear some work on the NavarroConjecture for particular groups. I think this may lead to the proof of the stronger version of this conjecture that Iproposed for those same groups. I was also interested to discuss some new research avenues with some of the youngerparticipants. It was an excellent and productive experience all around.

Page 260: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

256 Five-day Workshop Reports

C. Vallejo: Thanks for making possible such a great week at BIRS. It has been an especially enjoyable workshopfor me. Despite not having given a talk, I have found it very fruitful. I have talked with Nguyen about the possibility ofextending some of the results he presented in his talk by considering the average of degrees of characters in the principalblock and not in the whole group. Also, I have had the opportunity to talk with Britta Spath about Navarro’s conjecture.She has explained to me a new way, that she recently discovered, of relating central isomorphisms of character tripleswith Galois action. This might lead to a reduction theorem for Navarro’s conjecture in a near future.

J. Zhang: Banff workshop was just excellent.

I talked with many participants and discussed with Fong, Broue, Koshitani, etc. on parametrizations of charactersof groups of Lie type related to the Alperin–McKay conjecture. In my Banff talk I mentioned our verification of theInductive Blockwise Alperin weight condition for PSL(3, q). However there exists a small inaccuracy in a result onradical p-subgroups we cited, however this is ok for G =PSL(3, q). The paper based on my Banff talk is now acceptedby J. Alg. Encouraged by the Banff workshop we will continue to verify further Inductive Conditions. During the Banffworkshop I also discussed with Kulshammer, Nguyen Hung, etc. about further research collaborations. Kulshammerwill visit us for a month next year, and Nguyen Hung and I will possibly organize a research program on characterdegree problems in 2018–2019.

All the workshop’s participants agreed that Banff lived up to its promises of a quiet, inspiring and very comfortableplace to make science. We are all very grateful to the BIRS for providing such excellent facility for discussing and doingmathematics, and hope to return some time in the future.

Participants

Boltje, Robert (University of California, Santa Cruz)Bonnafe, Cedric (Universite Montpellier)Broue, Michel (Universite Paris Diderot)Brough, Julian (Bergische Universitaet)Brunat, Olivier (Universite Paris Diderot)Cabanes, Marc (Universite Paris Diderot)Chaneb, Reda (Universite Paris Diderot)Craven, David (University of Birmingham)Danz, Susanne (Catholic University of Eichstaett-Ingolstadt)Eaton, Charles (University of Manchester)Farrell, Niamh (City University London)Fong, Paul (University of Illinois at Chicago)Giannelli, Eugenio (TU Kaiserslautern)Gramain, Jean-Baptiste (University of Aberdeen)Grodal, Jesper (University of Copenhagen)Halasi, Zoltan (Eotvos Lorand University)Hiss, Gerhard (RWTH Aachen University)Hollenbach, Ruwen (TU Kaiserslautern)Klupsch, Matthias (RWTH Aachen)Koshitani, Shigeo (Chiba University)Kulshammer, Burkhard (University of Jena)Lassueur, Caroline (TU Kaiserslautern)Lubeck, Frank (RWTH Aachen)Malle, Gunter (Technische Universitat Kaiserslautern)Maroti, Attila (Alfred Renyi Institute of Mathematics)Navarro, Gabriel (University of Valencia)Nguyen, Hung (University of Akron)Paolini, Alessandro (TU Kaiserslautern)Rizo Carrion, Noelia (Universitat de Valencia)Rossi, Dan (University of Arizona)Rouquier, Raphael (University of California at Los Angeles)

Page 261: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

New Perspectives in Representation Theory of Finite Groups 257

Ruhstorfer, Lucas (Bergische Universitat Wuppertal)Sambale, Benjamin (TU Kaiserslautern)Schaeffer Fry, Amanda (Metropolitan State University of Denver)Spaeth, Britta (University of Wuppertal)Taylor, Jay (University of Arizona)Tent, Joan (Universitat de Valencia)Tiep, Pham (University of Arizona)Trefethen, Stephen (College of William and Mary)Turull, Alexandre (University of Florida)Vallejo, Carolina (ICMAT)Zhang, Jiping (Peking University)

Page 262: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[Al] J. L. ALPERIN, Weights for finite groups. Pp. 369–379 in: The Arcata Conference on Representations of FiniteGroups, Arcata, Calif. (1986), Proc. Sympos. Pure Math. 47, Part 1, Amer. Math. Soc., Providence, 1987.

[AF] J. L. ALPERIN AND P. FONG, Weights for symmetric groups and general linear groups. J. Algebra 131 (1990),2–22.

[B] R. BRAUER, Representations of finite groups. Pp. 133–175 in: Lectures on Modern Mathematics 1, Wiley NewYork, 1963.

[Br] M. BROUE, Rickard equivalences and block theory. Pp. 58–79 in: Groups ’93 Galway/St. Andrews, LondonMath. Soc. Lecture Note Series 211, Cambridge Univ. Press, 1995.

[BMM] M. BROUE, G. MALLE, AND J. MICHEL, Generic blocks of finite reductive groups. Asterisque No. 212 (1993),7–92.

[CT1] J. F. CARLSON AND J. THEVENAZ, The classification of endo-trivial modules. Invent. Math. 158 (2004), 389–411.

[CT2] J. F. CARLSON AND J. THEVENAZ, The torsion group of endotrivial modules. Algebra Number Theory 9 (2015),749–765.

[D1] E. C. DADE, Counting characters in blocks. I. Invent. Math. 109 (1992), no. 1, 187–210.[D2] E. C. DADE, Counting characters in blocks. II. J. Reine Angew. Math. 448 (1994), 97–190.[Di] R. DIPPER, On the decomposition numbers of the finite general linear groups, Trans. Amer. Math. Soc. 290

(1985), 315–343; II, 292 (1985), 123–133.[GH] J. GRUBER AND G. HISS, Decomposition numbers of finite classical groups for linear primes. J. Reine Angew.

Math. 485 (1997), 55–91.[IMN] I. M. ISAACS, G. MALLE, AND G. NAVARRO, A reduction theorem for the McKay conjecture. Invent. Math.

170 (2007), 33–101.[MS] G. MALLE AND B. SPATH, Characters of odd degree. Ann. of Math. 184 (2016), 869–908.[Mc] J. MCKAY, Irreducible representations of odd degree. J. Algebra 20 (1972), 416–418.[N] G. NAVARRO, The McKay conjecture and Galois automorphisms. Ann. of Math. (2) 160 (2004), 1129–1140.[NT1] G. NAVARRO AND PHAM HUU TIEP, A reduction theorem for the Alperin weight conjecture. Invent. Math.184

(2011), 529–565.[NT2] G. NAVARRO AND PHAM HUU TIEP, Characters of relative p′-degree with respect to a normal subgroup, Ann.

of Math. 178 (2) (2013), 1135–1171.[NTT] G. NAVARRO, PHAM HUU TIEP, AND A. TURULL, p-rational characters and self-normalizing Sylow p-

subgroups. Represent. Theory 11 (2007), 84–94.[NTV] G. NAVARRO, PHAM HUU TIEP, AND C. VALLEJO, Brauer correspondent blocks with one simple module. Trans.

Amer. Math. Soc. (to appear).[S1] B. SPATH, A reduction theorem for the Alperin-McKay conjecture. J. reine angew. Math. 680 (2013), 153–189.[S2] B. SPATH, A reduction theorem for the blockwise Alperin weight conjecture. J. Group Theory 16 (2013), 159–

220.[S3] B. SPATH, A reduction theorem for Dade’s projective conjecture. J. Eur. Math. Soc. 19 (2017), 1071–1126.

258

Page 263: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 23

Automorphic Forms, Mock ModularForms and String Theory (17w5097)

October 29 - November 3, 2017Organizer(s): Terry Gannon (University of Alberta), David Ginzburg (Tel Aviv Univer-sity), Axel Kleinschmidt (Max Planck Institute for Gravitational Physics), Stephen D.Miller (Rutgers University), Daniel Persson (Chalmers University of Technology), BorisPioline (LPTHE)

Overview

Automorphic forms in string theory

Over the last years there have been the beginnings of cross-interactions and cross-fertilizations between the theory ofautomorphic representations and string theory. The well-understood connections are mainly restricted to cases that instring theory go under the name of BPS-protected couplings with at least 8 residual supersymmetries. Such couplingsarise for example in the lowest orders in a low-energy expansion of certain string theory scattering amplitudes. Thesecan be represented by terms in an effective low-energy action that is of the schematic form

Seff =

∫M

dµ(R+ E(0,0)R

4 + E(1,0)∇4R4 + E(0,1)∇6R4 + . . .)

(23.0.1)

where R, R4, ∇4R4 and ∇6R4 denote certain invariants built from curvature tensors (or their covariant derivatives)on space-time M . The expansion of the effective action is ordered by the number of derivatives (any curvature tensorhas two derivatives) and since derivatives translate into momentum p after Fourier transform, and hence energy byEinstein’s relationE = p2, this is an expansion with later terms becoming more important when more energy is involvedin an interaction.

The coefficient functions E(p,q) appearing in the effective action (23.0.1) are constrained to be automorphic functionsas functions of the so-called string theory moduli. For type II string theory on toroidal backgrounds the moduli live onRiemannian symmetric spaces based on split real groups and the automorphy groups are the corresponding Chevalleygroups. Supersymmetry (a.k.a. BPS-protection) requires the coefficient functions E(p,q) to satisfy differential equationsand these differential equations can be translated into ideals of the universal envelopping algebra. This is the link toautomorphic representations that are also characterised by being annihilated by ideals in the universal enveloppingalgebra.

259

Page 264: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

260 Five-day Workshop Reports

This link was reviewed in the first introductory talk of the conference by Michael Green and further elaboratedon in the talk by Guillaume Bossard. For example, the first coefficient function E(0,0) is associated with the minimalrepresentation while the function E(1,0) is associated with the so-called next-to-minimal representation.

The cross-fertilization now arises from the fact that (i) having identified the right automorphic representations al-lows the use of representation theory to deduce constraints on the associated physics that is typically revealed by consid-ering the Fourier expansion of the automorphic function. Moreover, (ii) the analysis of the effective action (23.0.1) alsogives important impetus for mathematical research since the ideal that is determined by the coefficient function E(0,1) ofthe ∇6R4 term is not of any type normally considered in the theory of automorphic representations. In mathematicalterms, its maximal orbit in the wave-front set is not a single nilpotent orbit. This is a phenomenon that is not expectedto arise in the theory of automorphic functions and can be traced back to E(0,1) not being finite under the action of thecenter of the universal envelopping algebra. In physical terms, this new feature is due to the fact the BPS-protectionof this term is reduced to only four supersymmetries for which many new features arise (like Kontsevich–Soibelmanwall-crossing).

Both aspects (i) and (ii) have been discussed in talks at the workshop and also in many lively discussions outside thelecture hall. In particular, the generalisation of the standard definition of automorphic representations that is suggestedby (ii) was the subject of numerous discussions. Approaches discussed for understanding the associated automorphicforms were

• Spectral methods

• Poincare series

• Explicit constrained lattice sum constructions

The automorphic forms appearing (23.0.1) are associated with space-time dualities and, in the instances describedabove, are related to non-perturbative effects in the string coupling. In a different vein, one can also study string theoryin perturbation theory. This is the theory of computing certain integrals over the moduli spaces of (punctured) Riemannsurfaces that represent the world-sheet of the string. The integrands of these integrals are determined by conformal fieldtheory correlation functions and become complicated quickly as the genus and the number of punctures of the Riemannsurface increases. Again, one is often restricted to a low-energy expansion.

One case of particular interest is when the world-sheet of the string is torus, i.e. of genus one. The moduli space ofall such tori is SL(2,Z)\H whereH is the Poincare upper half-plane describing the ratio of the two periods of the torus.The integrand must be a doubly periodic function on the torus, in other words an automorphic function of SL(2,Z).As mentioned above, this automorphic function is determined by a field theory correlator that is in general not knownin closed form. Performing, however, a low-energy expansion one can (often) represent the integrand a given order inthis expansion in terms of a graph on the torus world-sheet where the punctures are connected by lines. The associatedintegrands have been dubbed modular graph functions and have recently a lot of attention as discussed in the talks ofMichael Green and Eric D’Hoker.

Modular graph functions on the torus are interesting since they contain interesting elliptic multiple zeta values whenone considers certain degeneration limits of the torus. (Elliptic) multiple zeta values represent an active current researchfield in number theory, in particular their single-valued projection at the one-loop level. The modular graph functionsthat contain them satisfy interesting systems of differential equations that can sometimes be used to extract relationsbetween modular graph functions and hence between the multiple zeta values they contain. A full understanding of thisis an open problem and was one of the topics discussed at the conference.

Automorphic representations

Mathematically, the functions E(0,0) and E(1,0), associated with R4- and∇4R4-couplings, are attached to certain specialautomorphic representations which have unusually small functional dimension. Such small automorphic represen-tations are usually representations of metaplectic groups. These small representations proved important in numbertheory as well as physics, where for example integral representations of L-functions, from Shimura’s 1975 constructionof the symmetric square L-function through constructions such as recent work of Cai-Friedberg-Ginzburg-Kaplan. Theprocess of unifying linear algebraic groups and their covers leads to helpful new insights, often tending in the directionof quantum groups. An example is the representation of Fourier coefficients of Eisenstein series as “Tokuyama formu-las” that are sums over crystal bases. Very recently work of Brubaker, Buciumas, Bump and Friedberg and others has

Page 265: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Automorphic Forms, Mock Modular Forms and String Theory 261

revealed further unexpected relations between quantum groups and automorphic forms on metaplectic groups. In par-ticular, the Fourier coefficients of metaplectic Eisenstein series are multiple Dirichlet series whose local factors can beinterpreted as partition functions of supersymmetric solvable lattice models. Functional equations of such Fourier co-efficients, which were studied by Kazhdan and Patterson and by Chinta and Gunnells can be interpreted as R-matricesof quantum groups. These ideas have equal origins in number theory and physics.

Approach of the workshop

Developments in recent years have made it clear that automorphic representations provide a crucial ingredient in ourunderstanding of non-perturbative aspects of string amplitudes. With the advent of Umbral moonshine we have alsoseen new cross-fertilisations between string theory, conformal field theory, finite groups and mock modular forms.These developments provided ample motivation for having a new channel of communication between researchers inboth communities, and the workshop succeeded in doing precisely this.

One of the main goals of the workshop was to provide an opportunity for interaction between mathematiciansand physicists working on different aspects of automorphic forms, mock modular forms and string theory. The talkswere selected in such a way as to give maximum exposition to current research topics from these fields and the livelydiscussions during the talks and in the breaks gave us the impression that our concept worked. The feedback we receivedfrom individual participants confirmed this impression and we think that the goal of creating new research ideas andprojects at this exciting interface between mathematics and physics was certainly met.

The workshop brought together many researchers interested in similar questions and the talks and discussions atthe workshop revealed many common questions that need to be researched further. There are some new collaborationsthat were initiated at the workshop as well as some ongoing collaborations that were propelled further as a consequenceof the workshop. We felt the workshop was very successful and the academic concept “works” such that we plan tocontinue similar events in the future. A follow-up program will take place at the Simons Center during the springof 2019 and we are aware of number of mutual visits between participants to further advance on research questionsidentified during the workshop.

The workshop was organized into 4 overall themes, and each theme was introduced during a 1-hour lecture andfollowed by a selection of shorter research seminars spread throughout the week.

Themes and talk descriptions

Automorphic forms in string theory

The keynote speakers were Michael B. Green and Eric D’Hoker, who each gave a one hour lecture. The first lecturereviewed two main instances of where automorphic forms and automorphic functions occur in string theory, namely thelow-energy expansion of graviton amplitudes in a U-duality invariant framework with automorphic forms on exceptionalduality groups and, secondly, the integrands of string scattering amplitudes at a fixed loop order. This latter subject hasmainly been developed at one-loop order corresponding to toroidal world-sheets and automorphy group SL(2,Z). Thiswas reviewed in the talk by D’Hoker and he also explained recent work on the extension two loops and Sp(4,Z) andeven higher loops, work that was considerably extended at the BIRS workshop (see outcome section).

Guillaume Bossard discussed a new approach to the automorphic couplings appearing in the low-energy expansion.This approach is based on considering so-called exceptional field theory amplitudes at loop order where certain classesof supersymmetric excitations (BPS states) circulate in the loop. The resulting sum over such states leads to automorphicforms expressed through constrained lattice sums, different from the usual coset sum description of Langlands. Thisapproach also provides expressions for automorphic functions that violate the usual Z-finiteness condition. Discussionsbetween physicists and mathematicians following this talk led to the proof of a conjecture of the rewriting of orbit sums.

Jeff Harvey presented his recent work with G. Moore on subtleties in the way Weyl reflections act in toroidal com-pactifications. The hitherto unnoted feature is that the combination of the usual geometric action of the reflection groupwith an action on the states leads to a cover (eight-fold in cases) of the standard Coxeter group.

Page 266: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

262 Five-day Workshop Reports

Automorphic representations

The keynote speaker was Gordan Savin who gave an overview of the field of small automorphic representations. Globaluniqueness of the minimal representation was recently shown by Kobayashi and Savin. He also explained the ubiquityof Weissman’s construction of the Fourier-Jacobi functor for the minimal representation. As already mentioned the im-portance of the minimal representation in string theory has become increasingly clear in recent years, thereby creatinga surge of interest in the field also from physics.

Marie-France Vigneras reported on here recent results on the existence of supercuspidal p-adic representations forG2, Dn and En, a project that was initiated as a direct result of her preparation for the workshop.

Dihua Jiang presented recent progress on the theory of endoscopic classification. He reviewed his conjecture withZhang on the large cuspidal spectrum, called the Global Large Cuspidal Packet Conjecture (GLCP-conjecture), and itsramifications.

Birgit Speh discussed her work on symmetry breaking of infinite-dimensional representations which deals withreal representations of a group G and their restriction to a subgroup H ⊂ G, in particular problems with estimatingmultiplicities of representations of H in the case of infinite-dimensional representations. The main focus was for thecase O(n, 1) ⊂ O(n+ 1, 1) and unitary representations.

Henry Kim reported on his joint work with Yamauchi on Ikeda-type lifts. Using the Ikeda-type lift they constructa higher level cusp form on E7,3 from any Hecke cusp form whose corresponding automorphic representation has nosupercuspidal local components.

Ben Brubaker gave a general overview on the connection between quantum groups and Whittaker functions ofmetaplectic Eisenstein series. Daniel Bump then proceeded to report on recent joint work with Brubaker, Buciumasand Gray. Whittaker functions on the n-fold metaplectic cover ofGL(r) over a nonarchimedean local field were studiedby Kazhdan and Patterson, who computed the scattering matrix of the intertwining integrals on the Whittaker models.It was shown in 2016 by Brubaker, Buciumas and Bump that this scattering matrix coincides with the R-matrix of aquantum group, a twist of quantum affine U√q(gl(n)), where q is the residue cardinality. Moreover, they showed thatthe spherical Whittaker functions could be expressed as partition functions of solvable lattice models, whose internalstructure is related to the quantum affine Lie superalgebra U√q(gl(n|1)). In recent work, Brubaker, Buciumas, Bumpand Gray proved that a second solvable lattice model has the same partition function using Yang-Baxter equations.

Automorphic forms on Kac-Moody groups

The keynote speaker was Manish Patnaik who reviewed the basic notions of Eisenstein series on loop groups, empha-sising the different versions (positive, negative, ...) that exist in the Kac–Moody case whereas they coincide in the usualfinite-dimensional Lie group case. He proved convergence results for these Eisenstein series in the number field andfunction field case, building on previous work by Garland. Another interesting aspect was the discussion of the affineversion of the Weil representation appearing in the theta functions and a corresponding Siegel–Weil theorem.

Kyu-Hwan Lee presented his work with Carbone and Liu on Eisenstein series on more general Kac–Moody groups,focussing on the results they have obtained for the rank 2 hyperbolic case. These results include convergence, functionalrelations, constant term formulas and holomorphy results for cusp forms.

Alexander Braverman presented his work with David Kazhdan on the affine Tamagawa number formula. The goalof this work is to provide an affine generalization of Langlands calculation of the volume of the fundamental domainG(Z)\G(R)) for G a split semi-simple simply-connected Lie group. Their proposal is to do this by computing thecohomology of the moduli space of G-bundles for G a loop group over a function field.

Indefinite theta series, moonshine and black holes

The keynote speaker was Stephen Kudla, who reported on the recent flurry of activity on theta series associated withindefinite quadratic forms. There is an extensive theory of theta series for positive definite quadratic forms, and con-nections with representation theory through theta correspondences and minimal representations. Sparked by resultsof Zwegers in relation with mock modular forms, and generalizations in string theory due to Alexandrov, Banerjee,

Page 267: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Automorphic Forms, Mock Modular Forms and String Theory 263

Manschot and Pioline, a general theory of indefinite theta series is now emerging. Kudla explained his recent resultswith Jens Funke which shows that indefinite theta series can be constructed using the method of Kudla-Millson.

Roberto Volpato reported on his work with Natalie Paquette and Max Zimet on counting 1/4-BPS states in N = 4string theory. The generating function of such BPS-states is famously given by the reciprocal of the Igusa cusp form Φ10

for Sp(4;Z). Their results combine consistency conditions on Φ10 with wall-crossing in string theory and has potentialrelevance for our understanding of Mathieu moonshine.

Katrin Wendland explained recent progress in constructing a module for Mathieu moonshine using K3 sigma mod-els. She reported on her results on the refined elliptic genus, and the work of Song on the cohomology of chiral de Rhamcomplex. The conclusion is that the generic space of states of K3-theories is modelled by this cohomology, and appearsto agree with what is expected from a Mathieu moonshine module.

Martin Raum reported on recent work with Michael Mertens on the skew-Maass lift. This is a generalization ofthe classical Maass lift to skew-holomorphic Jacobi forms. Part of the motivation for this work comes from recentdevelopments in umbral moonshine.

Christoph Keller reported on a series of works jointly with several people, including Belin, Maloney, Muhlmann,Castro, Gomes, Kachru and Paquette. This concerns an in-depth analysis of the spectrum of permutation orbifolds forgenus 1 and 2 conformal field theories. They investigate relations with black holes, Siegel modular forms and emergentspacetime.

Miscellaneous

Thomas Creutzig explained his recent work with Davide Gaiotto, which gives a new class of vertex algebras related byS-duality in GL-twistedN = 4 super-Yang-Mills. These vertex algebras arise in the intersection of pairs of 3d boundaryconditions in the 4d theory.

Siddhartha Sahi presented his work on multivariate hypergeometric functions for tube domains. This generalizesthe classical theory of hypergeometric functions to functions with matrix arguments. This theory is related to the theoryof Jack- and Macdonald-polynomials.

Howard Garland presented work in progress on the cohomology of arithmetic groups and relations to an arithmeticgeneralization of Riemannian curvature. He offered speculations on applications to automorphic forms and higher-derivative corrections in string theory.

Hermann Nicolai explained a surprising connection between conjectured Kac–Moody symmetries of N = 8 supe-gravity in D = 4 space-time dimensions and the standard model. Breaking supersymmetry (by an unknown mecha-nism) it is possible to assign quantum numbers under an SU(3)×U(1) group to theN = 8 fermions that exactly matchthe corresponding quantum numbers in the standard model. This subgroup requires the extension to the conjecturedKac–Moody symmetry K(E10) and cannot be accommodated in the standard R-symmetry SU(8).

Outcome of the Meeting

Publications linked to the workshop

Modular graph functions can also be defined in principle for higher genus Riemann surfaces, with additional subtletiesas was explained in the talk by Eric D’Hoker. Some of the ideas he sketched regarding higher string invariants wereclarified in subsequent discussions and the results have now been written up and published online as [1]. Anotherpreprint that was an immediate consequence of the workshop was [2] by Marie-France Vigneras who analysed certainsupercuspidal p-adic representations.

Page 268: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

264 Five-day Workshop Reports

Comments from workshop participants

“It is my view that mathematics has two very different main sources: physics and number theory. In automorphic forms, thehistorical development has been from number theory. Yet automorphic forms also appear in physics. So the Langlands programis a meeting ground for ideas coming from two different directions. I felt that the workshop was extremely helpful in gettingnumber theorists and physicists together for a common cause. The format of mainly 40 minute talks with a few survey lecturesworked very well.”

- Daniel Bump

“Many thanks again for organizing the recent BIRS workshop, and for inviting me. I enjoyed my stay very much, and I learneda lot.

I had discussions with Birgit Speh and with Martin Raum which may or may not lead to new collaborations.... New ideasare abundant after the worskhop; particularly the discussions with Roberto Volpato and with Jeff Harvey have helped.”

- Katrin Wendland

“Thank you organizing an inspiring conference. I found it useful to learn which examples and groups are useful. I intend tolook into automorphic forms for SO(4, 4) and SO(4, 20).”

- Birgit Speh

“I enjoyed the workshop a great deal. It was very exciting to hear from so many distinguished physicists about the way thatautomorphic forms appear in the study of string scattering amplitudes, and to hear from my colleagues in automorphic formsabout their latest progress. And the connections to VOAs and mock modular forms were very striking; this is an area I wouldappreciate the chance to learn more about.

The lectures were quite interesting and generally of very high quality. Let me add that I have already watched both Fridaylectures on-line, as I was not able to stay for them.

For specifics, Guillaume Bossard asked me some interesting questions, and I hope this will be the start of more discussions.I spent time talking to Martin Raum, with whom I am collaborating, and this was a helpful conversation in terms of movingour project forward. I spoke with Ben Brubaker and once again set in motion next steps on further joint work. I had a helpfulconversation with Gordan Savin which followed up a prior conversation from our last meeting in Boston.

I had some helpful discussions with Daniel Persson, Axel Kleinschmidt and Dan Bump concerning future steps in facilitat-ing collaborations between mathematicians and physicists.”

- Solomon Friedberg

“ I really enjoyed the workshop and would like to thank all of you for your hard work.

It was fantastic to have one half of the talks on physics and the other half on mathematics so that we could see the commongrounds and interactions of the two disciplines. The organization of talks was very coherent, focusing on one or two themeseach day. I could learn a lot each day.

I hope that there may be more workshops and conferences like this one in the future.”

Page 269: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Automorphic Forms, Mock Modular Forms and String Theory 265

- Kuy-Hwan Lee

“ First of all: congratulations on putting together a superb workshop in Banff. There was a good balance between time fortalks and free time to discuss, especially since everyone was physically staying in the same place, sharing lunch and dinner.

I did not initiate any new collaborations, but got a lot done with present collaborators Boris Pioline and Michael Green onone collaboration, as well as with Piotr Tourkine on another collaboration.

Having also attended last year’s workshop at Simons in Stony Brook, I think this year’s workshop had a more balanced setof topics and interests than last year’s. The rather informal but efficient organization was perfect for me. Let me add that I viewthe Banff Center as an *ideal* place for such meetings, so any time you care to invite me again, I will be there !”

- Eric D’Hoker

“My recent visit to BIRS was a great help in my research. I learned things from the talks of course and met several peoplewhose work I knew but whom I had never met personally including Stephen Kudla and Marie-France Vigneras. Most impor-tantly, I had conversations with Martin Westorholt-Raum and with Thomas Creutzig that had a direct bearing on a problem Iwas struggling with in a current research project. My conversations with them led me to a new approach and made me familiarwith parts of the literature that I would have had difficulty finding on my own. This is exactly the sort of thing one hopesmight happen at an interdisciplinary meeting like this. Thank you so much for providing such a pleasant atmosphere for theseproductive interactions.”

- Jeff Harvey

Participants

Berg, Marcus (Karlstad University)Bossard, Guillaume (Polytechnique, CNRS)Braverman, Alexander (University of Toronto)Brubaker, Benjamin (University of Minnesota)Bump, Daniel (Stanford University)Cosnier-Horeau, Charles (Jussieu Paris 6)Creutzig, Thomas (University of Alberta)D’Hoker, Eric (UCLA)Friedberg, Solomon (Boston College)Fuchs, Jurgen (Karlstad University)Gannon, Terry (University of Alberta)Garland, Howard (Yale University)Gerken, Jan (Max-Planck Institute for Gravitational Physics, Potsdam)Gourevitch, Dmitry (Weizmann Institute of Science,)Green, Michael (Cambridge University)Gurevich, Nadya (Ben Gurion University)Gustafsson, Henrik (Chalmers University of Technology)Harvey, Jeff (University of Chicago)Hohenegger, Stefan (Universite Claude Bernard Lyon 1)Jiang, Dihua (University of Minnesota)Keller, Christoph (ETH Zurich)Kim, Henry (Toronto)Kleinschmidt, Axel (Max Planck Institute for Gravitational Physics)Kudla, Stephen (University of Toronto)Lee, Kyu-Hwan (University of Connecticut)

Page 270: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

266 Five-day Workshop Reports

Leslie, Spencer (Boston College)Nicolai, Hermann (AEI Potsdam)Patnaik, Manish (University of Alberta)Persson, Daniel (Chalmers University of Technology)Pioline, Boris (LPTHE)Raum, Martin (Chalmers University of Technology)Sahi, Siddhartha (Rutgers University)Savin, Gordan (University of Utah)Sayag, Eitan (Ben Gurion U)Segal, Avner (University of British Columbia)Speh, Birgit (Cornell university)Tourkine, Piotr (University of Cambridge)Vigneras, Marie-France (Jussieu Paris 7)Volpato, Roberto (Padua University)Wendland, Katrin (Albert-Ludwigs-Universitat Freiburg)Zhang, Zhuohui (Rutgers)

Page 271: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] E. D’Hoker, M. B. Green and B. Pioline, Higher genus modular graph functions, string invariants, and their exactasymptotics, arXiv:1712.06135 [hep-th].

[2] M.-F. Vigneras, Existence of supersingular representations of p-adic reductive groups, arXiv:1712.10142 [math.NT].

267

Page 272: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 24

Forest and Wildland Fire Management: aRisk Management Perspective (17w5145)

November 5 - 10, 2017Organizer(s): Douglas G. Woolford (University of Western Ontario), David A. Brillinger(University of California at Berkeley), David L. Martell (University of Toronto), FredericP. Schoenberg (University of California at Los Angeles), B. Mike Wotton (University ofToronto/Canadian Forest Service)

Modern wildland fire management poses many challenging interdisciplinary problems that link forest ecology, forestmanagement, computer science, physics, mathematics, statistics, industrial engineering and operations research. Firemanagement was originally driven by a desire for fire exclusion. However, over the past century it has evolved from anactivity to an emerging paradigm of risk management, focused on multiple competing values and risks.

Throughout its history in Canada, fire managers have always looked to the best science available for assistance incharacterizing important and highly variable aspects of their fire environment and for providing key intelligence toinform their decision making. As wildland fire management has evolved, so too has the range of scientific disciplinesneeded to support decision making deepened to include many more quantitative disciplines. This BIRS workshop thatbrought together a mix of researchers and end-users was the ideal forum for furthering ongoing collaborative initiativesand for developing new ones.

The workshop focused on the design and implementation of science-backed, data-driven forest fire managementdecision support tools and their application by end-users (i.e., forest and wildland fire managers). Three main areaswere targeted: (1) The science underpinning decision support tools for fire management, (2) Fire regime modelling, and(3) Bridging gaps between researchers and fire managers. Emphasis was placed on a data science based approach foraddressing emerging questions at the forefront of fire science and fire management. Representatives from fire manage-ment agencies discussed their perspectives and needs as well as provided examples of operational decision support toolscurrently in use.

This report provides background on the field of quantitative wildland fire science and decision support for firemanagement with a particular emphasis on the outcomes of the 5-day workshop at BIRS and ongoing related initiatives.

268

Page 273: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Forest and Wildland Fire Management: a Risk Management Perspective 269

Overview of the Field

Decision Support Tools for Fire Management

In order to make progress on fire management objectives, mathematical and statistical expertise will need to be broughtto bear on the issues of uncertainty which complicate operational planning. There is a need to deliver “the right amountof right fire at the right place at the right time at the right cost”. For example, what is the best strategy for an individualairtanker? Is it better to have it return to its original home base where it was deployed at the start of the day, the basefrom which it was dispatched to its most recent fire after completing service on that fire, or should it fly to the basenearest to that fire? Although progress has been made on this kind of problem using tools developed by operationsresearchers, there is a very urgent need to develop and test better predictive fire occurrence models, especially ones thatincorporate the clustering of arrivals which can overwhelm agencies. There is also a need to further our understandingof the initial attack system, including estimating the probabilities for same day initial attack, and what factors contributeto initial attack success since the larger project fires that result when the initial attack system fails are a large drain oncrews and equipment and incur very large costs.

One strength of a probability-based model is that the variability in the data will be reflected in the standard er-rors associated with predictions or fitted values. However, one of he many challenges faced by fire managers and theresearchers that collaborate with them, is the difficulty of conveying such uncertainty to the users of the modellingproduct. Two types of uncertainties need to be addressed directly: 1) the standard error of the predicted probabilityand 2) the fact that predicted probabilities have an inherent uncertainty in the sense that although the model returns aprobability, the event of interest is binary. How can these uncertainties best be incorporated in decision support toolsaimed at informing fire managers?

Other key risk management issues to be discussed include harvest scheduling optimization, optimal allocation ofscarce fire suppression resources, and examining effects of changes in fire detection management and technology.

Fire Regime Modelling

The economic and social impacts of wildfire in Canada raise important challenging statistical questions. Once an areahas burned, does the most recent fire reduce the surrounding area’s risk of burning and if so, to what degree and forhow long and can such effects be replicated by active management of forest fuels? What ecological and forest manage-ment strategies are needed to maintain the current landscape mosaic and levels of biodiversity while protecting people,property and other values at risk? There is an interest in developing methods for simulating seasonal boreal forest fireregimes, comprising frequency, size, and severity, noting that there is a strong need to “backcast” characteristics of pastfire regimes if there had not been any fire suppression. In this context, the focus involves shifts to modelling aggregatebehaviour of fire regimes, say over an entire year and across a large study area (e.g., a province), rather than modellingat fine scale resolutions. Historical records of individual burn scars for many fire seasons are now becoming availablein digital format. Fires can be viewed as realizations of a marked spatio-temporal point process (where the points arethe ignitions and the marks are the characteristics of each individual fire, such as its duration, intensity, size, and soon). The fire ignition records combined with the burn scar data can be viewed as approximately independent, replicatedpoint pattern data. Important topics for discussion include developing tools for exploring and visualizing these data, forinvestigating inhibitory impacts of burn scars on the future intensity of points in that area, and for determining whethersuch marks are separable, and if not, how to jointly model points and marks.

Bridging Gaps Between Researchers and Fire Managers

Many factors, including climate change, changing land use patterns, fuel build-ups, government fiscal realities andthe fact that fire is a natural ecosystem process contribute to the emergence of more and increasingly complex andchallenging fire loads. Hence, there is a growing and urgent need for decision support systems that fire managerscan use to enhance their planning and decision-making processes. The development of such decision support systemscalls for bringing experts from the mathematical and decision sciences together with ecologists, fire scientists and firemanagers to develop collaborative team-based efforts to address such problems.

It is also recognized that it is simply not enough to produce advances in science. To have a significant impact,

Page 274: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

270 Five-day Workshop Reports

potential end-users of research results must be able to interpret and trust the output from these models. Tools areneeded to aid a manager who, with little lead time, must resolve complex decisions under considerable uncertaintyand be responsible for the outcomes of such decisions. For example, fire managers need to know about the potentialfor periods of extreme fire activity–situations when the fire management system can become overwhelmed. To planeffectively, they need tools that capture the scientific understanding of what causes multiple occurrences of large (andexpensive) fires and hence, allows them to quantify the probability that multiple regions will be overwhelmed at the sametime, preventing one region from assisting another region through intra and inter-agency resource sharing. Makingaccurate forecasts of such events requires models for the stages of the “lifetime” of a fire together with models whichcan predict where fires will be ignited; such knowledge can lead to improved prediction as to how fire load changes bothspatially and temporally.

Presentation Highlights

The objective of the workshop was to bring together researchers and end-users from a variety of disciplines (e.g., op-erations research, statistics, actuarial science, mathematics, computer science, ecology, forestry and the environmentaland health sciences) to provide a forum to discuss fire management needs and to initiate interdisciplinary team-basedcollaborations aimed at addressing important problems in forest fire management. Several themes were spread acrossthe 5 day workshop, an overview of highlights follows below.

Monday Morning: A High-Level Overview

In the opening session senior managers from two Canadian forest fire management organizations presented theiragency’s perspectives on the challenges they have experienced in recent years and the problems they would like re-searchers to address. Wally Born, the Executive Director of Wildfire Management Branch of the Province of Alberta’sAgriculture and Forest ministry discussed some of their challenges moving forward from the Fort MacMurray wildfirein May of 2016. He placed wildfire management within the context of its modern role of having to balance multiple val-ues on the forest landscape: 97% commitment of land with commercially viable forest for harvest, species as risk (withcaribou as a species of major public interest), 500 forest communities in their protection zone and major industrial de-velopment and infrastructure. Some of the key questions he identified as being highly relevant to his organization were:What resources are needed over the next 10 years (and moving forward) to maintain the performance of the organiza-tion, particularly within increasing population and commitments of land; how much of what kind of fuel managementof mitigation effort around communities will be needed to produce a measureable impact in reducing risk, suppressionneeds and costs; how do we set and define performance measures, in particular for the prevention program.

Tony Falco a senior manager from the Province of British Columbia’s (BC) Wildfire Management Branch gave thegroup a detailed update on the unprecedented 2017 fire season in BC and the multiple challenges that the province hadto deal with. Since that fire season was just wrapping up at the start of the workshop, Tony pointed out that therewere still many after action reviews to be carried out; however he identified some of their ongoing research prioritiesas understanding fuels mitigation and maintenance fr risk reduction as well as general fuel typing issues with BC foresttypes.

This morning session was wrapped up by a panel discussion lead by Al Tithecott (former Director of Ontario Min-istry of Natural Resources and Forestry’s Aviation, Forest Fire, and Emergency Services (OMNRF-AFFES) branch).The panelists included Wally Born, Tony Falco as well as Rob McAlpine (Manager of Wildfire Response and Opera-tions, OMNRF-AFFES). Discussion with the workshop group included greater elucidation on research needs identifiedin the formal presentations as well as other topics and challenges facing wildfire management organizations.

Monday Afternoon: Fire Operations

The afternoon session built on the high level perspective on wildfire management challenges presented by the senior op-erational managers in the morning. This session brought the discussion to a more operational level to provide examplesof some specific research needs and solutions being developed currently. David Martell from the University of Torontoopened the afternoon, placing his decision support focussed research activity into a framework of data analytics familiarto many today. Examples of completed and ongoing research that fit into the framework of Descriptive, Predictive and

Page 275: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Forest and Wildland Fire Management: a Risk Management Perspective 271

Prescriptive Analytics were presented and some of the opportunities and challenges of carrying out such research withoperational agencies were discussed. Key among these was was the critical need for researchers to develop close collab-orative relationships with fire managers to support not only the flow of the data they need for their analytics researchbut also, critically, to help them develop a sound understanding of the data and the caveats surrounding its observationand overall reliability. The session then included presentations from fire science specialists from the fire managementagencies in Ontario highlighting how the needs of operational decision maker are changing. To be able to managementwildfire in the future it is anticipated that a more defined risk management structure will be needed.

Keeping focused on bridging gaps between researchers and fire managers and with the objective of furtheringcollaborative research efforts, day one of the workshop wrapped up with a discussion of both the morning and afternoonsessions. Some key fire management agency needs and research opportunities were identified. This discussion focussedon the information and tools needed for risk analysis, risk reduction strategies and evaluating the effectiveness of thesuite of potential mitigation, preparedness, response and recovery strategies that are available.

Tuesday Morning: Operations Research

The morning session of day two sought to highlight mathematical approaches and solution to problems using the Oper-ations Research/Analytics methods that David Martell had spoken about on the previous day.

The session began with a keynote address by Mikael Ronnqvist of Laval University, who presented a talk enti-tled “Calibrated Route Finder: Improving the Safety, Environmental Consciousness, and Cost Effectiveness of TruckRouting in Sweden”, describing how he and his colleagues had used operations research methods to develop their award-winning research on transportation logistics in the forest sector in Sweden. This talk provided some concrete examplesof how modern analytics research methods can be exploited to develop decision support systems that had very significantdemonstrated award-winning impacts on the forest resource sector and other sectors other than fire management.

Yu Wei from Colorado State University presented a talk entitled “Building operations research models to improveour ability to address uncertainties in wildland fire management” to share his current and past experience at bringingadvanced mathematical programming methods to bear on fire management problems in the United States.

Matthew Thompson of the US Forest Service presented a talk entitled “Optimizing fire management strategies onthe basis of risk and control opportunities” in which he described how he and his colleagues are exploiting analytics todevelop spatially explicit decision support systems that fire managers can use when they develop and evaluate strate-gies for managing fires that have both beneficial and detrimental impacts and need not necessarily be managed usingtraditional fire exclusion strategies and tactics.

Jeremy Fried, also of the US Forest Service, presented a paper entitled “Modeling Stand Level Fuels ManagementEffectiveness and Economic Feasibility at Landscape Scale in the U.S.: a Forest Inventory Informed Approach.” He wasinvited because of his extensive experience at modelling the use of forest harvest residue for biomass to produce energyand at the same time, achieve fuel management objectives.

Tuesday Afternoon: Game Theory, Artificial Intelligence and Applied Analytics

Kate Larson, a computer scientist from the University of Waterloo presented a talk entitled “The Strengths and Limi-tations of Game Theory for Fire Management”, based in part on her research on resource sharing. Her talk was par-ticularly timely because of the needs of Canadian forest fire management agencies to share so many fire managementresources among themselves and with international fire management agencies in recent years.

Mark Crowley, another computer scientist from the University of Waterloo presented a talk entitled “Fighting Firewith AI: Using Artificial Intelligence to Improve Modelling and Decision Making in Wildfire Management” based on hiscollaborative research with colleagues in the United States. He was invited in part, because of the growing recognitionof the need for fire management agencies to exploit new methods being developed by artificial intelligence researchers.

David Martell of the University of Toronto presented a talk entitled “Prescriptive analytics to inform forest andwildland fire management” in which he outlined some of what he considered to be emerging fire research needs. Thiswas followed by Colin McFayden of the Ontario Ministry of Natural Resources and Forestry who served as a discussantand then facilitative an open discussion.

Page 276: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

272 Five-day Workshop Reports

Wednesday Morning: Risk

Greater understanding of risk management and the use of risk analysis techniques were identified by fire managersearlier in the workshop as needed to address the complexity of current and future wildfire management. as beingseen by operational fire management. Day three opened with a keynote speaker who has specialized in financial riskmanagement. Matt Davison of the University of Western Ontario described some of the analytical techniques he hasused in a variety of applied research settings and highlighted where these may be applicable in the environmental riskmanagement area. Ideas like diversification, leverage and hedging were presented and discussion ensued about howthey may be relevant in the wildfire context. For example, thinking about community protection in a diversificationcontext may allow novel framing of the risk reduction potential of a suite or “portfolio” of mitigation and preventionsolutions.

Talks that followed this keynote presented specific risk analysis and modelling studies. Jennifer Beverly of theUniversity of Alberta presented highlights from several studies of landscape fire risk and summarized up lessons learnsfrom these analyses for other researchers exploring these areas. Cordy Tymstra from Alberta’s Agriculture and ForestryWildfire Management Branch discussed enhancing situational awareness of spring wildfire danger in Alberta usingmethods similar to those used in biosurrvellience applications. The day ended with Cristina Vega-Garcia from theUniversity of Lleida presenting results from several studies using fire occurrence prediction models for improved wildfirerisk management in Spain.

Thursday Morning: Ecology, Mapping Fire Risk

Thursday’s session began with a focus on ecological aspects of fire activity with Steve Cumming from Laval Universitypresenting some work he had done on hierarchical modelling of the joint distribution of annual fire counts and firesize. Lori Daniels from the University of British Columbia presented a perspective on the 2017 wildfires situation inBC from her perspective as a fire ecologist and talked about potential solutions moving forward. Patrick James fromthe University of Montreal concluded the ecological session with some discussion of spatial modelling linking Sprucebudworm defoliation within changes in the probability of fire ignition. One of the themes emerging from the discussionsession after these talks was that the complex linkages between insect disturbance and subsequent fire activity are notwell agreed upon the scientific community. The relatively recent and large scale Mountain Pine Beetle epidemic in BCand the expanding spruce budworm outbreak in eastern Canada provide research opportunities; newer more advancedmethods of spatial data analysis may be useful to attempt to gain insight in this area.

Thursday morning ended with presentations on two ongoing different fire risk analysis and mapping projects thatwere both sponsored by the Canadian Safety and Security Program (Defense Research and Development Canada-Centrefor Science) and being led by research scientists at the Canadian Forest Service: Xianli Wang discussed the developmentof a national assessment of landscape fire risk in Canada. Steve Taylor talked about machine learning methods that arebeing used to develop a set of national (Canadian) daily fire occurrence models.

Thursday Afternoon: Examples Modelling Fire Risk and Beginning the Wrap-up

Thursday afternoon sessions carried on the theme of landscape ecology that began earlier that day, with a presentationfrom Geoff Cary of Australian National University on perspectives on quantifying the mitigation of wildfire risk fromboth an Australian and international modelling point of view. Ellen Whitman from the University of Alberta presentedwork conducted related to landscape patterns of burn severity in Canada’s boreal forest. Brett Moore from AlbertaAgriculture and Forestry discussed projection of potential fire growth using ensemble weather forecasts to provideprobabilistic estimates of potential wildfire locations from an operational viewpoint. The last of the talks was given byFrederic Schoenberg of the University of California, a statistician, who discussed his experience modelling using pointprocess frameworks to talk about model building and the issues of missing variables and the inclusion of variables thathad little real influence on the overall process being studied.

Thursday ended with a session led by Rob McAlpine (Manager of Wildfire Response and Operations, OMNRF-AFFES) who gave a closing discussion that emphasized future collaboration going forward. Participants discussedpotential future research projects including working on various aspects of appropriate response, optimizing helicopterdeployment, fire risk modelling and risk in general, quantifying uncertainty, as well as other areas. It was clear thatsome new teams were beginning to form and that collaborative research, both existing teams and new initiatives, would

Page 277: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Forest and Wildland Fire Management: a Risk Management Perspective 273

continue beyond the workshop.

Friday Morning: The Connection to Banff and Parks Canada

Similar to a 2013 BIRS 5-day workshop “Managing Fire on Populated Landscapes” (Braun et al. 2013), we tookadvantage of the opportunity to engage Parks Canada. Jane Park, a fire and vegetation management specialist atParks Canada who is stationed in Banff National Park was able to attend as a participant of the meeting. On Fridaymorning, there was a field trip to visit fuel management and prescribed burning operations sites in Banff NationalPark. Prescribed fires, which are fires that are intentionally set, planned and managed by fire specialist in order to helpmaintain forest health and biodiversity (Parks Canada 2018b). For an example including further information aboutrecent prescribed burning during 2017 in Banff National Park see Parks Canada (2018a). This gave our BIRS workshopparticipants the unique chance to “get some dirt on their boots”, which, in general, is not a common occurrence forquantitative researchers in the Mathematical and Statistical Sciences. Participants were able to see actual burn scarsand burn sites up close and witness the forest succession that follows after such prescribed burn events.

Scientific Progress Made

Several members of the organizing committee are also involved as investigators/collaborators on a CANSSI (CanadianStatistical Sciences Institute) Collaborative Research Team grant. This BIRS meeting provided a forum for some mem-bers of that team, including both academic researchers and their graduate students and postdoctoral fellow to networkand interact with other Canadian and international researchers who work in this area along with individuals from firemanagement agencies.

This BIRS 5-day workshop was a strategic workshop. It brought together researchers from a variety of disciplines(e.g., operations research, statistics, actuarial science, mathematics, computer science, forestry and the environmentaland health sciences) and provided a forum to discuss fire management needs, quantitative methods for solving problemsin the fire science and fire management context, and foster ongoing and initiate new team-based collaborations aimed ataddressing important problems in these areas. Building on the momentum provided by this BIRS workshop, a tacticalmeeting of a smaller group of individuals (many of whom were at this BIRS meeting) is planned for February 2018 inLondon, Ontario, Canada. The targeted areas identified, in part, through the discussions leading up to and at this BIRSworkshop include:

(i) Appropriate response, which is to minimize the total cost plus net loss on a fire-by-fire basis considering theimpacts (positive and negative) and the cost of alternative approaches of fire response.

(ii) Fire risk modelling, which includes the development and application of probabilistic fire growth models, quanti-fying socio/economic impact and accounting for risk tolerance and preference.

(iii) Quantifying or accounting for uncertainty, which is prevalent in all fire management decisions. Uncertainty entersthe decision systems in a) the state of the knowledge, b) data quality, c) human dimensions (biases/heuristics) andd) natural variability.

Outcome of the Meeting

Fire management agencies have the difficult task of addressing two competing concerns: the importance of fire from anecological perspective and the danger of fire from a human perspective. Faced with the need to deliver “the right amountof right fire at the right place at the right time at the right cost” forest fire managers must make difficult decisions on aregular basis.

There is a growing need for quantitative expertise in data science-based decision support models for wildland firemanagement. The organizing committee recognized this need and, in particular, the need to transfer knowledge fromthe mathematical and statistical sciences to managers. This BIRS workshop, which brought together a wide mix ofresearchers and end-users from across Canada along with international participants was another step forward in ac-complishing such objectives.

Page 278: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

274 Five-day Workshop Reports

The proposed workshop was timely in today’s environment where interdisciplinary, collaborative teams–that in-clude researchers from quantitative fields–are an absolute necessity for tackling high-impact applied problems. Ithighlighted the growing community of quantitative researchers and exposed some newer researchers from the com-putational, mathematical and statistical sciences to the wide variety of problems

Through presentations, panel and informal discussions that took place at this meeting, it is clear that any suchmodels must not only account for a wide range of specific information, such as values at risk, the current fire load andthe probability of fire occurrence and detection, but also incorporate information about the current locations of crewsand equipment and their associated dispatch and travel constraints. Another message that was clearly conveyed wasthat it is not enough to produce advances in science. To have an impact, end-users must be able to interpret and trustthe output from these models. Tools are needed to aid a manager who, with little lead time, must make decisions and beresponsible for the outcomes of such decisions. For example, fire managers need to know about the potential for periodsof extreme fire activity–situations when the fire management system can become overwhelmed. To plan effectively,they need tools that capture the scientific understanding of what causes multiple occurrences of large (and expensive)fires and hence, allows them to quantify the probability that multiple regions will be overwhelmed at the same time,preventing one region from assisting another region through intra and inter-agency resource sharing. Making accurateforecasts of such events requires models for the stages of the “lifetime” of a fire together with models which can predictwhere fires will be ignited; such knowledge can lead to improved prediction as to how fire load changes both spatially andtemporally. Joint modelling methodology from the Statistical Sciences has the potential to take what are predominantlymarginal models for individual components of wildland fire regimes and couple them together in a joint framework.One presenter (Cumming) discussed his preliminary forays into this arena and other participants and researchers areactively pursuing work in this area.

This workshop addressed fire management decision making from an operations research perspective by providinga forum for decision makers in fire management agencies to form tangible working relationships with ecologists, firescientists, industrial engineers, mathematical modellers and statisticians. Participants engaged in discussions about theuse of mathematical and statistical models to develop decision support tools. Examples of such tools currently in oper-ational use by fire management agencies that result from interdisciplinary collaborations with quantitative researcherswere on display. This BIRS meeting represents a culmination of a long-term initiative engaging quantitative researcherswith other researchers working in wildland fire and individuals from fire management agencies. It is our opinion thatthe success of this workshop along with the new scientific initiatives and research collaborations that continue to followare a direct result of the ongoing support of BIRS as well as other Canadian mathematical and statistical institutes,which is gratefully acknowledged.

Participants

Albert-Green, Alisha (University of Toronto)Becker, Devan (University of Western Ontario)Beverly, Jennifer (University of Alberta)Born, Wally (Alberta Agriculture and Forestry)Braun, W. John (University of British Columbia)Cary, Geoff (The Australian National University)Cole, Bill (Ontario Ministry of Natural Resources & Forestry)Crowley, Mark (University of Waterloo)Cumming, Steve (Universite Laval)Daniels, Lori (University of British Columbia)Davison, Matthew (University of Western Ontario)Evens, Jordan (Ontario Ministry of Natural Resources & Forestry)Falcao, Tony (BC Wildfire Service)Flannigan, Mike (University of Alberta)Fried, Jeremy (Pacific Northwest Research Station)James, Patrick (Universite de Montreal)Kruus, Rob (Saskatchewan MNR)Larson, Kate (University of Waterloo)Martell, David (University of Toronto)McAlpine, Rob (Ontario Ministry of Natural Resources)

Page 279: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Forest and Wildland Fire Management: a Risk Management Perspective 275

McFayden, Colin (Ontario Ministry of Natural Resources & Forestry)McLoughlin, Neal (Government of Alberta)Moore, Brett (Alberta Agriculture & Forestry)Park, Jane (Parks Canada)Podur, Justin (York University)Ronnqvist, Mikael (Laval University)Schoenberg, Frederic (University of California, Los Angeles)Taylor, Steve (Simon Fraser University)Thompson, Matthew (United States Forest Service)Thompson, John (University of Western Ontario)Thomson, Trevor (Simon Fraser University)Tithecott, Al (Tithecott Management Services)Tymstra, Cordy (Sustainable Resource Development)Vega-Garcia, Cristina (University of Lleida)Wang, Xianli (Canadian Forest Service)Wei, Yu (Colorado State University)Whitman, Ellen (University of Alberta)Wotton, Mike (Canadian Forest Service)Xi, Dexen (Western University)Xiong, Yi (Simon Fraser University)

Page 280: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] Braun, W.J., Dean, C.B., Guttorp, P., Martell, D.L., Woolford, D.G. and Wotton, B.M. (2013). 13w5125 (BIRS):Managing fire on populated forest landscapes. BIRS Workshop Reports.http://www.birs.ca/workshops//2013/13w5125/report13w5125.pdf

[2] Parks Canada. (2018a). 2017 Prescribed Fires - Banff National Park.https://www.pc.gc.ca/en/nature/science/conservation/feu-fire/feuveg-fireveg/dirige-prescribed/projet-projects#banff(Accessed January 4, 2018)

[3] Parks Canada. (2018b). Prescribed Fires.https://www.pc.gc.ca/en/nature/science/conservation/feu-fire/feuveg-fireveg/dirige-prescribed(Accessed January 4, 2018)

276

Page 281: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 25

Approximation algorithms and thehardness of approximation (17w5133)

November 12 - 17, 2017Organizer(s): Julia Chuzhoy (University of Chicago), Jochen Koenemann (University ofWaterloo), Mohammad R. Salavatipour (University of Alberta), Nisheeth Vishnoi (EPFL),David Williamson(Cornell University)

Overview of the Field

Most of the discrete optimization problems arising in the sciences, engineering, and mathematics are NP-hard. Thismeans that there exist no efficient algorithms to solve them optimally, assuming the P 6= NP conjecture. The area ofapproximation algorithms focuses on the design and analysis of efficient algorithms that find solutions of cost withina guaranteed factor of the optimal cost. The area of hardness of approximation focuses on proving lower bounds onthe guarantees that any efficient approximation algorithm can obtain for given problems assuming that P 6= NP(or a similar complexity assumption). Over the last two decades, there have been major advances in the design andanalysis of approximation algorithms, and in the complementary topic of hardness of approximation, see Vazirani [53],or Williamson and Shmoys [56].

The long-term agenda of our area is to classify all of the fundamental NP-hard problems according to their approx-imability and hardness thresholds. This agenda may seem far-fetched, but remarkable progress has been made over thelast two decades. Approximation guarantees and matching hardness thresholds have been established for several keyproblems; e.g.

• covering and partitioning (the set covering problem, Feige [19]),

• algebra (overdetermined system of equations, Hastad [25])

• graphs (clique, colouring, Zuckerman [58]),

• optimization (maximum cut, Goemans and Williamson [22], Khot et al. [30]),

• constraint satisfaction (maximum SAT problems, Hastad [25]).

More significant than these specific successes is the impact of novel techniques on related areas of mathematics. Weprovide some examples.

277

Page 282: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

278 Five-day Workshop Reports

Combinatorial Optimization:

The method of iterative rounding has been developed in the area of approximation algorithms to give remarkably goodresults for problems beyond the reach of classical combinatorial optimization, see Jain [27], and Lau et al. [35]. Thetechnique has recently yielded elegant new proofs for a number classic results in combinatorial optimization.

Metric Embeddings:

Structure-preserving embeddings between various geometric spaces have been studied intensively for decades, in fieldssuch as differential geometry and functional analysis. There are many applications of metric embeddings in the area ofapproximation algorithms. Moreover, the interaction between these fields has increased recently (see [37], [8], [7]).

Analysis of Boolean Functions:

Recent progress on hardness of approximation has come with the development of new tools in the area of “analysis ofBoolean functions”. This area combines techniques from harmonic analysis, probability theory, and functional analysisto study basic properties of Boolean functions. One recently developed tool, the Invariance Principle, has led to fruitfulconnections between hardness of approximation and the geometry of Gaussian space, see Mossel et al. [42].

Objectives of the Workshop Proposal

The goals of the workshop were as follows:

(i) To bring together leading researchers in the fields of approximation algorithms and complexity theory, and tostimulate the exchange of ideas and techniques between the two groups.

(ii) To focus on a few key topics that could lead to deep new results in the areas of approximation algorithms, combi-natorial optimization, hardness of approximation, and proof complexity. We describe a few topics below.

(a) The most famous problem in all of discrete optimization is perhaps the Traveling Salesman Problem (TSP).Yet despite the attention paid to this problem, its approximability remains poorly understood. The bestknown approximation algorithm for the symmetric case is a classic 3/2-approximation algorithm due toChristofides from 1976. On the other hand, the known hardness-of-approximation results are very weak.Over the last few years, there has been remarkable progress on several special cases of the TSP and onsome closely related problems. Many of these advances are introducing new and very interesting connec-tions between different areas such as probability, structural graph theory, coupled with technically difficultyet powerful new methods such as interlacing families of polynomials. In 2011 Oveis Gharan et al. [43]used properties of strongly Rayleigh measures together with an elaborate analysis of the structure of near-minimum cuts to obtain the first improvement on the 3/2-approximation guarantee for a key special caseof TSP called the graphic TSP. Since then, there has been a series of more work on this special case andrelated questions. The most recent result on this special case is a 7/5-approximation algorithm of Sebo andVygen [49] that hinges on a key probabilistic lemma of Momke and Svensson [41] coupled with an in-depthand novel analysis of structures that are well known in Combinatorial Optimization. An et al. [1] improvedon a 20-year old 5/3-approximation guarantee of Hoogeveen [26] for the s-t path TSP. Subsequently, Sebo[48] and Vygen [54] have improved on these results to obtain an 1.599-approximation guarantee, by usingfurther probabilistic insights. More recently, Gottschalk and Vygen [23] and Sebo and van Zuylen obtained1.566 and 1.529 approximations for the s,t-path TSP problem, respectively. Relying on (and extending) themajor result by Marcus, Spielman, and Srivastava [39] that proves a conjecture of Kadison-Singer, Anariand Oveis Gharan [2] recently showed the existence of O(poly(log log)n)-thin spanning trees. The resultimplies an O(poly(log log)n) upper bound on the integrality gap of the Held-Karp LP relaxation for theasymmetric TSP (improving the O(logn/ log log n) bound from 2010 [9]). The LP is long conjectured tohave an O(1) integrality gap. The result of [2], however, does not imply an approximation algorithm, itonly provides an estimate of the optimum value. Almost concurrently, Svensson [51] showed that for the

Page 283: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Approximation algorithms and the hardness of approximation 279

case of shortest path metrics of directed graphs (graphic ATSP), the integrality gap of the Held-Karp LPis O(1) and provides an efficient algorithm for it. The two biggest open problems in this area remain toimprove upon the 3/2-approximation for TSP and to obtain a constant factor approximation for ATSP. Byre-focusing attention on this problem, our goal is to continue the momentum from the past two workshops.Two notable new results in this area were found very recently, after our BIRS workshop: Vygen and Traub[55] recently presented a 1.5 + ε approximation for s,t-path TSP, nearly matching the performance ratioof Christofides’ algorithm for metric TSP. Svensson, Tarnawski, and Vegh [52] presented a constant fac-tor approximation for the asymmetric TSP problem; their bounds are relative to the standard ATSP LPrelaxation, confirming the conjecture that it has constant integrality gap.

(b) The Unique Games Conjecture (UGC) which was posed in 2002 by Khot [29] and the implications of it haveattracted a lot of attention over the last 13 years. The conjecture states that a certain type of constraint sat-isfaction problem is hard to approximate. If the conjecture is true, it shows that many of the approximationalgorithms we have (in particular SDP based algorithms) are best possible ([45, 31, 30]). More specificallyUGC implies near tight approximatibility thresholds for a large class of constraint satisfaction problems(CSPs) among others (see Raghavendra [46]). In a sense, UGC predicts that there is a ”meta-algorithm”that is optimal for those problems and this meta-algorithm is based on SDP [10]. Refuting the conjecturewould most likely require designing new algorithmic techniques that could potentially lead to improvedapproximation algorithms for many other problems. One component of the workshop will focus on thisconjecture and surrounding issues in the complexity of optimization problems.Lasserre hierarchy / Sum-of-Squares algorithms:Use of semidefinite programming (SDP) relaxations and the lift-and-project strengthening of them has at-tracted a lot of attention in the field in the last decade or so. The Lasserre hierarchy is a systematic methodof strengthening SDP relaxations by adding more constraints. In some instances these methods have beensuccessfully applied to obtain improved approximation algorithms for some classical results (e.g. [15]).More importantly, although some results support the UGC, some recent works have cast more doubts on itusing Lasserre SDP based algorithms. For example, Arora et al. [6] shown that the powerful Lasserre SDPhierarchy of algorithms could be used to obtain a subexponential-time algorithm for Unique Games (UG).More recently, Barak et al. [11] have used the connection between Lasserre algorithms and Sum-of-Squares(SOS) proof complexity, and have shown that the known hard instances of the UG problem can be analyzedby constant-degree SOS proofs, and thus be solved efficiently.Extended formulations and their complexity:Feasible solutions to instances of combinatorial optimization problems often naturally correspond to thevertices of certain polyhedra. One way of designing an efficient algorithm for a given optimization problemis therefore to find a compact description for the associated polyhedron, and to then apply an efficientLP algorithm. In ground-breaking work Yannakakis [57] first showed that for the TSP, every symmetricLP formulation must have an exponential number of constraints. Symmetry here means that for everypermutation of cities there is a corresponding permutation of the variables that leaves the LP invariant.Fiorini et al. [21] recently resolved Yannakakis’ main open problem and showed that TSP has no symmetricor asymmetric polynomial-sized formulation. In another breakthrough, Rothvoss [47] recently showed thatno subexponential-size extended formulation can exist for the matching polyhedron either.

(c) Routing problems in graphs arise in many areas of computers science, from VLSI design to Robotics. Theyhave also been extensively studied in the graph theory community. Two of the most basic graph routingproblems are the Edge Disjoint Paths (EDP) problem and Congestion Minimization. In EDP, we have toroute a maximum number of demand pairs from a given collection in a graph via disjoint paths. In Con-gestion Minimization, all demand pairs must be routed, while minimizing the maximum load on any edge.Both problems are still poorly understood: for EDP the best upper and lower bounds have ratios O(n1/2)[32] and Ω(log1/2 n) [4, 3], while the upper and lower bounds for congestion minimization have ratiosO(logn/ log log n) [44] and (roughly) Ω(log logn) [5], respectively. If one allows up to 2 paths to share anedge, a polylogarithmic approximation was recently shown [16, 17]. Graph routing problems are naturallyclosely related to network cuts and flows. The new techniques for graph decomposition introduced in [16]have lead to new results in several other areas, such as a polynomial bound for the Excluded Grid Theoremof Robertson and Seymour [14]. Another closely related topic is graph sparsification: given a graph G anda small subset T of its vertices, called terminals, we would like to ”compress” G into a much smaller graphH that contains the vertices of T , so that H behaves similarly to G with respect to the terminals. Graphsparsifiers naturally arise in approximation algorithms, graph theory, and fixed parameter tractability, andthey have been studied in all these communities, often independently. If we require that the sparsifier H

Page 284: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

280 Five-day Workshop Reports

only contains the terminals, then there are known constructions that achieve quality (approximation factor)O(log k/ log log k) for both the cut and the flow sparsifiers [40, 36, 13, 38, 18], and it is known that no betterthan Ω(

√logn)-quality is achievable for this setting [20, 38, 13, 18]. If H is allowed to contain additional

vertices, better results (namely constant-quality) are known. Unfortunately, we still do not know whetherit is possible to construct constant-quality cut and flow sparsifiers whose size only depends on k. Some ofthese recent results rely on graph decomposition techniques that were developed in the area of approxima-tion algorithms for graph routing problems. Quality-1 cut sparsifiers were introduced under the name ofmimicking networks by Hagerup et al. [24], and they have been used to provide kernels for various cutproblems, such as, for example, minimum multiway cut [33]. However, the best current upper and lowerbounds on the size of a mimicking network are 22O(k)

[24, 28, 12] and 2Ω(k) [34, 28], respectively, whichshow that even this very basic problem is still not well understood.

Presentation Highlights

Bimodular Integer Linear Programming

The first plenary talk was by Rico Zenklusen. He showed how one can solve any integer linear program (ILP) defined bya constraint matrix whose sub-determinants are all within −2,−1, 0, 1, 2 in strongly polynomial time. This is a verynice extension of the well-known fact that ILP’s with totally unimodular (TU) constraint matrix are solvable in stronglypolynomial time. This result uses several techniques. They first reduce the problem to a particular parity-constrainedILP over a TU constraint matrix and then use Seymour’s decomposition of TU matrices to break this ILP into simplerbase problems. Then they show how these simpler problems can be solved using combinatorial optimization techniques.He also highlighted some open problems in this field and some possible extensions to larger classes of ILPs.

A Simply Exponential Upper Bound on the Maximum Number of Stable Matchings

In the 2nd plenary talk, Shayan Oveis Gharan highlighted their recent result on the number of stable matchings, whichis a classical problem. They show that for any stable matching instance with n men and n women the number of stablematchings is at most Cn for some universal constant C > 1. The proof is based on a reduction to counting the numberof down-sets of a family of posets that we call mixing.

Approximating spanners and distance oracles

Despite significant recent progress on approximating graph spanners (subgraphs which approximately- preserve dis-tances), there are still several large gaps in our understanding. In the 3rd plenary talk, Michael Dinitz did a survey ofsome recent results, with a focus on low-stretch spanners (from SODA ’16) and on spanners with demands (from SODA’17). He described the gaps and open problems which remain, including open questions on the power of linear andsemidefinite relaxations for these kinds of network design problems. He also presented some recent results on approx-imation algorithms for optimizing distance oracles (the natural data-structure version of spanners). The talk mostlyfocused on the many interesting open problems remaining in approximation algorithms for optimizing data structures.

Approximation Schemes for Clustering Problems: Now With Outliers

Recent developments in local search analysis have yielded the first polynomial-time approximation schemes for thek-Means, k-Median, and Uncapacitated Facility Location problems (among others) in a variety of specific classes ofmetrics . An important extension of these problems is to the setting with outliers. That is, we we are given an additionalparameter Z and may discard up to Z points/clients in the input. This is especially important in the setting of k-Meansclustering where even a small fraction of outliers may cause a noticeable deviation in the centroids of near-optimumsolutions. In the 4th plenary talk, Zac Friggstad started with a brief review of their recent work from last year inlocal search analysis for k-Means clustering in Euclidean metrics. Then he presented a more recent development: ageneral framework for adapting local search analysis for clustering problems to get approximations for their variants

Page 285: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Approximation algorithms and the hardness of approximation 281

with outliers. In particular, how they obtain the following results for clustering in doubling metrics (including constant-dimensional Euclidean metrics) and shortest path metrics of bounded genus graphs. 1) PTASes for uniform openingcost UFL with outliers and 2) bicriteria PTASes that open (1 + ε)k centres for k-Median and k-Means clustering withoutliers. There is no violation on the given bound for outliers in any of these approximations.

The Paulsen problem, continuous operator scaling, and smoothed analysis

The Paulsen problem is a basic open problem in operator theory: Given vectors u1, ..., un in Rd that are eps-close tosatisfying the Parseval’s condition and the equal norm condition, is it close to a set of vectors v1, ..., vn inRd that exactlysatisfy the Parseval’s condition and the equal norm condition. Given u1, ..., un, the squared distance (to the set of exactsolutions) is defined as infv

∑ni=1 normui − vi

22 where the infimum is over the set of exact solutions. Previous results

show that the squared distance of any eps-close solution is at most O(poly(d, n, ε)) and there are eps-close solutionswith squared distance at least Omega(dε). The fundamental open question is whether the squared distance can beindependent of the number of vectors n.

In the last plenary talk, Lap Chi Lau showed how they answer this question affirmatively by proving that thesquared distance of any eps-close solution is O(d7ε). Their approach is based on a continuous version of the operatorscaling algorithm and consists of two parts. First, they define a dynamical system based on operator scaling and use itto prove that the squared distance of any eps-close solution is O(d2nε). Then, they show that by randomly perturbingthe input vectors, the dynamical system will converge faster and the squared distance of an eps-close solution is O(d3ε)when n is large enough and eps is small enough. To analyze the convergence of the dynamical system, they develop somenew techniques in lower bounding the operator capacity, a concept introduced by Gurvits to analyzing the operatorscaling algorithm.

Scientific progress made and outcome of the meeting

The schedule of the workshop provided ample free time for participants to work on joint research projects. A numberof new research projects were initiated during the workshop, while some other researchers used the opportunity tocontinue to work on projects started earlier. The research talks and the plenary talks were very well received.

Rico Zenklusen reports that during the workshop he and his colleagues (Chaitanya Swamy, Andre Linhares, andNeil Olver) made progress on a project on optimizing over the intersection of matroids. In particular, at BIRS he andSwamy spent time during the workshop on this project and they found an interesting extension and application of atechnique we developed earlier. In particular, further discussions with Lap-Chi that he had at BIRS proved very helpfulto find this connection. This revived a project on which they worked for some time already, but had trouble to identifya good way to fully exploit their techniques.

Also, Mohammad Salavatipour and his Ph.D. student Mirmahdi Rahgoshay (who participated at the workshop)started a project with Rico Zenklusen on a problem related to resource management on a network (with application tofire containment and spread of other harmful events). So far they have been able to improve the previously best knownresults (a 12-approximation by Zenklusen in SODA17) to an asymptotic approximation scheme that runs in quasi-polynomial time. The project is on-going with the goal of obtaining a true polynomial time approximation scheme.

Sam Hopkins (another participant) reports that he made significant progress on two projects as a result of theworkshop. At the workshop he gave the first public talk on his recent work with Jerry Li on clustering via sum of squaresproofs; discussions with colleagues afterwards and questions during the talk helped clarify a number of points in the newpaper and its relationship to previous work. In particular Aravindan Vijayaraghavan and Sam had some substantialdiscussions on these matters. He also had the opportunity to continue an existing project with Tselil Schramm onintegrality gaps for linear programming hierarchies, on which we made substantial progress.

Andreas Wiese reports that he and Fabrizio Grandoni continued their collaboration on a problem related to theUnsplittable Flow on a Path problem. Their discussions were very fruitful, in particular they were able to find the bestpossible approximation factor for an important special case. Moreover, they could simplify some of their argumentationswhich will yield a cleaner presentation in the paper they plan to write on the topic.

Fabrizio Grandoni reports: “I continued my collaboration with Parinya Chalermsook and Bundit Laekhanukitabout the Group Steiner Tree and related problems. The discussions that we had during the workshop were very

Page 286: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

282 Five-day Workshop Reports

fruitful and might eventually lead to some concrete progress on the problems that we are studying.”

Tselil Schramm tells us: “During the workshop, Sam Hopkins and myself continued working on an ongoing project,trying to use the recent ”pseudocalibration” technique to improve Sherali-Adams lower bounds for max-cut and otherconstraint satisfaction problems. I was also exposed to the Paulsen problem by Lap Chi Lau’s excellent talk, and laterbegan working on the problem (trying to improve on Lap Chi and coauthors’ result).”

Also, Viswanath Nagarajan reports: “I continued my collaboration with Anupam Gupta on designing approxima-tion algorithms for stochastic load balancing. The breakout times during the workshop were very useful in continuingour research discussions on this topic. We were able to come up with a counter-example for one of approaches. This hasbeen useful for us in identifying an alternative approach, which is still work in progress. “

The above are only a few examples of the research progress made during or after the workshop, and there are otherongoing projects that started at the workshop.

Acknowledgment: It is a pleasure to thank the BIRS staff for their support; this contributed to the success of theworkshop.

Participants

An, Hyung-Chan (Yonsei University)Blumenstock, Markus (Johannes Gutenberg University Mainz)Chalermsook, Parinya (Aalto University)Chawla, Shuchi (University of Wisconsin-Madison)Cheriyan, Joseph (University of Waterloo)Dinitz, Michael (Johns Hopkins University)Ene, Alina (Boston University)Feige, Uriel (Weizmann Institute)Friggstad, Zachary (University of Alberta)Goemans, Michel (Massachusetts Institute of Technology)Grandoni, Fabrizio (IDSIA, University of Lugano)Gupta, Anupam (Carnegie Mellon University)Gutekunst, Sam (Cornell University)Hopkins, Samuel (Cornell University)Khodamoradi, Kamyar (University of Alberta)Koenemann, Jochen (University of Waterloo)Koh, Cedric (University of Waterloo)Laekhanukit, Bundit (Simons Institute for the Theory of Computing & Shanghai University of Finance and Economics)Lau, Lap Chi (University of Waterloo)Lee, James R. (University of Washington)Lee, Euiwoong (Simons Institute)Mahboub, Maryam (U. of Alberta)Makarychev, Konstantin (Northwestern University)Makarychev, Yury (Toyota Technological Institute at Chicago)Megow, Nicole (University of Bremen)Nagarajan, Viswanath (University of Michigan)Naor, Seffi (Technion)Nimavat, Rachit (Toyota Technological Institute at Chicago)Oveis Gharan, Shayan (University of Washington)Paul, Alice (Brown University)Rahgoshay, Mirmahdi (University of Alberta)Saberi, Amin (Stanford University)Salavatipour, Mohammad R. (University of Alberta)Sanita, Laura (Comb. and Opt. – University of Waterloo)Schramm, Tselil (Harvard/MIT)Shmoys, David B. (Cornell University)

Page 287: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Approximation algorithms and the hardness of approximation 283

Swamy, Chaitanya (University of Waterloo)Trevisan, Luca (U.C. Berkeley)Tulsiani, Madhur (Toyota Technological Institute at Chicago)Vijayaraghavan, Aravindan (Northwestern University)Wiese, Andreas (U. of Chile)Zenklusen, Rico (ETH Zurich)

Page 288: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] H.-C.An, R.Kleinberg, and D.B.Shmoys: Improving Christofides’ algorithm for the s-t path TSP. In Proc. of ACMSTOC (2012): 875–886.

[2] A. Anari and S. Oveis Gharan, Effective-Resistance-Reducing Flows, Spectrally Thin Trees, and Asymmetric TSP,In Proc. of IEEE FOCS 2015.

[3] M. Andrews, J. Chuzhoy, V. Guruswami, S. Khanna, K. Talwar, and L. Zhang, Inapproximability of edge-disjointpaths and low congestion routing on undirected graphs. Combinatorica, 30(5):485–520 (2010).

[4] M. Andrews and L. Zhang, Hardness of the undirected edge-disjoint paths problem, In Proc. of ACM STOC (2005):276–283.

[5] M. Andrews and L. Zhang, Hardness of the undirected congestion minimization problem, SIAM J. Comput.,37(1):112–131 (2007).

[6] S.Arora, B.Barak, and D.Steurer, Subexponential algorithms for Unique Games and related problems, In Proc. ofIEEE FOCS (2010): 563–572.

[7] S.Arora, J.R.Lee, and A.Naor, Fre’chet embeddings of negative type metrics, Discrete & Computational Geometry38(4): 726–739 (2007).

[8] S.Arora, S.Rao, and U.V.Vazirani, Expander flows, geometric embeddings and graph partitioning. J.ACM: 56(2)(2009)

[9] A.Asadpour, M.Goemans, A.Madry, S.Oveis Gharan, and A.Saberi, An O(logn/ log logn)-approximation algo-rithm for the asymmetric Traveling Salesman Problem, In Proc. of ACM-SIAM SODA (2010): 379–389.

[10] B.Barak and D.Steurer, Sum-of-squares proofs and the quest toward optimal algorithms, In Proceedings of Inter-national Congress of Mathematicians (ICM), 2014.

[11] B.Barak, F.Brandao, A.Harrow, J.Kelner, D.Steurer, and Y.Zhou, Hypercontractivity, sum-of-squares proofs, andtheir applications, In Proc. of ACM STOC (2012): 307–326.

[12] E. W. Chambers and D. Eppstein, Flows in one-crossing-minor-free graphs. In Proc. of ISAAC (21), volume 6506LNCS, pages 241-252 (2010).

[13] M. Charikar, T. Leighton, S. Li, and A. Moitra, Vertex sparsifiers and abstract rounding algorithms, In Proc. ofIEEE FOCS (2010): 265–274.

[14] C. Chekuri and J. Chuzhoy, Polynomial bounds for the grid-minor theorem, In Proc. of ACM STOC (2014).

[15] E.Chlamtac, Approximation algorithms using hierarchies of semidefinite programming relaxations, In Proc. IEEEFOCS (2007): 691–701.

[16] J. Chuzhoy, Routing in undirected graphs with constant congestion, In Proc. of ACM STOC (2012): 855–874.

[17] J. Chuzhoy and S. Li, A polylogarithimic approximation algorithm for edge-disjoint paths with congestion 2, InProc. of IEEE FOCS (2012).

[18] M. Englert, A. Gupta, R. Krauthgamer, H. Raecke, I. Talgam-Cohen, and K. Talwar, Vertex sparsifiers: new resultsfrom old techniques. In Proc. of APPROX/RANDOM (2010): 152–165.

[19] U.Feige: A Threshold of lnn for Approximating Set Cover, J. ACM 45(4): 634-652 (1998).

[20] T. Figiel, W. B. Johnson, and F. Schechtman, Factorizations of natural embeddings of lnp into lr , I. Studia Math.,89:79–103 (1988).

284

Page 289: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Approximation algorithms and the hardness of approximation 285

[21] S. Fiorini, S. Massar, S. Pokutta, H. Tiwary, and R. de Wolf, Linear vs. semidefinite extended formulations: expo-nential separation and strong lower bounds, In Proc. of STOC (2012): 95–106.

[22] M.X.Goemans, and D.P.Williamson, Improved Approximation Algorithms for Maximum Cut and SatisfiabilityProblems Using Semidefinite Programming, J. ACM 42(6): 1115-1145 (1995).

[23] C. Gottschalk, and J. Vygen, Better s-t-Tours by Gao Trees, In Proc. of IPCO (2016): 126–137.

[24] T. Hagerup, N. Nishimura, J. Katajainen, and P. Ragde, Characterizing multiterminal flow networks and comput-ing flows in networks of bounded treewidth, J. Comput. Syst. Sci., 57 (1998).

[25] J.Hastad, Some optimal inapproximability results, J.ACM 48(4): 798-859 (2001).

[26] J.A.Hoogeveen, Analysis of Christofides’ heuristic: Some paths are more difficult than cycles, Operations ResearchLetters 10:291–295 (1991).

[27] K.Jain, A factor 2 approximation algorithm for the generalized Steiner network problem, Combinatorica 21(1):39-60 (2001).

[28] A. Khan, P. Raghavendra, P. Tetali, and L. A. Vegh, On mimicking networks representing minimum terminal cuts,CoRR, abs/1207.6371, (2012).

[29] S.Khot, On the power of unique 2-prover 1-round games, In Proc. of STOC (2002): 767–775.

[30] S.Khot, G.Kindler, E.Mossel, and R.O’Donnell, Optimal Inapproximability Results for MAX-CUT and Other 2-Variable CSPs? SIAM J. Comput. 37(1): 319-357 (2007).

[31] S.Khot, and O.Regev, Vertex cover might be hard to approximate to within 2-epsilon, J. Comput. Syst. Sci. 74(3):335-349 (2008).

[32] S. G. Kolliopoulos and C. Stein, Approximating disjoint-path problems using packing integer programs,Mathematical Programming 99:63–87, (2004).

[33] S. Kratsch and M. Wahlstrom, Representative sets and irrelevant vertices: New tools for kernelization, In Proc. ofIEEE FOCS (2012).

[34] R. Krauthgamer and I. Rika, Mimicking networks and succinct representations of terminal cuts, CoRR,abs/1207.6246, (2012).

[35] L.Lau, R.Ravi, and M.Singh, Iterative Methods in Combinatorial Optimization, Cambridge University Press, 2011.

[36] F. T. Leighton and A. Moitra, Extensions and limits to vertex sparsification, In Proc. of ACM STOC (2010): 47–56.

[37] N.Linial, E.London, and Y.Rabinovich, The geometry of graphs and some of its algorithmic applications,Combinatorica 15(2): 215-245 (1995)

[38] K. Makarychev and Y. Makarychev, Metric extension operators, vertex sparsifiers and Lipschitz extendability, InProc. of IEEE FOCS (2010): 255–264.

[39] A. W. Marcus, D. A. Spielman, and N. Srivastava, Interlacing families II: mixed characteristic polynomials andthe Kadison-Singer problem, Ann. of Math. 182-1 (2015), 327–350.

[40] A. Moitra, Approximation algorithms for multicommodity-type problems with guarantees independent of thegraph size, In Proc. of IEEE FOCS (2009): 2–12.

[41] T.Momke, and O.Svensson, Approximating graphic TSP by matchings, In Proc. of IEEE FOCS (2011): 560–569.

[42] E.Mossel, R.O’Donnell, and K.Oleszkiewicz, Noise stability of functions with low influences: invariance and opti-mality, In Proc. of FOCS (2005): 21–30.

[43] S.Oveis Gharan, A.Saberi, and M.Singh, A randomized rounding approach to the Traveling Salesman Problem, InProc. of IEEE FOCS (2011): 550–559.

[44] P. Raghavan and C. D. Tompson, Randomized rounding: a technique for provably good algorithms and algorithmicproofs, Combinatorica, 7:365-374, (1987).

[45] P.Raghavendra: Optimal algorithms and inapproximability results for every CSP? In Proc. of STOC (2008): 245–254 (2008).

[46] P.Raghavendra, Approximating NP-hard problems: efficient algorithms and their limits, Ph.D. thesis, Universityof Washington, 2009.

[47] T. Rothvoss, The matching polytope has exponential extension complexity, In Proc. of STOC (2014): 263–272.

[48] A.Sebo, Eight fifth approximation for TSP paths, In Proc. of IPCO (2013): 362–374.

Page 290: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

286 Five-day Workshop Reports

[49] A.Sebo and J.Vygen, Shorter tours by nicer ears: 7/5-approximation for graphic TSP, 3/2 for the path version,and 4/3 for two-edge-connected subgraphs, CoRR, abs/1201.1870v2 (2012).

[50] A.Sebo and A. van Zuylen, The Salesman’s Improved Paths: A 3/2 + 1/34 Approximation, In Proc. of FOCS(2016): 118–127,

[51] O. Svensson, Approximating ATSP by Relaxing Connectivity, In Proc. of IEEE FOCS 2015.

[52] O. Svensson, and J. Tarnawski, and L. Vegh, A Constant-Factor Approximation Algorithm for the AsymmetricTraveling Salesman Problem, arXiv:1708.04215.

[53] V.V.Vazirani, Approximation Algorithms. Springer-Verlag, Berlin, (2001).

[54] J.Vygen, Reassembling trees for the traveling salesman, CoRR, abs/1502.03715.

[55] J. Vygen and V. Traub, Approaching for the s-t-path TSP, In Proc. of SODA (2018): 1854–1864.

[56] D.P. Williamson, and D.B. Shmoys, The Design of Approximation Algorithms, Cambridge University Press, 2011.

[57] M. Yanakakis, Expressing combinatorial optimization problems by linear programs, Journal of Computer andSystem Sciences 43(3):441-466, 1991.

[58] D.Zuckerman, Linear Degree Extractors and the Inapproximability of Max Clique and Chromatic Number,Theory of Computing Volume 3, Article 6: 103-128 (2007)

Page 291: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 26

Nonlinear and Stochastic Problems inAtmospheric and Oceanic Prediction(17w5061)

November 19 - 24, 2017Organizer(s): Youmin Tang (University of Northern British), Wansuo Duan (Instituteof Atmospheric Physics, Chinese Academy of Sciences), Adam Monaha (University ofVictoria), Shouhong Wang (University of Indianan), Siraj Islam (University of NorthernBritish)

Overview of the Field

Advances on the theory of nonlinear dynamical systems, either deterministic or stochastic, as well as advances on thetheory of nonlinear partial differential equations (PDEs) and their numerical treatment, have had a profound impact onthe modeling and understanding of atmospheric and oceanic dynamics. With the rapid development of nonlinear andstochastic methods and their successful application in atmospheric and oceanic sciences, our capability of atmosphericand oceanic prediction has obtained significant improvement in the past decades. The success of numerical weatherforecast is recognized as one of the most important achievements in science, technology and society in the 20th century.

The application of nonlinear and stochastic methods in atmospheric and oceanic prediction is ubiquitous. Forexample, the development of both control theory of nonlinear dynamic systems and estimation theory of stochasticsystems makes it possible to optimally combine observations with numerical models (i.e., data assimilation), resulting inthe revolutionary improvement in prediction initializations. The second example is the ensemble prediction that changesthe traditional concept of prediction from single, deterministic prediction to multiple, probabilistic prediction, whichdeems from the Monte Carlo theory and the optimal unsteady theory of nonlinear dynamics systems. The ensembleprediction greatly addresses the uncertainties of predictions, and has been widely applied in operational predictioncenters.

Recent Developments and Open Problems

Over past decades, significant progress has been made in applying nonlinear and stochastic methods onto atmosphericand oceanic prediction. A typical example is the ensemble prediction, which considers well the high degree of non-

287

Page 292: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

288 Five-day Workshop Reports

linearity and stochastic forcing of the atmosphere and ocean systems. The development of nonlinear and stochasticmethods make the optimal perturbations available, which is fundamental in the construction of ensemble predictions.The other example is the data assimilation, which optimally use observations to initialize prediction and to constrainmodel parameter. Our success in weather forecast and climate prediction achieved in recent years can be to a greatextent attributed to the advance of data assimilation. However, some specific challenges still emerge in atmospheric andoceanic predictions inherent to nonlinear and stochastic theory, which can be summarized as following major scientificproblems: (i) How can we estimate the model states for high-dimensional nonlinear and non-Gaussian systems, andwhich nonlinear and stochastic method will be best for the particular type of problem? (ii) What new nonlinear optimalmethods or strategies need to be developed to detect the optimal prediction error growth? (iii) What new measurementmetrics should be developed to optimally quantify the prediction utility for a nonlinear and stochastic system? (iv) Howcan we measure and differ the contribution of different error sources to predictability, and further reduce the errors? (v)How should nonlinear and stochastic methods complement the dynamical models used in the atmospheric and oceanicprediction community? (vi) How can full advantage be taken, in atmospheric and oceanic sciences, of the powerful toolsthat have been developed by mathematicians in the framework of the theory of dynamical systems (dynamic transitions,Lyapunov exponents and vectors, fluctuation-dissipation theorem, invariant and parameterizing manifolds).

It is clear that the atmospheric and oceanic predictions are cross-disciplinary and especially embodied in appli-cations of mathematics in atmospheric and oceanic predictions. To push the collaboration of the researchers fromatmosphere, ocean, and mathematics and explore the above major scientific problems in predictions and predictabilitiesof atmospheric and oceanic motions, The Banff International Research Station (BIRS) hosted the workshop “Nonlinearand Stochastic Problems in Atmospheric and Oceanic Prediction” during November 19 - 24, 2017. This workshop wasco-convened by Prof. Youmin Tang (from Canada), Prof. Shouhong Wang (from America), Prof. Adam Mohana (fromCanada), and Prof. Wansuo Duan (from China). And twenty-seven experts from China, Canada, US, and UK attendedthe workshop and presented talks on the problems mentioned above.

Presentation Highlights

According to the pre-mentioned major scientific problems, the workshop classified the talks in three themes: (i) nonlin-ear geophysical fluid dynamics, (ii) data assimilation, and (iii) predictability and prediction.

Nonlinear Atmosphere and Ocean Dynamics

The atmosphere and ocean experiences significant changes in their nonlinear dynamics. To understand these changes,their mechanisms, and their regional implications a quantitative understanding of processes in the coupled ocean andatmosphere system is required. The presentations delivered under this theme shed light on such nonlinear dynamics ofseveral atmospheric/ocean systems such as El-Nino Southern Oscillation (ENSO), Madan Julian Oscillation (MJO), tur-bulent systems and large geophysical flows. For example, Dr. Hai Lin discussed ensemble integrations using a primitiveequation atmospheric model to investigate the atmospheric transient response to tropical thermal forcings that resembleEl Nino and La Nina. He showed that such response develops in the North Pacific within one week after the integrationand the signal in the North Atlantic and Europe is established by the end of the second week. Dr. Entcho Demirov talkedabout the role of dominant patterns of ocean and atmospheric variability which define coherent variations in physicalcharacteristics over large areas. He applied Bayesian Gaussian mixture models to define four dominant subseasonalweather regimes. Based on his analysis, he developed a computationally efficient stochastic weather generator for anal-ysis and prediction of the Subpolar North Atlantic atmospheric decadal variability. Dr. Chun-Hsiung Hsia applied asemi-discretized Euler scheme to solve three dimensional primitive equations. With suitable assumptions on the initialdata, he showed the long time stability of the proposed scheme.

Some interesting talks were delivered on dynamics of geophysical flows highlighting recent scientific progress. Forexample, Dr. Corentin Herbert discussed construction of a perturbative theory which describes the large-scale flow,relying on a technique known as stochastic averaging. Using direct numerical simulations in an idealized situation, heshowed the validity and limitations of this approach by testing analytical predictions for the structure of the large-scaleflow, such as the velocity profile or the eddy momentum flux profile. He further discussed the long term dynamics oflarge-scale geophysical flows and the abrupt transitions they undergo, using tools from large deviation theory. Dr. DavidStraub examined simulations of freely decaying turbulence in a non-hydrostatic Boussinesq model for which the basestate stratification contains a tropopause. Dr. Xiaoming Wang discussed numerical algorithms that are able to capture

Page 293: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nonlinear and Stochastic Problems in Atmospheric and Oceanic Prediction 289

the long-time statisitcal properties such as the climate, of large dissipative turbulent systems and applications to certainprototype Geophysical fluid dynamics models. Dr. Shouhong Wang presented an overview of dynamic transition theoryand its applications to various geophysical fluid flows.

Data Assimilations

In this theme, the difficulties of the assimilation approaches of ensemble Kalman filter (EnKF) and Particle filters (PF)were mainly discussed. These two approaches are both the numerical realization of the Bayes’ Theorem. Nevertheless,the former adopts the linear approximation to the observational operator and assume that the observational errorsobey Gaussian distribution in statics while the latter avoids these linear approximations and assumptions and is muchrealistic. The former has been widely used in predictions of atmosphere and ocean but there are still limitations and dif-ficulties in realistic predictions. The latter is most advanced in theory but presently under exploration. Clear, in-depthinvestigations are urgently needed for these two approaches. The presentations in this workshop provided some pro-gresses on these two approaches. For example, to reduce the huge computational resource of EnKF in high resolutioncoupled earth systems, Prof. Shaoqing Zhang considered the combination of stationary, slow-varying and fast-varyingfilters and designed a high efficiency approximates EnKF (Hea-EnKF) to dramatically enhance the computational ef-ficiency. He showed that it is the improved representation on stationary and slow-varying background statistics thatmakes the Hea-EnKF while only requiring a small fraction of computer resources be better than the standard EnKFthat uses finite ensemble statistics. This also makes the Hea-EnKF practical to assimilate multi-source observations intoany high-resolution coupled model intractable with current super-computing power for weather-climate analysis andpredictions. For PF, Dr. Nan Chen introduced a conditional Gaussian framework for prediction of nonlinear turbulentdynamical systems, which allows efficient and analytically solvable conditional statistics that facilitates the real-timedata assimilation and prediction. Especially, this method is able to beat the curse of dimensions in traditional particlemethods. In addition, a challenge of PF is to avoid the filter degeneracy with limited computational resources, where thefilter degeneracy is a situation in which almost every particle has negligible weight and contributes little to representingthe PDFs. The filter degeneracy is still an open question. Dr. Zheqi Shen focused on this question and tried to giveseveral strategies to overcome this filter degeneracy phenomenon, include using a hybrid scheme of PF and EnKF witha tunable parameter, and using vector weights to achieve localization, and finally thought that it is great potential todevelop the feasible PF for large geophysical model systems with nonlinear/non-Gaussian observational systems.

Predictability and Prediction

The “predictability” is a fundamental issue in both atmospheric and oceanic research, as well as numerical weather andclimate prediction. It indicates the extent to which even minor imperfections in the knowledge of the current state orof the representation of the system limits knowledge of subsequent states. Study of predictability could provide usefulinformation on reducing the prediction uncertainties for the atmospheric and oceanic motions. Study of predictabilitycan include two aspects. The first is studying the reason and mechanism of the uncertainty of forecast result (e.g.,forecast error) and the second is studying the way or methodology to reduce such kind of uncertainty. In the workshop,quite a few presentations focused on these two aspects and addressed progresses.

For the first aspect, it is important to estimate the predictability limit. In this field, Prof. Frank Kwasniok showedthat the predictability of large-scale atmospheric flow is characterized by finite-time Lyapunov exponents and covariantLyapunov vectors. Prof. Jianping Li considered this limitation and reported a nonlinear local Lyapunov exponentapproach (NLLE). He explored the basic theory of the NLLE and then applied this approach to the predictabilitystudies, and finally showed the usefulness of the NLLE in estimating the predictability time for weather and climateevents and determining the sensitive area for targeting observations. Subsequently, Prof. Ruiqiang Ding presentedanother application of NLLE in ensemble forecast and reported the approach of orthogonal nonlinear local Lyapunovvector (NLLV). He used the NLLV to generate the initial perturbations of the ensemble forecast and demonstratedthe applicability of NLLV in ensemble forecast. Singular vectors (SVs) is also an approach to estimating the limit ofpredictability. SVs have been successfully applied in operational weather forecast in ECWMF. In the presentation ofProf. Youmin Tang, he proposed the concept of climatological SVs. The competing aspect of this approach is withoutusing tangent model in calculating SVs. Dr. Siraj Ul Islam investigated the error growth dynamics of SVs and showedthat when the SVs are used as an initial perturbation, the forecast skill of key atmospheric variables over South AsianMonsoon region is significantly improved. Further, he demonstrated that the predictions with the singular vector havea more reliable ensemble spread, suggesting a potential merit for a probabilistic forecast. The linearity of SVs still

Page 294: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

290 Five-day Workshop Reports

limits the forecast skill. If considering effect of nonlinearity in SVs, the forecast skill generated by the SVs may begreatly improved. In the workshop, Prof. Wansuo Duan focused on this limitation of linear singular vector (LSV)and presented an approach of conditional nonlinear optimal perturbation (CNOP) to estimate the maximum predictionerror. Prof. Duan applied this approach to the predictability of ENSO and revealed the difference between CNOP andLSV. Then he used this approach to present the optimal observational array for dealing with the challenge of ENSOpredictions due to diversities of El Nino. By hindcast experiments, the target observations determined by the CNOPwere illustrated to be useful in distinguishing the types of El Nino events in predictions. In addition, Dr. Guodong Sunalso showed the usefulness of the CNOP in dealing with the effect of model parametric errors on predictability andemphasized the nonlinear interaction of multiply physical parameters is beneficial to improve the simulation ability andprediction skill of the soil moisture.

For the second aspect of predictability study, the presentations in the workshop reported several interesting ap-proaches to improving the forecast skill for weather and climate. For example, Prof. Fei Zheng suggested a newstochastic perturbation technique to improve the prediction skills of ENSO through using an intermediate coupledmodel. Similar idea was existed in the presentation of Dr. Aneesh Subramanian. Dr. Aneesh Subramanian comparedthe stochastic perturbation scheme representing model errors and the super-parameterization strategy for including theeffects of moist convection through explicit turbulent fluxes calculated from a cloud-resolving model (CRM) embeddedwithin a global climate model (GCM), finally showing that the combination of the two approaches helps improve thereliability of forecasts of certain tropical phenomena, especially in regions that are affected by deep convective systems.

Another aspect to predictability is to identify the most predictable mode. Prof. Timothy M. Delsole focused onthe degree to which interactive ocean circulations are important for making useful predictions of the next decade andidentified the most predictable patterns of global sea surface temperature in coupled atmosphere-ocean models. He sug-gested that interactive ocean circulation is not essential for the spatial structure of multi-year predictability previouslyidentified in coupled models and observations, but the time scale of predictability, and the relation of these predictablepatterns to other climate variables, is sensitive to whether the model supports interactive ocean circulations or not,especially over the North Atlantic.

Scientific Progress Made

The participants learned several new techniques and scientific progress to understand and resolve atmospheric andoceanic dynamics. The group discussion touched on most of the challenges and scientific issues related to nonlineargeophysical fluid dynamics, data assimilation, and predictability and prediction. Participants discussed various waysto make dynamical ensemble prediction more skillful on season to decadal time scales along with the discussion ofdeficiencies in numerical modeling. Such discussion helped participants to design better experiments or better datacollection protocols. All these discussions were very productive especially for the early career young scientists learningnew and advance scientific techniques from the leading scientist working in climate dynamics.

Outcome of the Meeting

This meeting brought together various mathematical and atmospheric backgrounds. The meeting was very timely anduseful, and will surely have a strong impact on the further developments of predictability and data assimilation andrelated subjects. The organizers were in particular pleased to observe the many young people interested in the subject;some of them have already made substantial contributions (as witnessed by the talks given by junior researchers) andwill surely continue to advance the subject.

The meeting provided a great overview of the field, with a number of excellent talks describing some recent excitingresults in predictability and data assimilation. It is quite certain that some new interesting results will emerge as a resultof the workshop.

Page 295: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nonlinear and Stochastic Problems in Atmospheric and Oceanic Prediction 291

Participants

Chen, Nan (New York University)Delsole, Timothy (George Mason U.)Demirov, Entcho (Memorial University)Ding, Ruiqiang (Institute of Atmospheric Physics, Chinese Academy of Sciences)Duan, Wansuo (Chinese Academy of Sciences)Frediani, Maria E. B. (University of British Columbia)Herbert, Corentin (Ecole Normale Superieure de Lyon)Hsia, Chun-Hsiung (National Taiwan University)Islam, Siraj ul (University of Northern British Columbia)Kieu, Chanh (Indiana University)Kwasniok, Frank (University of Exeter)Li, Jianping (Beijing normal university)Lin, Hai (Environment and Climate Change Canada)Liu, Ting (Second Institute of Oceanography)Lu, Fei (Johns Hopkins University)Mao, Yiwen (University of Victoria)Monahan, Adam (University of Victoria)Pimentel, Sam (Trinity Western University)Shen, Zheqi (Second Institute of Oceanography)Straub, David (McGill University)Subramanian, Aneesh (University of California, San Diego)Sun, Guodong (Chinese Academy of Sciences)Tang, Youmin (University of Northern British Columbia)Wang, Shouhong (Indiana University)Wang, Xiaoming (Florida State Uni. & Fudan Uni.)Zhang, Shaoqing (Ocean University of China)Zheng, Fei (Chinese Academy of Sciences)

Page 296: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

292

Page 297: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Two-dayWorkshop

Reports

Page 298: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

294 Two-day Workshop Reports

Page 299: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 27

Special Western Canada Linear AlgebraMeeting (17w2668)

July 7 - 9, 2017Organizer(s): Hadi Kharaghani (University of Lethbridge), Shaun Fallat (University ofRegina), Pauline van den Driessche (University of Victoria)

History of this Conference Series

The Western Canada Linear Algebra meeting, or WCLAM, is a regular series of meetings held roughly once every 2years since 1993. Professor Peter Lancaster from the University of Calgary initiated the conference series and has beeninvolved as a mentor, an organizer, a speaker and participant in all of the ten or more WCLAM meetings. Two broadgoals of the WCLAM series are a) to keep the community abreast of current research in this discipline, and b) to bringtogether well-established and early-career researchers to talk about their work in a relaxed academic setting.

This particular conference brought together researchers working in a wide-range of mathematical fields includingmatrix analysis, combinatorial matrix theory, applied and numerical linear algebra, combinatorics, operator theory andoperator algebras. It attracted participants from most of the PIMS Universities, as well as Ontario, Quebec, Indiana,Iowa, Minnesota, North Dakota, Virginia, Iran, Japan, the Netherlands and Spain. The highlight of this meeting wasthe recognition of the outstanding and numerous contributions to the areas of research of Professor Peter Lancaster.Peter’s work in mathematics is legendary and in addition his footprint on mathematics in Canada is very significant.

Presentation Highlights

The conference began with a presentation from Prof. Chi-Kwong Li surveying the numerical range and connections todilation. During this lecture the following conjecture was stated: If A ∈ M3 has no reducing eigenvalues, then thereis a T ∈ B(H) such that the numerical range of T is contained in that of A, and T has no dilation of the form I ⊗ A.This statement led to interesting discussion among various participants after the lecture. Prof. Ion Zaballa from Spaingave an inspiring lecture on reducing matrix polynomials, which is a topic of interest to our guest of honour, Prof.Peter Lancaster. The morning talks were followed by two insightful lectures given by graduate students, Jane Breen(University of Manitoba) and Xavier Martinez-Rivera (Iowa State University). Both presented recent work, one onclustering in Markov chains, and the other on a signed principal rank characteristic sequence that has connections tothe famous principal minor assignment problem. In the afternoon the focus of the workshop switched to matrix theory,

295

Page 300: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

296 Two-day Workshop Reports

especially the spectra of matrices. Prof. David Watkins, a former doctoral student of Prof. Peter Lancaster, discussedfast and stable computation of certain matrices, taking sparsity into account. This talk was based on a paper that hadwon a SIAM outstanding paper prize. The first day concluded with an interesting lecture on the localization of thespectrum of a matrix. This topic is classical when considering the numerical range or Gersgorin’s disc theorem. In thispresentation, Prof. Michael Tsatsomeros, adapted a formulation for drawing the numerical range to constructing a newobject called the envelope of A, which does a better job enclosing the spectrum of a matrix. The lecture was highlightedby some fascinating movies constructing the various regions associated with the envelopes of certain matrices.

On Sunday, the first three lectures were concerned with combinatorial matrix theory, with two of them deliveringinteresting connections to the classical topic of Hadamard matrices. The other, by Prof. Colin Garnett, provided insightson the patterns that allow all possible spectra (SAPs), and proved the 2n conjecture for irreducible patterns of order6, and order 7 over the reals. Participants suggested ways in which the techniques presented might be applicable forreducible patterns. After the break, the next lecture was concerned with Bezout equations of stable matrix functions.This topic has deep roots with the research of Prof. Peter Lancaster and his work was featured prominently throughoutthe talk.

Our conference concluded with a heart-warming lecture delivered by the distinguished mathematician, friend, andcolleague, Prof. Richard Guy. Prof. Guy provided a wonderful perspective on the life and celebrated career of Prof.Peter Lancaster.

Many thanks to BIRS and PIMS for the opportunity and funding for this very successful special WCLAM.

Some Open Problems and Questions

We list here a sample of the questions and unresolved problems that led to interesting discussions and constructivethought.

(i) (Chi-Kwong Li, College of William and Mary) If A ∈M3 has no reducing eigenvalues, then there is a T ∈ B(H)such that the numerical range of T is contained in that of A, and T has no dilation of the form I ⊗A.

(ii) (Sarah Plosker, Brandon University)What graphs other than chains exhibit PST? What if we do not require thatthe state transfer be perfect? What about weighted chains?

(iii) (Colin Garnett, Black Hills State University) Is it possible to randomly select a pattern that does not satisfy thealgebraic condition set forth with large enough n that is in fact spectrally arbitrary? What can we say about the2n conjecture over R versus over C?

(iv) (Behruz Tayfeh-Rezaie, Institute for Research in Fundamental Sciences (IPM)) Classify all Hadamard matriceswith only two distinct types for 4-tuples of rows. Find another infinite family of Hadamard matrices with onlytwo distinct types for 4-tuples of rows.

(v) (Michael Tsatsomeros, Washington State University) Can the envelope of a matrix be refined by considering moreeigenvalues and/or eigenvectors of its Hermitian part?

Special Occasion Associated with the Meeting

One other important feature of our meeting was a birthday celebration to honour the 88th birthday for Prof. PeterLancaster. The occasion was held in the BIRS lounge on Saturday evening with all participants being involved. Therewas a cake brought by Peter’s daughter and the event included reading well-wishes from friends and colleagues ofPeter’s that could not attend the meeting at BIRS.

Participants

Adm, Mohammad (University of Regina)Breen, Jane (University of Manitoba)

Page 301: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Special Western Canada Linear Algebra Meeting 297

Craigen, Robert (University of Manitoba)Fallat, Shaun (University of Regina)Guy, Richard (The University of Calgary)Hassani Monfared, Keivan (University of Calgary)Holzmann, Wolfgang (University of Lethbridge)Kaashoek, Rien (Vrije Universiteit)Kharaghani, Hadi (University of Lethbridge)Kim, In-Jae (Minnesota State University)Kirkland, Stephen (University of Manitoba)Lancaster, Peter (University of Calgary)Li, Chi-Kwong (College of William and Mary)Martinez-Rivera, Xavier (Iowa State University)Masgreghi, Javad (Laval University)McDonald, Judith (Washington State University)Nasserasr, Shahla (Nova Southeastern University)Pereira, Rajesh (University of Guelph)Plosker, Sarah (Brandon University)Raouafi, Samir (University of Regina)Sasani, Sara (University of Lethbridge)Suda, Sho (Aichi University of Education)Tayfeh-Rezaie, Behruz (Institute for Research in Fundamental Sciences (IPM))Tsatsomeros, Michael (Washington State University)Van den Driessche, Pauline (University of Victoria)Vander Meulen, Kevin (Redeemer University College)Watkins, David (Washington State University)Zaballa, Ion (Euskal Herriko Unibertsitatea)

Page 302: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] J.L. Aurentz, T. Mach, R. Vandebril, D.S. Watkins, Fast and backward stable computation of roots of polynomialsSIAM J. Matrix Anal. Appl. 36 (2015), 942 - 973.

[2] I. Gohberg, P. Lancaster, L. Rodman, Matrix Polynomials, SIAM, Philadelphia, 2009.

[3] G. Graves, S. Suda, Symmetric and skew-symmetric 0,±1-matrices with large determinants, to appear in J.Combin. Des. arXiv:1601:02769.

[4] N. Johnston, S. Kirkland, S. Plosker, R. Storey, X. Zhang, Perfect quantum state transfer using Hadamard diago-nalizable graphs, Linear Algebra and its Applications 531 (2017), 375–398.

[5] P.J. Psarrakos, M.J. Tsatsomeros, On the spectrum envelope of a matrix, Applied Mathematics and Computation244 (2014), 132 - 141.

[6] F. Tisseur , I. Zaballa, Triangularizing quadratic matrix polynomials, SIAM J. Matrix Analysis and Applications 34(2013), 312–337.

298

Page 303: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 28

Modeling and Simulation: PracticalEngineering Applications (17w2689)

September 1 - 3, 2017Organizer(s): Zhangxing John Chen (University of Calgary), Jesse Zhu (Western Univer-sity), Zhenghe Xu (University of Alberta)

Overview of the Field of Modeling and Simulation

Computational Science has been called a third branch of science, along with theory and experiment. In truth, it ispart of theory and part of experiment, but it is different from either. Many theoretical problems can only be solvedusing a high performance computer. Modeling and simulation, the major component of computational science, is muchmore than simply elaborating pure theory. It is a critical tool used to analyze important and varied phenomena suchas fluid flow and transport, chemical processing, air pollution, water contamination, nano material design, information,computational chemistry, and phase interfaces. In September 2011, a workshop on Modelling and Simulation washeld at the Banff International Research Station (BIRS) in Banff, Alberta. The major focus was the presentation oftheoretical modelling and simulation work by researchers and software developers from around the world. It alsohighlighted activities in the high performance computing area.

Recent Developments and Open Problems of Modeling and Simulation

Since 2011, the rapid development of computer power and sophisticated computational techniques has resulted in theapplication of high performance computing to modelling and simulation with unprecedented accuracy. Their scopehas also impacted a wide range of important science and engineering challenges. These challenges are comprehensiveemerging issues requiring a thorough understanding of the underlining principles of physics, chemistry, mathematicalmodelling, numerical solution techniques and computing infrastructure. Their understanding will have profound im-plications and applications in mathematics, science, engineering, medicine and industry. Rapid economic developmentsand social issues have significantly increased research in modelling and simulation in these areas. For example, theneed to study and understand the complex physical and chemical processes occurring in and around the earth, such asgroundwater contamination, oil and gas reservoir production, discovering new oil reserves, ocean hydrodynamics, CO2storage and sequestration, and air quality control, is all vital to our living environment, economic development, naturalresource management and national security. The study of these problems through laboratory experiments, modellingtheories and simulation techniques requires interdisciplinary collaborations between engineers, mathematicians, com-

299

Page 304: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

300 Two-day Workshop Reports

putational scientists and researchers working in industry, government laboratories and academy. Bringing togetherthese experts for a unified focused effort will advance predicting, understanding and optimizing many complex phe-nomena. The goal of this two-day BIRS workshop on September 1-3, 2017 was to hold lectures that emphasize the rapiddevelopment of modelling and simulation theories and their practical engineering applications [1, 2, 3].

Workshop Presentation Highlights

The workshop themes include: (A) Mathematics of Multiphase Fluid Flow and Transport; (B) High-Quality Discretiza-tion of Flow and Transport; (C) Computational Modeling of Multiscale Phenomena; (D) Parallel Computing; (E) Non-linear Effects on Propagation Properties of Numerical Models; (F) Interfacial Phenomena in Mineral and MaterialProcessing; (G) Spatially Explicit Carbon and Water Cycle Modelling; (H) Air Pollution and Industrial Wastes.

24 presentations in this workshop have dealt with these topics. Speakers have been carefully selected to ensure thata range of modeling and simulation techniques can be explored. This diversity is necessary in order to address variousphenomena arising from emerging issues in the energy and environment sectors [4, 5].

Some of the major presentations are summarized as follows:

The presentation given by Dr. Jesse Zhu covered research related to aspects of particle technology from particleformation, characterization, particle flow to many mathematical and practical applications. His fundamental studiesincluded development of fluidization theory and identification of new regimes, the expansion to liquid-solid fluidization,inverse fluidization and untrafine powder fluidization, and the detailed studies on many fluidized bed reactors.

The presentation given by Dr. Jinyu Sheng covered many modeling and simulation problems on physical oceanog-raphy, modelling and prediction of extreme marine events, interaction of ocean waves and currents, air-sea interaction,and tidal and storm-induced circulation.

The presentation given by Dr. Liu Zheng covered his research on mathematical, chemical and biomolecular funda-mentals of bioprocessing engineering with special interests in protein conformational transition, nanostructured enzymecatalysts, and formulation of natural product with biological functions.

The presentation given by Dr. Frank Cheng covered mathematical problems in corrosion science and engineeringof pipelines, including coating failure mode and effect analysis, cathodic protection shielding, high-voltage alternatingcurrent interference, CO2 corrosion, microbial corrosion, flow-accelerated corrosion, and under-deposit pitting corro-sion.

The presentation given by Dr. Haibo Niu covered his research related to the fate/transport process of contaminantsin marine environment, with a focus on oil spills and development of mathematical models to predict the fate/behavioursof oil to support emergency response.

The presentation given by Dr. Johnny Chen covered modeling and simulation issues in electrochemical energystorage and conversion technologies due to decreasing global fossil fuel supplies and increasing environmental concerns.In order to provide sustainable energy infrastructures and resources for future generations, significant improvementsto the current state of these technologies is imperative.

The presentation given by Dr. John Chen covered his scientific research on laboratory experiments, mathematicalmodeling, and the supporting analyses for recovery of oil and gas resources. In particular, his team work has beenfocused on lab, modeling and simulation studies of fluid flow and transport in unconventional oil and gas reservoirs(heavy oil, oil sands, and tight and shale oil and gas reservoirs). He shared his industrial collaboration experience withthe participants.

The presentation given by Dr. Charles Xu covered his modeling approach in development of highly efficient andcost-effective hydrothermal liquefaction (HTL) processes for energy recovery from various solid wastes/residues anddevelopment of catalytic processes and novel catalysts for the production ”green” fuels, chemicals from renewable feed-stock, and air emissions control.

The presentation given by Dr. Qiao Sun covered her current research activities around building a modeling frame-work for mechanical systems using wind turbine as an example. The goal is to be able to combine physics based anddata driven models to assist machine health condition assessment and failure prediction. A system level model thatincludes component models based on underlying physical principles can provide the key to a solution with the much

Page 305: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Modeling and Simulation: Practical Engineering Applications 301

needed fault prediction capabilities. In recent years, her focus is to establish an integrated wind turbine system modelthat can represent the digital copy of an actual system particularly in terms of its health condition.

The presentation given by Dr. Ying Zheng covered her research related to aspects of chemical reaction Engineering,catalyst synthesis and catalytic processes, her recent interests on catalytic process intensification and her fundamentalmathematical studies on new material synthesis, catalysis, reaction mechanism and mass/heat transfer in heterogenouscatalysis, reactor and process design.

The presentation given by Dr. Hongbo Zeng covered his research interests in colloid and interface science, func-tional materials and nanotechnology, with a special focus on intermolecular and surface interactions in soft matter (e.g.,polymers, biopolymers, biological systems, surfactants, and emulsions) and mathematical and engineering processes,and development of functional materials with engineering/bioengineering/ environmental applications.

The presentation given by Dr. Laurence Yang covered his current interest focused on cyber-physical-social systems(CPSS) design, data analytics on parallel and distributed (cloud) platforms. The booming growth and rapid developmentin embedded systems, wireless communications, sensing techniques and emerging support for cloud computing andsocial networks have enabled researchers and practitioners to create a wide variety of Cyber-Physical-Social Systems(CPSS) that reason intelligently, act autonomously, and respond to the users needs in a context and situation-awaremanner. It, as a novel emerging paradigm, has gained popularity within the research community and industry due tothe fact that it enables deep fusion among humans, computers, and things.

The presentation given by Dr. Dong Liang covered numerical problems in development of conservative high-ordercharacteristics methods for atmospheric aerosol transports; development of efficient Moving-cut HDMR approach foraerosol chemical process; study of energy laws of electromagnetic waves in metamaterials, and development of energy-conserved S-FDTD schemes for computational electromagnetics; development of efficient mass conservative domaindecomposition parallel computing methods for environmental computation and for contamination flow in porous media;and development of optimal control approaches for environmental pollution in atmosphere and in groundwater.

The presentation given by Dr. Huaxiong Huang covered numerical solutions for partial differential equations, math-ematical models and scientific computing for problems in science and engineering, and modeling real world problemsfrom industry and medicine.

The presentation given by Dr. Jing Chen covered development of regional and global carbon cycle models using thesetraits as inputs; development of instruments and methods for the ground measurement of these traits; development ofa distributed hydrological model using these traits and other inputs; development of canopy radiative transfer modelsfor remote sensing applications; development of biosphere stable isotope models for global carbon cycle research; de-velopment of atmospheric inversion and global carbon assimilation systems for estimating the global carbon cycle usingatmospheric CO2 concentration; and development of a long-term forest carbon cycle model with consideration of bothdisturbance and non-disturbance effects.

The presentation given by Dr. Nancy Chen covered her research in modeling the primary, secondary, and tertiaryrecovery processes; simulating complex recovery process in unconventional heavy oil bitumen and tight shale forma-tions; and optimizing reservoir development strategies to maximize oil recovery or net present values.

The presentation given by Dr. Yaushu Wong covered numerical solutions for partial differential equations, mathe-matical model and scientific computing for problems in science and engineering, and data sciences and its applicationsto real world problems.

The presentation given by Dr. Xianguo Li covered his research centred in thermofluid sciences, including fluid flow,heat and mass transfer, thermodynamics, combustion, energy and power systems.

The presentation given by Dr. Gemma Lu covered her current research focused on the interfacial and surfacescience, biomimetic materials, nanocomposites/nanomaterials/nanotechnology, and microfluidic systems applicable inenergy, environmental and biomedical fields such as sustainable energy, oil recovery enhancement, wastewater andtailings treatment, environmental sensing, CO2 capture and utilizations, and soil remediation.

In addition to the regular workshop presentations, there were also considerable time for interactions between par-ticipants. The workshop organizers coordinated round-table sessions where questions and answers were shared. Dueto the high-level and interdisciplinary nature of the participants invited, those who attended this proposed workshopwere exposed to novel ideas and will build new relationships for future collaborations. The workshop presentations,discussion at the round-table sessions and possible future collaborations among the participants will support significantprogress of mathematics, science and engineering modelling and simulation.

Page 306: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

302 Two-day Workshop Reports

Scientific Progress Made in Modeling and Simulation

The objective of this workshop is to bring together the worlds top active researchers (and their more junior counterparts)who study energy and environmental modeling and simulation to discuss past, recent, and prospective advances in thisarea. The speakers have summarized important advances from the past two decades and have discussed the currentunderstandings, the state-of-the-art techniques, and the current major challenges. Each session of this workshop hasprovided a vehicle for participants to learn novel techniques and new advances in this area of work. The content hasbeen academic in nature while addressing the many significant applications for industry [6, 7].

The ultimate goal of the workshop is to expose workshop participants (in particular, junior researchers) to the latestdevelopments in the field of modeling and simulation, while emphasizing the impact of this field on science, engineering,and industry.

The study of the diverse topics presented in the workshop through laboratory experiments, mathematical theory,and computational techniques requires interdisciplinary collaboration between engineers, mathematicians, computa-tional scientists, and researchers working in industry, government laboratories, and academy. The collaborative workof researchers in this workshop will create meaningful progress in predicting, understanding, and optimizing manycomplex phenomena. The rational for this two-day BIRS workshop is to hold lectures that pull together the major ideasand recent research results, chart future directions, and address newly emerging issues for energy and environmentmodeling and simulation. It is anticipated that the participants have left the workshop knowing the future researchdirections and the needed potential applications.

Outcome of the Workshop

The Banff International Research Station is a beautiful location for learning and building relationships. Its commonareas have supported our goals to have researchers engaged in discussion throughout the workshop event. We havebrought researchers from around the world to share their perspectives, test ideas, and create new connections bothintellectually and socially while exploring the latest developments in modeling and simulation. This workshop has pro-moted, enhanced, and stimulated cross-continental research interactions and collaborations in mathematical sciencesand will shape changes in the research work completed with modeling and simulation.

Participants

Chen, Zhangxing John (University of Calgary)Chen, Guohua (Hong Kong Polytechnic University)Chen, Jingming (University of Toronto)Chen, Zhongwei (University of Waterloo)Chen, Nancy Shengnan (University of Calgary)Cheng, Frank (University of Calgary)Huang, Huaxiong (York University)Le, X. Chris (University of Alberta)Li, Xianguo (University of Waterloo)Liang, Dong (York University)Liao, Wenyuan (University of Calgary)Liu, Zheng (Tsinghua University)Lu, Gemma (University of Calgary)Niu, Haibo (Dalhousie University)Sheng, Jinyu (Dalhousie University)Sun, Qiao (University of Calgary)Wong, Yaushu (University of Alberta)Xu, Charles (Western University)Yang, Laurence T. (St Francis Xavier University)Zeng, Fanhua (University of Regina)

Page 307: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Modeling and Simulation: Practical Engineering Applications 303

Zeng, Hongbo (University of Alberta)Zhao, Boxin (University of Waterloo)Zheng, Ying (University of Edinburgh)Zhu, Jesse (Western University)Zu, Jean (Stevens Institute of Technology (US))

Page 308: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] K. Aziz and A. Settari, Petroleum Reservoir Simulation, Applied Science Publishers, London, 1979.

[2] Z. Chen, Reservoir Simulation: Mathematical Techniques in Oil Recovery, in the CBMS-NSF Regional ConferenceSeries in Applied Mathematics, Vol. 77, SIAM, Philadelphia, 2007.

[3] Z. Chen, G. Huan, and Y. Ma, Computational Methods for Multiphase Flows in Porous Media, in the ComputationalScience and Engineering Series, Vol. 2, SIAM, Philadelphia, 2006.

[4] H. Zeng, K. Wu, X. Cui and Z. Chen, A practical model of wettability effect on nanoconfined water flow: Insightsand perspectives, Nano Today, doi.org/10.1016/j.nantod.2017.05.001

[5] K. Wang, H. Liu, J. Luo, K. Wu and Z. Chen, A comprehensive model coupling embedded discrete fractures,multiple interacting continua and geomechanics in shale gas reservoirs with multi-scale fractures, Energy & Fuel, 31(2017), 7758 - 7776.

[6] K. Wu, Z. Chen, J. Li, X. Li, J. Xu and X. Dong, Wettability effect on nanoconfined water flow, Proceedings of theNational Academy of Sciences, 114 (2017), 3358-3363.

[7] K. Wu, Z. Chen, X. Li and X. Dong, Equation of state for methane in nanoporous material at supercritical temper-ature over a wide range of pressures, Nature: Scientific Reports 6, 33461 (2016), 1-10.

304

Page 309: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 29

Transdisciplinary Approaches toIntegrating Policy and Science forSustainability (17w2688)

October 6 - 8, 2017Organizer(s): Gabriela Alonso Yanez (University of Calgary), Kathleen Halvorsen (Michi-gan Technological University), Marcella Ohira (Inter American Institute for Global ChangeResearch)

Overview

In Canada and in many other countries, there is a clear commitment to environmental sustainability through meaning-ful, effective collaboration between discipline specialists, resource-users, communities, and funding agencies. Recognis-ing that effective cross sector collaboration and transdisciplinary (TD) integration is rare, the workshop overarchinggoal was to train future leaders in transdisciplinary approaches for policy and science for sustainability in the Amer-icas. The workshop enabled 25 participants to be part of several educational and training activities designed to teachelements for successful teamwork; multi-disciplinary method and methods to engage stakeholders; project manage-ment and criteria for teamwork evaluation; science communication to different audiences; and revising and finalizingtransdisciplinary team proposals.

Recent Developments and Open Problems

A growing body of scholarship identifies obstacles that hinder truly transdisciplinary collaborations. For instance,the lack of active and sustained interaction of scientists with stakeholders, the absence of collaboration for framingproblems and setting shared agendas and goals, and a shortage of effective communication pathways among researchersand policy-makers serve to negatively impact the transfer, dissemination, and use of scientific knowledge ([1], [2], [3],[4]). This is particularly relevant for teams currently addressing global socioecological problems, which feature highdiversity of team membership and high task-interdependency.

305

Page 310: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

306 Two-day Workshop Reports

Presentation Highlights(i) Members of First Nations communities led several discussions and support participants in better understanding

the perspectives and views of non-academic actors in transdisciplinary projects.(ii) The Improvisational workshop to help participating scientists better communicate their findings.

(iii) Parks Canada staff led discussions on innovative strategies for protecting natural resources informed by a socioe-cological approach.

(iv) In collaboration with the Calgary Public Library and the Latin American Research Center, the seminar includeda multidisciplinary seminar on Global Change and cross sector collaboration with representatives from varioussectors of Canada’s First Nations, Academics, Calgary legislative and private sectors.

(v) Session on traditional land and indigenous ways of knowing including a visit to two Sacred sites (Sundial HillMedicine Wheel and Women’s Buffalo Jump) of the Black Foot First Nation led by First Nations contrasting withLatin American indigenous communities.

Scientific Progress Made

Recent research concludes that training interventions offer a promising opportunity to improve performance of in-ternational TD teams by teaching skills and competencies necessary to enable multi-cultural and multi-institutionalcollaborations ([5], [6]). These training interventions are particularly relevant for a variety of audiences working onpracticing and promoting TD ”namely, those who make policy, who fund science, who carry out research and who or-ganize and implement institutional systems” ([7], p. 62). Professional development designed specifically to improve andsustain TD work such as the one that took place in BIRS Banff are just beginning to emerge ([8], [9], [10]). However,these capacity building initiatives are critical for addressing policy relevant global change issues of the 21st century.

Outcome of the Meeting

Four transdisciplinary project proposals developed at the Seminar are now in the early stages of advancement andimplementation. They include 25 participants from 14 countries representing universities, various tiers of government,NGOs, and the private sector.

Group 1. One group project concerns work to identify and characterize features of community governance thataffect water provision in four locations in the Latin American countries of Chile, Brazil, Colombia and Argentina. Theteam’s main goal is to gather information about the ways in which people organize themselves and make decisions aboutwater provision at a local scale in four different Latin American sites. Knowledge about water provision at local scale iscentral to plan for strategies to address extreme droughts in the region.

Group 2. Another team is focusing on developing a framework to manage temperate grassland ecosystems in Ar-gentina, Brazil, Canada and Uruguay. Working in close collaboration with scientists, government staff, NGOs andfarmers, the team plans to create a framework that identifies and considers the ways in which various social groups andsectors currently adopt sustainable practices in temperate grasslands to design policy strategies that responds to localneeds.

Group 3. By researching influences of livelihood and governance mechanisms on resource management, this teamhopes to provide concrete recommendations that could help strengthen conditions in which people address stressorsaffecting coupled-human natural systems, particularly in smallholder contexts. Through this project, they aim to assessthe influence that smallholder livelihoods and governance mechanisms have on the management of both Peruvian small-scale shark fisheries and the Bolivian smallholder agriculture. In each, they will identify main livelihoods and sourcesof income within target communities, reliance on common-pool resources, main formal and non-formal governancemechanisms and develop recommendations to enhance the management of common-pool resources in regards to locallivelihood and governance structures. Their project will use mixed methods, combining quantitative and qualitativemethodologies specific to each case study.

Group 4. This project will carry out a comparative case study in two countries in the LAC region: Colombia andthe Dominican Republic. In addition to the case studies, secondary information comes from another three countries

Page 311: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Transdisciplinary Approaches to Integrating Policy and Science for Sustainability 307

in the region: Jamaica, Cuba and Guatemala. This will allow a better understanding of the general situation in theregion, in order to identify the best solutions and alternatives for cocoa producers to adapt to the adverse effects ofclimate change. Besides capacity building for cocoa producers to address climate change, the project will contributeto the literature and practice of the measure and value of ecosystem services In addition; an important outcome isthe modelling of future climatic scenario within the cocoa production cycle. Using systems dynamics to integrate thedifferent variables involved, such as the production, climate, social and economic variables. This will allow integratingan innovative transdisciplinary perspective to the results of the process a perspective.

The outcomes of these projects go beyond developing reports and scientific journal articles, and include directinvolvement with communities in an effort to co-produce knowledge that is applicable to real-world current situations.

As a result of the seminar, the IAI has offered to provide US$ 5,000 to three teams, which received positive proposalreviews from the seminar steering committee. The four teams are in the process of submitting their transdisciplinaryproposals to national and international funding agencies to raise additional funds.

For further information about the seminar program, participants, lectures and outcomes, please access

http://www.iai.int/?p=24364&lang=en

Participants

Alfaro, Gabriela (Universidad del Valle de Guatemala)Alonso-Yanez, Gabriela (University of Calgary)Bazely, Dawn (York University)Big Head, Ramona (Kainai Board of Education)Birthwright, Anne-Teresa (The University of the West Indies)Carrizo, Erica (Ministry of Science Technology and Productive Innovation)del Callejo, Ivan (Universidad Mayor de San Simon)Donald, Dwayne (University of Alberta)Duchicela, Jessica (Universidad de Las Fuerzas Armadas - ESPE)Fernandez Velazquez, Alexander (Province Delegation Ministry of Science Technology and Environment)Forsberg, Kerstin (Planeta Oceano)Halvorsen, Kathleen (Michigan Technological University)Hanna Collado, Jeniffer (National Council for Climate Change and Clean Development Mechanism of the DominicanRepublic)House-Peters, Lily (California State University Long Beach)Krogman, Naomi (University of Alberta)Liebman, Alexander Max (1989)Lui, Gabriel (Ministry of the Environment)Modernel, Pablo (Universidad de la Republica)Ohira, Marcella (Inter-American Institute for Global Change Research)Onate, Bastian (Corporacion para la Conservacion y Uso Sustentable de los bosques de Alerce)Padgurschi, Maıra (Brazilian Platform on Biodiversity and Ecosystem Services/State Unviersity of Campinas)Quarrington, Caitlin (MindFuel (Science Alberta Foundation))Rincon, Alexander (National University of Colombia)Souza, Tatiana (Conservation International -Brazil)Toledo, Mariana (Inter-American Institute for Global Change Research)Velasco, Alejandro (Quality Energy Solutions)Vergara, Jorge (Comunidad Indıgena Quechua de Quipisca)Vilarino, Maria Virginia (Argentinean Business Council for Sustainable Development)Zaldua, Natalia (Vida Silvestre Uruguay)

Page 312: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] K. Borner, et al., A multi-level systems perspective for the science of team science, Science Translational Medicine2(49) (2010), 24–49.

[2] S. L. Collins, et al., An integrated conceptual framework for long-term social?ecological research. Frontiers inEcology and the Environment 9 (2011), 351–357.

[3] S. J. Goring, et al., Improving the culture of interdisciplinary collaboration in ecology by expanding measures ofsuccess, Frontiers in Ecology and the Environment 12(1) (2014), 39–47.

[4] M. Mahdizadeh, A. Heydari, H. K. Moonaghi, Clinical interdisciplinary collaboration models and frameworks fromsimilarities to differences: A systematic review, Global Journal of Health Science 7(6) (2015), 170–180.

[5] B. Baker, Space tilings and local isomorphism, BioScience 65(7) (2015), 639–644.

[6] V. Gewin, Collaborations: recipe for a team, Nature 523(7559) (2015), 245–247.

[7] S. Cornell, et al., Opening up knowledge systems for better responses to global environmental change,Environmental Science & Policy 28 (2013), 60–70.

[8] D. M. Aronoff and M. Wolff, A Review of the Website TeamScience.net, Clinical medicine & research 10(1) (2012),38–39.

[9] Inter American Institute for Global Change Research, Capacity Building, Training Events, (2017), Available athttp://www.iai.int/?page_id=962&lang=en

[10] M. O’Rourke and S. J. Crowley, Philosophical intervention and cross-disciplinary science: The story of the ToolboxProject, Synthese (2013), 1–18.

308

Page 313: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 30

Retreat for Young Researchers inStochastics (17w2695)

October 20 - 22, 2017Organizer(s): Martin Barlow (UBC), Chris Hoffman (U. Washington), Mike Kouritzin (U.Alberta)

Overview

This was the third annual meeting of the PIMS Postdoctoral Training Centre in Stochastics (PTCS). The Retreat offersan opportunity for young probabilists or users of probabability from Western Canada and Washington state to interact,communicate their recent results and ongoing research programs and initiate new collaborations. The participants were9 postdocs, either supported by PTCS or members of the PTCS network, 5 junior faculty at PIMS Universities, and 9senior faculty from PIMS Universities; many of those in the last group are advisors for PTCS postdocs.

Presentation Highlights

The range of topics presented covered many of the active areas in modern probability, and included stochastic differ-ential equations and stochastic partial differential equations, stochastic perturbations of dynamical systems, ergodictheory, compressed sensing and epidemic models in mathematical biology.

Noah Forman (PTCS, UW) presented his work on dynamic random trees. These arise in models in populationgenetics. The main result gives an extension to random trees of the classic characterization of exchangeable randomvariables on N as a mixture of independent sequences. The lifting of this result to the tree case requires a numberof pruning and re-grafting operations on random trees. This work raises several interesting problems concerning thelimiting object, which is the continuum random tree of Aldous.

Eric Foxall described an epidemic model (the ‘stochastic logistic model’) on a network. His talk presented thesimplest case, that of a complete graph, and gave a detailed analysis of the structure and limits of the model near thecritical point. The complete graph has little geometric structure, and a long term goal of this research is to look atnetworks with more interesting and realistic geometry.

The talks by Gerrardo Barrera Vargas and Zhonwei Shen looked at small stochastic perturbations of dynamicalsystems. This area has quite classical roots – the pioneering work of Ventsel and Freidlin was in the late 1960s, but

309

Page 314: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

310 Two-day Workshop Reports

our understanding of the connections between ‘chaotic dynamics’ and ‘stochastic dynamics’ is very incomplete. Theresearch group of Y. Yi at U. Alberta is now very active in this area, and this workshop was helpful in building linksbetween this group and the probability groups at UBC and UW.

Shirou Wang described new work in ergodic theory, which gives that the entropy map upper semi-continuous for aclass of dynamical systems.

Wenning Wei described her work with S. Tang (Fudan U.) on Holder regularity in space of solutions to BackwardsStochastic PDE. (BSPDE). BSPDE’s arise for example in optimal control problems for (forward) SPDE’s. For a par-ticular class of linear BSPDE’s they are able to show the solution has sufficient smoothness in space to give the BSPDEa classical interpretation. A key idea is to consider spatial regularity into the space of L2 functions of time, which is anovel perspective and gives new results, even for the deterministic setting.

Simone Brugiapaglia discussed his recent work with Ben Adcock (SFU) and Casie Bau (Grad student now at UBC)on sparsity-based approaches for high dimensional function approximation.

Participants

Barlow, Martin (University of British Columbia)Barrera Vargas, Gerado (University of Alberta)Brugiapaglia, Simone (Simon Fraser University)Burdzy, Chris (University of Washington)Chen, Zhen-Qing (University of Washington)Forman, Noah (University of Washington)Foxall, Eric (U. Alberta)Gauthier, Carl-Erik (U Washington)Hoffman, Christopher (University of Washington)Hu, Yaozhong (University of Alberta)Kouritzin, Michael (Univerisity of Alberta)Lohmann, Martin (University of British Columbia)Murugan, Mathav (University of British Columbia)Perkins, Edwin (University of British Columbia)Qiu, Jinniao (Department of Mathematics & Statistics, University of Calgary)Ray, Gourab (University of Victoria)Sezer, Deniz (University of Calgary)Shen, Zhongwei (University of Alberta)Swishchuk, Anatoliy (University of Calgary)Wang, Shirou (University of Alberta)Wang, Jiun-Chau (University of Saskatchewan)Ware, Tony (The University of Calgary)Wei, Wenning (U. Calgary)

Page 315: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

311

Page 316: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

312 Two-day Workshop Reports

Page 317: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

FocusedResearch

GroupReports

Page 318: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

314 Focused Research Group Reports

Page 319: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 31

Extensions of index theory inspired byquantum scattering theory (17frg668)

June 18 - 25, 2017Organizer(s): Alan Carey (Australian National University), Fritz Gesztesy (Baylor Uni-versity)

Overview of the Field

In this section we describe the background to the meeting as it stood prior to the results announced by the participants.

The part of quantum theory that is relevant to this proposal is scattering theory. We approach scattering theory viathe spectral shift function theory of Krein, Koplienko, and Potapov–Skripka–Sukochev [12, 13, 14, 15].

On the geometric side following from [6] the work of Carey, Grosse and Kaad has introduced a new spectral invari-ant the ‘homological index’ [9, 7]. This homological index provides the appropriate framework for a major extensionof older ideas of R. Carey and J. Pincus [10] on almost commuting operators. It is expressed as a pairing of a newcyclic homology theory developed in [9] with its dual cohomology theory. An open question is whether we can relatethe homological index to the spectral shift function. In addition we want to know if it is possible to use cyclic theory toinvestigate the latter and conversely use the spectral shift function to calculate the homological index.

One major obstacle is the lack of computable examples. We expect to be able to understand these spectral functionsin the case of scattering for supersymmetric quantum Hamiltonians, Dirac-type operators on non-compact manifoldsand their perturbations. In particular, our aim is to connect the spectral properties of the operators under investigationto invariants such as spectral flow, the Fredholm index, Witten index and generalisations. Previous work in this direction[1] uses the de Rham complex viewed a la Witten as a supersymmetric system. This work, though it shows connectionsto spectral geometry, is incomplete and the conjectures made there about extensions to more general situations werenever published.

The work on the homological index in examples [7] shows that there is a deeper extension of index theory to the non-Fredholm case not understood in the 80s. In addition, our previous work has already led to a remarkable new insightinto a question first raised by Atiyah-Patodi-Singer on the relationship of spectral flow for paths of Dirac operators onodd-dimensional manifolds to the Fredholm index for a Dirac operator on the suspension of the manifold. We summarisethis aspect now because during the BIRS meeting progress was made.

315

Page 320: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

316 Focused Research Group Reports

This line of research started in [2] where it was shown that, when both exist, spectral flow is given by Krein’s spectralshift function (even in semi-finite von Neumann algebras). Related work for operators with essential spectrum appearedin [11] in 2011 (a long paper in Advances in Math.) Two main results were obtained in this latter paper, the first being atrace formula and the second, a generalised Pushnitski formula [15].

The focus then however, was on Fredholm theory inspired by the fact that spectral flow relates directly to thetopology of the underlying manifold on which the path of operators is defined. The surprising discovery made in [8] isthat the trace formula due originally to Pushnitiski [15] and generalised in [11] is in fact an operator trace identity thathas as a corollary, in the Fredholm case, the relationship between spectral flow and the Fredholm index first discoveredby Atiyah-Patodi-Singer. The results of [8] do not depend on a Fredholm assumption and thus go far beyond the resultsof Robbin and Salamon [16].

The main aim of this focussed research group proposal is to investigate generalisations of [11] to situations involvingnon-Fredholm operators inspired by the main result of [8]. One specific question is whether the spectral shift functioncan be really thought of as a generalisation of spectral flow to non-Fredholm situations. As an adjunct to this we willinvestigate the Witten index as a method of calculating this generalised spectral flow.

In [11] and [6] it is assumed that we have self adjoint operators D and perturbations A such that for complex z notin the spectrum of D the product A(D − z)−1 is trace class (i.e., one employs a relatively trace class condition). Thisrules out geometric examples. In work underway we have shown that the relatively Schatten class condition such as,A(D − z)−n, n = 2, 3, . . . , is trace class, is what is needed for the geometric case (and PDEs in general).

Our primary technical objective is to relax this (severe) relatively trace class perturbation restriction so as to havevery widespread applications in mathematical physics and geometry. There are some papers on the arxiv already ongraphene and some unpublished work of Moore and Witten that contain examples of great interest. Moreover condensedmatter theorists have speculated about generalisations of the Robbin-Salaman result for the study of spectral flow intheir models.

Thus the study of examples in two and three dimensions is already important and it will assist to provide insight intogeneral theory and applications. We anticipate interesting connections with condensed matter theory where spectralflow has played a role in the study of many model Hamiltonians.

Recent Developments and Open Problems

2.1 The spectral shift function. For a pair of self-adjoint not necessarily bounded operators A0 and A1 on a Hilbertspace H such that their difference, A1 − A0, is trace class, there exists a unique function ξ ∈ L1(R) (known as theKrein–Lifshitz spectral shift function) satisfying the fundamental trace formula, Tr(φ(A1)−φ(A0)) =

∫R ξ(x)φ′(x)dx,

whenever φ belongs to a class of suitably admissible functions.

More precisely, we are considering the following situation. Take a path of self adjoint operators A(t), t ∈ R on aHilbert space H0 and form DA = (d/dt) + A acting on H+ = L2(R; dt;H0). Here (DAu)(t) = u′(t) + A(t)u(t)u ∈ L2(R; dt;H0), t ∈ R, and A is a direct integral of the family A(t)t∈R (with asymptotes A(t) −→

t→±∞A± in the

norm resolvent sense) over R. If we let D+ = DA, D− = D∗+ and D be the 2 × 2 matrix-valued operator introducedabove acting onH = H+ ⊕H+, then under a relatively trace class assumption on A± we can compute the relationshipbetween spectral flow for the path A(t), the spectral shift function for the pair A±.

It is important to note that the spectral shift function also exists when no Fredholm condition is assumed and henceno standard notion of spectral flow exists. This raises the interesting question of what the spectral shift function ismeasuring geometrically when the Fredholm condition is absent.

2.2 Supersymmetry and graded spaces

We are given a Hilbert spaceH equipped with aZ2 grading γ, (that is, γ∗ = γ, γ2 = I), and a self adjoint unbounded

Page 321: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Extensions of index theory inspired by quantum scattering theory 317

operator D such that Dγ + γD = 0. Then we may write,

D =

(0 D−D+ 0

).

In addition, we are given self-adjoint operators H0 and H1 (think of H0 = D+D− and H1 = D−D+) acting on aseparable graded Hilbert space H, and ask for which scalar functions f , the difference f(H1) − f(H0) lies in the nth

Schatten–von Neumann ideal Ln(H). This question has been topical in perturbation theory for over 60 years startingin 1953 for the case where the difference V := H1 −H0 belongs to the trace class. In fact, Krein proved that

ξ(λ;H1, H0) =1

πlimε→0+

=(ln(DH1/H0(λ+ iε))) for a.e. λ ∈ R, (31.0.1)

or, equivalently, in terms of the perturbation determinant,

det((H0 − zI)(H1 − zI)−1) = exp

(∫R

ξH0,H1(λ)

(λ− z) dλ

), z ∈ C\R. (31.0.2)

Based on what we know about the relationship between Krein’s spectral shift function and standard spectral flow, wehave conjectured that there is a generalised spectral flow formula, for a relatively Schatten class perturbation conditionin the non-Fredholm case.

2.4 Details on the higher Schatten conjectures.

Recalling the notation of subsection 2.2, we remark that Atiyah, Patodi, and Singer studied operators of the formDA that we defined there. This class of operators form models for more complex situations. They arise in connectionwith Dirac-type operators (on compact and noncompact manifolds), the Maslov index, Morse theory (index), Floerhomology, Sturm oscillation theory, etc.

One of the principal aims of such a study is to compute the Fredholm index of DA. The most general publishedcomputation of this kind to date is contained in [11] under the assumption that A+−A− is of trace class relative to A−.Note that prior to [11] and [15] it was required to assume that operators in the path A(t) had discrete spectrum.

Invoking the spectral shift function ξ(λ;A+, A−) and the perturbation determinant associated to the pair (A+, A−),DA+/A−(z) = detH((A+−zI)(A−−zI)−1) corresponding to the pair (A+, A−), the following was established in [11],

Index(DA) = SpFlow(A(t)∞t=−∞) = ξ(0;A+, A−)

= π−1 limε↓0=(

ln(detH

((A+ − iε)(A− − iε)−1))

= ξ(0+;H1, H0).

We conjectured that these results will generalise when we have relatively Schatten class perturbations.

Work underway

We have a complete theory for the case of one dimension that has just been published. With A− = didt

, A+ = A− + Fwhere F is a self adjoint N ×N matrix acting onH = L2(R;CN ), we were able to establish the following:

(i) A Pushnitksi formula (cf. [15]), implying that

ξA−,A+ = ξH1,H2 ,

where H1, H2 are as above acting on K = L2(R,H);

(ii) explicit formulas for ξA−,A+ for these examples;

(iii) the trace formula:

2zTrK[(z +H1)−1 − (z +H2)−1] = TrH[gz(A−)− gz(A+)],

Page 322: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

318 Focused Research Group Reports

where gz(x) = x(z + x2)−1/2 (a smoothed version of the sign-function for z < 0);

(iv) an abstract framework for relatively Hilbert-Schmidt perturbations A+−A− in which the Pushnitski formulaand hence the result mentioned in (i) could be proved;

(v) the existence of a class of examples of pseudo-differential operators in any dimension to which the methoddeveloped in [11] applies.

The main gap

To utilise the methods we have employed previously, we need to study scattering for Dirac-type operators in two andthree dimensions. The existing literature is not adequate for our purposes. The objective then is to extend the one-dimensional analysis described above point by point to higher dimensions.

Presentation Highlights

(i) Alan Carey summarized the state of the art of spectral flow as it pertained to our meeting.

(ii) Galina Levitina explained the proof of the principal trace formula and the generalisations of Pushnitski’s work,applicable to higher dimensions.

(iii) Roger Nichols and Fritz Gesztesy described in detail the proof of absence of singular continuous spectrum forflat space, massless Dirac-type operators based on the strong limiting absorption principle in all space dimensions ≥ 2.In addition, a global limiting absorption principle (for the entire real line) for free, massless Dirac-type operators, againin all space dimensions ≥ 2, was presented.

(iv) Jens Kaad explained the recent work on the homological index and the resolvent version of the principal traceformula (PTF).

Scientific Progress Made

As a result of Galina Levitina’s presentation we can announce an end point to the work begun in [11] so that therelationship between the Witten index and generalised spectral and between the associated spectral shift functions hasbeen completely settled [5].

In interactions between the participants, the outstanding example of two-dimensional systems using Dirac-typemassless Hamiltonians was resolved in the affirmative. In fact, a first attempt prior to our Banff meeting required thelimitation of space dimension ≥ 3, but due to interactions at the meeting this could be extended to the important two-dimensional case (with potential interest in connection with graphene applications). Thus, a proof of the global limitingabsorption principle in all space dimensions ≥ 2 emerged during the meeting and will result in paper [4]. The latteralso contains essential self-adjointness results of Dirac-type operators which are of interest in their own right. No doubt,without the opportunity afforded by BIRS, this research would not have been done as it involved essential contributionsby nearly every participant at the meeting.

In addition, a proof of the limiting absorption principle for all nonzero energies for massless Dirac-type operatorswith matrix-valued potentials (applicable to electromagnetic potentials decaying at infinity like |x|−ρ, with ρ > 1) in alldimensions ≥ 2, has now been established [3].

A plan was formulated for publications from the meeting and we have identified at least three lengthy papers (oreven monographs), reporting our results being completed over the next six months.

Page 323: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Extensions of index theory inspired by quantum scattering theory 319

Outcome of the Meeting

The principal outcomes were,

(i) the finalisation of the proof of the principle trace formula as an operator trace identity (cf. [5]),

and,

(ii) establishing the limiting absorption principle for certain Dirac-type operators on flat space in all dimensions≥ 2(cf. [3], [4]).

Participants

Carey, Alan (Australian National University)Gesztesy, Fritz (Baylor University)Kaad, Jens (University of Southern Denmark)Levitina, Galina (UNSW Sydney)Nichols, Roger (Univ. of Tennessee at Chattanooga)Sukochev, Fedor (University of NSW)

Page 324: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] N. V. Borisov, W. Muller, and R. Schrader, Relative index theorems and supersymmetric scattering theory,Commun. Math. Phys. 114 (1988), 475–513.

[2] N. Azamov, A.L. Carey, and F.A. Sukochev, The spectral shift function and spectral flow, Comm. Math. Phys.276 (2007), no.1, 51–91; and N.A. Azamov, A.L. Carey, P.G. Dodds, and F.A. Sukochev, Operator integrals,spectral shift, and spectral flow, Canad. J. Math. 61 (2009), 241–263.

[3] A. Carey, F. Gesztesy, G. Levitina, R. Nichols, F. Sukochev, and D. Zanin, On the limiting absorption principlefor massless Dirac operators, to be submitted to SpringerBriefs in Mathematical Physics.

[4] A. Carey, F. Gesztesy, J. Kaad, G. Levitina, R. Nichols, D. Potapov, F. Sukochev, and D. Zanin, On the globallimiting absorption principle for massless Dirac operators, to be submitted for publication.

[5] A. Carey, F. Gesztesy, G. Levitina, D. Potapov, and F. Sukochev, On the relationship of spectral flow to theFredholm index and its extension to non-Fredholm operators, to be submitted for publication.

[6] A. Carey, F. Gesztesy, D. Potapov, F. Sukochev, and Y. Tomilov, A Jost–Pais-type reduction of Fredholm deter-minants and some applications, Integral Eq. Operator Theory 79 (2014), 389–447.

[7] A. Carey, H. Grosse, and J. Kaad, Anomalies of Dirac type operators on Euclidean space Commun. Math.Phys. 335 (2015) 445 - 475,

[8] A. Carey, H. Grosse, J. Kaad, On a spectral flow formula for the homological index, Adv. Math. 289 (2016),1106–1156.

[9] A. Carey and J. Kaad, Topological invariance of the homological index, J. reine angew. Math. 729 (2017),229–261.

[10] R. W. Carey and J. D. Pincus Index theory for operator ranges and geometric measure theory, in GeometricMeasure Theory and the Calculus of Variations, W. K. Allard and F. J. Almgren (eds.), Proc. of Symposia inPure Math., Vol. 44, 1986, pp. 149–161.

[11] F. Gesztesy, Y. Latushkin, K.A. Makarov, F. Sukochev, and Y. Tomilov, The index formula and spectral shiftfunction for relatively trace class perturbations, Advances in Math. 227 (2011), 319–420.

[12] F. Gesztesy and K.A. Makarov, The Ξ operator and its relation to Krein’s spectral shift function, J. Anal. Math.81 (2000), 139–183.

[13] F. Gesztesy, A. Pushnitski, and B. Simon, On the Koplienko spectral shift function. I. Basics, J. Math. Phys.,Anal., Geometry, 4 (2008), 63–107.

[14] D. Potapov, A. Skripka, F. Sukochev, Spectral shift function of higher-order, Invent. Math. 193 (2013), 501–538.

[15] A. Pushnitski, The spectral flow, the Fredholm index, and the spectral shift function, in Spectral Theory of Differ-ential Operators: M. Sh. Birman 80th Anniversary Collection, T. Suslina and D. Yafaev (eds.), AMS Transla-tions, Ser. 2, Advances in the Mathematical Sciences, Vol. 225, Amer. Math. Soc., Providence, RI, 2008, pp.141–155.

[16] J. Robbin and D. Salamon, The spectral flow and the Maslov index, Bull. London Math. Soc. 27 (1995), 1–33.

320

Page 325: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 32

Stochastic lattice differential equationsand applications (17frg671)

September 10 - 17, 2017Organizer(s): Xiaoying Han (Auburn University), Hakima Bessaih (Department of Math-ematics, University of Wyoming), Maria J. Garrido-Atienza (University of Sevilla), BjornSchmalfuss (Institut for Mathematics)

Introduction

Lattice dynamical systems arise naturally in a wide range of applications where the spatial structure has a discrete char-acter, such as image processing, pattern recognition, and chemical reaction theory. For some cases, lattice dynamicalsystems arise as discretization of partial differential equations, while they can be interpreted as ordinary differentialequations in Banach spaces which are often simpler to analyze.

In particular, lattice systems have been used in biological systems to describe the dynamics of pulses in myelinatedaxons where the membrane is excitable only at spatially discrete sites. For example, dynamics of excitable cells can bedescribed by the following lattice system:

∂vn∂t

= d(vn+1 − 2vn + vn−1) + f(un, vn), (32.0.1)

∂un∂t

= g(un, vn), (32.0.2)

where v represents the membrane potential of the cell, u comprises additional variables such as gating variables, chemi-cal concentrations, necessary to the model, the subscript n indicates the nth cell in a string of cells, and d is the couplingcoefficient d = 1/R, where R is the inter-cellular resistance. Typical models that can be fit in the scheme of (32.0.1) -(32.0.2) include Beeler-Reuter, Hodgkin-Huxley, FitzHugh-Nagumo, and many other models.

Lattice systems have also been used in fluid dynamics to describe the fluid turbulence in shell models. For example,the GOY and Sabra models:(

∂t+ νk2

n

)un = i(anun+1un+2 + bnun−1un+1 + cnun−1un−2)∗ + fn, (32.0.3)

where u represents the complex modes, n denotes the “shell index” that runs from 1 to ∞, and ∗ stands for complexconjugation.

321

Page 326: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

322 Focused Research Group Reports

Most systems in natural sciences and engineering are affected by uncertainty. From modeling point of view, totake into account the uncertainty, random effects have to be included. These random effects are considered not only ascompensations for the defects in some deterministic models, but rather essential phenomena. Stochastic lattice systemscome into play to describe systems with discrete spatial structure and random spatio-temporal forcing, i.e., noise. Morespecially, the term f(un, vn) ca be described as follows:

f(un, vn) = σ(un, un)dBHn (t) (32.0.4)

where BHn (t) is Brownian (or fractional) Brownian motion of Hurst parameter H .

Summary of activities held at the BIRS station

Our research team was scheduled to meet on September 11-18, 2016 but that date got re-scheduled because of timeconflicts with some members of the team. The meeting was held a year later: September 10-17, 2017. Hence, part of theresearch proposed initially was already finished before the meeting, see [1].

The team had lively discussion during their stay at the BIRS station and brainstorming ended up very positive andwe have already a few new projects lined up. We will inform BIRS when manuscripts are ready and will acknowledgeBIRS support. In particular, we have 3 defined projects outlined below:

Problem I. Synchronization of lattice equations

The synchronization of systems in physics and other sciences is an important issue. For an overview we refer to A.Pikovsky et al. First time that synchronization was considered from the point of view of physics goes back to C. Huygens.He observed in 1665 that two pendulum clocks which hang at the same wall or another structure oscillate synchronizedafter some time. Similar phenomena can be observed in other systems. To forecast synchronization in physical systems itis necessary to formulate mathematical models and to find mathematical tools which allow to interpret these phenomenain the sense of dynamical systems. An appropriate model for our physical system should contain random parameter.Hence the theory of random dynamical systems seems to provide the right tools to deal with the systems mentionedabove.

The subgroup of the Focused Research Team by Bessaih, Garrido-Atienza and Schmalfuß dealt to formulate aresearch plan for the following system. We consider the following lattice system

dui(t) = (ν(ui−1 − 2ui + ui+1)− λui + fi(ui)) dt+ σihi(ui)dωi(t), i ∈ Z. (32.0.5)

We consider any of these equation as a nonlinear model of a physical grid. Any grid is coupled with the neighborelements of the grid by a linear operator. The subsystems of the grid is perturbed by a noise dωi (either Brownian orfractional Brownian motion).

Consider now two or a finite number of parallel grid. The mathematical model is a system of equations like (32.0.5)where maybe the nonlinear parts can vary form equation to equation. These models also contain an operator which isresponsible for the coupling of the systems of the different grids. In the case of two systems we consider

du =(νAu− Λu− κK1(u− v) + F (u))dt+ dω

dv =(νAv − Λv − κK2(v − u) +G(v))dt+ dω(32.0.6)

which is now a system of stochastic evolution equations in the space of square integrable functions such that u, v ∈l2. Here A denotes a (generalized) linear operator responsible for the interaction of neighbored elements in each of thegrids. The coupling between different grids is given by the operators Ki.

The first step we had to overcome is to ensure that the system of equations generates a random dynamical systemφκ which depends on the intensity κ of the coupling.

One of our goals is to proof the following theorem:

Page 327: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Stochastic lattice differential equations and applications 323

Theorem 32.0.1 Assume appropriate assumptions such that (32.0.6) generates a random dynamical system on l2 × l2.Assume that Kj is appropriate.

• Then for κ > κ0 the random dynamical system has a random attractor Aκ.

• We havelimκ→∞

dl2×l2(Aκ,A∞) = 0

where A∞ = A0 ×A0 to be the attractor of the (random) dynamical system generated by

du = (νAu− Λu+1

2(F (u) +G(v))dt+

1

2d(ω − ω).

• For κ > κ0 and the additional assumption F = G we have synchronization on the level of trajectories, namely

limt→∞

‖u(t)− v(t)‖l2 = 0.

To interpret this theorem we note at first that a (random) attractor is a small often finite dimensional set in a infinitedimensional phase space which is in our case l2 × l2. This set contains essential information about the long term be-havior of a system. The second statement of the above theorem allows us to conclude that for large intensity parameterthe motion of the coupled system takes place in a neighborhood of the product of the attractor of an averaged system.The third statement teaches us that the long time states of the subsystems u, v are synchronized. We are convinced thatthere are many applications for our results.

Problem II. Dynamics of lattices generated by partial differential equations with nonlocal terms

T. Caraballo proposed and discussed with the organizers some details about the idea of studying the asymptoticbehaviour of the lattice dynamical system generated by the discretization of this problem:

du

dt− a(l(u))∆u = f t > 0, x ∈ R

u(0) = u0 ∈ L2(R).(32.0.7)

We consider l(u) =∫R u(x)dx, the function a is continuous and bounded by two positive constants:

0 < m ≤ a(s) ≤M ∀s ∈ R.

A tentative discretized problem could beduidt

= a(l(−→u ))(ui−1 − 2ui + ui+1) + f t > 0, x ∈ Rui(0) = (u0)i,

(32.0.8)

where, for instance, we could use the trapezium formula, l(u) ≈ l(−→u ) to discretize the nonlocal term, although otheralternatives could be considered as well.

The main objectives could be:

(i) To prove that (32.0.8) can be rewritten like a system of ordinary differential equations (lattice) d−→udt

= a(l(−→u ))A−→u + f = F (−→u )

u(0) = u0 ∈ l2,(32.0.9)

where −→u = (ui)i∈Z.

(ii) To prove existence and (eventually) uniqueness of solution in l2 for the problem (32.0.9). As far as we know, thisis a new problem still not analyzed in the literature.

(iii) To study the existence of attractors.

(iv) To analyze more general frameworks (considering more general terms like f(u), N > 1).

Page 328: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

324 Focused Research Group Reports

During the discussions, it was emphasized that, before carrying out this program, it would be necessary a deeperstudy about the PDE model with nonlocal term and all of its dynamical aspects. In fact, there exists an extensive litera-ture already published on this model with interesting applications to some problems of the applied sciences. This meansthat this is a long term program which cannot be developed in a short stay, but that it is worth being analyzed in future.

Problem III. Neural oscillator networks models

Han proposed a stochastic lattice dynamical system arising from neural oscillator networks. The system reads:

θi = fi + z(θi)(∑j 6=i

ajig(θj) + εiI(t)). (32.0.10)

Here the unknown variables θi are the states of the phase oscillators, i.e., angles parametrized by numbers in theinterval [0, 1] with periodic boundary conditions. The coupling matrix aji describes the network structure, and thefunction z(θ) measures how big an effect an input has on a neuron in state θ. I(t) in an external input and εi is theintensity of external forcing current at the ith neuron.

To align with the topic of the focused group research, Han proposed to study the dynamics of the above latticesystem where I(t) is a random input, such as, Brownian motion or fractional Brownian motion. The group had a livelydiscussion on what the exact meaning of the model was, and what useful information could be obtained by studying sucha system. In the end the group concluded that deeper understanding is needed to further consider this system.

During the rest of their stay, the subgroup formed by T. Caraballo and X. Han started an extensive literature reviewon the system (32.0.10), and gained an overview of system (32.0.10) as well as lattice models for neural networks ingeneral. They initiated a detailed plan to study the stability and attractivity of system (32.0.10) and its generalizations.The plan involves T. Caraballo, X. Han and one of X. Han’s current Ph.D. student. Two weeks after the closure of thisfocused research group, T. Caraballo visited X. Han at Auburn University, when the three of them were able to carryout the plan and obtain necessary analysis. Primitive calculations indicate that system (32.0.10) is stable and attractivein a proper sequence space under certain assumptions. More solid results are expected next year.

Participants

Bessaih, Hakima (University of Wyoming)Caraballo, Tomas (Universidad de Sevilla)Garrido-Atienza, Maria J. (University of Sevilla)Han, Xiaoying (Auburn University)Schmalfuss, Bjorn (Friedrich-Schiller-University Jena)

Page 329: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] H. Bessaih, M.J. Garrido-tienza, X. Han, B. Schmalfuß, Stochastic Lattice Dynamical Systems with Fractional Noise,SIAM Journal Anal. Math., 49(2):1495–1518, 2017.

325

Page 330: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

326 Focused Research Group Reports

Page 331: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Researchin TeamsReports

Page 332: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

328 Research in Teams Reports

Page 333: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 33

Derivative Free and Black BoxOptimization (17rit681)

June 4 - 11, 2017Organizer(s): Warren Hare (University of British Columbia), Charles Audet (Polytech-nique Montreal)

Overview of the Field

Derivative-free optimization (DFO) is the mathematical study of the optimization algorithms that do not use derivatives.While a DFO algorithm was used to test one of the worlds first computers (the MANIAC in 1952), it was not until the1990s that DFO algorithms were studied mathematically.

Blackbox optimization (BBO) is the study of optimization problems where the objective function is a blackbox. Thatis, no analytic description of the function is available, but given an arbitrary input the blackbox returns a function value.As BBO naturally arises whenever a computer simulation is involved in an optimization problem, BBO is one of the mostrapidly expanding areas of applied optimization. BBO can naturally be approached by DFO.

DFO algorithms have principally fallen into one of two categories: direct search methods and model-based methods.Direct search methods work from an incumbent solution and examine a collection of trial points; if improvement isfound, then the incumbent solution is updated, otherwise a search radius parameter is decreased and a new collectionof trial points is examined. Model-based methods approximate the objective function with a model function, and usethe gradients or even second derivatives of the model function to help guide optimization. (Note that while DFO studiesalgorithms that do not use derivatives, this does not mean that the objective function is nondifferentiable – for examplethe objective could be a computer simulation using numerical integration.) It was not until very recently that researchersbegan mixing direct search and model-based methods to create hybrid methods with improved performance.

Workshop Principle Objective

Derivative-free and blackbox optimization have made massive advances over the past decade and, in our opinion, rep-resent one of the most rapidly expanding fields of nonlinear optimization research. We also feel that DFO and BBOrepresent one of the most important areas in nonlinear optimization for solving future applications in real-world prob-lems. The foundational concepts in DFO and BBO have become sufficiently mature that it is now possible to teach them

329

Page 334: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

330 Research in Teams Reports

at a senior undergraduate level. The principle objective of this workshop was to provide Drs. Audet and Hare a week offocused activity to finalize a book titled “Derivative-free and blackbox optimization”.

This book is designed to provide a clear grasp of the foundational concepts in DFO and BBO, in order to push theseareas into the mainstream of nonlinear optimization.

The book is targeted at two broad audiences, and hope that both will find value. The first audience is individu-als interested in entering (or better understanding) the fascinating world of DFO and BBO. The second audience ispractitioners who have real-world problems to solve that cannot be approached by traditional gradient based methods.

The book does not present the absolute state-of-the-art in modern algorithms and theory. Instead it focuses onthe foundational material required to understand and appreciate the state-of-the-art in DFO and BBO. To this end, inaddition of studying several optimization methods, this book includes an elementary introduction to parts of nonsmoothanalysis that have proved useful in conceiving and analyzing the methods within. The book also presents rigorousconvergence theory for the algorithms in a way suitable for students in the mathematical sciences or in engineering.

In the past, practitioners faced with BBO problem have often fallen back on ad hoc methods, resulting in a plethoraof papers publishing incremental improvements to solution quality. The methods covered in the book have provenconvergence results, mathematically supported stopping criterion, and a track-record of practical success. Yet, for allof this, the methods are nonetheless easy to use and elegant in their simplicity. We hope the book can provide better andmore consistent starting point for future applications.

Workshop Side Objectives

Development of the book mentioned in the principle objective has prompted several new research directions for Drs.Audet and Hare. The side objective of this workshop was to continue discussion and collaboration of these questions.

Outcome of the Meeting

A final version of “Derivative-free and blackbox optimization” was submitted to Springer on June 9th, 2017 (the lastday of the workshop). The preface of the final version includes the much deserved line

Special thanks to the Banff International Research Station (BIRS) for hosting us during the final stages ofcompleting this book.

It is therefore clear that the principle objective of the workshop was accomplished.

The side objective of the workshop was addressed through several discussions about current projects and students.Notable progress was made in how to approximate a subdifferential using numerical directional derivatives and analysisof the quality of regression gradient approximation. Both problems are works in progress, but likely to result in papersin the near future.

Participants

Audet, Charles (Professor)Hare, Warren (University of British Columbia)

Page 335: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] C. Audet and W. Hare, Derivative-free and blackbox optimization, in press, Springer, Springer-Verlag, New York.

331

Page 336: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 34

The Structure of Finite Algebras and theConstraint Satisfaction Problem (17rit685)

August 6 - 13, 2017Organizer(s): Andrei Bulatov (Simon Fraser University), Marcin Kozik (Jagiellonian Uni-versity)

The Constraint Satisfaction Problem (CSP) appears in computer science in many forms. In the most general ver-sion a CSP instance consists of variables and constraints, and the goal is to evaluate the variables so that that all theconstraints are satisfied. In a version of CSP particularly amenable to theoretical approach each constraint takes a formof a pair: a tuple of variables and a list of allowed evaluations. This restriction immediately makes the problem com-plete for NP. However, by further restricting the constraints allowed to appear in instances, we arrive at a very generalframework capturing many natural problems in computer science.

The list of allowed evaluations in a constraint can be viewed as a relation of appropriate arity. In the parametrized(a.k.a non-uniform) version of CSP the set of relations allowed to appear in a constraint is restricted and called thelanguage of the CSP. The famous dichotomy conjecture of Feder and Vardi [3] postulates that every finite languagedefines a CSP which is NPC or solvable in a polynomial time. The classical theorem of Schaefer [6], classifying the CSPson two-element sets, is one of the most prominent results supporting the conjecture.

For a few years now the strongest partial results supporting the dichotomy conjecture use the algebraic approachpioneered in [5, 4]. It relies on a Galois correspondence [1] pairing a CSP language with an algebra. The algebracaptures the computational complexity of the associated CSP up to a LOGSPACE reduction, but the connection goesdeeper. The structure of the algebra is intrinsically connected with the types of obstacles the algorithm needs to overcomein order to verify an existence of a solution.

Many structural properties of algebras are captured by identities i.e. universally quantified equalities of operations.For example the identity f(x, y) ≈ f(y, x) states that an operation f is binary and commutative. Among the first, andmost important, results of the algebraic approach is the fact that an algebra satisfies identities which cannot be satisfiedby projections (such algebras are called Taylor algebras) or the associated CSP is NPC.

This year two independent papers introduced two algorithms solving CSPs for all the Taylor algebras and thusconfirming the dichotomy conjecture. In each case the algorithm requires strong structural properties of the algebrato function correctly. In the paper of Andrei Bulatov [2] the structure is described by a colored graph defined on theelements of an algebra; in the other publication Dmitriy Zhuk [8] relies on the more “global” properties of being anabsorbing subuniverse or a center. The goal of the workshop was to study properties of Taylor algebras using the toolsdeveloped for these proofs.

332

Page 337: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

The Structure of Finite Algebras and the Constraint Satisfaction Problem 333

Specifying the problem

In both proofs one of the very first steps of the analysis is to refine the Taylor algebra. This reduces the number ofoperations, and at the same time (via the Galois correspondence) allows new relations into the constraint language. Aslong as the reduction produces a Taylor algebra the transformation has no impact on the correctness of the proof.

The first obstacle to comparing the approaches is the fact that each paper performs a different reduction. More pre-cisely, starting with the same language, each proof allows a different set of extra relations, which makes the approachesincompatible without impacting their correctness. In order to overcome this obstacle the work during the meeting wasfocused on so called Taylor minimal algebras.

Taylor minimal algebras are the algebras corresponding to maximal tractable languages i.e. languages such thateach new relation is either trivially definable (primitively positively definable) in the languages or makes in NPC. Thefact that every Taylor algebra can be reduced to a minimal Taylor algebra requires some proof [7], but is true.

More importantly such an algebra cannot be further reduced without loosing the property of being a Taylor algebraand thus no reductions performed by Bulatov or Zhuk can modify it. This allows to perform a direct comparison of thestructural results used in both proofs (albeit in the restricted case of Taylor minimal algebras).

The goal of the meeting can be concisely described as understanding the structure of Taylor minimal algebras bymeans of the proofs of the CSP dichotomy conjecture.

Results obtained during the meeting

Currently, there are three approaches to analyzing the structure of finite algebras that are relevant to the complexityof the CSP. Two of them are used in the dichotomy proofs mentioned above: Bulatov’s local underlaying structureof the algebra captured by a directed graph (on the elements of the algebra) where each edge is a semilattice, affineor majority edge, and Zhuk’s more global structure based on so-called centers and affine fragments of algebras. Thethird approach has been developed by Barto and Kozik and relies on the concept of absorbing subuniverse i.e. a set ofelements of the algebra closed with respect to all the operations of the algebra and such that for some operation f thevalue f(a1, . . . , an) is in the set as long as at most one ai is outside.

During the meeting we were able to connect (to some extent) the three approaches. Most the results fall in the scopeof at least one of the following broad groups:

• Refining, expanding, and reproving, using different techniques, some of the existing structural results. E.g. prov-ing through colored graphs results that previously required using absorbing subuniverses, and the other wayround.

• Comparing the constructions from the three approaches in Taylor minimal algebras. In particular, it turns outthat these constructions are equivalent for the absorbing subuniverses where f can be taken binary or ternaryand are not equivalent in general.

• Obtaining characterizations of algebras whose directed graph has only restricted set of colors. These characteri-zations can be obtained in different languages including Maltsev conditions (through identities) and through thestructure of absorption.

Further research directions

The meeting initiated a line of research devoted to developing a systematic understanding of the structure of Taylorminimal algebras. The tools used to prove the CSP dichotomy conjecture are applicable and provide a guideline. Thelong–reaching goal is to construct a semblance of a structural theory for Taylor minimal algebras with a hope of even-tually lifting it to all finite Taylor algebras.

Page 338: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

334 Research in Teams Reports

Participants

Barto, Libor (Charles University)Bulatov, Andrei (Simon Fraser University)Kozik, Marcin (Jagiellonian University)Zhuk, Dimitriy (Moscow State University)

Page 339: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] V. G. Bodnarcuk, L. A. Kaluznin, V. N. Kotov, and B. A. Romov. Galois theory for Post algebras. I, II. Kibernetika(Kiev), (3):1–10; ibid. 1969, no. 5, 1–9, 1969.

[2] Andrei Bulatov. A dichotomy theorem for nonuniform csps. In 58th Annual IEEE Symposium on Foundations ofComputer Science, 2017.

[3] Tomas Feder and Moshe Y. Vardi. The computational structure of monotone monadic SNP and constraint satisfac-tion: A study through datalog and group theory. SIAM J. Comput., 28(1):57–104, 1998.

[4] Peter Jeavons. On the algebraic structure of combinatorial problems. Theoretical Computer Science, 200(1–2):185– 204, 1998.

[5] Peter Jeavons, David Cohen, and Marc Gyssens. Closure properties of constraints. J. ACM, 44(4):527–548, 1997.

[6] Thomas J. Schaefer. On the complexity of some two-person perfect-information games. J. Comput. Syst. Sci.,16(2):185–225, 1978.

[7] Mark H. Siggers. A strong mal’cev condition for locally finite varieties omitting the unary type. Algebra universalis,64(1-2):15–20, 2010.

[8] Dmitriy Zhuk. The proof of csp dichotomy conjecture. In 58th Annual IEEE Symposium on Foundations ofComputer Science, 2017.

335

Page 340: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Chapter 35

Nikol’skii inequalities and theirapplications (17rit679)

December 3 - 10, 2017Organizer(s): Feng Dai (University of Alberta), Andriy Prymak (University of Manitoba),Vladimir Temlyakov (University of South Carolina), Sergey Tikhonov (ICREA, CRM,UAB)

During our stay in Banff, we have studied the following questions:

• Remez and Nikol’skii’s inequalities for trigonometric polynomials and spherical harmonics.

• Marcinkiewicz type inequalities for finite dimensional vector spaces of continuous functions.

Before discussing the obtained results, let us give a brief overview of the research field.

Overview of the Field

In the recent paper [4], a close connection between Remez and Nikol’skii’s inequalities was shown. Here we restrictourselves only to the questions related to Remez’s inequalities.

Let XN denote a linear subspace of functions in Lp(Øg) with 0 < p ≤ ∞ and Øg being a probability space. Thegeneral form of the Remez inequality for a function f ∈ XN ⊂ Lp(Ω), 0 < p ≤ ∞, reads as follows: for any measurableB ⊂ Ω with measure |B| ≤ b < 1,

‖f‖Lp(Ω) ≤ C(N, |B|, p)‖f‖Lp(Ω\B).

Applications of Remez type inequalities include many different results in approximation theory and harmonic analysis;see [4] for more details and references.

For trigonometric polynomials T (Q) with frequencies from Q ⊂ Zd (here XN = T (Q) and Ω = Td) the followingresult is well known [4]: For d ≥ 1 and

Q = Π(N) := k ∈ Zd : |kj | ≤ Nj , j = 1, . . . , d,

where Nj ∈ Nd, for any p ∈ (0,∞], we have that C(N, |B|, p) = C(d, p) provided that

|B| ≤ C∏dj=1 Nj

.

336

Page 341: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Nikol’skii inequalities and their applications 337

The investigation of the Remez-type inequalities for the hyperbolic cross trigonometric polynomials with

Q =k ∈ Zd :

d∏j=1

max|kj |, 1 ≤ N

(35.0.1)

has been recently initiated in [4]. It turns out that for such polynomials the problem of establishing the optimal Remezinequalities has different solutions when p <∞ and p =∞. If p <∞, then

C(N, |B|, p) = C(d, p)

provided that

|B| ≤ C

N.

The known result in the case p =∞ reads as follows.

Theorem 35.0.1 [4] There exist two positive constants C1(d) and C2(d) such that for any set B ⊂ Td of normalizedmeasure

|B| ≤ C2(d)

N(logN)d−1

and for any f ∈ T (Q), where Q is given by (35.0.1), we have

‖f‖∞ ≤ C1(d)(logN)d−1 supu∈Td\B

|f(u)|. (35.0.2)

It is worth mentioning that this result is sharp with respect to the logarithmic factor. This is because the followingstatement is false.

There exist δ > 0, A, c, and C such that for any f ∈ T (N) and any set B ⊂ Td of measure |B| ≤ (cN(logN)A)−1 theRemez-type inequality holds

‖f‖∞ ≤ C(logN)(d−1)(1−δ) supu∈Td\B

|f(u)|.

Scientific Progress Made

We obtain a nontrivial Remez inequality for the hyperbolic cross trigonometric polynomials with no logarithmic factorin (35.0.2).

Theorem 35.0.2 For d ≥ 2, let

αd =

d∑j=1

1

jand βd = d− α(d).

There exist two positive constants C1(d) and C2(d) such that for any set B ⊂ Td of normalized measure

|B| ≤ C2(d)

Nα(d)(logN)β(d)

and for any f ∈ T (Q), where Q is given by (35.0.1), we have

‖f‖∞ ≤ C1(d) supu∈Td\B

|f(u)|.

The proof is based on the the discretization inequality and the following Marcinkiewicz-type inequality for thehyperbolic cross polynomials. The latter result is interesting by itself and it reads as follows.

Page 342: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

338 Research in Teams Reports

Theorem 35.0.3 For d ≥ 1, let

α(d) =

d∑j=1

1

jand βd = d− α(d).

There exists a set Λ of at most CdNα(d)(logN)β(d) points in [0, 2π)d such that for all f ∈ T (N),

‖f‖∞ ∼ maxω∈Λ|f(ω)|.

Note that Theorem 35.0.3 for d = 1 is well known. More about Marcinkiewicz-type discretization can be found in[1]-[3].

Outcome of the Meeting

Currently we ( F. Dai, A. Prymak, S. Tikhonov, and V.N. Temlyakov) are working on the Marcinkiewicz-type discretiza-tion problems. We have made significant progress on this problem during our discussions in Banff. We are currentlypreparing for a joint paper, which will include several interesting results in this direction. We believe that the meetingserved as an excellent start for the new collaboration on various problems in approximation theory related to polynomialinequalities and discretizations.

Participants

Dai, Feng (University of Alberta)Prymak, Andriy (University of Manitoba)Temlyakov, Vladimir (University of South Carolina)Tikhonov, Sergey (Catalan Institution for Research and Advanced Studies)

Page 343: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

Bibliography

[1] V.N. Temlyakov, The Marcinkiewicz-type discretization theorems for the hyperbolic cross polynomials, JaenJournal on Approximation 9 (2017), No. 1; arXiv: 1702.01617v2 [math.NA] 26 May 2017.

[2] V.N. Temlyakov, The Marcinkiewicz-type discretization theorems, arXiv:1703.03743v1 [math.NA], 10 Mar 2017.

[3] V.N. Temlyakov, Universal discretization, arXiv:1708.08544v1 [math.NA], 28 Aug 2017.

[4] V.N. Temlyakov, S. Tikhonov. Remez-type and Nikolskii-type inequalities: general relations and the hyperbolic crosspolynomials, Constr. Appr. 46 (2017), 593–615.

339

Page 344: Banff International Research Station Proceedings 2017rather than heuristic arguments. Such structures, sometimes called Lagrangian Coherent Structures (LCSs [23, 38]), identify crucial

340 Research in Teams Reports