Approximation Theory of Deep Learning for Sequence Modelling

Time: 2023-09-01 Views: Published By: CMLR

Speaker(s): Qianxiao Li(National University of Singapore)

Time: 16:00-17:00 September 5, 2023

Venue: Room 1303, Sciences Building No. 1(理科一号楼1303室)

Abstract:


In this talk, we present some recent results on the approximation theory of deep learning architectures for sequence modelling. In particular, we formulate a basic mathematical framework, under which different popular architectures such as recurrent neural networks, dilated convolutional networks (e.g. WaveNet), encoder-decoder structures, and most recently - transformers - can be rigorously compared. These analyses reveal some interesting connections between approximation, memory, sparsity/low-rank, graphical structures that may guide the practical selection and design of these network architectures.


Tencent:

Link: https://meeting.tencent.com/dm/eFKeKFIJuToz

ID:227-124-893



Brief bio:


d44e99463aca4c52aa918a836c3e5b59.jpeg


Qianxiao Li is an assistant professor in the Department of Mathematics, and a principal investigator in the Institute for Functional Intelligent Materials, National University of Singapore. He graduated with a BA in mathematics from the University of Cambridge and a PhD in applied mathematics from Princeton University. His research interests include the interplay of machine learning and dynamical systems, control theory, stochastic optimisation algorithms and data-driven methods for science and engineering.