Hong Kong Workshop on Information Theory, Coding and Learning

30 November, 2025 · The Chinese University of Hong Kong

About the Workshop

The workshop is hosted by the Department of Information Engineering at The Chinese University of Hong Kong, and the IEEE Information Theory Society Hong Kong Chapter. We welcome you to join us for a series of talks by invited experts in the field. The workshop will be held from 9am-1pm on 30 November, 2025, at Yasumoto International Academic Park (YIA) LT7, The Chinese University of Hong Kong.

Please register via the following link by 26 November, 2025. The venue has a capacity of at most 150 attendees (first register first served)
https://forms.office.com/r/U4f66DwarT?origin=lprLink
For enquiries, please contact Cheuk Ting Li (email: ctli [at] ie.cuhk.edu.hk).

Keynote speakers

Abbas El Gamal

Hitachi America Professor

Department of Electrical Engineering,Stanford University

Rüdiger Urbanke

Professor

School of Computer and Communication Sciences, EPFL

Invited Speakers

Amin Gohari

Vice-Chancellor Associate Professor

Department of Information Engineering, CUHK

Guangyue Han

Interim Head and Professor

Department of Mathematics, HKU

Masahito Hayashi

Presidential Chair Professor

School of Data Science, CUHK(SZ)

Linqi, Song

Associate Professor

Department of Computer Science, CityUHK

Raymond W. Yeung

Choh-Ming Li Professor

Department of Information Engineering, CUHK

Talk Details

9:10am - 9:40am

Abbas El Gamal

Stanford University

Title: Yao meets Shannon

Bio: Abbas El Gamal is the Hitachi America Professor in the School of Engineering and Senior Fellow at the Precourt Institute for Energy at Stanford University. He received his M.S. in Statistics and Ph.D. in Electrical Engineering both from Stanford University in 1977 and 1978, respectively. From 1978 to 1980, he was an Assistant Professor of Electrical Engineering at USC. He has been on the faculty of the Department of Electrical Engineering at Stanford University since 1981. From 2003 to 2012, he was Director of the Information Systems Laboratory at Stanford University. From 2012-2017 he was Chair of the Department of Electrical Engineering at Stanford University. His research contributions have been in network information theory, FPGAs, digital imaging devices and systems, and smart grid modeling and control. He has authored or coauthored over 230 papers and holds over 35 patents in these areas. He is coauthor of the book Network Information Theory (Cambridge Press 2011). He is a member of the US National Academy of Engineering and a Life Fellow of the IEEE. He received several honors and awards for his research contributions, including the 2016 IEEE Richard Hamming Medal and the 2012 Claude E. Shannon Award. He served on the Board of Governors of the Information Theory Society from 2009 to 2016 and was President in 2014. He cofounded and served on the board of directors and technical advisory boards of several Silicon Vally semiconductor, EDA, and biotech companies.

Abstract

I will briefly describe early and recent connections between Shannon's information theory and Yao’s communication complexity. The talk is based on a chapter in an upcoming third edition of Cover and Thomas’s Element of Information Theory.

9:40am - 10:10am

Rüdiger Urbanke

EPFL

Title: Information-Theoretic Framework for Understanding Learning in Modern Machine-Learning Architectures

Bio: Ruediger L. Urbanke received his Dipl. Ing. degree from the Vienna University of Technology, Austria, in 1990, followed by an M.Sc. and a Ph.D. in Electrical Engineering from Washington University in St. Louis, MO, in 1992 and 1995, respectively. From 1995 to 1999, he served in the Mathematics of Communications Department at Bell Labs, after which he joined the School of Computer & Communication Sciences (I&C) at EPFL, where he is part of the Information Processing Group. He was an Associate Editor of the IEEE Transactions on Information Theory from 2000 to 2004, served as President of the IEEE Information Theory Society in 2017, and is currently the Dean of I&C. Dr. Urbanke is co-author of Modern Coding Theory (Cambridge University Press). His research spans the design and analysis of error-correcting codes for both classical and quantum communication, as well as the theoretical foundations of modern machine learning.

Abstract

I will present an information-theoretic framework that casts learning as universal prediction under log loss, with performance characterized by regret bounds. At the heart of this framework is a principled notion of architecture-dependent model complexity, defined in terms of the probability mass or volume of models near the data-generating process. This volume can be related to spectral properties of the Hessian or Fisher Information Matrix, enabling practical approximations. I will argue that successful learning architectures exhibit a broad range of effective complexities, which allows them to adaptively fit a wide class of functions—even in highly over-parameterized settings. This perspective offers new insights into the role of inductive biases, the effectiveness of stochastic gradient descent, and phenomena such as flat minima. The framework unifies online and batch learning, supervised and generative modeling, and applies across the stochastic realizable, agnostic, and even individual sequence settings. It also sheds light on why modern architectures—such as deep neural networks and transformers—are so successful, suggesting that their layered structure naturally leads to a broad, adaptive complexity profile. These insights point toward the possibility of designing new architectures with comparable or superior performance. Joint work with Meir Feder and Yaniv Fogel and with the help of Ido Atlas, all Tel Aviv University.

10:15am - 10:45am

Raymond W. Yeung

CUHK

Title: Characterization of Conditional Mutual Independence

Bio: Raymond W. Yeung received the BS, MEng and PhD degrees in electrical engineering from Cornell University. He joined AT&T Bell Laboratories in 1988. Since 1991, he has been with CUHK, where he is currently Choh-Ming Li Professor of Information Engineering. His research interest is in information theory and network coding.

Abstract

Let K and K' be two conditional mutual independencies of discrete random variables. In this work, we give complete characterizations of:

K is equivalent to K';
K implies K'.

10:45am - 11:15am

Linqi Song

CityUHK

Title: Enhancing the Reasoning Abilities of Large Language Models (LLMs)

Bio: Prof. Linqi Song is an Associate Professor with the Department of Computer Science, City University of Hong Kong and a Research Scientist with the InnoHK AIFT Lab. He received the Ph.D. degree in electrical engineering from the University of California, Los Angeles (UCLA), USA and the B.S. and M.S. degrees from Tsinghua University, China. He was a Postdoctoral Scholar with the Department of Electrical and Computer Engineering, UCLA. His research interests include LLMs, reinforcement learning, federated learning, and information theory. He has published more than 170 journal and conference papers at main AI and IT venues, such as IEEE TIT, TPAMI, JSAC, TMC, JSTSP, NeurIPS, ICLR, and ACL. He has been the Associate Editor/Guest Editor for several journals, such as DSP, ICN, and he has been serving as TPC members for IEEE ISIT, ITW, Mobihoc, Infocom, etc. He has received the Best Paper Awards of IEEE MIPR 2020, the Best Paper Award of China Communications 2023, and three Silver Medals of Geneva International Exhibition of Inventions.

Abstract

While modern LLMs exhibit impressive language understanding and generation capabilities, their reasoning abilities—particularly for complex, multi-step problems—remain a critical challenge. In this talk, we explore a suite of advanced techniques to systematically improve LLM reasoning performance. First, we discuss tool augmentation, where LLMs leverage external resources (e.g., Python interpreters) to improve the reasoning abilities. Next, we examine fine-tuning strategies, including supervised learning on interpolated reasoning datasets (e.g., via MATH or GSM8K). We highlight trade-offs between task-specific specialization and generalizability. We then present multi-agent frameworks, where multiple LLMs collaborate or debate to refine outputs. Techniques like self-consistency voting, iterative refinement, and role-specialized agents (e.g., "verifier" or "critic" models) mitigate individual model biases and errors. Our last focus is explicit reasoning traceability: structuring outputs as step-by-step chain-of-thoughts (CoT), where we introduce a learning from correctness way to trace the reasoning steps and correct possible wrong paths in the reasoning. Our results show promising solutions towards future LLMs with better reasoning abilities.

11:15am - 11:45am

Masahito HAYASHI

CUHK (SZ)

Title: General Optimization Algorithm Based on Information-Theoretic Quantity

Bio: Masahito Hayashi was born in Japan, in 1971. He received the B.S. degree from the Faculty of Sciences, Kyoto University, Japan, in 1994, and the M.S. and Ph.D. degrees in mathematics from Kyoto University in 1996 and 1999, respectively. He was with Kyoto University as a Research Fellow of Japan Society of the Promotion of Science (JSPS) from 1998 to 2000 and the Laboratory for Mathematical Neuroscience, Brain Science Institute, RIKEN, as a Researcher, from 2000 to 2003. He was with ERATO Quantum Computation and Information Project, Japan Science and Technology Agency (JST), as the Research Head, from 2003 to 2006, and ERATO-SORST Quantum Computation and Information Project, JST, as the Group Leader, from 2006 to 2007. He was also with the Superrobust Computation Project Information Science and Technology Strategic Core (21st Century COE by MEXT) Graduate School of Information Science and Technology, The University of Tokyo, as an Adjunct Associate Professor, from 2004 to 2007. He was with the Graduate School of Information Sciences, Tohoku University, as an Associate Professor, from 2007 to 2012. In 2012, he joined the Graduate School of Mathematics, Nagoya University, as a Full Professor. He was with Shenzhen Institute for Quantum Science and Engineering, Southern University of Science and Technology, Shenzhen, China, as the Chief Research Scientist, from 2020 to 2023. In 2023, he joined the School of Data Science, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), as a Full Professor, and joined International Quantum Academy (SIQA) as the Chief Research Scientist. He also has the title of Presidential Chair Professor in CUHK-Shenzhen from 2023. Also, he was with the Centre for Quantum Technologies, National University of Singapore, as a Visiting Research Associate Professor, from 2009 to 2012, and a Visiting Research Professor from 2012 to 2024. He was with the Center for Advanced Intelligence Project, RIKEN, as a Visiting Scientist, from 2017 to 2020. He was with Shenzhen Institute for Quantum Science and Engineering, Southern University of Science and Technology, as a Visiting Professor, from 2018 to 2020, and Center for Quantum Computing, Peng Cheng Laboratory, Shenzhen, as a Visiting Professor, from 2019 to 2020. He is currently with the Department of Information Engineering, The Chinese University of Hong Kong as an Adjunct Professor since 2024. In 2006, he published the book Quantum Information: An Introduction (Springer), whose revised version was published as Quantum Information Theory: Mathematical Foundation from Graduate Texts in Physics (Springer) in 2016. In 2016, he published other two books Group Representation for Quantum Theory and A Group Theoretic Approach to Quantum Information (Springer). He is on the editorial board of New Journal of Physics and International Journal of Quantum Information. His research interests include classical and quantum information theory and classical and quantum statistical inference. He received the Information Theory Society Paper Award for Information-Spectrum Approach to Second Order Coding Rate in Channel Coding in 2011. In 2016, he received the Japan Academy Medal from Japan Academy and the JSPS Prize from Japan Society for the Promotion of Science. In 2022, he was elected as an IMS Fellow and an IEEE Fellow.

Abstract

Iterative minimization algorithms appear in various areas including machine learning, neural networks, and information theory. The em algorithm is one of the famous iterative minimization algorithms in the area of machine learning, and the Arimoto– Blahut algorithm is a typical iterative algorithm in the area of information theory. However, these two topics had been separately studied for a long time. In this talk, we propose a generalized framework that was recently proposed in the context of the Arimoto–Blahut algorithm. Then, we show various convergence theorems, one of which covers the case when each iterative step is done approximately. Also, we apply this algorithm to the target problem of the em algorithm, and propose its improvement. In addition, we apply it to other various problems in information theory.

This talk will cover the contents of:

M. Hayashi, "Bregman-divergence-based Arimoto-Blahut algorithm," IEEE Transactions on Information Theory, vol. 71, no. 10, pp. 7788-7801, Oct. (2025).
M. Hayashi, "Iterative minimization algorithm on a mixture family," Information Geometry, 2024 _Special Issue: Half a Century of Information Geometry, Part 2 (2004). https://doi.org/10.1007/s41884-024-00140-5
M. Hayashi, "Bregman divergence based em algorithm and its application to classical and quantum rate distortion theory," IEEE Transactions on Information Theory, vol. 69, 3460 -- 3492 (2023).

11:45am - 12:15pm

Guangyue, Han

HKU

Title: A Stochastic Calculus Approach to Continuous-Time Gaussian Channels

Bio: Guangyue Han received the B.S. and M.S. degrees in mathematics from Peking University, China, in 1997 and 2000, respectively, and the Ph.D. degree in mathematics from the University of Notre Dame, USA, in 2004. After three years with the Department of Mathematics, the University of British Columbia, Canada, he joined the Department of Mathematics, the University of Hong Kong, China, in 2007. His main research areas are coding and information theory.

Abstract

Continuous-time Gaussian channels have traditionally been formulated and investigated using functional analysis. Although this approach is intuitively appealing, it presents several challenges. This talk will focus on the stochastic calculus approach, which, despite being less widely known, can effectively address many of the issues associated with the classical functional analysis method. It turns out that with some recent enhancement, the stochastic calculus approach can recover numerous existing results on continuous-time Gaussian channels, quantitatively strengthen some well-established findings in the literature and further rigorously derive new results.

12:15pm - 12:45pm

Amin Gohari

CUHK

Title: The Auxiliary Receiver Approach in Network Information Theory

Bio: Amin Gohari received his B.Sc. degree from Sharif University, Iran, in 2004 and his Ph.D. degree in electrical engineering from the University of California, Berkeley in 2010. Dr. Gohari received the 2010 Eli Jury Award from UC Berkeley, Department of Electrical Engineering and Computer Sciences, for "outstanding achievement in the area of communication networks," and the 2009-2010 Bernard Friedman Memorial Prize in Applied Mathematics from UC Berkeley, Department of Mathematics, for "demonstrated ability to do research in applied mathematics." He also received the Gold Medal from the 41st International Mathematical Olympiad (IMO 2000) and the First Prize from the 9th International Mathematical Competition for University Students (IMC 2002). He received the IEEE Iran Section Young Researcher Award in 2021. Dr. Gohari served as an Associate Editor for the IEEE Transactions on Information Theory from 2018-2021. He was also a finalist for the IEEE Jack Keil Wolf ISIT Student Paper Award for three consecutive years from 2008-2010 during his PhD.

Abstract

In this talk, I will discuss recent advancements in the auxiliary receiver approach. This approach serves as a mathematical tool for deriving outer bounds in network information theory. It has proven to be highly effective, providing state-of-the-art outer bounds for a variety of settings in network information theory, including relay channels, interference channels, broadcast channels, key agreement, distributed hypothesis testing, and more.