1st ACM SIGCOMM Workshop on Formal Methods Aided Network Operation (FMANO)

Program

9:00am - 9:10am

Opening remarks

9:10am - 9:45am

Keynote 1: Networks above the Network: Opportunities for Formal Methods in The World of Cloud Native Computing

Brighten Godfrey (UIUC)

Brighten Godfrey is a Professor in the Department of Computer Science at the University of Illinois Urbana-Champaign. He co-founded and served as CTO of network verification pioneer Veriflow through its 2019 acquisition by VMware (now Broadcom), where he serves as a Technical Director. He received his Ph.D. at UC Berkeley in 2009, and his B.S. at Carnegie Mellon University in 2002. His research interests lie in the design of networked systems and algorithms. He has served as co-chair of ACM SIGCOMM 2022, SOSR 2016 and ACM HotNets 2014, and currently serves on the ACM SIGCOMM Steering Committee. He is a winner of the ACM SIGCOMM Rising Star Award, the Sloan Research Fellowship, and the National Science Foundation CAREER Award.

Abstract:

The field of network verification has achieved significant success in improving correctness of deployed networks. This success is due in part to the focused scope of network data and control planes, making the problem tractable. However, modern cloud infrastructure is evolving and broadening. Above the physical network, a rich ecosystem of virtual infrastructure helps support so-called “cloud native” applications, including cluster orchestrators, containers and microservices, service meshes, and their associated virtual network control and data planes. In this talk, we’ll explore new opportunities for formal methods to help the operations of these “networks above the network” — and how this domain presents new challenges characterized by dynamic closed-loop control, complex custom software elements, and closer association with applications.

9:45am - 10:30am

Session 1: Analyzing Programmable Networks

  • 9:45am-10:00am

    P4-Ace: Resource-Efficient Optimization and Verification for Programmable Switches

    Zixi Cui, Saifeng Hou, Le Tian, Yu Wang, Peng Yi, Hongchang Chen (PLA Information Engineering University); Xiaoyu Yi, Yongjie Wang (Zhejiang Lab)

  • 10:00am-10:15am

    Scaver: A Scalable Verification System for Programmable Network

    Ying Yao, Zixi Cui, Le Tian, Menglong Li, Fan Pan, Yuxiang Hu (PLA Information Engineering University)

  • 10:15am-10:30am

    P4CGO: Control Plane Guided P4 Program Optimization

    Chenan Wen, Zhuocong Li, Syed Usman Jafri, Xiaokang Qiu, Sanjay Rao (Purdue University)

10:30am - 10:45am

Coffee Break

Coffee Break
10:45am - 11:20am

Keynote 2: Modeling and Simulation-Based System for Large AI Model Deployment Planning, Problem Localization, and Optimization

Xiaoliang Wang (Tencent/Fudan University)

Xiaoliang Wang a Professor at Fudan University. Before that, he was a visiting scholar of Microsoft Research and worked at the Tencent network platform department. He published multiple papers at SIGCOMM, NSDI, OSDI. His research focuses on distributed systems and cloud networks.

Abstract:

Corporations worldwide are investing heavily in the infrastructure for training large AI models. However, the service providers have met great challenges in the planning, construction, operation, and optimization of the ultra-large-scale AI clusters. In this talk, we report a performance model that organizes key elements of the LLM, and optimizes the deployment and operation of AI clusters. The performance model, called Astral-GOA (Tencent Astral AI Large Model Global Optimization Analysis System), has been applied in Tencent AI clusters for years to support large AI model training and inference. We provide case studies of Astral-GOA to verify its effectiveness for planning, problem localization, and optimization. We further demonstrate the operational experience and discuss the difference to the current modeling tools like NVIDIA Calculon, Astra-sim, etc. The overall design of Astral-GOA is useful to the community and raises new research problems. We hope to attract more practitioners and researchers to discuss and improve the tool together.

11:20am - 12:05pm

Session 2: Verifying Large-Scale Networks

  • 11:20am-11:35am

    A Lightweight and Fast Network Verification Platform for Cloud Data Center Networks

    Lin Shen, Yang Wang, Haihua Li, Wei Song, Ziyang Yao, Zhou Yuan, Feng Dong, Ronghua Sun(Huawei Technologies); Lizhao You (Xiamen University)

  • 11:35am-11:50am

    Heracles: A Novel State-based Distributed Verification Framework for DNS Configurations

    Kaiqiang Hu, Haizhou Du (Shanghai University of Electric Power); Yao Wang (Xiamen University)

  • 11:50am-12:05pm

    A Hybrid Testing System on P4 Cloud Gateway

    Xin Yang(Northeastern University and Alibaba Cloud); Enge Song, Wanli Cao, Tian Pan, Yang Song, Ye Yang, Yongwang Wu, Jianyuan Lu, Bowen Yang, Rong Wen(Alibaba Cloud); Fuliang Li, Xingwei Wang (Northeastern University); Xing Li, Biao Lyu (Zhejiang University and Alibaba Cloud); Shunmin Zhu (Hangzhou Feitian Cloud and Alibaba Cloud)

12:05pm - 1:30pm

Lunch Break

Lunch Break
1:30pm - 2:05pm

Keynote 3: The bane of network operations: Networks are evolved, not (intelligently) designed

Ratul Mahajan (University of Washington)

Ratul Mahajan is an Associate Professor at the University of Washington's Paul G. Allen School of Computer Science. He is also the co-director of UW FOCI (Future of Cloud Infrastructure) and an Amazon Scholar. In earlier lives, he was a Co-founder and CEO of Intentionet, a company that pioneered network verification, and a Principal Researcher at Microsoft Research. Ratul is a computer systems researcher with a networking focus and has worked on a broad range of topics, including network verification, connected homes, optical networks, Internet routing and measurements, and mobile systems. Many of the technologies that he has helped develop are part of real-world systems at Microsoft and other companies. Ratul has been recognized as an ACM Distinguished Scientist, an ACM SIGCOMM Rising Star, and a Microsoft Research Graduate Fellow. His papers have won the ACM SIGCOMM Test-of-Time Award, the IEEE William R. Bennett Prize, the ACM SIGCOMM Best Paper Awards (twice), the HVC Best Paper Award, and the IETF Applied Networking Research Prize.

Abstract:

In this talk, I will argue that methods to simplify network operations, be they based on synthesis, verification, or testing, must account for network evolution. Otherwise, the abstractions upon which these methods are built can go stale over time and even leave the network in a worse state than not having those abstractions in the first place. Using examples from my recent work, evolution-friendliness is not something that can be layered atop basic techniques, but it can require a ground up thinking that leads to methods of a different kind.

2:05pm - 2:30pm

Session 3: Planning Network Updates

  • 2:05pm-2:13pm

    Openlab: A One-Stop Service Platform for Software-Defined Networks Emulation

    Siyu Han, Shuo Wang, Chiliang Zhong, Dong Zhou, Tao Huang, Liang Wei, Chaohong Tan (Beijing University of Posts and Telecommunications)

  • 2:13pm-2:20pm

    Accelerating ACL Configuration Update through Data Plane Analysis

    Tianyi Kou, Haifeng Sun, Zirui Zhuang, Qi Qi, Jingyu Wang, Jianxin Liao (Beijing University of Posts and Telecommunications)

2:20pm - 2:45pm

Coffee Break

Coffee Break
2:45pm - 3:15pm

Session 4: Going Beyond Verification

  • 2:45pm-2:53pm

    Interpretable Network Synthesis via Localized Specifications

    Haoxian Chen (ShanghaiTech University)

  • 2:53pm-3:00pm

    DNS Misconfigurations Diagnosis and Repair

    Ziyi Wang, Letian Zhu, Zhenghao Su, Qiao Xiang (Xiamen University)

  • 3:00pm-3:15pm

    ConfigHub: A Network Configuration Sharing Platform

    Gao Han, Hanyang Shao, Ruiqin Duan, Changqi Zhuang, Qiao Xiang (Xiamen University)

3:15pm - 4:00pm

Panel: Network Formal Methods: Challenges and Opportunities

  • Panelists

    Brighten Godfrey (UIUC), Xiaoliang Wang (Tencent/NJU), Ratul Mahajan (UW), Peng Zhang (XJTU), Yifei Yuan (Alibaba)

Call for Papers

There has been a growing interest from networking researchers, equipment vendors, and Internet service providers to apply formal methods in network operation to improve the availability, reliability, and performance of networks. Starting from the pioneering work on data plane verification (i.e., Anteater, HSA and Veriflow), researchers have invented many novel approaches to utilize formal methods in a wide range of aspects of network operation, including but not limited to routing configuration verification, diagnosis and synthesis, verification and testing of programmable networks, DNS configuration and implementation verification, and congestion control and scheduling verification. Many of these innovations have been published in top-tier venues such as SIGCOMM, NSDI, SOSP, and OSDI. The percentage of papers on network formal methods in SIGCOMM and NSDI for the past ten years shows a steady interest in network formal methods in our community (e.g., up to 16.4% in SIGCOMM 2021 and up to 16.9% in NSDI 2020).

Although it is a relatively new research area, the impact of this line of research has already gone beyond publications. Not only several startups (e.g., Intentionet) have emerged, but multiple mega-scale Internet companies (e.g., Microsoft, Amazon, and Alibaba) have deployed formal-methods-based operation tools (e.g., Batfish, RCDC, and Hoyan) in their production networks.

The goal of the formal methods aided network operation (FMANO) workshop is to (1) provide a venue for the community to engage in a lively discussion on the challenges, opportunities, and future research agendas in the intersection of networking and formal methods and (2) make this emerging and exciting research area accessible to a broader audience.

In this workshop, we seek contributions to the design principles, implementations, and operation experiences of formal-methods-based network operation systems. Selected keynotes and a panel of experts from both academia and industry will be included to complement more traditional paper sessions, to set the research agenda, debate the issues, and share the most recent progress.

Topics of Interest

We welcome submissions on all aspects of formal-methods-based network operation systems. Topics include, but are not limited to:

Submission Instructions

Submissions must be original, unpublished work, and not under consideration at another conference or journal. We accept two types of submissions: regular papers that are at most six (6) pages long, including all figures, tables, references, and appendices in two-column 10pt ACM format; and extended abstracts that are at most two (2) pages long with a maximum of one additional page for references only.

Papers and extended abstracts must include author names and affiliations for single-blind peer reviewing by the PC. Authors of accepted submissions are expected to present and discuss their work at the workshop. Please submit your paper via https://fmano24.hotcrp.com/

Important Dates

Organizers

Workshop Chairs
Ying Zhang Meta
Qiao Xiang Xiamen University
Programe Committee
Ryan Beckett Microsoft
Lin Huang Antgroup
Xin Jin Peking University
Franck Le IBM
Geng Li Huawei
Mengqi Liu Alibaba
Miguel Rio UCL
Chang Lou University of Virginia
Alan Liu UMD
Ning Luo Northwestern University
Congcong Miao Tencent
Ruzica Piskac Yale
Qin Wu Huawei
Ennan Zhai Alibaba
Peng Zhang XJTU