1st ACM SIGCOMM Workshop on Formal Methods Aided Network Operation (FMANO)

Program

9:00am - 9:10am

Opening remarks

9:10am - 9:45am

Keynote 1: Networks above the Network: Opportunities for Formal Methods in The World of Cloud Native Computing

Brighten Godfrey (UIUC)

Brighten Godfrey is a Professor in the Department of Computer Science at the University of Illinois Urbana-Champaign. He co-founded and served as CTO of network verification pioneer Veriflow through its 2019 acquisition by VMware (now Broadcom), where he serves as a Technical Director. He received his Ph.D. at UC Berkeley in 2009, and his B.S. at Carnegie Mellon University in 2002. His research interests lie in the design of networked systems and algorithms. He has served as co-chair of ACM SIGCOMM 2022, SOSR 2016 and ACM HotNets 2014, and currently serves on the ACM SIGCOMM Steering Committee. He is a winner of the ACM SIGCOMM Rising Star Award, the Sloan Research Fellowship, and the National Science Foundation CAREER Award.

Abstract:

The field of network verification has achieved significant success in improving correctness of deployed networks. This success is due in part to the focused scope of network data and control planes, making the problem tractable. However, modern cloud infrastructure is evolving and broadening. Above the physical network, a rich ecosystem of virtual infrastructure helps support so-called “cloud native” applications, including cluster orchestrators, containers and microservices, service meshes, and their associated virtual network control and data planes. In this talk, we’ll explore new opportunities for formal methods to help the operations of these “networks above the network” — and how this domain presents new challenges characterized by dynamic closed-loop control, complex custom software elements, and closer association with applications.

9:45am - 10:30am

Session 1: Analyzing Programmable Networks

9:45am-10:00am
P4-Ace: Resource-Efficient Optimization and Verification for Programmable Switches
Zixi Cui, Saifeng Hou, Le Tian, Yu Wang, Peng Yi, Hongchang Chen (PLA Information Engineering University); Xiaoyu Yi, Yongjie Wang (Zhejiang Lab)
10:00am-10:15am
Scaver: A Scalable Verification System for Programmable Network
Ying Yao, Zixi Cui, Le Tian, Menglong Li, Fan Pan, Yuxiang Hu (PLA Information Engineering University)
10:15am-10:30am
P4CGO: Control Plane Guided P4 Program Optimization
Chenan Wen, Zhuocong Li, Syed Usman Jafri, Xiaokang Qiu, Sanjay Rao (Purdue University)

10:30am - 10:45am

Coffee Break

10:45am - 11:20am

Keynote 2: Modeling and Simulation-Based System for Large AI Model Deployment Planning, Problem Localization, and Optimization

Xiaoliang Wang (Tencent/Fudan University)

Xiaoliang Wang a Professor at Fudan University. Before that, he was a visiting scholar of Microsoft Research and worked at the Tencent network platform department. He published multiple papers at SIGCOMM, NSDI, OSDI. His research focuses on distributed systems and cloud networks.

Abstract:

Corporations worldwide are investing heavily in the infrastructure for training large AI models. However, the service providers have met great challenges in the planning, construction, operation, and optimization of the ultra-large-scale AI clusters. In this talk, we report a performance model that organizes key elements of the LLM, and optimizes the deployment and operation of AI clusters. The performance model, called Astral-GOA (Tencent Astral AI Large Model Global Optimization Analysis System), has been applied in Tencent AI clusters for years to support large AI model training and inference. We provide case studies of Astral-GOA to verify its effectiveness for planning, problem localization, and optimization. We further demonstrate the operational experience and discuss the difference to the current modeling tools like NVIDIA Calculon, Astra-sim, etc. The overall design of Astral-GOA is useful to the community and raises new research problems. We hope to attract more practitioners and researchers to discuss and improve the tool together.

11:20am - 12:05pm

Session 2: Verifying Large-Scale Networks

11:20am-11:35am
A Lightweight and Fast Network Verification Platform for Cloud Data Center Networks
Lin Shen, Yang Wang, Haihua Li, Wei Song, Ziyang Yao, Zhou Yuan, Feng Dong, Ronghua Sun(Huawei Technologies); Lizhao You (Xiamen University)
11:35am-11:50am
Heracles: A Novel State-based Distributed Verification Framework for DNS Configurations
Kaiqiang Hu, Haizhou Du (Shanghai University of Electric Power); Yao Wang (Xiamen University)
11:50am-12:05pm
A Hybrid Testing System on P4 Cloud Gateway
Xin Yang(Northeastern University and Alibaba Cloud); Enge Song, Wanli Cao, Tian Pan, Yang Song, Ye Yang, Yongwang Wu, Jianyuan Lu, Bowen Yang, Rong Wen(Alibaba Cloud); Fuliang Li, Xingwei Wang (Northeastern University); Xing Li, Biao Lyu (Zhejiang University and Alibaba Cloud); Shunmin Zhu (Hangzhou Feitian Cloud and Alibaba Cloud)

12:05pm - 1:30pm

Lunch Break

1:30pm - 2:05pm

Keynote 3: The bane of network operations: Networks are evolved, not (intelligently) designed

Ratul Mahajan (University of Washington)

Ratul Mahajan is an Associate Professor at the University of Washington's Paul G. Allen School of Computer Science. He is also the co-director of UW FOCI (Future of Cloud Infrastructure) and an Amazon Scholar. In earlier lives, he was a Co-founder and CEO of Intentionet, a company that pioneered network verification, and a Principal Researcher at Microsoft Research. Ratul is a computer systems researcher with a networking focus and has worked on a broad range of topics, including network verification, connected homes, optical networks, Internet routing and measurements, and mobile systems. Many of the technologies that he has helped develop are part of real-world systems at Microsoft and other companies. Ratul has been recognized as an ACM Distinguished Scientist, an ACM SIGCOMM Rising Star, and a Microsoft Research Graduate Fellow. His papers have won the ACM SIGCOMM Test-of-Time Award, the IEEE William R. Bennett Prize, the ACM SIGCOMM Best Paper Awards (twice), the HVC Best Paper Award, and the IETF Applied Networking Research Prize.

Abstract:

In this talk, I will argue that methods to simplify network operations, be they based on synthesis, verification, or testing, must account for network evolution. Otherwise, the abstractions upon which these methods are built can go stale over time and even leave the network in a worse state than not having those abstractions in the first place. Using examples from my recent work, evolution-friendliness is not something that can be layered atop basic techniques, but it can require a ground up thinking that leads to methods of a different kind.

2:05pm - 2:30pm

Session 3: Planning Network Updates

2:05pm-2:13pm
Openlab: A One-Stop Service Platform for Software-Defined Networks Emulation
Siyu Han, Shuo Wang, Chiliang Zhong, Dong Zhou, Tao Huang, Liang Wei, Chaohong Tan (Beijing University of Posts and Telecommunications)
2:13pm-2:20pm
Accelerating ACL Configuration Update through Data Plane Analysis
Tianyi Kou, Haifeng Sun, Zirui Zhuang, Qi Qi, Jingyu Wang, Jianxin Liao (Beijing University of Posts and Telecommunications)

2:20pm - 2:45pm

Coffee Break

2:45pm - 3:15pm

Session 4: Going Beyond Verification

2:45pm-2:53pm
Interpretable Network Synthesis via Localized Specifications
Haoxian Chen (ShanghaiTech University)
2:53pm-3:00pm
DNS Misconfigurations Diagnosis and Repair
Ziyi Wang, Letian Zhu, Zhenghao Su, Qiao Xiang (Xiamen University)
3:00pm-3:15pm
ConfigHub: A Network Configuration Sharing Platform
Gao Han, Hanyang Shao, Ruiqin Duan, Changqi Zhuang, Qiao Xiang (Xiamen University)

3:15pm - 4:00pm

Panel: Network Formal Methods: Challenges and Opportunities

Panelists
Brighten Godfrey (UIUC), Xiaoliang Wang (Tencent/NJU), Ratul Mahajan (UW), Peng Zhang (XJTU), Yifei Yuan (Alibaba)

Call for Papers

There has been a growing interest from networking researchers, equipment vendors, and Internet service providers to apply formal methods in network operation to improve the availability, reliability, and performance of networks. Starting from the pioneering work on data plane verification (i.e., Anteater, HSA and Veriflow), researchers have invented many novel approaches to utilize formal methods in a wide range of aspects of network operation, including but not limited to routing configuration verification, diagnosis and synthesis, verification and testing of programmable networks, DNS configuration and implementation verification, and congestion control and scheduling verification. Many of these innovations have been published in top-tier venues such as SIGCOMM, NSDI, SOSP, and OSDI. The percentage of papers on network formal methods in SIGCOMM and NSDI for the past ten years shows a steady interest in network formal methods in our community (e.g., up to 16.4% in SIGCOMM 2021 and up to 16.9% in NSDI 2020).

Although it is a relatively new research area, the impact of this line of research has already gone beyond publications. Not only several startups (e.g., Intentionet) have emerged, but multiple mega-scale Internet companies (e.g., Microsoft, Amazon, and Alibaba) have deployed formal-methods-based operation tools (e.g., Batfish, RCDC, and Hoyan) in their production networks.

The goal of the formal methods aided network operation (FMANO) workshop is to (1) provide a venue for the community to engage in a lively discussion on the challenges, opportunities, and future research agendas in the intersection of networking and formal methods and (2) make this emerging and exciting research area accessible to a broader audience.

In this workshop, we seek contributions to the design principles, implementations, and operation experiences of formal-methods-based network operation systems. Selected keynotes and a panel of experts from both academia and industry will be included to complement more traditional paper sessions, to set the research agenda, debate the issues, and share the most recent progress.

Topics of Interest

We welcome submissions on all aspects of formal-methods-based network operation systems. Topics include, but are not limited to:

Deployment and operation requirements
Network design
Network diagnosis
Network modeling
Network monitoring
Network protocol design and analysis
Network repair
Network security
Network simulation
Network synthesis
Network verification
Operation across multiple domains

Submission Instructions

Submissions must be original, unpublished work, and not under consideration at another conference or journal. We accept two types of submissions: regular papers that are at most six (6) pages long, including all figures, tables, references, and appendices in two-column 10pt ACM format; and extended abstracts that are at most two (2) pages long with a maximum of one additional page for references only.

Papers and extended abstracts must include author names and affiliations for single-blind peer reviewing by the PC. Authors of accepted submissions are expected to present and discuss their work at the workshop. Please submit your paper via https://fmano24.hotcrp.com/

Important Dates

Submissions deadline
May 24, 2024, AOE (Updated)
Acceptance notification
June 3, 2024
Camera-ready deadline
June 14, 2024
Workshop
Aug 4, 2024

Organizers

Workshop Chairs
Ying Zhang	Meta
Qiao Xiang	Xiamen University
Programe Committee
Ryan Beckett	Microsoft
Lin Huang	Antgroup
Xin Jin	Peking University
Franck Le	IBM
Geng Li	Huawei
Mengqi Liu	Alibaba
Miguel Rio	UCL
Chang Lou	University of Virginia
Alan Liu	UMD
Ning Luo	Northwestern University
Congcong Miao	Tencent
Ruzica Piskac	Yale
Qin Wu	Huawei
Ennan Zhai	Alibaba
Peng Zhang	XJTU