Presentation

當 AI 走向邊緣運算:網路維運人員該知道的事

schedule10:40 - 11:00
Abstract / Overview

Artificial intelligence is rapidly becoming a critical component of modern digital systems. While much attention has focused on model accuracy and large-scale training in centralized data centers, an equally important factor is often overlooked: latency. In many real-world applications, the value of AI depends not only on how accurate the model is, but also on how quickly it can produce a decision. This talk explores how low-latency AI inference is emerging as a new workload class for Internet infrastructure. We begin with a simple observation: in the age of AI, outcomes increasingly depend on both accuracy and response time. This principle can be seen across domains. In the public sector, modern unmanned systems combine remote human control with elements of autonomy and AI-assisted targeting, where decision speed can be critical. In the private sector, applications such as visual product search in mobile eCommerce rely on machine vision models that must return results quickly to maintain user engagement. These requirements raise important questions for network operators. If AI inference must operate with very low latency, where should it run? While model training will continue to reside in centralized data centers, inference workloads may increasingly move closer to users at CDN points of presence or ISP edge facilities. From an operator’s perspective, this shift introduces several new considerations. Edge environments that historically relied on CPU-based workloads may begin to incorporate GPUs as new infrastructure components. Power consumption, while manageable for inference workloads, becomes an operational factor. Network architecture—such as IPv6 deployment and latency optimization—directly influences AI performance. In addition, edge software stacks designed around CPUs must evolve to coordinate scarce GPU resources efficiently. Finally, as AI becomes embedded in operational and decision-making systems, traditional security principles remain essential. AI models are only as reliable as their training data, and infrastructure security continues to play a critical role in ensuring trustworthy outcomes. This talk discusses the emerging Age of AI Inference at the Edge and its implications for network operators, CDN platforms, and Internet infrastructure.

Personnel / Bio
Alex Leung

梁增偉

Akamai

Alex Leung is Akamai's Senior Enterprise Architect where he serves as a trusted advisor for leading broadcasters, helping them to transform their services in adaptation to the OTT delivery trend, and navigate the complex array of technologies therein. Over the past years at Akamai, he has led media consultancy projects that helped regional broadcasters optimize their media streaming operations in preparation for large events, including World Cup 2018 and Indian Premier League 2019. Prior to joining Akamai, Alex led a number of challenging projects through his 20-year career, ranging from a video-on-demand e-learning platform for Hong Kong Police Force to an image search engine based on Apache SOLR. He holds a Master in Applied Physics from Stanford University and a Bachelor Degree in Engineering Physics from Cornell University.

Sequence / 2026.05.15

...

08:3009:00

...

Session Interval
09:0009:05
Keynote

...

host / Jack Wang

主持人 / 王彥傑

09:0509:20
Keynote

...

資訊局局長/趙式隆、Nicole、TWNIC若凡、Jack Wang、Jack Kwok

資訊局局長/趙式隆、詹婷怡、余若凡、王彥傑、郭奕豪

09:2009:40
Keynote

...

Jack Kwok、Achie

Jack Kwok、Achie

09:4010:05
Keynote

...

Steve Crocker

Steve Crocker

Edgemoor Research Institute

10:0510:25
Keynote

...

Tony Smith

Tony Smith

APNIC

10:2510:40

...

Session Interval
10:4011:00
Presentation

...

Alex Leung

梁增偉

Akamai

11:0011:20
Presentation

...

Bastien Claeys

Bastien Claeys

Nokia

11:2011:35
Presentation

...

Stanley Chen

Stanley Chen

11:3511:55
Presentation

...

Tomoki Yoshikawa

Tomoki Yoshikawa

Home NOC Operators Group

11:5512:15
Presentation

...

Philip Paeps

Philip Paeps

Alternative Enterprises

12:1513:35

...

Session Interval
13:3513:55
Presentation

...

Masataka Mawatari

Masataka Mawatari

JPIX

13:5514:15
Presentation

...

Yoshinobu Matsuzaki

Yoshinobu Matsuzaki

IIJ

14:1514:35
Presentation

...

Tashi Phuntsho

Tashi Phuntsho

FLEXOPTIX

14:3514:50
Presentation

...

Sam Sham

岑育霖

RETN

14:5015:10
Presentation

...

Taisuke Sato

Taisuke Sato

Seiko Solutions

15:1015:25

...

Session Interval
15:2515:45
Networking

...

15:4516:05
Presentation

...

Scott Fisher

Scott Fisher

Team Cymru

16:0516:25
Presentation

...

Pavel Odintsov

Pavel Odintsov

FastNetMon LTD

16:2516:50
Presentation

...

16:5017:00
Keynote

...