☆ 4.7 Article

Employ AI to Improve AI Services : Q-Learning Based Holistic Traffic Control for Distributed Co-Inference in Deep Learning

IEEE TRANSACTIONS ON SERVICES COMPUTING (2022)

期刊

IEEE TRANSACTIONS ON SERVICES COMPUTING

卷 15, 期 2, 页码 627-639

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TSC.2021.3113184

关键词

Task analysis; Cloud computing; Artificial intelligence; Internet of Things; Optimization; Feature extraction; Deep learning; Machine learning; network management; Internet of Things; service-oriented systems; resource management

类别

Computer Science, Information Systems Computer Science, Software Engineering

资金

JSPS KAKENHI [JP19K20250, JP20F20080, JP20H04174]
Leading Initiative for Excellent Young Researchers (LEADER), MEXT, Japan
JST, PRESTO, Japan [JPMJPR21P3]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article introduces an AI-based network optimization solution for scheduling AI services. Through deep Q-learning algorithm and priority-based data forwarding strategy, the overall throughput and task processing rate are maximized.

As the inevitable part of intelligent service in the new era, the services for AI tasks themselves have received significant attention, which due to the urgency of energy and computing resources, is difficult to implement in a stable and widely distributed system and coordinately utilize remote edge devices and cloud. In this article, we introduce an AI-based holistic network optimization solution to schedule AI services. Our proposed deep Q-learning algorithm optimizes the overall throughput of AI co-inference tasks themselves by balancing the uneven computation resources and traffic conditions. We use a multi-hop DAG (Directed Acyclic Graph) to describe a deep neural network (DNN) based co-inference network structure and introduce the virtual queue to analyze the Lyapunov stability for the system. Then, a priority-based data forwarding strategy is proposed to maximize the bandwidth efficiency, and we develop a Real-time Deep Q-learning based Edge Forwarding Scheme Optimization Algorithm (RDFO) to maximize the overall task processing rate. Finally, we conduct the platform simulation for the distributed co-inference system. Through the comparison with other benchmarks, we testify to the optimality of our proposal.

Employ AI to Improve AI Services : Q-Learning Based Holistic Traffic Control for Distributed Co-Inference in Deep Learning

期刊

IEEE TRANSACTIONS ON SERVICES COMPUTING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Employ AI to Improve AI Services : Q-Learning Based Holistic Traffic Control for Distributed Co-Inference in Deep Learning

期刊

IEEE TRANSACTIONS ON SERVICES COMPUTING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文