☆ 3.8 Proceedings Paper

Accelerating Deep Learning Classification with Error-controlled Approximate-key Caching

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022) (2022)

期刊

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022)

卷 -, 期 -, 页码 2118-2127

出版社

IEEE

DOI: 10.1109/INFOCOM48880.2022.9796677

关键词

类别

Computer Science, Hardware & Architecture Engineering, Electrical & Electronic Telecommunications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article discusses the current situation and challenges of using Deep Learning (DL) technologies to solve networking problems. It proposes a novel caching paradigm to reduce computational complexity and introduces an error-correction algorithm to improve the efficacy of approximate caching.

While Deep Learning (DL) technologies are a promising tool to solve networking problems that map to classification tasks, their computational complexity is still too high with respect to real-time traffic measurements requirements. To reduce the DL inference cost, we propose a novel caching paradigm, that we named approximate-key caching, which returns approximate results for lookups of selected input based on cached DL inference results. While approximate cache hits alleviate DL inference workload and increase the system throughput, they however introduce an approximation error. As such, we couple approximate-key caching with an error-correction principled algorithm, that we named auto-refresh. We analytically model our caching system performance for classic LRU and ideal caches, we perform a trace-driven evaluation of the expected performance, and we compare the benefits of our proposed approach with the state-of-the-art similarity caching - this testifies the practical interest of our proposal.

Accelerating Deep Learning Classification with Error-controlled Approximate-key Caching

期刊

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022)

出版社

IEEE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Accelerating Deep Learning Classification with Error-controlled Approximate-key Caching

期刊

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022)

出版社

IEEE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文