Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning

Learning-based Adaptive Bit Rate~(ABR) method, aiming to learn outstanding strategies without any presumptions, has become one of the research hotspots for adaptive streaming. However, it is still suffering from several issues, i.e., low sample efficiency and lack of awareness of the video quality information. In this paper, we propose Comyco, a video quality-aware ABR approach that enormously improves the learning-based methods by tackling the above issues. Comyco trains the policy via imitating expert trajectories given by the instant solver, which can not only avoid redundant exploration but also make better use of the collected samples. Meanwhile, Comyco attempts to pick the chunk with higher perceptual video qualities rather than video bitrates. To achieve this, we construct Comyco's neural network architecture, video datasets and QoE metrics with video quality features. Using trace-driven and real world experiments, we demonstrate significant improvements of Comyco's sample efficiency in comparison to prior work, with 1700x improvements in terms of the number of samples required and 16x improvements on training time required. Moreover, results illustrate that Comyco outperforms previously proposed methods, with the improvements on average QoE of 7.5% - 16.79%. Especially, Comyco also surpasses state-of-the-art approach Pensieve by 7.37% on average video quality under the same rebuffering time.

[1]  Pieter Abbeel,et al.  An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.

[2]  Te-Yuan Huang,et al.  A buffer-based approach to rate adaptation: evidence from a large video streaming service , 2015, SIGCOMM 2015.

[3]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[4]  Bruno Ribeiro,et al.  Oboe: auto-tuning video ABR algorithms to network conditions , 2018, SIGCOMM.

[5]  Anca D. Dragan,et al.  DART: Noise Injection for Robust Imitation Learning , 2017, CoRL.

[6]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[7]  Philip Levis,et al.  Pantheon: the training ground for Internet congestion-control research , 2018, USENIX Annual Technical Conference.

[8]  Lifeng Sun,et al.  QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning , 2018, ACM Multimedia.

[9]  Federico Chiariotti,et al.  D-DASH: A Deep Q-Learning Framework for DASH Video Streaming , 2017, IEEE Transactions on Cognitive Communications and Networking.

[10]  Kai Zeng,et al.  Display device-adapted video quality-of-experience assessment , 2015, Electronic Imaging.

[11]  Nei Kato,et al.  On Removing Routing Protocol from Future Wireless Networks: A Real-time Deep Learning Approach for Intelligent Traffic Control , 2018, IEEE Wireless Communications.

[12]  Hongzi Mao,et al.  Variance Reduction for Reinforcement Learning in Input-Driven Environments , 2018, ICLR.

[13]  Lifeng Sun,et al.  Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming , 2018, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[14]  Bruno Sinopoli,et al.  A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP , 2015, Comput. Commun. Rev..

[15]  Xiangyu Zhang,et al.  ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design , 2018, ECCV.

[16]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[17]  Zhengfang Duanmu,et al.  A Quality-of-Experience Database for Adaptive Video Streaming , 2018, IEEE Transactions on Broadcasting.

[18]  Ali C. Begen,et al.  SDNDASH: Improving QoE of HTTP Adaptive Streaming Using Software Defined Networking , 2016, ACM Multimedia.

[19]  C.-C. Jay Kuo,et al.  Challenges in cloud based ingest and encoding for high quality streaming media , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[20]  Mohamed Medhat Gaber,et al.  Imitation Learning , 2017, ACM Comput. Surv..

[21]  Te-Yuan Huang,et al.  Hindsight: evaluate video bitrate adaptation at scale , 2019, MMSys.

[22]  Christian Timmerer,et al.  A Survey on Bitrate Adaptation Schemes for Streaming Media Over HTTP , 2019, IEEE Communications Surveys & Tutorials.

[23]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[24]  Carsten Griwodz,et al.  Commute path bandwidth traces from 3G networks: analysis and applications , 2013, MMSys.

[25]  Cisco Visual Networking Index: Forecast and Methodology 2016-2021.(2017) http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual- networking-index-vni/complete-white-paper-c11-481360.html. High Efficiency Video Coding (HEVC) Algorithms and Architectures https://jvet.hhi.fraunhofer. , 2017 .

[26]  Zhengfang Duanmu,et al.  A Quality-of-Experience Index for Streaming Video , 2017, IEEE Journal of Selected Topics in Signal Processing.

[27]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[28]  Ramesh K. Sitaraman,et al.  From Theory to Practice , 2019, ACM Trans. Multim. Comput. Commun. Appl..

[29]  Vyas Sekar,et al.  Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with FESTIVE , 2012, CoNEXT '12.

[30]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[31]  Hari Balakrishnan,et al.  Mahimahi: Accurate Record-and-Replay for HTTP , 2015, USENIX Annual Technical Conference.

[32]  Ramesh K. Sitaraman,et al.  BOLA: Near-Optimal Bitrate Adaptation for Online Videos , 2016, IEEE/ACM Transactions on Networking.

[33]  Yuan Tang,et al.  TF.Learn: TensorFlow's High-level Module for Distributed Machine Learning , 2016, ArXiv.

[34]  Andreas Schmidt,et al.  Cross-Layer Effects on Training Neural Algorithms for Video Streaming , 2018, NOSSDAV.

[35]  Krishna R. Pattipati,et al.  ABR streaming of VBR-encoded videos: characterization, challenges, and solutions , 2018, CoNEXT.

[36]  Timo Aila,et al.  Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[37]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[38]  Jacob Benesty,et al.  Pearson Correlation Coefficient , 2009 .

[39]  Sergey Levine,et al.  Guided Meta-Policy Search , 2019, NeurIPS.

[40]  Asma Ben Letaifa,et al.  Machine learning based QoE prediction in SDN networks , 2017, 2017 13th International Wireless Communications and Mobile Computing Conference (IWCMC).

[41]  Reza Rassool,et al.  VMAF reproducibility: Validating a perceptual practical video quality metric , 2017, 2017 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[42]  Djemel Ziou,et al.  Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[43]  William May,et al.  HTTP Live Streaming , 2017, RFC.

[44]  David M. Beazley,et al.  SWIG: An Easy to Use Tool for Integrating Scripting Languages with C and C++ , 1996, Tcl/Tk Workshop.