skip to main content
10.1145/2018602.2018615acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free access

Identifying performance bottlenecks in CDNs through TCP-level monitoring

Published: 19 August 2011 Publication History

Abstract

Content distribution networks (CDNs) need to make decisions, such as server selection and routing, to improve performance for their clients. The performance may be limited by various factors such as packet loss in the network, a small receive buffer at the client, or constrained server CPU and disk resources. Conventional measurement techniques are not effective for distinguishing these performance problems: application-layer logs are too coarse-grained, while network-level traces are too expensive to collect all the time. We argue that passively monitoring the transport-level statistics in the server's network stack is a better approach.
This paper presents a tool for monitoring and analyzing TCP statistics, and an analysis of a CoralCDN node in PlanetLab for six weeks. Our analysis shows that more than 10% of connections are server-limited at least 40% of the time, and many connections are limited by the congestion window despite no packet loss. Still, we see that clients in 377 Autonomous Systems (ASes) experience persistent packet loss. By separating network congestion from other performance problems, our analysis provides a much more accurate view of the performance of the network paths than what is possible with server logs alone.

References

[1]
Route Views Project. http://www.routeviews.org/.
[2]
Web100 Project. http://www.web100.org/.
[3]
Web10G Project. http://web10g.org/.
[4]
N. Dukkipati, T. Refice, Y. Cheng, J. Chu, T. Herbert, A. Agarwal, A. Jain, and N. Sutin. An Argument for Increasing TCP's Initial Congestion Window. ACM SIGCOMM Computer Communication Review, 40, 2010.
[5]
R. Fonseca, M. J. Freedman, and G. Porter. Experiences with Tracing Causality in Networked Services. In USENIX INM/WREN, San Jose, CA USA, April 2010.
[6]
R. Fonseca, G. Porter, R. H. Katz, S. Shenker, and I. Stoica. X-Trace: A Pervasive Network Tracing Framework. In USENIX NSDI, Cambridge, MA USA, April 2007.
[7]
M. J. Freedman. Experiences with CoralCDN: A Five-Year Operational View. In USENIX NSDI, San Jose, CA USA, April 2010.
[8]
M. J. Freedman, E. Freudenthal, and D. Mazières. Democratizing Content Publication with Coral. In USENIX NSDI, San Francisco, CA USA, March 2004.
[9]
A. Gerber, J. Pang, O. Spatscheck, and S. Venkataraman. Speed Testing without Speed Tests: Estimating Achievable Download Speed from Passive Measurements. In ACM IMC, Melbourne, Australia, November 2010.
[10]
R. Krishnan, H. V. Madhyastha, S. Srinivasan, S. Jain, A. Krishnamurthy, T. Anderson, and J. Gao. Moving Beyond End-to-End Path Information to Optimize CDN Performance. In ACM IMC, Chicago, IL USA, November 2009.
[11]
H. V. Madhyastha, T. Isdal, M. Piatek, C. Dixon, T. Anderson, A. Krishnamurthy, and A. Venkataramani. iPlane: An information plane for distributed services. In USENIX OSDI, Seattle, WA USA, November 2006.
[12]
M. Mathis, J. Heffner, and R. Raghunarayan. RFC 4898: TCP Extended Statistics MIB. http://www.ietf.org/rfc/rfc4898.txt, May 2007.
[13]
V. N. Padmanabhan, L. Qiu, and H. J. Wang. Server-based inference of Internet link lossiness. In IEEE INFOCOM, San Francisco, CA USA, March 2003.
[14]
R. S. Prasad, M. Murray, C. Dovrolis, and k. claffy. Bandwidth estimation: Metrics, measurement techniques, and tools. In IEEE Network, November/December 2003.
[15]
A.-J. Su, D. R. Choffnes, A. Kuzmanovic, and F. E. Bustamante. Drafting Behind Akamai. In ACM SIGCOMM, Pisa, Italy, September 2006.
[16]
M. Yu, A. Greenberg, D. Maltz, J. Rexford, L. Yuan, S. Kandula, and C. Kim. Profiling Network Performance for Multi-tier Data Center Applications. In USENIX NSDI, Boston, MA USA, March 2011.
[17]
M. Zhang, C. Zhang, V. Pai, L. Peterson, and R. Wang. PlanetSeer: Internet Path Failure Monitoring and Characterization in Wide-Area Services. In USENIX OSDI, San Francisco, CA USA, December 2004.
[18]
Y. Zhang, L. Breslau, V. Paxson, and S. Shenker. On the Characteristics and Origins of Internet Flow Rates. In ACM SIGCOMM, Pittsburgh, PA USA, August 2002.

Cited By

View all
  • (2023)FlowPinpoint: Localizing Anomalies in Cloud-client Services for Cloud ProvidersIEEE Transactions on Cloud Computing10.1109/TCC.2023.3257162(1-15)Online publication date: 2023
  • (2023)A Performance-Based Assessment Approach for Cloud Service Provider SelectionNew Sustainable Horizons in Artificial Intelligence and Digital Solutions10.1007/978-3-031-50040-4_19(250-264)Online publication date: 9-Nov-2023
  • (2021)RBA: Adaptive TCP Receive Buffer Sizing2021 IEEE Symposium on Computers and Communications (ISCC)10.1109/ISCC53001.2021.9631487(1-7)Online publication date: 5-Sep-2021
  • Show More Cited By

Index Terms

  1. Identifying performance bottlenecks in CDNs through TCP-level monitoring

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        W-MUST '11: Proceedings of the first ACM SIGCOMM workshop on Measurements up the stack
        August 2011
        74 pages
        ISBN:9781450308007
        DOI:10.1145/2018602
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 19 August 2011

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. content distribution network
        2. performance bottleneck
        3. tcp

        Qualifiers

        • Research-article

        Conference

        SIGCOMM '11
        Sponsor:
        SIGCOMM '11: ACM SIGCOMM 2011 Conference
        August 19, 2011
        Ontario, Toronto, Canada

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)97
        • Downloads (Last 6 weeks)13
        Reflects downloads up to 14 Sep 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)FlowPinpoint: Localizing Anomalies in Cloud-client Services for Cloud ProvidersIEEE Transactions on Cloud Computing10.1109/TCC.2023.3257162(1-15)Online publication date: 2023
        • (2023)A Performance-Based Assessment Approach for Cloud Service Provider SelectionNew Sustainable Horizons in Artificial Intelligence and Digital Solutions10.1007/978-3-031-50040-4_19(250-264)Online publication date: 9-Nov-2023
        • (2021)RBA: Adaptive TCP Receive Buffer Sizing2021 IEEE Symposium on Computers and Communications (ISCC)10.1109/ISCC53001.2021.9631487(1-7)Online publication date: 5-Sep-2021
        • (2020)Online Monitoring of TCP Throughput LimitationsNOMS 2020 - 2020 IEEE/IFIP Network Operations and Management Symposium10.1109/NOMS47738.2020.9110324(1-9)Online publication date: Apr-2020
        • (2019)Understanding the Networking Performance of Wear OSProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/3322205.33110743:1(1-25)Online publication date: 26-Mar-2019
        • (2019)I Sent ItProceedings of the Fourteenth EuroSys Conference 201910.1145/3302424.3303961(1-15)Online publication date: 25-Mar-2019
        • (2019)TCP Stalls at the Server SideIEEE/ACM Transactions on Networking10.1109/TNET.2018.288628227:1(272-287)Online publication date: 1-Feb-2019
        • (2019)An assessment model for identifying maintenance priorities strategy for bridgesAin Shams Engineering Journal10.1016/j.asej.2019.06.003Online publication date: Aug-2019
        • (2018)Optimizing Internet Transit Routing for Content Delivery NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2017.276175226:1(76-89)Online publication date: 1-Feb-2018
        • (2017)Accelerating Mobile Web Loading Using Cellular Link InformationProceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services10.1145/3081333.3081367(427-439)Online publication date: 16-Jun-2017
        • Show More Cited By

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media