Welcome to Journal of Beijing Institute of Technology
Volume 27Issue 3
.
Turn off MathJax
Article Contents
Longfei Li, Zhanzhuang He, Jianfeng Wang, Yangchun Shi, Haiqiang Feng. Small Segment Coalescing: A Hardware Acceleration Method of Receive Side for TCP/IP Processing[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2018, 27(3): 406-418. doi: 10.15918/j.jbit1004-0579.17169
Citation: Longfei Li, Zhanzhuang He, Jianfeng Wang, Yangchun Shi, Haiqiang Feng. Small Segment Coalescing: A Hardware Acceleration Method of Receive Side for TCP/IP Processing[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2018, 27(3): 406-418.doi:10.15918/j.jbit1004-0579.17169

Small Segment Coalescing: A Hardware Acceleration Method of Receive Side for TCP/IP Processing

doi:10.15918/j.jbit1004-0579.17169
  • Received Date:2017-11-20
  • Dealing with a hardware acceleration method, small segment coalescing (SSC) was proposed to achieve the acceleration of TCP/IP processing in the receiving process. To reduce the number of data copy, CPU interruptions and TCP/IP processing, SSC combines small received TCP segments that belong to the same TCP/IP connection into a large TCP package in Network Interface Card (NIC). The whole process is implemented by hardware in NIC so that SSC remains transparent to upper drivers. Based on the intensive study on TCP/IP protocol and NIC mechanism, the coalescing policy is carefully designed to make sure that SSC can make a reasonable decision on whether or when to start or finish coalescing without delay. In addition, SSC is implemented and integrated into LCE5718, which is a totally self-designed dual-port Gigabit Ethernet controller. Finally, the simulation environment is constructed to verify the function of the design. A field programmable gate array (FPGA) prototype is set up, and experiments are conducted to show the performance of SSC in different configurations.
  • loading
  • [1]
    Kaur K, Syed A, Mohammad A, et al. An evaluation of major threats in cloud computing associated with big data[C]//2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), IEEE, 2017:368-372.
    [2]
    Baiquan X. TCP/IP acceleration stack based on multi-core platform[C]//2014 Sixth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA), IEEE, 2014:651-655.
    [3]
    Li P, Luo Y. P4 GPU:accelerate packet processing of a P4 program with a CPU-GPU heterogeneous architecture[C]//2016 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS), IEEE, 2016:125-126.
    [4]
    Ren Y, Li T, Yu D, et al. Protocols for wide-area data-intensive applications:design and performance issues[C]//Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, IEEE Computer Society Press, 2012:34-45.
    [5]
    Kumar A, Huggahalli R, Makineni S. Characterization of direct cache access on multi-core systems and 10gbe[C]//IEEE 15th International Symposium on High Performance Computer Architecture, IEEE, 2009:341-352.
    [6]
    Kimura B Y L, Lima D C S F, Loureiro A A F. Alternative scheduling decisions for multipath TCP[J]. IEEE Communications Letters, 2017, 21(11):2412-2415.
    [7]
    Liao G, Zhu X, Bnuyan L. A new server I/O architecture for high speed networks[C]//2011 IEEE 17th International Symposium on High Performance Computer Architecture (HPCA), IEEE, 2011:255-265.
    [8]
    MacArthur P, Russell R D. An efficient method for stream semantics over RDMA[C]//2014 IEEE 28th International Parallel and Distributed Processing Symposium, IEEE, 2014:841-851.
    [9]
    Ren Y, Li T, Yu D, et al. Design and testbed evaluation of RDMA-based middleware for high-performance data transfer applications[J]. Journal of Systems and Software, 2013, 86(7):1850-1863.
    [10]
    Li L, Shi Y, Wang J, et al. A VM-friendly NIC architecture for cloud computing[C]//2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA), IEEE, 2017:196-200.
    [11]
    Lee D, Subramanian L, Ausavarungnirun R, et al. Decoupled direct memory access:isolating CPU and IO traffic by leveraging a dual-data-port DRAM[C]//2015 International Conference on Parallel Architecture and Compilation (PACT), IEEE, 2015:174-187.
    [12]
    Domsch M L, Hernandez H M, Winter R L, et al. Method of handling network traffic through optimization of receive side scaling:US Patent 8,842,562[P]. 2014-09-23.
    [13]
    Muelas D, de Vergara J E L, Ramos J, et al. On the impact of TCP segmentation:experience in VoIP monitoring[C]//2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), IEEE, 2017:708-713.
    [14]
    Ding L, Kang P, Yin W, et al. Hardware TCP offload engine based on 10-Gbps ethernet for low-latency network communication[C]//2016 International Conference on Field-Programmable Technology (FPT), IEEE, 2016:269-272.
    [15]
    Chang E H, Wang C C, Liu C T, et al. Virtualization technology for TCP/IP offload engine[J]. IEEE Transactions on Cloud Computing, 2014, 2(2):117-129.
    [16]
    Emmerich P, Raumer D, Wohlfart F, et al. Performance characteristics of virtual switching[C]//2014 IEEE 3rd International Conference on Cloud Networking (CloudNet), IEEE, 2014:120-125.
    [17]
    Gebert S, Zinner T, Lange S, et al. Performance modeling of softwarized network functions using discrete-time analysis[C]//2016 28th International Teletraffic Congress (ITC 28), IEEE, 2016:234-242.
    [18]
    Li L, He Z, Wang J, et al. Implementation of gigabit ethernet controller with fault tolerance and prevention mechanism[C]//2017 Prognostics and System Health Management Conference (PHM-Harbin), IEEE, 2017:1-8.
    [19]
    Hayashi Y, Zhen J Y, Nishiyama S, et al. Method for detecting low-rate attacks on basis of burst-state duration using quick packet-matching function[C]//2017 IEEE International Symposium on Local and Metropolitan Area Networks (LANMAN), IEEE, 2017:1-2.
    [20]
    Sharma K, Badarla V. Curtailing latency in data center network by adopting jumbo frames[C]//2016 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS), IEEE, 2016:1-6.
    [21]
    Das A, Debbarma S. Performance of jumbo sized data on jumbo frame and Ethernet frame using UDP over IPv4/IPv6[C]//2013 2nd International Conference on Advanced Computing, Networking and Security (ADCONS), IEEE, 2013:204-207.
    [22]
    Park M Y, Chung S H, Lee Y S. A sender-based packet loss differentiation algorithm based on estimating the queue usage between a TCP sender/receiver[J]. Journal of the Korea Society of Computer and Information, 2011, 16(1):133-142.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (494) PDF downloads(474) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map