• Không có kết quả nào được tìm thấy

Business Challenge: Get the Fastest Image Processing Capability with the Least Computing Resources

N/A
N/A
Protected

Academic year: 2022

Chia sẻ "Business Challenge: Get the Fastest Image Processing Capability with the Least Computing Resources "

Copied!
4
0
0

Loading.... (view fulltext now)

Văn bản

(1)

Executive Summary

Applications that feature streaming images, processing, and storage need transcoding and image processing that keeps up with users’ demands. The high-performance, powerful FPGA-based CTAccel Image Processor (CIP) benefits data centers by increasing image processing throughput, reducing computational latency, and reducing total cost of ownership (TCO). CIP redefines data center image processing with an FPGA architecture that uses massively parallel algorithms to increase computational performance.

Business Challenge: Get the Fastest Image Processing Capability with the Least Computing Resources

Internet traffic is increasing dramatically (by 24% annually), and images comprise a large portion of internet data. Many companies are dealing with huge quantities of images in the data center and performing various image processing tasks (Table 1).

For example, image decoding, resizing, cropping, and encoding are typical tasks that use large numbers of servers. These functions are resource intensive, and CPU performance per core is not keeping pace with demand.

The CIP accelerator accelerates JPEG, WebP, and Lepton decoding and encoding, as well as image resizing and cropped pixel processing. The CIP accelerator effectively accelerates thumbnail generation and image transcoding, common image processing functions such as sharpening and watermarking, and image analytics.

This solution brief describes JPEG to Lepton and JPEG to WebP CIP accelerator functions.

Data Center FPGA Heterogeneous Computing

Data Center FPGA-Based Image Processing Accelerator

CTAccel Image Processor (CIP) Running on an Intel® FPGA Greatly Improves Image

Processing Performance in the Data Center

Accelerating JPEG, WebP and Lepton decoding, encoding and resizing on Intel®

Xeon®-based servers by offloading all functions to the Intel FPGA.

Application Sample Enterprises

Social Network Facebook, Instagram, Twitter Cloud Storage iCloud, Dropbox, Microsoft OneDrive*

Mobile Instant Message WhatsApp*, Snapchat*

CDN Provider Akamai, Verizon Digital Media Services

E-Commerce Amazon, eBay, Google

Table 1. Example Enterprises Using Image Processing

Accelerated Functions

• JPEG to Lepton

• JPEG to WebP (M6)

• JPEG to JPEG

• JPEG to HEIF

Use Cases

• Thumbnail generation

• Resizing / cropping

• Watermarking

• Brightness / Contrast

• Sharpening

• Maincolor

• Image storage

Solution Brief

(2)

Solution Brief | CTAccel Image Processing Accelerator

Accelerating JPEG to Lepton Image Format Conversion with CIP Lepton

Lepton is an open source tool and file format for lossless compression of JPEG images. It reduces file sizes an average of 22% yet preserves the original JPEG file, including all its metadata.

Lepton compression and decompression represent heavy computational workloads for CPUs. A conventional x86 server with dual E5-2630 processors can compress JPEG files into Lepton format at only 20 megabytes per second.

CTAccel Image Processor Lepton (CIP Lepton) is an FPGA-based heterogeneous computing solution that offloads Lepton compression to the FPGA, increasing throughput by 3X, shortening latency by 4X, and reducing TCO. CIP redefines image processing in state-of-the-art data centers utilizing massively parallel data algorithms by significantly increasing their computational performance.

Accelerating JPEG to WebP Image Format Conversion with CIP WebP

WebP is a new image format for either lossless or lossy compression of web images. WebP allows web developers to create smaller, richer images for faster web pages. WebP lossy images are 25-34% smaller than comparable JPEG images of equivalent quality.

CTAccel Image Processor WebP (CIP WebP) is an FPGA-based image processing acceleration solution that greatly improves image processing and analytics performance by transfering the CPU workload to the FPGA. CIP’s powerful processing capability increases data center image processing throughput 3-4X, reduces computational latency by 3X, and reduces TCO.

High Performance

Reduces latency and increases throughput

Low Power

Software Compatible

Ease of Maintenance

20 Watts per card reduces TCO

Compatible with ImageMagick, OpenCV, GraphicsMagick, Lepton

Upgrade remotely and switch context without rebooting

High Performance

Reduces latency and increases throughput

Low Power

Software Compatible

Ease of Maintenance

20 Watts per card reduces TCO

Compatible with ImageMagick, OpenCV, GraphicsMagick, Lepton

Upgrade remotely and switch context without rebooting

High Performance

Reduces latency and increases throughput

Low Power

Software Compatible

Ease of Maintenance

20 Watts per card reduces TCO

Compatible with ImageMagick, OpenCV, GraphicsMagick, Lepton

Upgrade remotely and switch context without rebooting

High Performance

Reduces latency and increases throughput

Low Power

Software Compatible

Ease of Maintenance

20 Watts per card reduces TCO

Compatible with ImageMagick, OpenCV, GraphicsMagick, Lepton

Upgrade remotely and switch context without rebooting

High Performance

Increased server throughput up to 4-10X while reducing

latency is made possible by offloading

all JPEG decoding, pixel-level processing,

and encoding to the FPGA

Low Power With the FPGA consuming only 20

watts, the CIP can drastically increase

compute density, translating to reduced TCO

Software Compatible CIP is compatible with the most popular image

processing software:

ImageMagick*, OpenCV*, GraphicsMagick, Lepton*, and

HEIF; without the need for any system software modifications

Ease of Maintenance The FPGA accelerator can be reconfigured

remotely, to easily optimize the performance of any custom usage scenario CTAccel Image Processor (CIP)

High-Performance FPGA-Based Image Processing Accelerator

UGC Web Portals News Apps

Cloud Storage

Cloud Album

Scenarios

Values

LeptonJPEG Resizing Crop

JPEGWebP Lepton HEIF

CIP

Pixel

Processing Encode Decode

Functions

Software Compatibility

Integrating CIP in user applications

OpenCV

ImageMagick*

GraphicsMagick

Lepton Enhanced

Throughput Latency Reduction ReductionTCO

3X 4-10X 4-10X

Social Network E-Commerce

2

(3)

Server

FPGA: Intel® PAC

Host

Resize

Decode

(Other formats) Pixel Processing (Crop/Sharpen/...)

Encode (Other formats)

Lepton

WebP(M6) HEIF

JPEG

Other Formats WebP(M6)

Encode Lepton Encode

EncodeJPEG DecodeHEIF

DecodeJPEG

Input Images

System Configuration

The CIP accelerator features an Intel® Programmable Acceleration Card with Intel Arria® 10 GX FPGA.

CTAccel used the system configurations shown in Figure 1 to generate the metrics cited in this solution brief.

Figure 1. System Configuration

Solution Brief | CTAccel Image Processing Accelerator

Figure 2. Intel Programmable Acceleration Card with Intel Arria GX FPGA

Function JPEG to WebP JPEG to Lepton JPEG to JPEG

CPU Two Intel® Xeon® E5-2680 v4 Two Intel® Xeon® E5-2680 v4 Two Intel® Xeon® E5-2680 v4

RAM 128 GB 128 GB 128 GB

Operating System CentOS v7.2.1511 CentOS v7.2.1511 CentOS v7.2.1511

Kernel 3.10.0-514.2.2.el7.x86_64 3.10.0-514.2.2.el7.x86_64 3.10.0-514.2.2.el7.x86_64 Input 10,000 - 1024 x 768 images

10,000 - 4096 x 2160 images 10,000 - 1024 x 768 images 10,000 images

1024 x 768 input, 240 x 180 output Table 2. System Configuration for Metrics

3

(4)

Intel does not control or audit third-party data. You should review this content, consult other sources, and confirm whether referenced data are accurate.

Cost reduction scenarios described are intended as examples of how a given Intel-based product, in the specified circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction.

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to intel.com/performance.

Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software, or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer, or learn more at intel.com.

† Tests measure performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit www.intel.com/benchmarks. Intel does not control or audit third-party data. You should review this content, consult other sources, and confirm weather referenced data are accurate.

© Intel Corporation. Intel, the Intel logo, the Intel Inside mark and logo, Altera, Arria, Cyclone, Enpirion, Experience What’s Inside, Intel Atom, Intel Core, Intel Xeon, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. See Trademarks on intel.com for full list of Intel trademarks. *Other marks and brands may be claimed as the property of others.

DS-1008-1.2 Solution Brief | CTAccel Image Processing Accelerator

Solution Value

Adding an FPGA to an already powerful CPU provides compelling advantages. While a graphics processing unit (GPU) architecture can be ideal for some video applications, it is not as effective a computing platform for image processing.

The CIPs high-performance processing capabilities increase image processing throughput, reduce

computational latency, and TCO. The customer benefits from the CIP include:

• Reduces TCO by 3X

• Increases image processing throughput by 4-10X

• Reduces computational latency by 4-10X

Figure 3 illustrates the CIP’s overall solution architecture.

CTAccel provides powerful functions for customers who have heavy image processing loads and want to accelerate functions, such as image transcoding, JPEG thumbnail generation, sharpening, main color, watermarking, and brightness and contrast adjustments.

Conclusion

The high-performance FPGA-based CIP helps customers improve image processing performance by transferring computational workloads from CPUs to FPGAs. This increases image processing throughput, reduces computational latency, and TCO.

Find the solution that is right for you. Contact your Intel representative or visit http://www.ct-accel.com.

Smart Phone, PAD Camera PC

E-Commerce Social Network News App End User

Application

JPEG Decode Resize CIP

FPGA

WebP Encode

Figure 3. Solution Architecture (Example)

Learn More

• Learn more about the CIP at http://www.ct-accel.com

• Learn more about Intel® IoT Gateway solutions at http://

www.intel.com/content/www/us/en/internet-of-things/

gateway-solutions.html

4

Tài liệu tham khảo

Tài liệu liên quan

The Centre for Genetic Manipulation of Crop Plants (CGMCP) was established in 1996 at the South Campus of the University of Delhi with funding from the National Dairy

Configurations for (1) “Up to 2x more inference throughput improvement on Intel® Xeon® Platinum 9282 processor with Intel® DL Boost” + (2) “Up to 30X AI performance with Intel®

As a complement to data center and cloud computing systems, edge computing places compute resources and data processing capabilities at the edge of the network, closer to the

The silica nanoparticle size distribution can be determined by either dynamic light scattering (DSL) [24], or estimating from SEM/TEM image data, using image analysis software- Image

levels of wavelet's decomposition, edge detection using wavelet can be worked well with the noisy image and images of large size.. The image is used here for the

• Intel is integrating competitive packet processing and data plane features in the Intel® Xeon® Scalable processor, to complement the infrastructure acceleration capabilities

In [8], authors propose several methods of noise removal from degraded images with Gaussian noise by using adaptive wavelet threshold (Bayes Shrink, Modified Bayes

“stakeholder” is any individual or group of people who are directly or indirectly affected by the actions of the organization. This theory assumes that, in addition