Accelerating ray tracing engine of BLENDER on the new Sunway architecture
Abstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In th...
Saved in:
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2025-01-01
|
Series: | Engineering Reports |
Subjects: | |
Online Access: | https://doi.org/10.1002/eng2.12789 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832576645326700544 |
---|---|
author | Zhaoqi Sun Zhen Wang Mengyuan Hua Puyu Xiong Wubing Wan Ping Gao Wenlai Zhao Zhenchun Huang Lin Han |
author_facet | Zhaoqi Sun Zhen Wang Mengyuan Hua Puyu Xiong Wubing Wan Ping Gao Wenlai Zhao Zhenchun Huang Lin Han |
author_sort | Zhaoqi Sun |
collection | DOAJ |
description | Abstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In this article, we present our efforts to port the ray tracing engine CYCLES of Blender to the new generation of Sunway supercomputers. We propose optimizations that are tailored to the new hardware architecture, including a multi‐level parallel scheme that efficiently maps and scales Blender onto the novel Sunway architecture, strategies to address memory bottlenecks, a revised task dispatching method that achieves excellent load balancing, and a pipeline approach that maximizes computation and communication overlap. By combining all these optimizations, we achieve a significant reduction in rendering time for a single‐frame image, from 2260 s using the single‐core serial version to 71 s using 48 processes, which is a speedup of about 128×. Accelerating the ray tracing engine CYCLES of Blender in the new generation of Sunway supercomputers. |
format | Article |
id | doaj-art-c4817401b2664b2f9e928b6f771e1ffb |
institution | Kabale University |
issn | 2577-8196 |
language | English |
publishDate | 2025-01-01 |
publisher | Wiley |
record_format | Article |
series | Engineering Reports |
spelling | doaj-art-c4817401b2664b2f9e928b6f771e1ffb2025-01-31T00:22:48ZengWileyEngineering Reports2577-81962025-01-0171n/an/a10.1002/eng2.12789Accelerating ray tracing engine of BLENDER on the new Sunway architectureZhaoqi Sun0Zhen Wang1Mengyuan Hua2Puyu Xiong3Wubing Wan4Ping Gao5Wenlai Zhao6Zhenchun Huang7Lin Han8School of Computer and Artificial Intelligence Zhengzhou University Zhengzhou ChinaSchool of Computer and Artificial Intelligence Zhengzhou University Zhengzhou ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Zhengzhou Zhengzhou ChinaAbstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In this article, we present our efforts to port the ray tracing engine CYCLES of Blender to the new generation of Sunway supercomputers. We propose optimizations that are tailored to the new hardware architecture, including a multi‐level parallel scheme that efficiently maps and scales Blender onto the novel Sunway architecture, strategies to address memory bottlenecks, a revised task dispatching method that achieves excellent load balancing, and a pipeline approach that maximizes computation and communication overlap. By combining all these optimizations, we achieve a significant reduction in rendering time for a single‐frame image, from 2260 s using the single‐core serial version to 71 s using 48 processes, which is a speedup of about 128×. Accelerating the ray tracing engine CYCLES of Blender in the new generation of Sunway supercomputers.https://doi.org/10.1002/eng2.12789BlenderCYCLESmany‐core architecturerenderingSunway supercomputer |
spellingShingle | Zhaoqi Sun Zhen Wang Mengyuan Hua Puyu Xiong Wubing Wan Ping Gao Wenlai Zhao Zhenchun Huang Lin Han Accelerating ray tracing engine of BLENDER on the new Sunway architecture Engineering Reports Blender CYCLES many‐core architecture rendering Sunway supercomputer |
title | Accelerating ray tracing engine of BLENDER on the new Sunway architecture |
title_full | Accelerating ray tracing engine of BLENDER on the new Sunway architecture |
title_fullStr | Accelerating ray tracing engine of BLENDER on the new Sunway architecture |
title_full_unstemmed | Accelerating ray tracing engine of BLENDER on the new Sunway architecture |
title_short | Accelerating ray tracing engine of BLENDER on the new Sunway architecture |
title_sort | accelerating ray tracing engine of blender on the new sunway architecture |
topic | Blender CYCLES many‐core architecture rendering Sunway supercomputer |
url | https://doi.org/10.1002/eng2.12789 |
work_keys_str_mv | AT zhaoqisun acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT zhenwang acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT mengyuanhua acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT puyuxiong acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT wubingwan acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT pinggao acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT wenlaizhao acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT zhenchunhuang acceleratingraytracingengineofblenderonthenewsunwayarchitecture AT linhan acceleratingraytracingengineofblenderonthenewsunwayarchitecture |