Accelerating ray tracing engine of BLENDER on the new Sunway architecture

Abstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In th...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhaoqi Sun, Zhen Wang, Mengyuan Hua, Puyu Xiong, Wubing Wan, Ping Gao, Wenlai Zhao, Zhenchun Huang, Lin Han
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:Engineering Reports
Subjects:
Online Access:https://doi.org/10.1002/eng2.12789
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576645326700544
author Zhaoqi Sun
Zhen Wang
Mengyuan Hua
Puyu Xiong
Wubing Wan
Ping Gao
Wenlai Zhao
Zhenchun Huang
Lin Han
author_facet Zhaoqi Sun
Zhen Wang
Mengyuan Hua
Puyu Xiong
Wubing Wan
Ping Gao
Wenlai Zhao
Zhenchun Huang
Lin Han
author_sort Zhaoqi Sun
collection DOAJ
description Abstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In this article, we present our efforts to port the ray tracing engine CYCLES of Blender to the new generation of Sunway supercomputers. We propose optimizations that are tailored to the new hardware architecture, including a multi‐level parallel scheme that efficiently maps and scales Blender onto the novel Sunway architecture, strategies to address memory bottlenecks, a revised task dispatching method that achieves excellent load balancing, and a pipeline approach that maximizes computation and communication overlap. By combining all these optimizations, we achieve a significant reduction in rendering time for a single‐frame image, from 2260 s using the single‐core serial version to 71 s using 48 processes, which is a speedup of about 128×. Accelerating the ray tracing engine CYCLES of Blender in the new generation of Sunway supercomputers.
format Article
id doaj-art-c4817401b2664b2f9e928b6f771e1ffb
institution Kabale University
issn 2577-8196
language English
publishDate 2025-01-01
publisher Wiley
record_format Article
series Engineering Reports
spelling doaj-art-c4817401b2664b2f9e928b6f771e1ffb2025-01-31T00:22:48ZengWileyEngineering Reports2577-81962025-01-0171n/an/a10.1002/eng2.12789Accelerating ray tracing engine of BLENDER on the new Sunway architectureZhaoqi Sun0Zhen Wang1Mengyuan Hua2Puyu Xiong3Wubing Wan4Ping Gao5Wenlai Zhao6Zhenchun Huang7Lin Han8School of Computer and Artificial Intelligence Zhengzhou University Zhengzhou ChinaSchool of Computer and Artificial Intelligence Zhengzhou University Zhengzhou ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Wuxi Wuxi ChinaNational Supercomputing Center in Zhengzhou Zhengzhou ChinaAbstract With the increasing popularity of high‐resolution displays, there is a growing demand for more realistic rendered images. Ray tracing has become the most effective algorithm for image rendering, but its complexity and large amount of computing data require sophisticated HPC solutions. In this article, we present our efforts to port the ray tracing engine CYCLES of Blender to the new generation of Sunway supercomputers. We propose optimizations that are tailored to the new hardware architecture, including a multi‐level parallel scheme that efficiently maps and scales Blender onto the novel Sunway architecture, strategies to address memory bottlenecks, a revised task dispatching method that achieves excellent load balancing, and a pipeline approach that maximizes computation and communication overlap. By combining all these optimizations, we achieve a significant reduction in rendering time for a single‐frame image, from 2260 s using the single‐core serial version to 71 s using 48 processes, which is a speedup of about 128×. Accelerating the ray tracing engine CYCLES of Blender in the new generation of Sunway supercomputers.https://doi.org/10.1002/eng2.12789BlenderCYCLESmany‐core architecturerenderingSunway supercomputer
spellingShingle Zhaoqi Sun
Zhen Wang
Mengyuan Hua
Puyu Xiong
Wubing Wan
Ping Gao
Wenlai Zhao
Zhenchun Huang
Lin Han
Accelerating ray tracing engine of BLENDER on the new Sunway architecture
Engineering Reports
Blender
CYCLES
many‐core architecture
rendering
Sunway supercomputer
title Accelerating ray tracing engine of BLENDER on the new Sunway architecture
title_full Accelerating ray tracing engine of BLENDER on the new Sunway architecture
title_fullStr Accelerating ray tracing engine of BLENDER on the new Sunway architecture
title_full_unstemmed Accelerating ray tracing engine of BLENDER on the new Sunway architecture
title_short Accelerating ray tracing engine of BLENDER on the new Sunway architecture
title_sort accelerating ray tracing engine of blender on the new sunway architecture
topic Blender
CYCLES
many‐core architecture
rendering
Sunway supercomputer
url https://doi.org/10.1002/eng2.12789
work_keys_str_mv AT zhaoqisun acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT zhenwang acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT mengyuanhua acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT puyuxiong acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT wubingwan acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT pinggao acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT wenlaizhao acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT zhenchunhuang acceleratingraytracingengineofblenderonthenewsunwayarchitecture
AT linhan acceleratingraytracingengineofblenderonthenewsunwayarchitecture