When benchmarking GPU algorithms it is important to measure kernel execution time separated from memory transfer times (CPU->GPU and GPU->CPU). Is it possible to measure these times separately for existing algorithms (e.g., GPUBinaryThresholdImageFilter?). Thanks in advance.
Hi @Rafael_Palomar ,
Welcome to the ITK community!
To separate the memory transfer times, call UpdateBuffers()
on the images.
Here is an example:
# ==========================================================================
#
# Copyright NumFOCUS
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0.txt
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#
# ==========================================================================*/
import itk
import sys
This file has been truncated. show original
2 Likes
dzenanz
(Dženan Zukić)
Split this topic
July 14, 2021, 2:22pm
3
2 posts were split to a new topic: GPU support in Python packages
Thank you, @matt.mccormick ! That solved the issue.
2 Likes