/docs/MyDocs : contents of Development/libraries/cuda/performance.txt at revision 1

: (revision 1)

To get this branch, use:

bzr branch
http://darksoft.org/webbzr/docs/MyDocs

browse files
view with revision information
view revision
view changes to this file
download file

1. Memory allocations

 for () {
    cudaMalloc()
    ....
    cudaFree()
 } 
 
 is significantly! faster than
 
 for () {
    cudaMalloc()
    ...
 }
 
 for () {
    cudaFree()
 }
 
 However, it is even better (significantly) to allocate everything at once
 and just segment access.