Expressive Algorithmic Programming with Thrust
Thrust is a parallel algorithms library which resembles the C++ Standard Template Library (STL). Thrust's High-Level interface greatly enhances programmer Productivity while enabling performance...
View ArticleNVIDIA Nsight Tegra
Develop native applications for the world's most popular mobile platform, using the world’s most popular integrated development environment on the world's first mobile super processor.
View ArticleThinking Parallel, Part III: Tree Construction on the GPU
In part II of this series, we looked at hierarchical tree traversal as a means of quickly identifying pairs of potentially colliding 3D objects and we demonstrated how optimizing for low divergence can...
View ArticleCUDA Pro Tip: Flush Denormals with Confidence
I want to keep this post fairly brief, so I will only give minimal background on floating point numbers. If you need a refresher on floating point representation, I recommend starting with the...
View ArticleCUDA Pro Tip: Kepler Texture Objects Improve Performance and Flexibility
The Kepler architecture introduces texture objects, a new feature that makes textures easier to use and higher performance.
View ArticleDeveloping Portable CUDA C/C++ Code with Hemi
Software development is as much about writing code fast as it is about writing fast code, and central to rapid development is software reuse and portability. When building heterogeneous applications,...
View ArticleNumbaPro: High-Performance Python with CUDA Acceleration
Python is a high-productivity dynamic programming language that is widely used in science, engineering, and data analytics applications. There are a number of factors influencing the popularity of...
View Article
More Pages to Explore .....