Apple and Intel) and lots of ways to parallel program without OpenCL (to compare to). This field is much wider, as you have different OSes, even different OpenCL SDKs for the same processors (e.g. I intentionally didn't write anything about the other use cases of OpenCL, e.g. So they definitely play in the same league, and even if one is 5% faster than the other for some specific problem at the moment, I thing it would not make a difference in a general view. are you able to make use of all the compute unitsįrom my tests, the answer to these questions - will my code use the hardware optimally - is yes for both frameworks.will the compiler create efficient code.will optimal code use all available memory bandwidth.What remains to be tested are things like: Thread scheduling is done in hardware, so they have the same performance there. Maybe you should approach this problem from another angle: what can you do with one of the frameworks that you can't do with the other? They both use the same drivers, so both will support fancy technologies that come out with new hardware. I dont think that that would be time well spent, though. You could choose a few and compare your results. There are the NVIDIA Code Examples, done in both CUDA and OpenCL. This is what you ask for, but I don't know anything like that in existence - people will choose either technology for their bigger projects and won't write everything twice. Results will differ, so you would have to have a big test suite. But you won't be able to deduce from that experiment that you'll get similar results on another program, or with different hardware. The problem with benchmarks is obviously that you only can objectively evaluate very specific things - say, the same program done in CUDA and OpenCL, on the same hardware (as you named a source). Fair warning though, I might get some stuff wrong. Not being a performance/benchmarking expert I can only try to give you a few general thoughts on OpenCL vs. Just to stop speculating about this or that, I'd like to check everything by myself, but I need you to help me! Is there an OpenCL benchmark set, universally accepted, I can use to compare with native code? Is there an analogous of CUDA SDK written in OpenCL code? In the case of Pthread or Windows threads, I really have no idea, but I think that "generality" and multi-architecture approach will always have something to "pay". That is good, ok, but is there a loss of performance by migrating from a native programming library to OpenCL? In the case of nVidia GeForces, I've already found an article were two realizations of the same program - CUDA vs OpenCL code - were compared and the first one seemed to be more performant. OpenCL should be the first parallel architecture programming standard, and it'll be eventually adopted by the most part of programmers. We'll be keeping an eye on how the PC graphics card space develops this year, so stay tuned to for future updates on this.I can read around a lot about OpenCL, and it seems to be the most promising (the only one?) multi-architecture library. With AMD's RDNA2-based GPUs expected to launch within 2020, it makes sense for Nvidia to prepare new GPUs to keep its dominance in the market. Of course, all we have now is speculation and we won't know more till the Nvidia GPU Technology Conference later this month. Again, even the same program with different workloads will yield different winners, but also, performance can vary substantially between performance tiers or pro-grade models. A single benchmark, including a real-world use case, is inconclusive. Turing-based Nvidia GeForce GPUs could reach this level of specs, though we expect to see higher clock speeds than 1.1GHz. AMD has some catching up to do with Vulkan but like I said, performance varies drastically with OpenCL. However, Nvidia does use the same GPU architecture for both its flagship-level Quadro and GeForce GPUs. Judging from the specs so far, these could be early engineering samples of Quadro flagship GPUs. The most powerful of the three GPUs had a crazy benchmark score For reference, the Nvidia GeForce RTX 2080 Ti has a score of 129084. The GeekBench OpenCL scores for the three cards were 222377, 184096, and 141654 points - all very high scores that beat even the GeForce RTX 2080 Ti. It's uncertain what the configurations are, but the folks at TechSpot believe that the GPUs discovered are the next generation of Nvidia's graphics card.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |