- 易迪拓培训,专注于微波、射频、天线设计工程师的培养
feko 6.0 将支持GPU 用于MOM法
录入:edatop.com 点击:
GPU Acceleration of Electromagnetic Solutions in FEKO
Document ActionsPrint this Send this
Complex solution algorithms as available in FEKO (e.g. MoM, PO, UTD, FEM, or MLFMM) were historically not well suited to acceleration with GPUs (graphical processing units). These worked best for FDTD or similar, where the same operation is repeated over and over again. However, recent advances in software technology have made it possible to accelerate run-time intensive solution phases in FEKO (to be available in the next release FEKO Suite 6.0).
GPU acceleration hardware
Recently high-performance computing has seen a boost through the usage of parallel processing capabilities of multi-core GPUs (graphics processing units). Even fairly standard graphics cards found in typical engineering desktop PCs have computing capabilities exceeding those of high-end CPUs.
EM Software & Systems – S.A. (Pty) Ltd (EMSS) is proud to announce that in the next release of its electromagnetic solver FEKO Suite 6.0 GPU and mixed CPU/GPU processing will be integrated for run-time critical phases of the solution process of some of the solvers integrated in FEKO.
For moderate to large size Method of Moments (MoM) problems, the most run-time critical solution phase is the solution of the system of linear equations. Even though FEKO employs highly optimised libraries for this phase for the various CPUs, the GPU capabilities can beat this performance by more than one order of magnitude.
As example, we have studied the radar scattering from a metallic object over a frequency band with frequency dependent meshing (i.e. using between 2 700 and 10 000 unknowns) and compare the time for the matrix solution phase (i.e. excluding times for matrix setup, near- and far-field etc.). Fig. 1 shows that for an NVidia GTX295 graphics card (using one GPU only) the time is 13.9 sec, while on the three CPUs tested for comparison the times vary between 122 and 397 seconds (i.e. GPU is a factor of 9 to 29 faster than these CPUs).
Figure 1: FEKO run-time for the MoM solution phase of solving the system of linear
equations using different CPUs (Intel and AMD)and GPUs (NVidia Graphics Cards).
Often the performance is measured in GFLOPS (billion floating point operations per second), and for the same radar scattering problem the achieved FEKO performance for this solution phase of solving the MoM system of linear equations is depicted in Fig. 2. Also from this graph the superiority of GPU processing for such types of calculations is obvious.
Figure 2: GFLOPS performance in FEKO for the MoM solution phase of solving the system of
linear equations using different CPUs (Intel and AMD) and GPUs (two NVidia graphics cards).
Note that the results presented here might further improve (optimisations) by the time of the release of FEKO Suite 6.0. It is also expected that the release of new GPU hardware will greatly impact performance (for instance for double precision algebra a significant boost is expected by NVidia’s Fermi architecture). It is also planned to have other solution phases of FEKO for other techniques accelerated by GPUs.
Document ActionsPrint this Send this
Complex solution algorithms as available in FEKO (e.g. MoM, PO, UTD, FEM, or MLFMM) were historically not well suited to acceleration with GPUs (graphical processing units). These worked best for FDTD or similar, where the same operation is repeated over and over again. However, recent advances in software technology have made it possible to accelerate run-time intensive solution phases in FEKO (to be available in the next release FEKO Suite 6.0).
GPU acceleration hardware
Recently high-performance computing has seen a boost through the usage of parallel processing capabilities of multi-core GPUs (graphics processing units). Even fairly standard graphics cards found in typical engineering desktop PCs have computing capabilities exceeding those of high-end CPUs.
EM Software & Systems – S.A. (Pty) Ltd (EMSS) is proud to announce that in the next release of its electromagnetic solver FEKO Suite 6.0 GPU and mixed CPU/GPU processing will be integrated for run-time critical phases of the solution process of some of the solvers integrated in FEKO.
For moderate to large size Method of Moments (MoM) problems, the most run-time critical solution phase is the solution of the system of linear equations. Even though FEKO employs highly optimised libraries for this phase for the various CPUs, the GPU capabilities can beat this performance by more than one order of magnitude.
As example, we have studied the radar scattering from a metallic object over a frequency band with frequency dependent meshing (i.e. using between 2 700 and 10 000 unknowns) and compare the time for the matrix solution phase (i.e. excluding times for matrix setup, near- and far-field etc.). Fig. 1 shows that for an NVidia GTX295 graphics card (using one GPU only) the time is 13.9 sec, while on the three CPUs tested for comparison the times vary between 122 and 397 seconds (i.e. GPU is a factor of 9 to 29 faster than these CPUs).
Figure 1: FEKO run-time for the MoM solution phase of solving the system of linear
equations using different CPUs (Intel and AMD)and GPUs (NVidia Graphics Cards).
Often the performance is measured in GFLOPS (billion floating point operations per second), and for the same radar scattering problem the achieved FEKO performance for this solution phase of solving the MoM system of linear equations is depicted in Fig. 2. Also from this graph the superiority of GPU processing for such types of calculations is obvious.
Figure 2: GFLOPS performance in FEKO for the MoM solution phase of solving the system of
linear equations using different CPUs (Intel and AMD) and GPUs (two NVidia graphics cards).
Note that the results presented here might further improve (optimisations) by the time of the release of FEKO Suite 6.0. It is also expected that the release of new GPU hardware will greatly impact performance (for instance for double precision algebra a significant boost is expected by NVidia’s Fermi architecture). It is also planned to have other solution phases of FEKO for other techniques accelerated by GPUs.
牛逼的,支持一下啊
顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶顶!
期待feko6.0的到来!
这功能很有吸引力,几十倍的加速啊…………啧啧……
看这个题目让人感到心动啊,期待feko6.0
对网格数量到好几百万时的加速性该如何处理.我想这会是更大的挑战.
很期待..
CST微波工作室培训课程套装,专家讲解,视频教学,帮助您快速学习掌握CST设计应用
上一篇:POSTFEKO显示输入阻抗、S参数、VSWR等参数问题
下一篇:Feko中有无根据需要随意设置入射和散射方向之间关系的功能