小黑屋
原帖由 u571 于 2011-11-23 12:07 发表 543MP4浮点shader性能连RSX十分之一都没有,RSX的十分之一能做什么特效?
查看详细资料
TOP
原帖由 hourousha 于 2011-11-23 13:50 发表 USSE2什么时候有9 FP32 Flops per pipe了? 你要说它可以Vec4 + Scalar并发那没问题,但是Scalar是FP32,Vec4可是FX10 x4而不是FP32 x4的哦。 所以,在Vec4做FMADD的情况下,它也只是1 FP32 Flops+ 8 FX10 ops而 ...
Yes it is FP32 ops, as Arun indicated each pipeline is VEC4 F32 FMAD, in addition we can run a parallel (real) floatingpoint op giving 9 F32 flops per pipe or 36 per core.
543 has 4 USSE2 pipes, each capable of 4 FP32 MADDs and one additional FP32 operation which IMG has never really talked about (best case it's an extra ADD, worst case it only does stupid stuff like format conversions, I don't know). FP16 and INT8 performance is higher but not massively so.