±¨ ¸æ ÈË£ºÁÖлª ÍÆËãÒµÎñ²¿Ö÷ÈÎ ÉϺ£½»Í¨´óѧ£¬ÍøÂçÐÅÏ¢ÖÐÐÄ
»ã±¨¹¦·ò£º06ÔÂ06ÈÕ£¨ÖÜÈý£©11:00¡«12:00
»ã±¨µØÖ·£ºÐ£±¾²¿¶«ÇøÍÆËã»ú´óÂ¥402ÊÒ
Ñû Çë ÈË£ºÍ¯Î¬ÇÚ ½ÌÊÚ
»ã±¨ÌáÒª£º
The inadequate public information of China's SW26010 processor's micro-architecture prevents global researchers from improving application performances on the TaihuLight supercomputer. This study aims to illuminate the uncharted area of SW26010 in order to provide important information for performance optimizations and modeling. We developed a micro-benchmark suite, swCandle, to evaluate the key micro-architectural features. The benchmark results revealed some unanticipated findings beyond the publicly available data. For instance, the broadcast mode of register communications has the same latency as the peer-to-peer mode.
»ã±¨È˼ò½é:
ÁÖлªÓÚ¶«¾©¹¤Òµ´óѧ»ñµÃÀíѧ²©Ê¿Ñ§Î»£¬ÖØÒª×êÑз½ÏòΪ¸ß»úÄÜÍÆËãºÍ»úÄÜÓÅ»¯£¬ÔÚ¹úÄÚ±í¸ßˮƽ»áÒéºÍÆÚ¿¯°ä·¢ÂÛÎÄ20ÓàÆª¡£ÏÖÈÎÉϺ£½»Í¨´óÑ§ÍøÂçÐÅÏ¢ÖÐÐÄÍÆËãÒµÎñ²¿Ö÷ÈΣ¬ÕƹÜѧÌõÄÏȽøÔÆ¡¢GPUƽ̨ºÍ³¬ËãÆ½Ì¨¶þÆÚ½¨Éè¡£Áìµ¼ÍÆËã»úϵ˶ʿÉú30Ãû¡£