A 28nm 29.2TFLOPS/W BF16 and 36.5TOPS/W INT8 Reconfigurable Digital CIM Processor with Unified FP/INT Pipeline and Bitwise In-Memory Booth Multiplication for Cloud Deep Learning Acceleration.
Fengbin TuYiqi WangZihan WuLing LiangYufei DingBongjin KimLeibo LiuShaojun WeiYuan XieShouyi YinPublished in: ISSCC (2022)