Unleashing the Potential of PIM: Accelerating Large Batched Inference of Transformer-Based Generative Models

标题
Unleashing the Potential of PIM: Accelerating Large Batched Inference of Transformer-Based Generative Models
作者
关键词
-
出版物
IEEE Computer Architecture Letters
Volume 22, Issue 2, Pages 113-116
出版商
Institute of Electrical and Electronics Engineers (IEEE)
发表日期
2023-08-16
DOI
10.1109/lca.2023.3305386

向作者/读者发起求助以获取更多资源

Find Funding. Review Successful Grants.

Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.

Explore

Add your recorded webinar

Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.

Upload Now