"LATTE: Low-Precision Approximate Attention with Head-wise Trainable ..."

Jiing-Ping Wang, Ming-Guang Lin, An-Yeu Wu (2024)

Details and statistics

DOI: 10.48550/ARXIV.2404.07519

access: open

type: Informal or Other Publication

metadata version: 2024-05-29

a service of  Schloss Dagstuhl - Leibniz Center for Informatics