"MergeDistill: Merging Language Models using Pre-trained Distillation."

Simran Khanuja, Melvin Johnson, Partha P. Talukdar (2021)

Details and statistics

DOI: 10.18653/V1/2021.FINDINGS-ACL.254

access: open

type: Conference or Workshop Paper

metadata version: 2021-08-10

a service of  Schloss Dagstuhl - Leibniz Center for Informatics