default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 38 matches
- 2024
- Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui:
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding. ACL (Findings) 2024: 7655-7671 - Jun Zhang, Jue Wang, Huan Li, Lidan Shou, Ke Chen, Gang Chen, Sharad Mehrotra:
Draft& Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding. ACL (1) 2024: 11263-11282 - Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia:
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification. ASPLOS (3) 2024: 932-949 - Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui:
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding. CoRR abs/2401.07851 (2024) - Siqi Wang, Hailong Yang, Xuezhu Wang, Tongxuan Liu, Pengbo Wang, Xuning Liang, Kejie Ma, Tianyu Feng, Xin You, Yongjun Bao, Yi Liu, Zhongzhi Luan, Depei Qian:
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding. CoRR abs/2402.15678 (2024) - Aonan Zhang, Chong Wang, Yi Wang, Xuanyu Zhang, Yunfei Cheng:
Recurrent Drafter for Fast Speculative Decoding in Large Language Models. CoRR abs/2403.09919 (2024) - Mukul Gagrani, Raghavv Goel, Wonseok Jeon, Junyoung Park, Mingu Lee, Christopher Lott:
On Speculative Decoding for Multimodal Large Language Models. CoRR abs/2404.08856 (2024) - Chen Zhang, Zhuorui Liu, Dawei Song:
Beyond the Speculative Game: A Survey of Speculative Execution in Large Language Models. CoRR abs/2404.14897 (2024) - Yunsheng Ni, Chuanjian Liu, Yehui Tang, Kai Han, Yunhe Wang:
EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models. CoRR abs/2405.07542 (2024) - Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel:
Distributed Speculative Inference of Large Language Models. CoRR abs/2405.14105 (2024) - Xiaoxuan Liu, Cade Daniel, Langxiang Hu, Woosuk Kwon, Zhuohan Li, Xiangxi Mo, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang:
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput. CoRR abs/2406.14066 (2024) - Parsa Kavehzadeh, Mohammadreza Pourreza, Mojtaba Valipour, Tinashu Zhu, Haoli Bai, Ali Ghodsi, Boxing Chen, Mehdi Rezagholizadeh:
S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models. CoRR abs/2407.01955 (2024) - Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran:
Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. CoRR abs/2407.04641 (2024) - Zongyue Qin, Ziniu Hu, Zifan He, Neha Prakriya, Jason Cong, Yizhou Sun:
Multi-Token Joint Speculative Decoding for Accelerating Large Language Model Inference. CoRR abs/2407.09722 (2024) - Jacob K. Christopher, Brian R. Bartoldson, Bhavya Kailkhura, Ferdinando Fioretto:
Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion. CoRR abs/2408.05636 (2024) - 2023
- Najeeb Gambo Abdulhamid, Millicent Ochieng, Kalika Bali, Elizabeth A. Ankrah, Naveena Karusala, Keshet Ronen, Jacki O'Neill:
Can Large Language Models Support Medical Facilitation Work? A Speculative Analysis. AfriCHI 2023: 64-70 - Charlie Chen, Sebastian Borgeaud, Geoffrey Irving, Jean-Baptiste Lespiau, Laurent Sifre, John Jumper:
Accelerating Large Language Model Decoding with Speculative Sampling. CoRR abs/2302.01318 (2023) - Jun Zhang, Jue Wang, Huan Li, Lidan Shou, Ke Chen, Gang Chen, Sharad Mehrotra:
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding. CoRR abs/2309.08168 (2023) - Qidong Su, Christina Giannoula, Gennady Pekhimenko:
The Synergy of Speculative Decoding and Batching in Serving Large Language Models. CoRR abs/2310.18813 (2023) - 2021
- Jacob Kreindl, Daniele Bonetta, Lukas Stadler, David Leopoldseder, Hanspeter Mössenböck:
Low-overhead multi-language dynamic taint analysis on managed runtimes through speculative optimization. MPLR 2021: 70-87 - 2014
- Ben Kovitz, Jerry Swan:
Structural stigmergy: a speculative pattern language for metaheuristics. GECCO (Companion) 2014: 1407-1410 - Matthew Le, Matthew Fluet:
Combining Shared State with Speculative Parallelism in a Functional Language. IFL 2014: 2:1-2:10 - 2013
- Guillermo Moncecchi:
Recognizing Speculative Language in Research Texts. (Détection du langage spéculatif dans la littérature scientifique). Paris West University Nanterre La Défense, France, 2013 - 2012
- Guillermo Moncecchi, Jean-Luc Minel, Dina Wonsever:
Improving Speculative Language Detection using Linguistic Knowledge. ExProM@ACL 2012: 37-46 - Hui Yang, Anne N. De Roeck, Vincenzo Gervasi, Alistair Willis, Bashar Nuseibeh:
Speculative requirements: Automatic detection of uncertainty in natural language requirements. RE 2012: 11-20 - 2011
- Benjamin Thielmann, Jens Huthmann, Andreas Koch:
Evaluation of speculative execution techniques for high-level language to hardware compilation. ReCoSoC 2011: 1-8 - 2010
- Andreas Vlachos, Mark Craven:
Detecting Speculative Language Using Syntactic Dependencies and Logistic Regression. CoNLL Shared Task 2010: 18-25 - 2008
- Halil Kilicoglu, Sabine Bergler:
Recognizing speculative language in biomedical research articles: a linguistically motivated perspective. BMC Bioinform. 9(S-11) (2008) - Halil Kilicoglu, Sabine Bergler:
Recognizing Speculative Language in Biomedical Research Articles: A Linguistically Motivated Perspective. BioNLP 2008: 46-53 - 2005
- Alberto de la Encina, Ismael Rodríguez, Fernando Rubio:
Testing Speculative Work in a Lazy/Eager Parallel Functional Language. LCPC 2005: 274-288
skipping 8 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-09-26 01:48 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint