Остановите войну!
for scientists:
default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 117 matches
- 2024
- Iria de-Dios-Flores, Silvia Paniagua Suárez, Cristina Carbajal-Pérez, Daniel Bardanca Outeiriño, Marcos García, Pablo Gamallo:
CorpusNÓS: A massive Galician corpus for training large language models. PROPOR 2024: 593-599 - Jiafeng Guo, Changjiang Zhou, Ruqing Zhang, Jiangui Chen, Maarten de Rijke, Yixing Fan, Xueqi Cheng:
CorpusBrain++: A Continual Generative Pre-Training Framework for Knowledge-Intensive Language Tasks. CoRR abs/2402.16767 (2024) - 2023
- Yolanda Blanco-Fernández, Alberto Gil-Solla, José Juan Pazos-Arias, Diego Quisi-Peralta:
Automatically Assembling a Custom-Built Training Corpus for Improving the Learning of In-Domain Word/Document Embeddings. Informatica 34(3): 491-527 (2023) - Guoxin Yu, Lemao Liu, Haiyun Jiang, Shuming Shi, Xiang Ao:
Making Better Use of Training Corpus: Retrieval-based Aspect Sentiment Triplet Extraction via Label Interpolation. ACL (Findings) 2023: 4914-4927 - Antoni Oliver González, Sergi Alvarez:
Filtering and rescoring the CCMatrix corpus for Neural Machine Translation training. EAMT 2023: 39-45 - Zilin Wang, Peng Liu, Jun Chen, Sipan Li, Jinfeng Bai, Gang He, Zhiyong Wu, Helen Meng:
A Synthetic Corpus Generation Method for Neural Vocoder Training. ICASSP 2023: 1-5 - Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing Ping, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi:
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications. KDD 2023: 5270-5281 - Alek Keersmaekers, Wouter Mercelis, Toon Van Hal:
Word Sense Disambiguation for Ancient Greek: Sourcing a training corpus through translation alignment. ALP@RANLP 2023: 148-159 - Richard Kimera, Daniela N. Rim, Heeyoul Choi:
Building a Parallel Corpus and Training Translation Models Between Luganda and English. CoRR abs/2301.02773 (2023) - Jiaxin Wen, Hao Zhou, Minlie Huang:
Re3Dial: Retrieve, Reorganize and Rescale Dialogue Corpus for Long-Turn Open-Domain Dialogue Pre-training. CoRR abs/2305.02606 (2023) - Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing Ping, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi:
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications. CoRR abs/2306.02592 (2023) - Khushi Bhardwaj, Raj Sanjay Shah, Sashank Varma:
Pre-training LLMs using human-like development data corpus. CoRR abs/2311.04666 (2023) - 2022
- Nontakan Nuntachit, Prompong Sugunnasil:
Do We Need a Specific Corpus and Multiple High-Performance GPUs for Training the BERT Model? An Experiment on COVID-19 Dataset. Mach. Learn. Knowl. Extr. 4(3): 641-664 (2022) - Luyu Gao, Jamie Callan:
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval. ACL (1) 2022: 2843-2853 - Adam Nik, Ge Zhang, Xingran Chen, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data. CASE@EMNLP 2022: 91-99 - Jun Liu, Yihaoran Ning, Yuanyu Fang, Luxuan Zhuang, Zhuohan Yu, Tingkun Wu:
A Corpus-Based Sampling to Build Training Data Set for Extracting Japanese Sentence Pattern. ICEIT 2022: 123-128 - Yen-Ting Lee, Cheng-Te Li, Shou-De Lin:
Conditional Sentence Rephrasing without Parallel Training Corpus. ICME Workshops 2022: 1 - Yang Li, Tong Nie, Yucheng Shao, Zesheng Zhu, Tian Yang, Shengnan Zhang, Ze Zhu, Fang Zhang:
A Real-Time Voice Conversion Method Based on A Non-Parallel Corpus for Training. ICNCC 2022: 313-320 - Tomasz Rutowski, Amir Harati, Elizabeth Shriberg, Yang Lu, Piotr Chlebek, Ricardo Oliveira:
Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language. INTERSPEECH 2022: 3343-3347 - Ziyao Zhang, Alessio Falai, Ariadna Sánchez, Orazio Angelini, Kayoko Yanagisawa:
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS). INTERSPEECH 2022: 2353-2357 - Per Egil Kummervold, Freddy Wetjen, Javier de la Rosa:
The Norwegian Colossal Corpus: A Text Corpus for Training Large Norwegian Language Models. LREC 2022: 3852-3860 - Kurt Micallef, Albert Gatt, Marc Tanti, Lonneke van der Plas, Claudia Borg:
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese. CoRR abs/2205.10517 (2022) - Ziyao Zhang, Alessio Falai, Ariadna Sánchez, Orazio Angelini, Kayoko Yanagisawa:
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS). CoRR abs/2207.01507 (2022) - Adam Nik, Ge Zhang, Xingran Chen, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data. CoRR abs/2211.02729 (2022) - Chris Sanchez, Zheyuan Zhang:
The Effects of In-domain Corpus Size on pre-training BERT. CoRR abs/2212.07914 (2022) - 2021
- Linqing Chen, Junhui Li, Zhengxian Gong, Boxing Chen, Weihua Luo, Min Zhang, Guodong Zhou:
Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation with Cross-Task Pre-training. ACL/IJCNLP (1) 2021: 2851-2861 - Yun Hu, Yeshuang Zhu, Jinchao Zhang, Changwen Zheng, Jie Zhou:
Toward Fully Exploiting Heterogeneous Corpus: A Decoupled Named Entity Recognition Model with Two-stage Training. ACL/IJCNLP (Findings) 2021: 1641-1652 - Walter Gerych, Harrison Kim, Joshua DeOliveira, MaryClare Martin, Luke Buquicchio, Kavin Chandrasekaran, Abdulaziz Alajaji, Hamid Mansoor, Elke A. Rundensteiner, Emmanuel Agu:
GAN for Generating User-Specific Human Activity Data From An Incomplete Training Corpus. IEEE BigData 2021: 4705-4714 - Costanza Conforti, Jakob Berndt, Marco Basaldella, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd, Nigel Collier:
Adversarial Training for News Stance Detection: Leveraging Signals from a Multi-Genre Corpus. EACL (Hackashop) 2021: 1-7 - Oshin Agarwal, Heming Ge, Siamak Shakeri, Rami Al-Rfou:
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training. NAACL-HLT 2021: 3554-3565
skipping 87 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-03-29 09:20 CET from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint