- 2018-07-13: ORCID state of dblp
Since more than a year now, dblp has intensified its efforts to link dblp bibliographies to ORCIDs used by that author. ORCID information are now added regularly to the dblp data set. The primary sources for ORCIDs are: first, the annual ORCID open data dump and, second, metadata directly provided by publishers who have started to increasingly label author signatures with ORCID information. Neither of those data sources are free of errors and data hick-ups, so we are still manually cleaning the ORCID data prior to adding them to the corpus. But overall, ORCIDs have helped us to correct numerous cases of homonymous and synonymous bibliographies in dblp, so it is absolutely worth our time.
Since the past big ORCID update from last week (as of July 11, 2018), you can now find in dblp:
- 806,744 ORCID author signatures (i.e., author-publication pairs), that is 6.5% of all author signatures in the dataset
- 165,752 bibliographies that are linked to ORCIDs (either manually or implicit)
- 24,357 bibliographies with ORCIDs manually verified by the dblp team
If you look at computer science publications in dblp that have been published in the recent years, you will find that the fraction of author signatures with ORCID in dblp has climbed to now about 10.9% off all signatures in 2018. Of course, year 2018 is not done yet, and we are continuously working on improving the coverage further among all publications, regardless of the year of publication. The number of signatures and coverage for publication in recent years is:
- 2014: 892,240 signatures (8.2% coverage)
- 2015: 930,746 signatures (7.7% coverage)
- 2016: 984,559 signatures (6.7% coverage)
- 2017: 1,042,103 signatures (5.6% coverage)
- 2018: 439,350 signatures (10.9% coverage)
The oldest publication with an ORCID is from 1961. Because of the small number of publications in dblp from that time, the ORCID coverage for 1961 is a remarkable 0.12%.
- 2018-04-09: Historical dblp file is now available.
A historical version of dblp (called hdblp) is now available at zenodo. hdblp contains historical revisions of all metadata records in dblp. The file can be used to reconstruct dblp for each day between June 1999 and March 2018.
- 2018-02-27: dblp via HTTPS
Finally, all dblp servers now support HTTPS. We will gradually move towards exclusively using HTTPS in the future, so please feel free to update your bookmarks.
- 2017-12-22: 4 million publications!
We are happy to celebrate more than 4,000,000 million computer science publications in the dblp computer science bibliography!
- 2017-09-15: The first 700k: dblp starts integrating ORCIDs
ORCID (http://orcid.org) is a widely used persistent identifier scheme for researchers. As of September 2017, there are more than 3.8 million registered ORCIDs of which about 800,000 have at least one publication listed in the public ORCID corpus. Many publishers will now ask their authors to provide an ORCID when submitting or publishing a paper. We encourage all researchers to register and map their ORCID with their publications.
A few weeks ago, we started first experiments with integrating ORCIDs into dblp. At the moment, there are about 700,000 signatures (i.e., author-publication pairs) in dblp for which we know an ORCID. This is about 6% of all signatures. We expected ORCIDs to help us identify authors and create clear, unambiguous author bibliographies (see How does dblp handle homonyms and synonyms). We were not disappointed. For the initial import, we found in our corpus
- 600 cases where an author profile was related to more than one ORCID. This indicates that the profile actually lists publications from different authors.
- 5000 cases where the same ORCID appears in more than one author profile.
We are currently processing these cases. While we found several cases where ORCID information is wrong (e.g., authors accidentally claiming publications that where written by someone else), the data is very reliable in general. However, according to our philosophy, manual confirmation is needed. This will take some time.
A detailed description on ORCIDs on the dblp web interface and in the data dump can be found at in our F.A.Q.s.
We will keep you posted on development.
- 2017-09-05: Further modifications to dblp's XML format
With the latest release of the dblp XML data dump, we made some extensions to the document type definition (DTD) of the XML format. The goal of these changes is to lay the groundwork for a number of planned extensions of the dblp data model that will be realized at a later date.
These changes will only affect you if you use:
- the daily XML data dump, available at http://dblp.org/xml/,
- any future snapshot releases of the XML dump, available at http://dblp.org/xml/release/, or
- the live XML web API, e.g. http://dblp.org/rec/xml/journals/pvldb/Ley09.xml
Regular users of the web site or other export formats – like the BibTeX export – are not affected.
As always, the latest version of the DTD can be found at:
All upcoming as well as earlier versions of the DTD file will be preserved along with the persistent snapshot releases at:
A brief summary of the recent changes is available at:
Detailed Information on the newly added attributes can be found in the F.A.Q.:
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal"
- exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics"
- boolean and: separate words by space
e.g., codd model
- boolean or: connect words by pipe symbol (|)
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
The full-text search results are provided via , courtesy of AI2.
found ?? matches
- service temporarily not available