default search action
David M. Mimno
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2021
- [j12]Maria Antoniak, Melanie Walsh, David Mimno:
Tags, Borders, and Catalogs: Social Re-Working of Genre on LibraryThing. Proc. ACM Hum. Comput. Interact. 5(CSCW1): 29:1-29:29 (2021) - 2020
- [j11]Dong Nguyen, Maria Liakata, Simon DeDeo, Jacob Eisenstein, David Mimno, Rebekah Tromble, Jane Winters:
How We Do Things With Words: Analyzing Text as Social and Cultural Data. Frontiers Artif. Intell. 3: 62 (2020) - 2019
- [j10]Maria Antoniak, David Mimno, Karen Levy:
Narrative Paths and Negotiation of Power in Birth Stories. Proc. ACM Hum. Comput. Interact. 3(CSCW): 88:1-88:27 (2019) - [j9]Taygun Kekeç, David Mimno, David M. J. Tax:
Boosted negative sampling by quadratically constrained entropy maximization. Pattern Recognit. Lett. 125: 310-317 (2019) - 2018
- [j8]Sanjeev Arora, Rong Ge, Yoni Halpern, David M. Mimno, Ankur Moitra, David A. Sontag, Yichen Wu, Michael Zhu:
Learning topic models - provably and efficiently. Commun. ACM 61(4): 85-93 (2018) - [j7]Maria Antoniak, David Mimno:
Evaluating the Stability of Embedding-based Word Similarities. Trans. Assoc. Comput. Linguistics 6: 107-119 (2018) - 2017
- [j6]Jordan L. Boyd-Graber, Yuening Hu, David M. Mimno:
Applications of Topic Models. Found. Trends Inf. Retr. 11(2-3): 143-296 (2017) - [j5]Eric P. S. Baumer, David M. Mimno, Shion Guha, Emily Quan, Geri K. Gay:
Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence? J. Assoc. Inf. Sci. Technol. 68(6): 1397-1410 (2017) - 2016
- [j4]Alexandra Schofield, David M. Mimno:
Comparing Apples to Apple: The Effects of Stemmers on Topic Models. Trans. Assoc. Comput. Linguistics 4: 287-300 (2016) - 2012
- [j3]David M. Mimno:
Computational historiography: Data mining in a century of classics journals. ACM Journal on Computing and Cultural Heritage 5(1): 3:1-3:19 (2012) - 2009
- [j2]Gregory R. Crane, Alison Babeu, David Bamman, Thomas M. Breuel, Lisa Cerrato, Daniel Deckers, Anke Lüdeling, David M. Mimno, Rashmi Singhal, David A. Smith, Amir Zeldes:
Classics in the Million Book Library. Digit. Humanit. Q. 3(1) (2009) - 2005
- [j1]David M. Mimno, Gregory R. Crane, Alison Jones:
Hierarchical Catalog Records: Implementing a FRBR Catalog. D Lib Mag. 11(10) (2005)
Conference and Workshop Papers
- 2024
- [c54]Marianne Aubin Le Quéré, Hope Schroeder, Casey Randazzo, Jie Gao, Ziv Epstein, Simon Tangi Perrault, David Mimno, Louise Barkhuus, Hanlin Li:
LLMs as Research Tools: Applications and Evaluations in HCI Data Work. CHI Extended Abstracts 2024: 479:1-479:7 - [c53]Hamed Rahimi, David Mimno, Jacob Louis Hoover, Hubert Naacke, Camélia Constantin, Bernd Amann:
Contextualized Topic Coherence Metrics. EACL (Findings) 2024: 1760-1773 - [c52]LeAnn McDowall, Maria Antoniak, David Mimno:
Sensemaking about Contraceptive Methods across Online Platforms. ICWSM 2024: 1041-1053 - [c51]Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito:
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity. NAACL-HLT 2024: 3245-3276 - 2023
- [c50]Rebecca M. M. Hicke, David Mimno:
T5 meets Tybalt: Author Attribution in Early Modern English Drama Using Large Language Models. CHR 2023: 274-302 - [c49]Lyra D'Souza, David Mimno:
The Chatbot and the Canon: Poetry Memorization in LLMs. CHR 2023: 475-489 - [c48]Rosamond Elizabeth Thalken, Matthew Wilkens, David Mimno:
Large Language Models and NER: better results with less work. DH 2023 - [c47]Andrea W. Wen-Yi, David Mimno:
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings. EMNLP 2023: 1124-1131 - [c46]Rosamond Elizabeth Thalken, Edward H. Stiglitz, David Mimno, Matthew Wilkens:
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement. EMNLP 2023: 9252-9265 - [c45]Gregory Yauney, Emily Reif, David Mimno:
Data Similarity is Not Enough to Explain Language Model Performance. EMNLP 2023: 11295-11304 - 2021
- [c44]Maria Antoniak, David Mimno:
Bad Seeds: Evaluating Lexical Methods for Bias Measurement. ACL/IJCNLP (1) 2021: 1889-1904 - [c43]Gregory Yauney, David Mimno:
Comparing Text Representations: A Theory-Driven Approach. EMNLP (1) 2021: 5527-5539 - [c42]Moontae Lee, Sungjun Cho, Kun Dong, David Mimno, David Bindel:
On-the-fly Rectification for Robust Large-Vocabulary Topic Inference. ICML 2021: 6087-6097 - 2020
- [c41]Moontae Lee, David Bindel, David Mimno:
Prior-aware Composition Inference for Spectral Topic Models. AISTATS 2020: 4258-4268 - [c40]Christof Schöch, Karina van Dalen-Oskam, Maria Antoniak, Fotis Jannidis, David Mimno:
Replication and Computational Literary Studies. DH 2020 - [c39]Laure Thompson, David Mimno:
Constructing and Analyzing Short Science Fiction at Scale. DH 2020 - [c38]Gregory Yauney, David Mimno:
Network Analysis Finds Shifts in the History of Modern Architecture. DH 2020 - [c37]Gregory Yauney, Jack Hessel, David Mimno:
Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents. EMNLP (1) 2020: 2039-2045 - 2019
- [c36]Jack Hessel, Lillian Lee, David Mimno:
Unsupervised Discovery of Multimodal Links in Multi-image, Multi-sentence Documents. EMNLP/IJCNLP (1) 2019: 2034-2045 - [c35]Moontae Lee, Sungjun Cho, David Bindel, David Mimno:
Practical Correlated Topic Modeling and Analysis via the Rectified Anchor Word Algorithm. EMNLP/IJCNLP (1) 2019: 4990-5000 - 2018
- [c34]Laure Thompson, David Mimno:
Authorless Topic Models: Biasing Models Away from Known Structure. COLING 2018: 3903-3914 - [c33]Jack Hessel, David M. Mimno, Lillian Lee:
Quantifying the Visual Concreteness of Words and Topics in Multimodal Datasets. NAACL-HLT 2018: 2194-2205 - 2017
- [c32]Alexandra Schofield, Måns Magnusson, David M. Mimno:
Pulling Out the Stops: Rethinking Stopword Removal for Topic Models. EACL (2) 2017: 432-436 - [c31]Alexandra Schofield, Laure Thompson, David M. Mimno:
Quantifying the Effects of Text Duplication on Semantic Models. EMNLP 2017: 2737-2747 - [c30]David M. Mimno, Laure Thompson:
The strange geometry of skip-gram with negative sampling. EMNLP 2017: 2873-2878 - [c29]Jack Hessel, Lillian Lee, David M. Mimno:
Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity. WWW 2017: 927-936 - 2016
- [c28]Michael J. Muller, Shion Guha, Eric P. S. Baumer, David M. Mimno, N. Sadat Shami:
Machine Learning and Grounded Theory Method: Convergence, Divergence, and Combination. GROUP 2016: 3-8 - [c27]Moontae Lee, Seok Hyun Jin, David M. Mimno:
Beyond Exchangeability: The Chinese Voting Process. NIPS 2016: 4934-4942 - 2015
- [c26]Tobias Schnabel, Igor Labutov, David M. Mimno, Thorsten Joachims:
Evaluation methods for unsupervised word embeddings. EMNLP 2015: 298-307 - [c25]Moontae Lee, David Bindel, David M. Mimno:
Robust Spectral Inference for Joint Stochastic Matrix Factorization. NIPS 2015: 2710-2718 - 2014
- [c24]David Mimno, Peter M. Broadwell, Timothy R. Tangherlini:
The Telltale Hat: LDA and Classification Problems in a Large Folklore Corpus. DH 2014 - [c23]David M. Mimno, Moontae Lee:
Low-dimensional Embeddings for Interpretable Anchor-based Topic Inference. EMNLP 2014: 1319-1328 - 2013
- [c22]Sanjeev Arora, Rong Ge, Yonatan Halpern, David M. Mimno, Ankur Moitra, David A. Sontag, Yichen Wu, Michael Zhu:
A Practical Algorithm for Topic Modeling with Provable Guarantees. ICML (2) 2013: 280-288 - 2012
- [c21]Robert K. Nelson, David Mimno, Travis Brown:
Topic Modeling the Past. DH 2012: 64-69 - [c20]David M. Mimno, Matthew D. Hoffman, David M. Blei:
Sparse stochastic inference for latent Dirichlet allocation. ICML 2012 - [c19]Anton Bakalov, Andrew McCallum, Hanna M. Wallach, David M. Mimno:
Topic models for taxonomies. JCDL 2012: 237-240 - [c18]Prem Gopalan, David M. Mimno, Sean Gerrish, Michael J. Freedman, David M. Blei:
Scalable Inference of Overlapping Communities. NIPS 2012: 2258-2266 - 2011
- [c17]David M. Mimno, David M. Blei:
Bayesian Checking for Topic Models. EMNLP 2011: 227-237 - [c16]David M. Mimno, Hanna M. Wallach, Edmund M. Talley, Miriam Leenders, Andrew McCallum:
Optimizing Semantic Coherence in Topic Models. EMNLP 2011: 262-272 - [c15]David M. Mimno:
Reconstructing Pompeian Households. UAI 2011: 506-513 - 2009
- [c14]David M. Mimno, Hanna M. Wallach, Jason Naradowsky, David A. Smith, Andrew McCallum:
Polylingual Topic Models. EMNLP 2009: 880-889 - [c13]Hanna M. Wallach, Iain Murray, Ruslan Salakhutdinov, David M. Mimno:
Evaluation methods for topic models. ICML 2009: 1105-1112 - [c12]Limin Yao, David M. Mimno, Andrew McCallum:
Efficient methods for topic model inference on streaming document collections. KDD 2009: 937-946 - [c11]Hanna M. Wallach, David M. Mimno, Andrew McCallum:
Rethinking LDA: Why Priors Matter. NIPS 2009: 1973-1981 - 2008
- [c10]Rebecca Reznik-Zellen, Bob Stevens, Michael Thorn, Jeff Morse, Mark D. Smucker, James Allan, David M. Mimno, Andrew McCallum, Mark Tuominen:
InterNano: e-Science for the Nanomanufacturing Community. eScience 2008: 382-383 - [c9]David M. Mimno, Andrew McCallum:
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression. UAI 2008: 411-418 - 2007
- [c8]David M. Mimno, Wei Li, Andrew McCallum:
Mixtures of hierarchical topics with Pachinko allocation. ICML 2007: 633-640 - [c7]David M. Mimno, Andrew McCallum:
Mining a digital library for influential authors. JCDL 2007: 105-106 - [c6]David M. Mimno, Andrew McCallum:
Organizing the OCA: learning faceted subjects from a library of digital books. JCDL 2007: 376-385 - [c5]David M. Mimno, Andrew McCallum:
Expertise modeling for matching papers with reviewers. KDD 2007: 500-509 - 2006
- [c4]Gregory R. Crane, David Bamman, Lisa Cerrato, Alison Jones, David M. Mimno, Adrian Packel, David Sculley, Gabriel Weaver:
Beyond Digital Incunabula: Modeling the Next Generation of Digital Libraries. ECDL 2006: 353-366 - [c3]Gideon S. Mann, David M. Mimno, Andrew McCallum:
Bibliometric impact measures leveraging topic analysis. JCDL 2006: 65-74 - 2005
- [c2]David M. Mimno, Alison Jones, Gregory R. Crane:
Finding a catalog: generating analytical catalog records from well-structured digital texts. JCDL 2005: 271-280 - 2004
- [c1]M. S. Patton, David M. Mimno:
Services for a customizable authority linking environment. JCDL 2004: 420
Reference Works
- 2014
- [r1]Jordan L. Boyd-Graber, David M. Mimno, David Newman:
Care and Feeding of Topic Models. Handbook of Mixed Membership Models and Their Applications 2014: 225-254
Informal and Other Publications
- 2024
- [i32]Maria Antoniak, David Mimno, Rosamond Elizabeth Thalken, Melanie Walsh, Matthew Wilkens, Gregory Yauney:
The Afterlives of Shakespeare and Company in Online Social Readership. CoRR abs/2401.07340 (2024) - [i31]Rebecca M. M. Hicke, David Mimno:
[Lions: 1] and [Tigers: 2] and [Bears: 3], Oh My! Literary Coreference Annotation with LLMs. CoRR abs/2401.17922 (2024) - [i30]Gregory Yauney, David Mimno:
Stronger Random Baselines for In-Context Learning. CoRR abs/2404.13020 (2024) - [i29]Andrea W. Wen-Yi, Unso Eun Seo Jo, Lu Jia Lin, David Mimno:
How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs. CoRR abs/2407.09652 (2024) - [i28]Andrea W. Wen-Yi, Kathryn Adamson, Nathalie Greenfield, Rachel Goldberg, Sandra Babcock, David Mimno, Allison Koenecke:
Automate or Assist? The Role of Computational Models in Identifying Gendered Discourse in US Capital Trial Transcripts. CoRR abs/2407.12500 (2024) - 2023
- [i27]LeAnn McDowall, Maria Antoniak, David Mimno:
Sensemaking About Contraceptive Methods Across Online Platforms. CoRR abs/2301.09295 (2023) - [i26]Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito:
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity. CoRR abs/2305.13169 (2023) - [i25]Hamed Rahimi, Jacob Louis Hoover, David Mimno, Hubert Naacke, Camélia Constantin, Bernd Amann:
Contextualized Topic Coherence Metrics. CoRR abs/2305.14587 (2023) - [i24]Rosamond Elizabeth Thalken, Edward H. Stiglitz, David Mimno, Matthew Wilkens:
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement. CoRR abs/2310.18440 (2023) - [i23]Rebecca M. M. Hicke, David Mimno:
T5 meets Tybalt: Author Attribution in Early Modern English Drama Using Large Language Models. CoRR abs/2310.18454 (2023) - [i22]A. Feder Cooper, Katherine Lee, James Grimmelmann, Daphne Ippolito, Christopher Callison-Burch, Christopher A. Choquette-Choo, Niloofar Mireshghallah, Miles Brundage, David Mimno, Madiha Zahrah Choksi, Jack M. Balkin, Nicholas Carlini, Christopher De Sa, Jonathan Frankle, Deep Ganguli, Bryant Gipson, Andres Guadamuz, Swee Leng Harris, Abigail Z. Jacobs, Elizabeth Joh, Gautam Kamath, Mark Lemley, Cass Matthews, Christine McLeavey, Corynne McSherry, Milad Nasr, Paul Ohm, Adam Roberts, Tom Rubin, Pamela Samuelson, Ludwig Schubert, Kristen Vaccaro, Luis Villa, Felix Wu, Elana Zeide:
Report of the 1st Workshop on Generative AI and Law. CoRR abs/2311.06477 (2023) - [i21]Gregory Yauney, Emily Reif, David Mimno:
Data Similarity is Not Enough to Explain Language Model Performance. CoRR abs/2311.09006 (2023) - [i20]Andrea W. Wen-Yi, David Mimno:
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings. CoRR abs/2311.18034 (2023) - 2022
- [i19]Jacob Eisenstein, Daniel Andor, Bernd Bohnet, Michael Collins, David Mimno:
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model. CoRR abs/2210.02498 (2022) - [i18]Siddhartha Brahma, Polina Zablotskaia, David Mimno:
Breaking BERT: Evaluating and Optimizing Sparsified Attention. CoRR abs/2210.03841 (2022) - 2021
- [i17]Gregory Yauney, David Mimno:
Comparing Text Representations: A Theory-Driven Approach. CoRR abs/2109.07458 (2021) - [i16]A. Feder Cooper, Maria Antoniak, Christopher De Sa, Marilyn Migiel, David Mimno:
Tecnologica cosa: Modeling Storyteller Personalities in Boccaccio's Decameron. CoRR abs/2109.10506 (2021) - [i15]Moontae Lee, Sungjun Cho, Kun Dong, David Mimno, David Bindel:
On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference. CoRR abs/2111.06580 (2021) - 2020
- [i14]Laure Thompson, David Mimno:
Topic Modeling with Contextualized Word Representation Clusters. CoRR abs/2010.12626 (2020) - [i13]Gregory Yauney, Jack Hessel, David Mimno:
Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents. CoRR abs/2010.16363 (2020) - 2019
- [i12]Jack Hessel, Lillian Lee, David Mimno:
Unsupervised Discovery of Multimodal Links in Multi-Image, Multi-Sentence Documents. CoRR abs/1904.07826 (2019) - [i11]Dong Nguyen, Maria Liakata, Simon DeDeo, Jacob Eisenstein, David Mimno, Rebekah Tromble, Jane Winters:
How we do things with words: Analyzing text as social and cultural data. CoRR abs/1907.01468 (2019) - 2018
- [i10]Jack Hessel, David M. Mimno, Lillian Lee:
Quantifying the visual concreteness of words and topics in multimodal datasets. CoRR abs/1804.06786 (2018) - 2017
- [i9]Jack Hessel, Lillian Lee, David M. Mimno:
Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity. CoRR abs/1703.01725 (2017) - [i8]Moontae Lee, David M. Mimno:
Low-dimensional Embeddings for Interpretable Anchor-based Topic Inference. CoRR abs/1711.06826 (2017) - [i7]Moontae Lee, David Bindel, David M. Mimno:
Prior-aware Dual Decomposition: Document-specific Topic Inference for Spectral Topic Models. CoRR abs/1711.07065 (2017) - 2016
- [i6]Moontae Lee, Seok Hyun Jin, David M. Mimno:
Beyond Exchangeability: The Chinese Voting Process. CoRR abs/1610.09428 (2016) - [i5]Moontae Lee, David Bindel, David M. Mimno:
Robust Spectral Inference for Joint Stochastic Matrix Factorization. CoRR abs/1611.00175 (2016) - 2015
- [i4]Jack Hessel, Alexandra Schofield, Lillian Lee, David M. Mimno:
What do Vegans do in their Spare Time? Latent Interest Detection in Multi-Community Networks. CoRR abs/1511.03371 (2015) - 2012
- [i3]David M. Mimno:
Reconstructing Pompeian Households. CoRR abs/1202.3747 (2012) - [i2]David M. Mimno, Andrew McCallum:
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression. CoRR abs/1206.3278 (2012) - [i1]Sanjeev Arora, Rong Ge, Yoni Halpern, David M. Mimno, Ankur Moitra, David A. Sontag, Yichen Wu, Michael Zhu:
A Practical Algorithm for Topic Modeling with Provable Guarantees. CoRR abs/1212.4777 (2012)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:09 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint