{"id":245,"date":"2026-03-25T13:35:37","date_gmt":"2026-03-25T13:35:37","guid":{"rendered":"https:\/\/yom-tov.info\/blog\/?p=245"},"modified":"2026-03-25T13:39:27","modified_gmt":"2026-03-25T13:39:27","slug":"prolific-authors-and-why-theyre-a-problem","status":"publish","type":"post","link":"https:\/\/yom-tov.info\/blog\/2026\/03\/25\/prolific-authors-and-why-theyre-a-problem\/","title":{"rendered":"Prolific authors and why they\u2019re a problem"},"content":{"rendered":"\n<p><em>\u201cIt is a sobering thought that when Mozart was my age, he had been dead for two years.\u201d Tom Lehrer<\/em><\/p>\n\n\n\n<p>If you look back at the papers you\u2019ve published over the past year and you think you had a good year, think again. According to DBLP, 5 authors published <strong>more than one paper every day of last year<\/strong> (that\u2019s more than 365 papers), and 361 authors published over 100 papers a year. I know many scientists who read fewer than 100 papers per year.<\/p>\n\n\n\n<p>The figure below shows the distribution of the number of papers per author for all of DBLP, while the second figure shows the number of papers in AI venues, broadly defined (ICML, NeurIPS, KDD, WWW, etc.).<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"617\" src=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist-1024x617.jpg\" alt=\"\" class=\"wp-image-246\" srcset=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist-1024x617.jpg 1024w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist-300x181.jpg 300w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist-768x462.jpg 768w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist-1536x925.jpg 1536w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_hist.jpg 1679w\" sizes=\"auto, (max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><figcaption class=\"wp-element-caption\">Figure 1: A histogram of the number of papers per author in DBLP. Note the axes are logarithmic.<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"617\" src=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist-1024x617.jpg\" alt=\"\" class=\"wp-image-247\" srcset=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist-1024x617.jpg 1024w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist-300x181.jpg 300w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist-768x462.jpg 768w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist-1536x925.jpg 1536w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/authors_per_paper_AI_hist.jpg 1679w\" sizes=\"auto, (max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><figcaption class=\"wp-element-caption\">Figure 2: A histogram of the number of papers per author in AI venues (broadly defined) according to DBLP. Note the axes are logarithmic.<\/figcaption><\/figure>\n\n\n\n<p>Who are these prolific authors? Among the top 50, there are 20 from China and another 6 from Hong Kong, 13 from the USA, 7 Singaporeans, two Australians and 2 from the EU.<\/p>\n\n\n\n<p>If we define prolific authors as those who have published over 500 papers during their career, we can plot the percentage of their papers over the years, compared to the same for all other (less \u201cproductive\u201d) authors. As the figure below shows, for most of the time since 1970 the two groups have looked similar. Then, since around 2020, they\u2019ve began to diverge, and the prolific authors begin to publish much more than the rest of us.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"617\" src=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1-1024x617.jpg\" alt=\"\" class=\"wp-image-252\" srcset=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1-1024x617.jpg 1024w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1-300x181.jpg 300w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1-768x462.jpg 768w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1-1536x925.jpg 1536w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/publications_over_time-1.jpg 1679w\" sizes=\"auto, (max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><figcaption class=\"wp-element-caption\">Figure 3: Publications over the years, normalized separately for prolific authors and for all other authors<\/figcaption><\/figure>\n\n\n\n<p>Another hint about these authors comes from looking at those who publish most each year, and their tenure at the time. Tenure, in this context, is the number of years since they\u2019ve started publishing. If the population of scientists was constant, we\u2019d expect this tenure to be the same, but my data is finite and there are more and more scientists, so the slope of the tenure is lower. The figure below shows the data for the 100 most prolific authors each year. Over most years the slope of the tenure is around 0.3 (as you can see from the dotted line, which was fit to data up to 2015). Then, around 2018 it diverges, and the tenure rapidly increases.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"617\" src=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years-1024x617.jpg\" alt=\"\" class=\"wp-image-249\" srcset=\"https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years-1024x617.jpg 1024w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years-300x181.jpg 300w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years-768x462.jpg 768w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years-1536x925.jpg 1536w, https:\/\/yom-tov.info\/blog\/wp-content\/uploads\/2026\/03\/author_tenure_over_years.jpg 1679w\" sizes=\"auto, (max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><figcaption class=\"wp-element-caption\">Figure 4: Average tenure of prolific authors over the years<\/figcaption><\/figure>\n\n\n\n<p>My understanding, therefore, is that in the past 8 years or so, senior people have began adding their names to publications in ways that they wouldn\u2019t have before then. This is especially true in countries that reward publications (either with academic accolades or financial rewards).<\/p>\n\n\n\n<p>But this brings me to an interesting fact. According to the widely-adopted <a href=\"https:\/\/www.icmje.org\/recommendations\/browse\/roles-and-responsibilities\/defining-the-role-of-authors-and-contributors.html\">recommendations <\/a>of the International Committee of Medical Journal Editors (ICMJE), \u201cauthorship be based on the following 4 criteria:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Substantial contributions to the conception or design of the work; or the acquisition, analysis, or interpretation of data for the work; AND<\/li>\n\n\n\n<li>Drafting the work or reviewing it critically for important intellectual content; AND<\/li>\n\n\n\n<li>Final approval of the version to be published; AND<\/li>\n\n\n\n<li>Agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.\u201c<\/li>\n<\/ul>\n\n\n\n<p><strong>And all 4 criteria must be met<\/strong>. That means, for example, that being the head of a lab who brought the money to conduct the work is not sufficient to claim authorship. She must also be able to vouch for the accuracy and integrity of the work and to approve its final version.\u00a0<\/p>\n\n\n\n<p>I strongly doubt that someone who has published more than one paper every day of the year can, in good conscience, apply all four criteria to each of the papers.<\/p>\n\n\n\n<p><strong>Why does it matter?<\/strong><\/p>\n\n\n\n<p>First, like it or not, publication counts are used as a signal to decide on academic merit and in some cases on financial reward. Otherwise, why would these people put their names on so many papers? Thus, ignoring the assumption that having one\u2019s name on a paper means that they\u2019ve been involved in the work causes that signal to be meaningless.<\/p>\n\n\n\n<p>Second, as noted above, it can be considered unethical or fraudulent, at least according to ICMJE.<\/p>\n\n\n\n<p><strong>How can we solve this problem? <\/strong><\/p>\n\n\n\n<p>I don\u2019t think there\u2019s one way to do it. Some conferences (e.g., <a href=\"https:\/\/2026.ijcai.org\/ijcai-ecai-2026-call-for-papers-main-track\/\">IJCAI 2026<\/a>) have begun asking authors to pay per submission if the authors submit more than one paper to the conference. This is good but it isn\u2019t going to change much for those who are in rich universities or labs. <\/p>\n\n\n\n<p>Another way that\u2019s widely used is authorship statements, though these are easily abused. <\/p>\n\n\n\n<p>Perhaps large publishers should cap the number of papers that an author can submit to them. If the ACM, for example, would not allow more than, e.g., 24 papers per author per year (2 papers a month sounds like a good number to me), then authors will have to choose which papers they really worked on. However, perhaps they\u2019ll just put their names on the papers they think will have the most impact.<\/p>\n\n\n\n<p><strong>What do you think? Do you have good solutions?<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u201cIt is a sobering thought that when Mozart was my age, he had been dead for two years.\u201d Tom Lehrer If you look back at the papers you\u2019ve published over the past year and you think you had a good year, think again. According to DBLP, 5 authors published more than one paper every day &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/yom-tov.info\/blog\/2026\/03\/25\/prolific-authors-and-why-theyre-a-problem\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Prolific authors and why they\u2019re a problem&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-245","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/posts\/245","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/comments?post=245"}],"version-history":[{"count":0,"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/posts\/245\/revisions"}],"wp:attachment":[{"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/media?parent=245"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/categories?post=245"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/yom-tov.info\/blog\/wp-json\/wp\/v2\/tags?post=245"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}