This article contains affiliate links. Bold Arc may earn a commission at no cost to you.
In 1900, roughly 9,000 scientific papers were published globally. The number is not a typo. The entire world's organized scientific output for a year — biology, chemistry, physics, medicine, mathematics, geology, astronomy — fit in a stack of journals small enough to shelve in a single room. The scientific community was a village. Researchers in London corresponded by letter with researchers in Berlin and Boston over periods of weeks and months, exchanging findings at a pace that would be unrecognizable to anyone working in science today.
In 2023, the global scientific publishing database Dimensions.ai recorded over 5 million peer-reviewed papers published in a single year. That is more than 500 times the 1900 output. The global scientific community has grown from a village to a civilization. And the implications of that knowledge explosion — for medicine, for technology, for agriculture, for energy, for the arc of human progress — have no historical precedent.
This is not just a story about more papers. It is a story about compound interest in knowledge. Each paper builds on prior work. Each finding enables the next question. The acceleration is not linear — it has features of exponential growth, and the rate of that growth is itself increasing.
How We Got Here: The Market for Knowledge
The growth in scientific output is not accidental. It is the product of specific, identifiable market mechanisms that drove the expansion of scientific capacity over the past century.
The most important mechanism is the university system. In 1900, universities were elite institutions serving tiny fractions of national populations. The United States had roughly 238,000 college students. By 2023, US college enrollment exceeded 19 million — an 80-fold increase while the population grew only 3-fold. The market for credentials — the economic value of a degree as a signal of skill and trainability — drove massive investment in higher education institutions. Those institutions, to attract students, needed research reputations, which required faculty doing research, which required laboratories, which required funding, which created the demand for research grants that governments and private foundations increasingly supplied.
The research university model — where faculty are expected to both teach and produce original research — was itself a market response to the demand for trained specialists. Germany developed it first in the 19th century; the United States copied and expanded it aggressively after World War II, when the federal government's investment in university research through the NSF, NIH, DARPA, and related agencies created the world's most productive scientific enterprise. The model spread globally over the 20th century. Today, China alone publishes more scientific papers annually than the United States — a consequence of its massive investment in research universities over the past 30 years.
Open Access: The Distribution Revolution
The expansion of scientific output was paralleled by a transformation in how scientific knowledge is distributed. For most of the 20th century, scientific knowledge was locked behind journal subscription paywalls accessible only to researchers with institutional affiliations. A paper published in the New England Journal of Medicine was accessible to a cardiologist at Harvard; it was not accessible to a physician in rural Kenya, a medical student in Vietnam, or a curious layperson in Ohio.
The open access movement — accelerated by the internet and driven by a combination of researcher frustration with publisher profit margins and government policy requirements — has fundamentally changed this. Preprint servers like arXiv (physics, mathematics, computer science) and bioRxiv/medRxiv (biology and medicine) now publish papers before or simultaneously with peer-reviewed journal publication, making findings instantly accessible to anyone worldwide. The European Research Council and NIH now require open access publication for publicly funded research. The number of papers freely accessible online has grown from near-zero in 1990 to over half of all newly published papers today.
The consequence: a researcher in Lagos or Hanoi or Nairobi now has access to the same published scientific literature as a researcher at MIT — in real time, for free. The democratization of scientific knowledge is one of the most underappreciated developments in the history of scientific progress. The barriers to scientific participation are no longer primarily informational. They are increasingly about local research infrastructure — laboratories, instruments, funding — and that gap is also closing.
International Collaboration: Comparative Advantage Applied to Science
One of the most striking features of modern science is its internationalization. In 1960, the vast majority of scientific papers had authors from a single country. International co-authorship was rare. Today, it is the norm: studies using bibliometric data consistently find that papers with international co-authors receive significantly more citations than single-country papers — a proxy for scientific impact. The most cited papers in most fields are now typically multi-country collaborations.
This is comparative advantage applied to the production of knowledge. Different countries have different concentrations of scientific talent, different equipment, different patient populations (relevant for medical research), and different natural environments (relevant for ecology, geology, and climate science). A paper studying the genetics of a disease prevalent in East Africa benefits from researchers with access to East African patients; it also benefits from computational biologists with access to high-performance computing clusters in the United States or Europe. The collaboration that combines these capabilities produces better science than either party could produce alone.
The data on international collaboration confirms what economic theory predicts. China's rapid integration into global science — driven by massive domestic investment in universities and research, combined with the return migration of Chinese researchers trained abroad — has not merely added to global scientific output. It has enabled collaborations that cross disciplinary and geographic lines in ways that are accelerating progress in high-priority fields: materials science, quantum computing, battery technology, genomics.
"Scientific progress is compound interest. Each finding enables the next question. At 5 million papers per year, the compound rate is unprecedented — and the downstream products of that knowledge, from medicine to materials, are only beginning to emerge."
AI-Assisted Research: The Next Acceleration
The most recent and least fully understood acceleration in scientific output is the application of artificial intelligence to research itself. The impact is already measurable in specific fields and will expand across science broadly over the next decade.
The most dramatic single example is AlphaFold. Protein structure prediction — determining the three-dimensional shape of a protein from its amino acid sequence — had been described as one of biology's most important unsolved problems since the 1960s. Knowing a protein's structure is essential for understanding its function and for designing drugs that target it. The manual experimental methods (X-ray crystallography, cryo-electron microscopy) required years of work to determine a single protein's structure. The backlog of uncharacterized proteins ran into the hundreds of millions.
In 2020, DeepMind's AlphaFold 2 solved the protein folding problem with accuracy competitive with experimental methods. In 2022, AlphaFold released predicted structures for over 200 million proteins — essentially the known protein universe — freely available to researchers worldwide. What had been a decades-long bottleneck in drug discovery, enzyme design, and cell biology was effectively removed in a single year. The downstream effects on biomedical research will compound for decades.
AlphaFold is the most visible example of a broader pattern. AI is being applied to materials discovery (identifying candidate battery materials, superconductors, and catalysts from among billions of potential combinations), to drug molecule design, to climate modeling, to telescope data analysis, to genomic sequence interpretation. In each case, AI does not replace human scientific judgment — it eliminates the computational bottleneck that limited how many candidates human judgment had to evaluate. The effect is a dramatic acceleration in the pace at which hypotheses can be tested and findings produced.
Patents: The Commercialization Signal
Scientific papers measure the production of knowledge. Patent filings measure the conversion of knowledge into practical application. Both show the same acceleration.
Global patent filings crossed 3.5 million in 2022, according to the World Intellectual Property Organization (WIPO) — up from under 1 million in 1985. The growth is concentrated in technology, biomedical, and clean energy fields. China became the world's largest patent filer by volume in 2011 and has maintained that position. The US, Japan, South Korea, and Germany remain the leading sources of high-quality (citation-weighted) patents.
The patent data tells a specific story about knowledge conversion: scientific findings from universities and research institutions are being translated into commercial technologies at an accelerating rate. The lag between scientific discovery and commercial application — which was measured in decades for most of the 20th century — has compressed dramatically in fields like genomics, AI, and materials science. mRNA vaccine technology, which went from a research idea in the 1990s to a deployed pandemic vaccine in under a year, is the most dramatic recent example of this compression.
What the Knowledge Explosion Means for the Arc
The connection between scientific output and human progress is not abstract. The specific products of the scientific knowledge explosion are already visible in arc metrics across health, longevity, food security, energy, and safety.
Consider medicine. The sequencing of the human genome — completed in 2003 after a $3 billion, 13-year international effort — made possible the precision oncology that is now delivering survival rates for cancers that were death sentences a decade ago. mRNA vaccine technology, developed through decades of academic research with no commercial application visible until COVID-19, produced vaccines in under a year that have saved millions of lives. CRISPR gene editing, developed from bacterial immune system research with no initial practical application in mind, is now in clinical trials for sickle cell disease, beta-thalassemia, and several forms of cancer. Each of these is a direct product of the scientific knowledge base — of specific papers, published in specific journals, building on prior work in specific fields.
The same dynamic applies to clean energy, where decades of academic research in semiconductor physics, electrochemistry, and materials science produced the solar panels and lithium-ion batteries that are now making fossil fuel energy economically obsolete. It applies to agriculture, where decades of plant genetics research produced the drought-resistant and high-yield crop varieties that have sustained food production for a growing global population. It applies to information technology, where the academic computer science of the 1960s through 1990s produced the internet, GPS, and the AI systems reshaping every economic sector today.
The Self-Reinforcing System
What makes the knowledge explosion genuinely unprecedented is its self-reinforcing character. More scientists produce more papers, which produce more findings, which enable more specialized sub-fields, which train more specialized scientists, which produce more papers. The AI tools developed using scientific knowledge accelerate the production of further scientific knowledge. The economic value of scientific discovery funds the universities and research institutions that produce more scientists.
There is no obvious ceiling to this dynamic. The number of questions that scientific methods can address — in biology, materials, computation, physics, ecology, social science — is effectively unlimited. The tools available to address those questions grow more powerful each year. The global community of researchers continues to expand, as education expands and as remote collaboration tools make geographic constraints less binding.
In 1900, the world's scientific knowledge fit in a few rooms. In 2023, it fit — barely — in the combined servers of the world's research databases. In 2043, the knowledge base will be larger by a factor we cannot precisely forecast but can directionally predict: substantially larger, far more efficiently organized by AI tools, and still compounding. The downstream products of that knowledge — in health, in energy, in materials, in food production — will flow to billions of people whose lives will be better for work they will never know was done.
The knowledge arc has no historical precedent. And it is still accelerating.
Explore the full innovation arc: The Arc Index — and the broader framework for measuring human progress beyond GDP: GDP Is Not Enough.
-
The Beginning of Infinity by David Deutsch
Deutsch's profound argument that human knowledge is the fundamental driver of progress — and that the knowledge explosion has no endpoint. Required reading for anyone thinking about the arc of science. -
The Knowledge Machine by Michael Strevens
A rigorous account of how science's 'iron rule' of experiment and evidence became the most powerful knowledge-generating mechanism in human history.