An interactive semantic map of the latest 10 million published papers [P]
Explore 10 million papers through a spatial, AI-powered interface.
A Reddit user (u/icannotchangethename) has launched The Global Research Space, an interactive semantic map that visualizes the latest 10 million published papers sourced from the OpenAlex database. The map uses SPECTER 2, a transformer-based model designed for scientific document embeddings, to generate vector representations from titles and abstracts. These embeddings are then reduced to two dimensions using UMAP (Uniform Manifold Approximation and Projection), a dimensionality reduction technique that preserves local and global structure. The resulting points are partitioned into distinct semantic neighborhoods using Voronoi partitioning on density peaks, creating clusters that represent related research topics. Floating topic labels are generated via custom algorithms, though the creator notes they are still a work in progress.
The tool supports both keyword and semantic queries, allowing users to search for papers by exact terms or conceptual similarity. An analytics layer enables ranking of institutions, authors, and topics, providing insights into research impact and trends. The interface is designed for spatial exploration, letting users navigate the scientific landscape intuitively. The map is free to use at theglobalresearchspace.com, and the creator welcomes feedback for improvements. This approach offers a novel way to discover interdisciplinary connections and track emerging research areas, potentially aiding researchers, librarians, and science policymakers in navigating the vast and growing body of published work.
- Sourced 10 million papers from OpenAlex and embedded them using SPECTER 2 on titles and abstracts.
- Reduced dimensionality with UMAP and applied Voronoi partitioning on density peaks for semantic neighborhoods.
- Supports keyword and semantic queries, plus analytics for ranking institutions, authors, and topics.
Why It Matters
Transforms overwhelming research volumes into an explorable map, enabling discovery of cross-disciplinary connections and trends.