Feed

open-index/hacker-news · Datasets at Hugging Face

OpenIndex

2026.03.25

·Hugging Face·by 배레온/부산/개발자

#data analysis#dataset#Hacker News#Hugging Face#NLP

Key Points

1This dataset provides the complete and continuously updated archive of Hacker News content, including stories, comments, polls, and job postings, spanning from 2006 to the present.
2Organized into monthly Parquet files with real-time 5-minute updates for current activity, it offers a comprehensive and live mirror of the site, currently totaling over 47 million items.
3The dataset's standard Parquet structure allows for efficient querying and analysis directly from Hugging Face via tools like DuckDB, the `datasets` library, pandas, and `huggingface_hub`.

timestamp[ms, tz=UTC]

Feed

OpenIndex

2026.03.25

·Hugging Face·by 배레온/부산/개발자

#data analysis#dataset#Hacker News#Hugging Face#NLP

1This dataset provides the complete and continuously updated archive of Hacker News content, including stories, comments, polls, and job postings, spanning from 2006 to the present.
2Organized into monthly Parquet files with real-time 5-minute updates for current activity, it offers a comprehensive and live mirror of the site, currently totaling over 47 million items.
3The dataset's standard Parquet structure allows for efficient querying and analysis directly from Hugging Face via tools like DuckDB, the `datasets` library, pandas, and `huggingface_hub`.

timestamp[ms, tz=UTC]