Index card datasets for training and evaulating models for conversion of index cards to structured data/metadata
AI & ML interests
🤗 Hugging Face x 🌸 BigScience initiative to create open source community resources for LAMs.
Recent Activity
View all activity
This collection contains models, datasets and spaces related to historic language models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask • 0.1B • Updated • 371 • 8 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask • 0.1B • Updated • 43 • 2 -
Riksarkivet/bert-base-cased-swe-historical
Fill-Mask • 0.1B • Updated • 3 • 4 -
dell-research-harvard/AmericanStories
Updated • 4.41k • 169
Datasets which can help train or evaluate various approaches to automatic metadata generation and extraction.
-
biglam/doab-metadata-extraction
Viewer • Updated • 8.09k • 223 • 14 -
biglam/rubenstein-manuscript-catalog
Viewer • Updated • 49.7k • 175 • 3 -
biglam/bpl-card-catalog
Viewer • Updated • 838k • 343 • 5 -
biglam/harvard-library-bibliographic-dataset
Viewer • Updated • 11.1M • 211 • 2
Historic Newspaper Datasets on the Hub
-
The Newspaper Navigator Dataset: Extracting And Analyzing Visual Content from 16 Million Historic Newspaper Pages in Chronicling America
Paper • 2005.01583 • Published • 3 -
bigscience-historical-texts/hipe2020
Updated • 43 • 3 -
bigscience-historical-texts/HIPE2020_sent-split
Updated • 37 -
biglam/bnl_newspapers1841-1879
Viewer • Updated • 631k • 29 • 2
Index card datasets for training and evaulating models for conversion of index cards to structured data/metadata
Datasets which can help train or evaluate various approaches to automatic metadata generation and extraction.
-
biglam/doab-metadata-extraction
Viewer • Updated • 8.09k • 223 • 14 -
biglam/rubenstein-manuscript-catalog
Viewer • Updated • 49.7k • 175 • 3 -
biglam/bpl-card-catalog
Viewer • Updated • 838k • 343 • 5 -
biglam/harvard-library-bibliographic-dataset
Viewer • Updated • 11.1M • 211 • 2
This collection contains models, datasets and spaces related to historic language models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask • 0.1B • Updated • 371 • 8 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask • 0.1B • Updated • 43 • 2 -
Riksarkivet/bert-base-cased-swe-historical
Fill-Mask • 0.1B • Updated • 3 • 4 -
dell-research-harvard/AmericanStories
Updated • 4.41k • 169
Historic Newspaper Datasets on the Hub
-
The Newspaper Navigator Dataset: Extracting And Analyzing Visual Content from 16 Million Historic Newspaper Pages in Chronicling America
Paper • 2005.01583 • Published • 3 -
bigscience-historical-texts/hipe2020
Updated • 43 • 3 -
bigscience-historical-texts/HIPE2020_sent-split
Updated • 37 -
biglam/bnl_newspapers1841-1879
Viewer • Updated • 631k • 29 • 2