-
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
There is of course the list at https://github.com/juand-r/entity-recognition-datasets, but all of the recent English datasets cover other domains of English, such as the music NER, space NER, etc. All interesting things, but not 2020s English newswire.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
Towards a Tagalog NLP pipeline: Building a spaCy model from scratch
-
Any large manually annotated NER datasets?
-
Streamlining AI/ML Deployment with ModelKits: Innovations and Future Directions
-
Introducing the New GitHub Action for using Kit CLI on MLOps pipelines
-
Say hello to Kit–An open source solution to MLOps complexity