name-dataset vs Stanza

name-dataset

The Python library for names. (by philipperemy)

Stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages (by stanfordnlp)

Natural Language Processing General Python NLP Machine Learning Deep Learning Artificial intelligence Pytorch universal-dependencies named-entity-recognition Corenlp

Source Code

stanfordnlp.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

name-dataset		Stanza
	Project
2	Mentions	8
784	Stars	7,060
-	Growth	0.7%
2.9	Activity	9.8
6 months ago	Latest Commit	7 days ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

name-dataset

Posts with mentions or reviews of name-dataset. We have used some of these posts to build our list of alternatives and similar projects.

Requesting surname data with frequency
1 project | /r/datasets | 3 Jan 2023

Yep! I actually know of something that is exactly what you’re looking for. Note that you will need to know a bit of python to use it. Here’s the link: https://github.com/philipperemy/name-dataset
Searching for an API that I can search a name
1 project | /r/CodingHelp | 24 Dec 2022

I don't think anything like this exists publicly.. but you could use the various available datasets and build your own API based on that data.. here's a +2GB dataset https://github.com/philipperemy/name-dataset

Stanza

Posts with mentions or reviews of Stanza. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-06.

Down and Out in the Magic Kingdom
1 project | news.ycombinator.com | 23 Jul 2023
Parts of speech tagged for German
3 projects | /r/German | 6 Jan 2023

I use Python's spacy library: https://spacy.io/models/de or stanza: https://stanfordnlp.github.io/stanza/ each with their respective language models.
Off the shelf sentence parsers?
2 projects | /r/LanguageTechnology | 26 Aug 2022

stanza has a constituency parser. There's a model compatible with the dev branch with an accuracy of 95.8 on PTB, using Roberta as a bottom layer, so it's pretty decent at this point. (The currently released model is not as accurate, but it's easy to get the better model to you.) There's also Tregex as a Java addon which can very easily search for a noun phrase highest up in the tree: NP !>> NP will search for a noun phrase which is not dominated by any higher up noun phrase.
The Spacy NER model for Spanish is terrible
2 projects | /r/LanguageTechnology | 20 Dec 2021
Spacy vs NLTK for Spanish Language Statistical Tasks
1 project | /r/LanguageTechnology | 12 Nov 2021
Stanza not tokenising sentences as expected
1 project | /r/learnpython | 3 Nov 2021

I am using Stanza to tokenise the sentences:
Stanza – A Python NLP Package for Many Human Languages
1 project | /r/programming | 29 Oct 2021

1 project | news.ycombinator.com | 27 Oct 2021

Compare name-dataset vs Stanza and see what are their differences.

name-dataset

Stanza

name-dataset

Stanza