-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
FAISS works well https://github.com/facebookresearch/faiss
You can look for usage here: https://github.com/LIAAD/yake and there is also a reference section with publications for more details of how this works. From what I remember, each keyphrase candidate is assigned an aggregated score based on various features: position in the text, casing, frequency, surrounding text frequency...
Other suggestions: Take a look at Scattertext. Compare keywords to the problem of aspect extraction. I think an underutilized way to look at textual data when you have a single group of interest is the word-frequency-based odds ratio.
Related posts
-
[P] Building model to extract keywords from legal documents
-
Rust Keyword Extraction: Creating the YAKE! algorithm from scratch
-
I want to extract important keywords from large documents...
-
Show HN: Whisper.cpp and YAKE to Analyse Voice Reflections [iOS]
-
[P] what is the most efficient way to pattern matching word-to-word?