The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 feature-extraction Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
interviews.ai
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.
-
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
-
metarank
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
-
OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Deep_Learning_Machine_Learning_Stock
Deep Learning and Machine Learning stocks represent promising opportunities for both long-term and short-term investors and traders.
-
speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
-
mistql
A query / expression language for performing computations on JSON-like structures. Tuned for clientside ML feature extraction.
-
desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
-
MiniAI-Face-Recognition-AndroidSDK
NIST FRVT Top Ranked Face Recognition, iBeta 2 Certified Liveness Detection Engine on Mobile
-
upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: For deep learning practitioners in industry, is the workflow always this annoying? [D] | /r/MachineLearning | 2023-07-10This is definitely a good thing to try for time-series; you can automate your feature extraction too (eg using https://github.com/blue-yonder/tsfresh ).
Project mention: Comparative Analysis of Memory Consumption: OpenMLDB vs Redis Test Report | dev.to | 2024-04-03b. Pull the testing code
Project mention: Deep_Learning_Machine_Learning_Stock: NEW Deep Learning And Reinforcement Learning - star count:1017.0 | /r/algoprojects | 2023-12-10
Project mention: The fastest way to improve quality of ML model on tabular data | /r/learnmachinelearning | 2023-06-18web: https://upgini.com
While there are abundant researches about evaluating ChatGPT on natural language understanding and generation tasks, few studies have investigated how ChatGPT's behavior changes over time. In this paper, we collect a coarse-to-fine temporal dataset called ChatLog, consisting of two parts that update monthly and daily: ChatLog-Monthly is a dataset of 38,730 question-answer pairs collected every month including questions from both the reasoning and classification tasks. ChatLog-Daily, on the other hand, consists of ChatGPT's responses to 1000 identical questions for long-form generation every day. We conduct comprehensive automatic and human evaluation to provide the evidence for the existence of ChatGPT evolving patterns. We further analyze the unchanged characteristics of ChatGPT over time by extracting its knowledge and linguistic features. We find some stable features to improve the robustness of a RoBERTa-based detector on new versions of ChatGPT. We will continuously maintain our project at https://github.com/THU-KEG/ChatLog.
feature-extraction related posts
- For deep learning practitioners in industry, is the workflow always this annoying? [D]
- dna_parser : A Python package written in Rust to encode DNA sequences for machine learning.
- [D] Incorporating external data in LSTM models for sales forecasting in e-commerce
- [R] Approach to identify clusters on a time series
- GitHub - meyda/meyda: Audio feature extraction for JavaScript.
- Meyda: A JavaScript audio feature extraction library
- The outputs of my jupyter notebooks inside of Github repos only show half of what they used to. Why did this happen and how to fix? I am certain that the outputs used to show everything when viewed in Github, and I have not reuploaded the notebooks to the repo's since then.
-
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024
Index
What are some of the best open-source feature-extraction projects? This list will help you:
Project | Stars | |
---|---|---|
1 | tsfresh | 8,076 |
2 | EfficientNet-PyTorch | 7,715 |
3 | interviews.ai | 4,437 |
4 | towhee | 2,989 |
5 | metarank | 1,985 |
6 | OpenMLDB | 1,550 |
7 | meyda | 1,388 |
8 | Deep_Learning_Machine_Learning_Stock | 1,142 |
9 | pykaldi | 978 |
10 | speechpy | 880 |
11 | tsfel | 852 |
12 | machinelearnjs | 536 |
13 | deltapy | 527 |
14 | tsflex | 360 |
15 | mistql | 345 |
16 | desbordante-core | 321 |
17 | PyTorch-Model-Compare | 308 |
18 | MiniAI-Face-Recognition-AndroidSDK | 307 |
19 | upgini | 290 |
20 | opensmile-python | 217 |
21 | textfeatures | 167 |
22 | torchextractor | 99 |
23 | ChatLog | 93 |
Sponsored