pyod
pygod
Our great sponsors
pyod | pygod | |
---|---|---|
7 | 3 | |
7,941 | 1,208 | |
- | 5.0% | |
7.7 | 8.6 | |
7 days ago | 4 days ago | |
Python | Python | |
BSD 2-clause "Simplified" License | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pyod
-
A Comprehensive Guide for Building Rag-Based LLM Applications
This is a feature in many commercial products already, as well as open source libraries like PyOD. https://github.com/yzhao062/pyod
-
Analyze defects and errors in the created images
PyOD
-
Multivariate Outlier Detection in Python
Check out the algorithms and documentation in this toolkit. It’ll give you a list of methods to read up on to understand their mechanisms. https://github.com/yzhao062/pyod
- Pyod – A Comprehensive and Scalable Python Library for Outlier Detection
- Predictive Maintenance and Anomaly Detection Resources
-
[D] Unsupervised Outlier Detection - Advise Requested
The source code and documentaion of PyOD is the best survey about OOD. Besides, the normalized flow and VQVAE are also feasible.
- PyOD: ~50 anomaly detection algorithms in one framework.
pygod
-
RAG Using Structured Data: Overview and Important Questions
Ok, using ChatGPT and Bard (the irony lol) I learned a bit more about GNNs:
GNNs are probabilistic and can be trained to learn representations in graph-structured data and handling complex relationships, while classical graph algorithms are specialized for specific graph analysis tasks and operate based on predefined rules/steps.
* Why is PyG it called "Geometric" and not "Topologic" ?
Properties like connectivity, neighborhoods, and even geodesic distances can all be considered topological features of a graph. These features remain unchanged under continuous deformations like stretching or bending, which is the defining characteristic of topological equivalence. In this sense, "PyTorch Topologic" might be a more accurate reflection of the library's focus on analyzing the intrinsic structure and connections within graphs.
However, the term "geometric" still has some merit in the context of PyG. While most GNN operations rely on topological principles, some do incorporate notions of Euclidean geometry, such as:
- Node embeddings: Many GNNs learn low-dimensional vectors for each node, which can be interpreted as points in a vector space, allowing geometric operations like distances and angles to be applied.
- Spectral GNNs: These models leverage the eigenvalues and eigenvectors of the graph Laplacian, which encodes information about the geometric structure and distances between nodes.
- Manifold learning: Certain types of graphs can be seen as low-dimensional representations of high-dimensional manifolds. Applying GNNs in this context involves learning geometric properties on the manifold itself.
Therefore, although topology plays a primary role in understanding and analyzing graphs, geometry can still be relevant in certain contexts and GNN operations.
* Real world applications:
- HuggingFace has a few models [0] around things like computational chemistry [1] or weather forecasting.
- PyGod [2] can be used for Outlier Detection (Anomaly Detection).
- Apparently ULTRA [3] can "infer" (in the knowledge graph sense), that Michael Jackson released some disco music :-p (see the paper).
- RGCN [4] can be used for knowledge graph link prediction (recovery of missing facts, i.e. subject-predicate-object triples) and entity classification (recovery of missing entity attributes).
- GreatX [5] tackles removing inherent noise, "Distribution Shift" and "Adversarial Attacks" (ex: noise purposely introduced to hide a node presence) from networks. Apparently this is a thing and the field is called "Graph Reliability" or "Reliable Deep Graph Learning". The author even has a bunch of "awesome" style lists of links! [6]
- Finally this repo has a nice explanation of how/why to run machine learning algorithms "outside of the DB":
"Pytorch Geometric (PyG) has a whole arsenal of neural network layers and techniques to approach machine learning on graphs (aka graph representation learning, graph machine learning, deep graph learning) and has been used in this repo [7] to learn link patterns, also known as link or edge predictions."
--
0: https://huggingface.co/models?pipeline_tag=graph-ml&sort=tre...
1: https://github.com/Microsoft/Graphormer
2: https://github.com/pygod-team/pygod
3: https://github.com/DeepGraphLearning/ULTRA
4: https://huggingface.co/riship-nv/RGCN
5: https://github.com/EdisonLeeeee/GreatX
6: https://edisonleeeee.github.io/projects.html
7: https://github.com/Orbifold/pyg-link-prediction
- GitHub - pygod-team/pygod: A Python Library for Graph Outlier Detection (Anomaly Detection)
- PyGOD: Library for graph outlier detection (anomaly detection)
What are some alternatives?
tods - TODS: An Automated Time-series Outlier Detection System
anomaly-detection-resources - Anomaly detection related books, papers, videos, and toolboxes
isolation-forest - A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
PyNeuraLogic - PyNeuraLogic lets you use Python to create Differentiable Logic Programs
alibi-detect - Algorithms for outlier, adversarial and drift detection
kafkaml-anomaly-detection - Project for real-time anomaly detection using Kafka and python
pycaret - An open-source, low-code machine learning library in Python
ADBench - Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.
anomalib - An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
DGFraud - A Deep Graph-based Toolbox for Fraud Detection
stumpy - STUMPY is a powerful and scalable Python library for modern time series analysis
TabFormer - Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)