TabFormer vs pygod

TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021) (by IBM)

Source Code

arxiv.org

Suggest alternative

Edit details

pygod

A Python Library for Graph Outlier Detection (Anomaly Detection) (by pygod-team)

outlier-detection anomaly-detection graph-anomaly-detection Machine Learning security-tools Opensource Deeplearning Python graphmining Pytorch graph-neural-networks fraud-detection Toolkit

Source Code

pygod.org

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

TabFormer		pygod
	Project
10	Mentions	3
297	Stars	1,217
2.7%	Growth	2.2%
0.0	Activity	8.6
9 months ago	Latest Commit	16 days ago
Python	Language	Python
Apache License 2.0	License	BSD 2-clause "Simplified" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TabFormer

Posts with mentions or reviews of TabFormer. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-20.

Time-based splitting performing significantly worse than random splitting
2 projects | /r/learnmachinelearning | 20 May 2023

Hi, I am currently working on a basic binary classifier for a transaction dataset, to predict which transaction is fraudulent (Dataset: https://github.com/IBM/TabFormer). The following is a quick summary of the dataset:
Question regarding Relational Graph Convolutional Network for a Fraud Detection problem
1 project | /r/learnmachinelearning | 4 May 2023

I am currently working on a transaction dataset (https://github.com/IBM/TabFormer/tree/main/data/credit_card) and I intend to build a fraud detection engine, but with tabular data transformed into a graph. I have used this article as my main outline for this approach: https://developer.nvidia.com/blog/optimizing-fraud-detection-in-financial-services-with-graph-neural-networks-and-nvidia-gpus/.
TabFormer: NEW Data - star count:231.0
1 project | /r/algoprojects | 25 Mar 2023

1 project | /r/algoprojects | 24 Mar 2023

1 project | /r/algoprojects | 23 Mar 2023

1 project | /r/algoprojects | 22 Mar 2023

1 project | /r/algoprojects | 21 Mar 2023

1 project | /r/algoprojects | 20 Mar 2023

1 project | /r/algoprojects | 19 Mar 2023
[D] Neural Networks are not the only universal approximators, so why are they so uniquely effective?
1 project | /r/MachineLearning | 29 Mar 2022

When people talk about tabular data they mean something with like <100 columns where your classification might strongly depend on a handful of specific ones. There is of course a regime where data is "somewhat" tabular (some NLP problems) so it's not entirely well-defined. And there are NN architecture for tabular data like the tabformer.

pygod

Posts with mentions or reviews of pygod. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-10.

RAG Using Structured Data: Overview and Important Questions
5 projects | news.ycombinator.com | 10 Jan 2024

Ok, using ChatGPT and Bard (the irony lol) I learned a bit more about GNNs:
GNNs are probabilistic and can be trained to learn representations in graph-structured data and handling complex relationships, while classical graph algorithms are specialized for specific graph analysis tasks and operate based on predefined rules/steps.
* Why is PyG it called "Geometric" and not "Topologic" ?
Properties like connectivity, neighborhoods, and even geodesic distances can all be considered topological features of a graph. These features remain unchanged under continuous deformations like stretching or bending, which is the defining characteristic of topological equivalence. In this sense, "PyTorch Topologic" might be a more accurate reflection of the library's focus on analyzing the intrinsic structure and connections within graphs.
However, the term "geometric" still has some merit in the context of PyG. While most GNN operations rely on topological principles, some do incorporate notions of Euclidean geometry, such as:
- Node embeddings: Many GNNs learn low-dimensional vectors for each node, which can be interpreted as points in a vector space, allowing geometric operations like distances and angles to be applied.
- Spectral GNNs: These models leverage the eigenvalues and eigenvectors of the graph Laplacian, which encodes information about the geometric structure and distances between nodes.
- Manifold learning: Certain types of graphs can be seen as low-dimensional representations of high-dimensional manifolds. Applying GNNs in this context involves learning geometric properties on the manifold itself.
Therefore, although topology plays a primary role in understanding and analyzing graphs, geometry can still be relevant in certain contexts and GNN operations.
* Real world applications:
- HuggingFace has a few models [0] around things like computational chemistry [1] or weather forecasting.
- PyGod [2] can be used for Outlier Detection (Anomaly Detection).
- Apparently ULTRA [3] can "infer" (in the knowledge graph sense), that Michael Jackson released some disco music :-p (see the paper).
- RGCN [4] can be used for knowledge graph link prediction (recovery of missing facts, i.e. subject-predicate-object triples) and entity classification (recovery of missing entity attributes).
- GreatX [5] tackles removing inherent noise, "Distribution Shift" and "Adversarial Attacks" (ex: noise purposely introduced to hide a node presence) from networks. Apparently this is a thing and the field is called "Graph Reliability" or "Reliable Deep Graph Learning". The author even has a bunch of "awesome" style lists of links! [6]
- Finally this repo has a nice explanation of how/why to run machine learning algorithms "outside of the DB":
"Pytorch Geometric (PyG) has a whole arsenal of neural network layers and techniques to approach machine learning on graphs (aka graph representation learning, graph machine learning, deep graph learning) and has been used in this repo [7] to learn link patterns, also known as link or edge predictions."
--
0: https://huggingface.co/models?pipeline_tag=graph-ml&sort=tre...
1: https://github.com/Microsoft/Graphormer
2: https://github.com/pygod-team/pygod
3: https://github.com/DeepGraphLearning/ULTRA
4: https://huggingface.co/riship-nv/RGCN
5: https://github.com/EdisonLeeeee/GreatX
6: https://edisonleeeee.github.io/projects.html
7: https://github.com/Orbifold/pyg-link-prediction
GitHub - pygod-team/pygod: A Python Library for Graph Outlier Detection (Anomaly Detection)
1 project | /r/programming | 9 Apr 2022
PyGOD: Library for graph outlier detection (anomaly detection)
1 project | news.ycombinator.com | 7 Apr 2022

What are some alternatives?

When comparing TabFormer and pygod you can also consider the following projects:

Transformers4Rec - Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

pyod - A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

TabFormer vs Transformers4Rec pygod vs pyod

Compare TabFormer vs pygod and see what are their differences.

TabFormer

pygod

TabFormer

pygod

What are some alternatives?