Surprise vs MLflow

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Surprise		MLflow
	Project
8	Mentions	56
6,194	Stars	17,335
-	Growth	1.5%
2.7	Activity	9.9
7 days ago	Latest Commit	5 days ago
Python	Language	Python
GNU General Public License v3.0 or later	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Surprise

Posts with mentions or reviews of Surprise. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-10-16.

Recommender Systems: Surprise library installation on m1 mac
1 project | /r/learnpython | 12 Jan 2023

Something is wrong with the repo. The compiler fails with this error clang: error: no such file or directory: 'surprise/similarities.c' If you go to the repo, you'll see the file is indeed missing: https://github.com/NicolasHug/Surprise/tree/master/surprise
Recommender systems question
1 project | /r/MLQuestions | 12 Nov 2022

Scikit-surprise is a useful package and has pretty good documentation to help make the leap from conceptual understanding to code. If you want to understand the various implementations, the package is open source and available on GitHub. I can’t speak for optimal computational efficiency but I think that it’s premature to worry about that while you’re still making the transition from concept to functionality.
Surprise – a simple recommender system library for Python
1 project | /r/Python | 1 Mar 2022

1 project | /r/recommendersystems | 1 Mar 2022

1 project | /r/programming | 1 Mar 2022

1 project | news.ycombinator.com | 1 Mar 2022
Dislike button would improve Spotify's recommendations
4 projects | news.ycombinator.com | 16 Oct 2021

I spent the latter half of 2019 trying to build this as a startup. Ultimately I pivoted (now I do newsletter recommendations instead), but if I hadn't made some mistakes I think it could've gotten more traction. Mostly I should've simplified the idea to make it easier to build. If anyone's interested in working on this, here's what I would do:
(But first some background: The way I saw it, you can split music recommendation into two tasks: (1) picking a song you already know that should be played right now, and (2) picking a new song you've never heard of before. (Music recommendation is unique in this way since in most other domains there isn't much value in re-recommending items). I think #1 is more important, and if you nail that, you can do a so-so job of #2 and still have a good system.)
Make a website that imports your Last.fm history. Organize the history into sessions (say, groups of listen events with a >= 30 minute gap in between). Feed those sessions into a collaborative filtering library like Surprise[1], as a CSV of `, , 1` (1 being a rating--in this case we only have positive ratings). Then make some UI that lets people create and export playlists. e.g. I pick a couple seed songs from my listening history, then the app uses Surprise to suggest more songs. Present a list of 10 songs at a time. Click a song to add it, and have a "skip all" button that gets a new list of songs. Save these interactions as ratings--e.g. if I skip a song, that's a -1 rating for this playlist. For some percentage of the suggestions (20% by default? Make it configurable), use Last.fm's or Spotify's API to pick a new song not in your history, based on the songs in the current playlist. Also sometimes include songs that were added to the playlist previously--if you skip them, they get removed from the playlist. Then you can spend a couple minutes every week refreshing your playlists. Export the playlists to Spotify/Apple Music/whatever.
As you get more users, you can do "regular" collaborative filtering (i.e. with different users) to recommend new songs instead of relying on external APIs. There are probably lots of other things you could do too--e.g. scrape wikipedia to figure out what artists have done collaborations or something. In general I think the right approach is to build a model for artist similarity rather than individual song similarity. At recommendation time, you pick an artist and then suggest their top songs (and sometimes pick an artist already in the user's history, and suggest songs they haven't heard yet--that's even easier).
This is the simplest thing I can think of that would solve my "I love music but I listen to the same old songs everyday because I'm busy and don't want to futz around with curating my music library" problem. You wouldn't have to waste time building a crappy custom music app, and users won't have to use said crappy custom music app (speaking from personal experience...). You wouldn't have to deal with music rights or integrating with Spotify/Apple Music since you're not actually playing any music.
If you want to go further with it, you could get traction first and then launch your own streaming service or something. (Reminds me a bit of Readwise starting with just highlights and then launching their own reader recently). I think it'd be neat to make an indie streaming service--kind of like Bandcamp but with an algorithm to help you find the good stuff. Let users upload and listen to their own MP3s so it can still work with popular music. Of course it'd be nicer for users in the short term if you just made deals with the big record labels, however this would help you not end up in Spotify's position of pivoting to podcasts so you can get out of paying record labels. And then maybe in a few decades all the good music won't be on the big labels anyway :).
Anyway if anyone is remotely interested in building something like this, I'll be your first user. I really need it. Otherwise I'll probably build it myself at some point in the next year or two as a side project.
[1] http://surpriselib.com/
Show HN: The Sample – newsletters curated for you with machine learning
1 project | news.ycombinator.com | 28 Jun 2021

I'm planning to build a business on this, so probably won't open-source it--but I'm always looking for interesting things to write about! I write a weekly newsletter called Future of Discovery[1]; I might write up some more implementation details there in a week or two. In the mean time, most of the heavy lifting is done by the Surprise python lib[2]. It's pretty easy to play around with, just give it a csv of , , and then you can start making rating predictions. Also fastText[3] is easy to mess around with too. Most of the code I've written just layers things on top of that, e.g. to handle exploration-vs-exploitation as discussed in another thread here.
Recently I've been factoring out the ML code into a separate recommendation service so it can different kinds of apps (I just barely made this essay recommender system[4] start using it for example).
I'm happy to chat about recommender systems also if you like, email's in my profile.
[1] https://findka.com
[2] http://surpriselib.com/
[3] https://fasttext.cc/
[4] https://essays.findka.com

MLflow

Posts with mentions or reviews of MLflow. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-23.

Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations
1 project | dev.to | 26 Apr 2024

How can this be? The current state of practice in AI/ML work requires adaptivity, which is uncommon in classical computational fields. There are myriad tools that capture the work across the many instances of the AI/ML lifecycle. The idea that any one tool could sufficiently capture the dynamic work is unrealistic. Take, for example, an experiment tracking tool like W&B or MLFlow; some form of experiment tracking is necessary in typical model training lifecycles. Such a tool requires some notion of a dataset. However, a tool focusing on experiment tracking is orthogonal to the needs of analyzing model performance at the data sample level, which is critical to understanding the failure modes of models. The way one does this depends on the type of data and the AI/ML task at hand. In other words, MLOps is inherently an intricate mosaic, as the capabilities and best practices of AI/ML work evolve.
My Favorite DevTools to Build AI/ML Applications!
9 projects | dev.to | 23 Apr 2024

MLflow is an open-source platform for managing the end-to-end machine learning lifecycle. It includes features for experiment tracking, model versioning, and deployment, enabling developers to track and compare experiments, package models into reproducible runs, and manage model deployment across multiple environments.
Exploring Open-Source Alternatives to Landing AI for Robust MLOps
18 projects | dev.to | 13 Dec 2023

Platforms such as MLflow monitor the development stages of machine learning models. In parallel, Data Version Control (DVC) brings version control system-like functions to the realm of data sets and models.
cascade alternatives - clearml and MLflow
3 projects | 1 Nov 2023
EL5: Difference between OpenLLM, LangChain, MLFlow
2 projects | /r/LLMDevs | 19 Jun 2023

MLFlow - http://mlflow.org
Explain me how websites like Dall-E, chatgpt, thispersondoesntexit process the user data so quickly
1 project | /r/dataengineering | 17 Jun 2023
[D] What licensed software do you use for machine learning experimentation tracking?
1 project | /r/MachineLearning | 11 Jun 2023
Exploring MLOps Tools and Frameworks: Enhancing Machine Learning Operations
3 projects | dev.to | 6 Jun 2023

MLflow:
Options for configuration of python libraries - Stack Overflow
2 projects | /r/learnpython | 14 May 2023

In search for a tool that needs comparable configuration I looked into mlflow and found this. https://github.com/mlflow/mlflow/blob/master/mlflow/environment_variables.py There they define a class _EnvironmentVariable and create many objects out of it, for any variable they need. The get method of this class is in principle a decorated os.getenv. Maybe that is something I can take as orientation.
[D] Is there a tool to keep track of my ML experiments?
2 projects | /r/MachineLearning | 13 May 2023

I have been using DVC and MLflow since then DVC had only data tracking and MLflow only model tracking. I can say both are awesome now and maybe the only factor I would like to mention is that IMO, MLflow is a bit harder to learn while DVC is just a git practically.

What are some alternatives?

When comparing Surprise and MLflow you can also consider the following projects:

LightFM - A Python implementation of LightFM, a hybrid recommendation algorithm.

clearml - ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

scikit-learn - scikit-learn: machine learning in Python

Sacred - Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

tensorflow - An Open Source Machine Learning Framework for Everyone

zenml - ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.

python-recsys - A python library for implementing a recommender system

guildai - Experiment tracking, ML developer tools

Crab - Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

dvc - 🦉 ML Experiments and Data Management with Git

Keras - Deep Learning for humans

Surprise vs LightFM MLflow vs clearml Surprise vs scikit-learn MLflow vs Sacred Surprise vs tensorflow MLflow vs zenml Surprise vs python-recsys MLflow vs guildai Surprise vs Crab MLflow vs dvc Surprise vs Keras MLflow vs tensorflow

Compare Surprise vs MLflow and see what are their differences.

Surprise

MLflow

Surprise

MLflow

What are some alternatives?