the-algorithm-ml
jdupes
DISCONTINUED
Our great sponsors
the-algorithm-ml | jdupes | |
---|---|---|
36 | 44 | |
9,863 | 1,681 | |
0.4% | - | |
10.0 | 0.0 | |
6 months ago | 6 months ago | |
Python | C | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
the-algorithm-ml
-
AOC said Elon Musk put his 'finger on the scale' during Turkey's presidential election and is 'concerned' it will set a precedent for the 2024 US election
Blog summarising the change: https://blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm
-
Twitter's For You Recommendation Algorithm
Twitter's announcement | Main GitHub Repo | ML GitHub Repo | Engineering Blog Post
- FLaNK Stack Weekly 3 April 2023
-
Analysis of Twitter algorithm code reveals social medium down-ranks tweets about Ukraine
They have made a major part of the code source available recently: the algorithm. However there are at least three issues with calling this an "open source Twitter":
-
Something tells me Twitter isn’t going to get anything useful from their GitHub issues
link
-
Twitter released the source code for the algorithm that recommends tweets
Yeah, that's a basic problem, plus the neural network in the middle is just a recipe for a neural network, without any of the training weights or anything, because it is the data, people's personal data, that this would embody, that is the source of wealth for the network.
- Twitter's Recommendation Algorithm
-
[News] Twitter algorithm now open source
Repo for their recommendation-engine: https://github.com/twitter/the-algorithm-ml
jdupes
-
fdupes: Identify or Delete Duplicate Files
200 lines of Nim [1] seems to run about 9X faster than the 8000 lines of C in fdupes on a little test dir I have. If you need C, I think jdupes [2] is faster as @TacticalCoder points out a couple of times here. In my testing, `dups` is usually faster than `jdupes`, though.
-
I'm amazed how I find anything & why I have so many dupes!
There's always the well-respected tool, Czkawka. Or, of the CLI is your thing, jdupes is a good option.
- Anyone know of any good file deduplication tools?
-
Johnny Decimal
My research into this many years ago turned out that jdupes was the right / best solution I could find for my usecase.
https://github.com/jbruchon/jdupes
Though that works fine from a script perspective I'd like some more interactive way of sorting directories etc. Identifying is just the first step, jdupes helps with linking the files (both soft and hard links comes with caveats though!) but that is mostly to save space, not to help in reorganisation.
-
Any good duplicate file finder for windows?
jdupes is a tuned fork of the well-known fdupes, and has Win32 releases.
- FLaNK Stack Weekly 3 April 2023
- Backing Up Data: Tips/Advice for Tons of Unorganized Data and Duplicate Files from Multiple Sources
-
Anyone running Bees? Or deduping data some other way?
If not bees, do you run other programs for deduping? I see jdupes has support for BTRFS, https://github.com/jbruchon/jdupes, and also duperemove, https://github.com/markfasheh/duperemove.
- Ask HN: Tool to find identical file subtrees scattered over disks
What are some alternatives?
fdupes - FDUPES is a program for identifying or deleting duplicate files residing within specified directories.
dupeguru - Find duplicate files
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
rdfind - find duplicate files utility
duperemove - Tools for deduping file systems
czkawka - Multi functional app to find duplicates, empty folders, similar images etc.
fclones - Efficient Duplicate File Finder
phockup - Media sorting tool to organize photos and videos from your camera in folders by year, month and day.
btrfs-progs - Development of userspace BTRFS tools
cdecrypt - Decrypt Wii U NUS content — Forked from: https://code.google.com/archive/p/cdecrypt/
dduper - Fast block-level out-of-band BTRFS deduplication tool.
xxHash - Extremely fast non-cryptographic hash algorithm