-
- Markup to generate HTML I used in courses that I taught. The markup generated links according to an outline (next/prev page, next/prev section) and a few other navigation related things.
- Python-based pipe-objects-not-strings shell with cluster and database support deeply integrated. Twenty years later, it has turned into Marcel (https://marceltheshell.org), and I’m still the only one using it AFAIK.
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
I maintain a fork of tt-rss[0] that I use to follow blogs, podcasts, and YouTube. I wrote a podcatcher that used the back-end database, too.
I forked it back in 2005 because the maintainer wasn't interested in the direction my patches were going. My version has diverged dramatically from the current version.
I have no idea how many hours I've put into it over 19 years. It has needed surprisingly little care and feeding (which I'd attribute to it being a simple PHP app).
I've used it nearly daily in the last 19 years.
[0] https://tt-rss.org/
-
I wrote myself a debug tool at 2007. It's a piece of GUI "to borrow". A Windows app that catches Windows messages addressed to it and shows things in its own windows. It can show numbers you give it, give numbers back, count calls, and measure time. So it's a kind of debugger/profiler/GUI.
I open sourced it 8 years ago but it was not the original intention. I wrote it for myself. https://github.com/akalenuk/16counters
I wrote it in MASM32. It's therefore a tiny .exe file of about 7 KB. I spent maybe a few hours initially, but I've been adding features one-by-one for several years. I use it a few times a month.
-
I’ve got two great examples of this. Both I use to help control DNS on my local network. They’re open source, but I doubt anyone else is using it, and I’m fine with that.
unifi-dns-scraper[0]: a simple tool that logs into my Unifi console to get all the hosts and then creates a hosts file that my local DNS servers can use.
unifi-doh-blocker[1]: as part of my efforts to better control my network, I don’t want random devices ignoring my local DNS by using DoH. This gets various lists of public DNS over HTTPS servers and updates a blocklist on my Unifi Dream Machine Pro. With a few other firewall rules this essentially forces all my DNS through local servers which then do encrypted DNS queries to a third party DNS service.
These tools make me happy and were fun to write.
[0] https://github.com/pridkett/unifi-dns-scraper
-
-
I made https://github.com/notAI-tech/fastDeploy for ease of use of deploying Deep learning models.
Except me and the team at my company nobody uses it.
After I made it lot of solutions came in this space, but IMO this is the easiest one to use still.
-
I’ve spent five years and 100KLoC building https://github.com/kyleu/projectforge which helps generate and update other Golang projects I maintain. I use it daily.
-
InfluxDB
Purpose built for real-time analytics at any scale. InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
-
The static site generator for my website: http://beza1e1.tuxen.de/gen.py
A news bot: https://github.com/qznc/mrktws-news (the output is public, does it count?)
A TiddlyWiki server: https://github.com/qznc/tiddlywiki-py
Such stuff usually costs me a few frantic evenings to build the first version and then minor maintenance.
-
The static site generator for my website: http://beza1e1.tuxen.de/gen.py
A news bot: https://github.com/qznc/mrktws-news (the output is public, does it count?)
A TiddlyWiki server: https://github.com/qznc/tiddlywiki-py
Such stuff usually costs me a few frantic evenings to build the first version and then minor maintenance.
-
I made a CLI tool called kilojoule that is similar to jq. I addition to the normal suite of JSON manipulations, it also has support for a couple of other file formats and can call other shell commands.
https://github.com/stevenlandis/kilojoule
I’ve found it to be a pleasant multi tool for interacting with the shell when I need something a little more than bash.
It took a couple of weekends and evenings to get working but was a really fun way to learn about parsers, interpreters and Rust.
-
Over the past two years I've been working on tooling that allows me to delink programs back into object files. What started out as a bunch of Jython scripts is nowadays a full-blown Ghidra extension that can export working object files from a program selection in two mouse clicks. I'm using it as part of a video game decompilation project, but it also enables a whole bunch of other use cases I've documented on my blog.
It's not that it is meant for my use only (any capable reverse-engineer familiar with Ghidra should be able to pick it up and use it) nor that it will never see the light of the day (it's open-source). However, it is such an esoteric capability and outright heresy according to computer sciences that I'm having a hard time just finding people who can wrap their heads around the concept, let alone people who would actually want to use this. Simply put, it's so far out there that I don't even know where or how to advertise it in the first place.
A couple of people did end up reaching out to me in the last couple of weeks about it, so it might be on the cusp of sprouting a very tiny user base. Still, I've made it for my own use and until very recently I've never expected anybody else would use it.
If someone wants to check out the dark magic: https://github.com/boricj/ghidra-delinker-extension (disclaimer: might trigger nightmares to linker developers).
-
I pretty much rebuilt my bioinformatics stack from scratch in Go, and I’m pretty proud of it! It does things that no other library, even python libraries, do for synthetic biology applications.
I’m pretty sure nobody else uses it, but I use it a lot for DNA design work for my company https://github.com/koeng101/dnadesign
-
I built a hydroponic garden as a covid hobby. I wrote software to maintain the garden, water it on schedule, apply ph changes to the water, turn lights on / off, humidify, as well as monitor statistics (temperature, humidity, water temperature, water ph, water conductivity).
Rough guess would be that I spent 50 hours actually working on the software.
There's a handful of raspberry pis involved. I wrote everything in elixir / used https://nerves-project.org for a bit of. The dashboard is written with phoenix live view. One of the raspberry pis is the "brain" and basically runs the dashboard and controls devices. The devices are all in an elixir cluster. I also run timescale db for some basic history of metrics.
Once I start a grow I don't use it that much actively, but it passively runs all the time. I check in every few days or week to make sure nutrients are looking good.
I've grown strawberries, lettuce, jalapenos, and cayenne peppers.
-
dash-mpd-rs
Rust library for parsing, serializing and downloading media content from a DASH MPD manifest.
I have a smartphone app that scrapes replay TV listings for a few shows that I like to watch at the gym and allows me to download the low-quality media stream to the phone to view offline ad-free.
I released the Rust library that downloads and reassembles media segments from a DASH stream (https://github.com/emarsden/dash-mpd-rs). Won't release the web scraping bits because they are against website terms and conditions, and because annoying countermeasures will be implemented if too many people use them.
-
-
I have a 3-letter domain that I use as my personal playground. It features:
- One-click upload of text, clipboard, and files. So from any computer, I can type xyz.com + ctrl+v to upload the clipboard, or drag and drop a file, etc. It's the only feature available to unauthenticated sessions, but I'm not too worried because you don't get a URL back.
- A list of uploaded files, with buttons to copy link, download, delete, or generate QR code.
- Notifications (for authenticated sessions) when an upload is made.
- Simplified RSS feed reader (also custom[1]). It scans my feeds every 10 minutes, and displays the URLs of new items, along with a button to mark them as read.
- A Server-sent-events channel[2], and a UI that displays visitors in real time. The channel allows me to send redirects/file downloads to any visitor, so I'm constantly asking people to visit xyz.com when I need to hand them something digitally.
- Youtube video downloader (powered by yt-dlp), along with audio extractor and pitch+speed correction for songs.
Once in a while I add or remove widgets according to my needs. The whole thing runs in custom pair of Go and Python servers, developed slowly over the course of years, and I use it multiple times a day.
[1]: https://github.com/boppreh/feeder
[2]: https://github.com/boppreh/server-sent-events
-
I have a 3-letter domain that I use as my personal playground. It features:
- One-click upload of text, clipboard, and files. So from any computer, I can type xyz.com + ctrl+v to upload the clipboard, or drag and drop a file, etc. It's the only feature available to unauthenticated sessions, but I'm not too worried because you don't get a URL back.
- A list of uploaded files, with buttons to copy link, download, delete, or generate QR code.
- Notifications (for authenticated sessions) when an upload is made.
- Simplified RSS feed reader (also custom[1]). It scans my feeds every 10 minutes, and displays the URLs of new items, along with a button to mark them as read.
- A Server-sent-events channel[2], and a UI that displays visitors in real time. The channel allows me to send redirects/file downloads to any visitor, so I'm constantly asking people to visit xyz.com when I need to hand them something digitally.
- Youtube video downloader (powered by yt-dlp), along with audio extractor and pitch+speed correction for songs.
Once in a while I add or remove widgets according to my needs. The whole thing runs in custom pair of Go and Python servers, developed slowly over the course of years, and I use it multiple times a day.
[1]: https://github.com/boppreh/feeder
[2]: https://github.com/boppreh/server-sent-events
-
Does not strictly match the "will never see the light of the day for anyone else."
But forked an ActivityPub server (GoToSocial) to proxy Rss feeds (https://github.com/Timshel/RssTooter).
Spent a couple of days on it initially (but had made something similar for Nitter before).
-
I run a small Dwarf Fortress podcast, and I didn't like the transcription options when we started a few years ago, so I wrote some python glue to do diarization (separate out speakers) and transcription using a torchaudio project, and either whisper or openai depending on how I'm feeling that day. Works surprisingly well, with timestamps and clean-up:
https://github.com/drewbuschhorn/a_strange_mood_podcast_tran...
vs
https://astrangemoodpodcast.com/2023/01/17/episode-1-is-dwar...
-
I run a small Dwarf Fortress podcast, and I didn't like the transcription options when we started a few years ago, so I wrote some python glue to do diarization (separate out speakers) and transcription using a torchaudio project, and either whisper or openai depending on how I'm feeling that day. Works surprisingly well, with timestamps and clean-up:
https://github.com/drewbuschhorn/a_strange_mood_podcast_tran...
vs
https://astrangemoodpodcast.com/2023/01/17/episode-1-is-dwar...
-
I would add videos I liked to the "Liked Videos" or other playlist areas on my YouTube account. I'd return to find the video months or years later, only to see that it may have been removed (usually due to copyright or other channel closure). This video removal issue also extended to other platforms. I found this annoying.
To avoid this, I made an extremely simple video platform. I save all my videos there instead. It's still my hobby project that I use the most.
The code is at https://github.com/AlbinoDrought/creamy-videos
There's an optional youtube-dl importer UI at https://github.com/AlbinoDrought/creamy-videos-importer (separated for easier updates)
The importer repo also contains a Firefox extension so any target can be right-clicked -> Import to Creamy Videos -> Select a set of tags -> sent to the importer UI, youtube-dl, and then eventually video storage: https://github.com/AlbinoDrought/creamy-videos-importer/tree...
-
I would add videos I liked to the "Liked Videos" or other playlist areas on my YouTube account. I'd return to find the video months or years later, only to see that it may have been removed (usually due to copyright or other channel closure). This video removal issue also extended to other platforms. I found this annoying.
To avoid this, I made an extremely simple video platform. I save all my videos there instead. It's still my hobby project that I use the most.
The code is at https://github.com/AlbinoDrought/creamy-videos
There's an optional youtube-dl importer UI at https://github.com/AlbinoDrought/creamy-videos-importer (separated for easier updates)
The importer repo also contains a Firefox extension so any target can be right-clicked -> Import to Creamy Videos -> Select a set of tags -> sent to the importer UI, youtube-dl, and then eventually video storage: https://github.com/AlbinoDrought/creamy-videos-importer/tree...
-
It's more "nobody else is interested" than "it's not out in the open", but I've made my own structured data format implemented as a Rust serializer https://github.com/rsaarelm/idm and am using it for a growing collection of command-line tools for managing personal notes written as outline files https://github.com/rsaarelm/idm-tools and to run a static site generator https://github.com/rsaarelm/blog-engine . I'm also writing a game that uses IDM as the data serialization format.
Idea for the format was that you can write structured data with a really minimal syntax if you have an external type schema running the parsing, and the syntax emerged from the line-and-indentation based outline note files I'd started writing for myself. It took some months of work and planning and a couple rewrites to get the core IDM library working right. The tools and site generator were simple and straightforward in comparison.
-
It's more "nobody else is interested" than "it's not out in the open", but I've made my own structured data format implemented as a Rust serializer https://github.com/rsaarelm/idm and am using it for a growing collection of command-line tools for managing personal notes written as outline files https://github.com/rsaarelm/idm-tools and to run a static site generator https://github.com/rsaarelm/blog-engine . I'm also writing a game that uses IDM as the data serialization format.
Idea for the format was that you can write structured data with a really minimal syntax if you have an external type schema running the parsing, and the syntax emerged from the line-and-indentation based outline note files I'd started writing for myself. It took some months of work and planning and a couple rewrites to get the core IDM library working right. The tools and site generator were simple and straightforward in comparison.
-
It's more "nobody else is interested" than "it's not out in the open", but I've made my own structured data format implemented as a Rust serializer https://github.com/rsaarelm/idm and am using it for a growing collection of command-line tools for managing personal notes written as outline files https://github.com/rsaarelm/idm-tools and to run a static site generator https://github.com/rsaarelm/blog-engine . I'm also writing a game that uses IDM as the data serialization format.
Idea for the format was that you can write structured data with a really minimal syntax if you have an external type schema running the parsing, and the syntax emerged from the line-and-indentation based outline note files I'd started writing for myself. It took some months of work and planning and a couple rewrites to get the core IDM library working right. The tools and site generator were simple and straightforward in comparison.
-
- Artwork Framer[2]: Script using Blender that converts .png into a framed 3D model. You can even adjust the frame if you want and then export it as a model to be used in VR (works even in Apple Vision)
[1] https://github.com/bartaxyz/denmark-swimming
-
-
gbfs
Documentation for the General Bikeshare Feed Specification, a standardized data feed for shared mobility system availability. Maintained by MobilityData (by MobilityData)
I wrote a tool in Haskell that continually polls my city's (Toronto) bike share API and dumps it into a database, along with a web server component that does a bunch of on-the-fly analysis of the data and serves that up as various visualizations and stats.
I got so frustrated with my city's bike share program one night and I wanted to figure out whether the problems I had been having were widespread or not.
Surprisingly enough there's a standard for bike share APIs [0] that my city adheres to. I'd been looking for an excuse to learn Haskell and figured the best way to stay motivated was to make useful tool for myself.
I've been working on it for the better part of a year at this point (probably >200 hours) and I use it roughly daily. After gathering a few months of station-level data it became clear enough that the system (specifically, how the e-bikes are handled) has some pretty serious issues.
I ended up sending the city a couple of freedom of information requests for some data I was missing and for the contract between them and the company that operates the system. The contract says that the operator gets paid a fee for every dead e-bike they move to a charging station - regardless of whether or not that bike actually manages to charge.
The city owns around 2000 e-bikes and there are frequently times where a whopping 20 are available - the rest are out of battery, sitting in docking stations for multiple days. Occasionally the operator shuffle the bikes over to the handful electrified charging stations, and about half the time the bikes don't charge (they just completely fill the stations, meaning nobody can use them - one of those stations happens to be right outside my partner's place). Then 4-7 days later they come around in a truck and get rid of the bikes, and the whole cycle repeats again in a couple days.
Basically, we're getting fleeced by the system operator. It isn't even clear to me if it's intentional or not, or just the usual combination of lax municipal oversight and an apathetic operator.
I've been meaning to send my data and the contract over to a few reporters who care about that sort of thing, since (a) the bike share system is genuinely a great way to get around (when it works) and (b) financially it's really good value for the city [1], and I'd like to see the problems fixed (especially given that the city is planning on massively expanding the e-bike fleet).
[0]: General Bikeshare Feed Specification: https://github.com/MobilityData/gbfs
[1]: The per-trip subsidy is $0.67, versus $4.08 for a trip on our transit system (!)
-
Weave: https://github.com/rberenguel/weave
It's hard to explain what it is. Think personal knowledge manager text editor inspired by Acme. The readme and video there are very outdated, I add features faster than I update that (as the only user this is what I get). I post videos of new features on twitter though, as a reminder to myself of where I passed through
-
All the time!
My most recent were tools to improve the AWS commandline experience for s3 and Athena
https://github.com/heuermh/cooper
https://github.com/heuermh/sea-eagle
Both are available via Homebrew
https://github.com/heuermh/homebrew-parquet-tools
I would next like to improve the TUI experience for tabular data, e.g. using the Charm_ Bubbles/Gum table component, but I have yet to investigate how the JVM <--> Go interaction might work.
-
All the time!
My most recent were tools to improve the AWS commandline experience for s3 and Athena
https://github.com/heuermh/cooper
https://github.com/heuermh/sea-eagle
Both are available via Homebrew
https://github.com/heuermh/homebrew-parquet-tools
I would next like to improve the TUI experience for tabular data, e.g. using the Charm_ Bubbles/Gum table component, but I have yet to investigate how the JVM <--> Go interaction might work.
-
All the time!
My most recent were tools to improve the AWS commandline experience for s3 and Athena
https://github.com/heuermh/cooper
https://github.com/heuermh/sea-eagle
Both are available via Homebrew
https://github.com/heuermh/homebrew-parquet-tools
I would next like to improve the TUI experience for tabular data, e.g. using the Charm_ Bubbles/Gum table component, but I have yet to investigate how the JVM <--> Go interaction might work.
-
https://github.com/kitallis/WAP
hisaab – expense tracking and categorizing because my bank doesn't do apis and sends credit card reports in PDFs
-
-
It's a glorified local setup, running in a cloud free tier.
- Oracle Cloud Free Tier[1] for a Ubuntu VPS (4 ARM cores, 24 GB RAM). Surprisingly pleasant and reliable, given who's offering and for how much ($0). It used to be a FreeBSD VPS on DigitalOcean, until they kept screwing up their FreeBSD support and bricking my machines.
- Caddy[2] web server with Let's Encrypt certificates, working as reverse proxy.
- A Go server for HTTP (static files, uploads, maintaining server-sent-event channels).
- A Python server for the widgets, communicating with the Go server via events exchanged in a socket.
- Source code edited manually in-place (SSH or SSHFS[3], with Git) and restarted as needed. I know, I know, awful practice. But as I'm the only user, uptime during development is not a concern.
- Startup is handled by a @reboot cronjob and a bash one liner.
- Text files for "structured storage" (RSS feed items, authenticated sessions, mapping of uploaded file names).
As horrible as it might all sound, it has survived ten years and two cloud vendors. Nowadays I might containerize it, or rewrite as one Rust server, but I think I made the right choices at the time.
8/10 given the unusual requirements.
[1] https://www.oracle.com/cloud/free/
[2] https://caddyserver.com/
[3] https://github.com/winfsp/sshfs-win
-
Sure, it's public: https://github.com/AdrianVollmer/Congruence
-
gcodepreview
OpenSCAD library for moving a tool in lines and arcs so as to model how a part would be cut using G-Code or described as a DXF.
Couldn't find any CAM software which works as I want, so I've been working on implementing G-code and DXF export (so as to match tool movement) from OpenSCAD for a while now --- got a big boost when a Python-enabled version was made:
https://pythonscad.org/
and now have a pretty much workable tool:
https://github.com/WillAdams/gcodepreview
(Re)wrote it using Literate Programming techniques, probably hundreds of hours (I've been at this for years and am not a particularly good programmer) and I use it on pretty much every project I make which isn't just drawing stuff in a Bézier curve drawing program.
-
Assistive technology, I made a head-tracking mouse replacement. I’ve now used it almost every day at work for a decade. It’s open source but AFAIK no one else is using it:
https://github.com/aranchelk/headmouse
-
Sandstorm
Sandstorm is a self-hostable web productivity suite. It's implemented as a security-hardened web app package manager.
This hits close to home. I have something similar, except it's not as featureful and it's ridiculously purpose-built for me. E.g. I have one page in this app for generating PDFs of a very specific type that I need. I have another page that parses CSVs from my bank. These aren't tools I use regularly, I just need them occasionally, but they're all in one repo in one project.
There are of course apps that do each of these, but when you start integrating them together, with their different ways of doing auth, different kinds of APIs, etc. things get so complicated. I worked in a company that integrated with some third parties and those integrations were easily 50% of the workload for the dev team and maybe 90% of the workload for support. There is so much effort being wasted on having different systems for different things.
I feel like there's an idea out there that will solve all this for open source--maybe involving stitched GraphQL APIs, OIDC+JWT for auth, etc. Something kind of like Sandstorm[1] except with Sandstorm the different apps weren't necessarily built with Sandstorm in mind. In this system there would be a centralized identity management system like Keycloak that manages concepts like users, roles and apps, and everything else delegates auth to that centralized system.
[1] https://sandstorm.io/
-
https://github.com/jonquark/InkyCal
-
syft
CLI tool and library for generating a Software Bill of Materials from container images and filesystems
I can wholeheartedly recommend Syft.[0]
Decoupling SBOM data collection from vulnerability tracking (with your tool of choice) is a nice capability.
0: https://github.com/anchore/syft
-
I made Highscore (https://github.com/jsdw/highscore) just for the purpose of keeping track of high scores with somebody on a few karaoke games that didn't track individual peoples scores.. never been used for anything else :)
-
I have a lot of things like this. Mostly one-offs for bizarre stuff, like a FUSE plugin that saves data to FAT-formatted image files. Some things, though, I consider useful enough to post on Github for people who might need it. My favorite (and most-used, personally) is an app I made to copy files off of my Panasonic cameras. They still work perfectly well, but the official app no longer works to send pics via wifi. I had to reverse-engineer the entire protocol stack from network traffic, it's a doozy. I ended up implementing SSDP from scratch and faking DLNA server support, along with the miserable SOAP protocol. It's a beautiful mess, but it works. Got me back into Android development too.
https://github.com/kerbalno15/shutterlink
-
WDoc
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable, under developpement
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
AnnA_Anki_neuronal_Appendix
Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
OCR_with_format
Wrapper around pytesseract to postprocess in a way that preserves spacing and formattings.
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
I'm a medical student with FOMO and who loves AI stuff. I have about 70 public github repos that nobody uses even though I take pride in at least trying to use good code and documenting everything.
Here's a highlight (edit: more like an ego dump)
I couldn't keep up with my news so I made the perfect summarizer that goes through the thought process of the author : https://github.com/thiswillbeyourgithub/DocToolsLLM/
I needed an AI based system that go through my anki cards, but might as well make it able to read dozens of file formats. Now I can put entire medical youtube playlists, conferences, anki databases, hundreds of PDFs and ask a single question across all of them at once : also https://github.com/thiswillbeyourgithub/DocToolsLLM/
I couldn't keep up with my anki review so I made an embedding based review reranker (one of my first ever project!) : https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_A...
I needed to make my sleep more efficient : micropython to the rescue : https://github.com/thiswillbeyourgithub/sleep_tracker_pineti...
I wanted to store everything and anything in Logseq ttps://github.com/thiswillbeyourgithub/LogseqMarkdownParser and https://github.com/thiswillbeyourgithub/LogseqPDFImporter
I was so late for my medical exams I needed to learn FAST : I made an incredible app that turns hours long recordings of you talking aloud while reading your PDF into hundreds of flashcards that learns to mimic your style by using previous validated answers etc. Using stt and LLM. This one is not out yet but will be.
Also I made scripts that automatically reformulate old bad anki cards into more appropriate ones.
As well as creating complex mnemonics stories and images automatically added to your anki cards. Not yet public.
I needed help to sort through my many todos so I made an ELO based reranker made to be used in the CLI (careful, wet paint) : https://github.com/thiswillbeyourgithub/mini_LiTOY
I wanted my anki OCR to be more legible so I made tesseracts try to preserve spacing : https://github.com/thiswillbeyourgithub/OCR_with_format
I lost my voice for some time but needed to be heard so I made speech.sh : https://github.com/thiswillbeyourgithub/speech.sh
And quite a few others.
-
Ah, I understand your confusion, now. I didn't link to the actual project because it's not active, anymore. I guess I could link to the repo: https://github.com/Leftium/PhotoDrop
Finding a service that integrated with Dropbox the way I wanted probably took more time than the actual development.
-
https://github.com/martindemello/anthill
does fairly basic word search stuff, and there are lots of more sophisticated programs out there, but none of them run in a linux terminal and that's what i wanted. i have it constantly open in a tab, for various word game related things.
-
pacman-backup
:floppy_disk: Pacman Backup tool for off-the-grid updates via portable USB sticks or (mesh) LAN networks.
My github is littered with software I built as tools in the moment for myself.
The coolest thing in my opinion was [1] pacman-backup, which is a small (hacky) nodejs script to be able to download packages on one machine and be able to use pacman on another airgapped machine with the downloaded packages.
A couple months after I took down the repository I received a letter from the US where someone was asking me whether the repo is still maintained and whether or not it still works. As it turns out, the people of Cuba use Archlinux, too, and they used my tool to use flash drives to share updates.
I still think this is the coolest little tool I ever built, and I probably have to rewrite it in a more sane language (maybe go?) at some point.
[1] https://github.com/cookiengineer/pacman-backup
-
FalkorDB
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
That is cool! Would love to see how it works with FalkorDB as the Graph Database for the knowledge Graph https://github.com/FalkorDB/FalkorDB
-
I have good experience with
https://github.com/pivotal/LicenseFinder
This produces BOM with versions but rather than out of date it focuses on licenses which comes handy during acquisitions due diligence. Supports many languages
-
A git switching utility that switches git profiles and ssh keys between two accounts, written in powershell. Its a one click solution by clicking of two batch file wrappers one for original and one for override configs, I love it saving me faffing manually w git configs and ssh agents.
Built custom (atom) extensions for the Hydra vj environment (https://hydra.ojack.xyz/) that add in midi control (based on the midi spec for web which I am still amazed exists and is at least somewhat supported https://www.w3.org/TR/webmidi/) , some lazy eval helpers and custom css styling for the programming text for more crazy feedback visuals.
A custom build script that runs a client and server architecture and their related logs all split in the same Window terminal window.
Several highly idiosyncratic single purpose visualisation / vj apps in processing and openframeworks, some doing camera feedback stuff, some wiggling points and particulars around on music or blob detection.
-
I wrote something to bulk download my library from Bandcamp and actually packaged it as an excuse to mess with github actions: https://github.com/ReK42/bcamp-dl
-
-
Web scraping different platforms and sending results to my favorite messenger: https://github.com/magicxor/ExtensibleMessageBroker
-
Transformation of copied text using regular expressions: https://github.com/magicxor/ClipboardTransform
It usually takes 3-10 days to implement and polish the core functionality. Some apps I use on a daily basis (e.g., bots in my favorite messenger), and some 1-2 times per month.
-
-
I made a data takeout tool for the SaaS kindergarten communication platform used by my child's daycare. On it's own the software is really simple and could have been a few hundred lines of Python, but at some point I changed my mind and wanted to share this with other parents in case they find it, the things that make a project fit for publishing took 95% of actual work since you want it general and robust enough someone non-technical can use it.
https://github.com/karolistamutis/kidsnoter
-
I build https://maggick.fr/Spotify_RAS/ specifically for my need.
The tool select albums on my Spotify collection and add them to my playing queue. I am using the tool daily (I put 6-7 albums in the morning and I am setup for the day with a few hours of music).
It is just some Spotify API manipulation in Typescript and it was not that long to code (except some UI editing and tweaking, I am not good with front end :D). I am trying to open the application to everyone but the Spotify process is quit long.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives