transcript.fish
Filestash
transcript.fish | Filestash | |
---|---|---|
3 | 109 | |
16 | 9,474 | |
- | - | |
8.8 | 9.3 | |
9 days ago | 7 days ago | |
TypeScript | JavaScript | |
- | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
transcript.fish
-
Ask HN: Tell us about your project that's not done yet but you want feedback on
I have been working on this podcast transcription project for a couple months and it's been super rewarding.
I listen to a podcast called No Such Thing As A Fish, where some researchers talk about their favorite facts they learned that week. Then they riff on it and are generally smart and funny. I listened to the series so many times that I decided I wanted to listen to the show on shuffle, not at the episode level, but at the fact level.
Since I have been playing around with whisper.cpp in python this seemed like a perfect way to combine some technologies I've been wanting to play with.
I ran whisper over the entire podcast and transcribed all the episodes. I had to do this multiple times because I kept messing up. It eventually took like 7 straight days of my M1 processing to get through ~490 episodes.
4 million words, and an 800Mb SQLite database later, I got the transcriptions done and have put up a nice site for searching through the data.
https://transcript.fish
Now I just need to figure out the rest. Breaking it up into facts. Getting the audio working. Highlighting and linking to words, phrases, etc.
Some cool info about the process so far:
1. The SQLite database is chunked up and stored as static files, and the frontend queries the static files directly using HTTP range requests, so it only downloads a couple hundreds kbs when querying.
2. I've been proper using ChatGPT 3.5 free version to help me write python and SQL. It's been pretty game changing as I feel basically no pain from not knowing what I'm doing.
The code is here: https://github.com/noman-land/transcript.fish
Please help if you know how to get whisper speaker diarization working!! I would really appreciate the help.
-
Cloud Backed SQLite
Interesting, thanks for the tip! I've never heard of lockdown mode. Unfortunately I don't own an iPhone. Any chance you could post an issue with steps to repro?
https://github.com/noman-land/transcript.fish/issues
Thank you!
Filestash
- Ask HN: What Underrated Open Source Project Deserves More Recognition?
-
A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev
Filestash — A Dropbox-like file manager that connects to a range of protocols and platforms: S3, FTP, SFTP, Minio, Git, WebDAV, Backblaze, LDAP and more.
-
Ask HN: What apps have you created for your own use?
I made https://github.com/mickael-kerjean/filestash out of the need to collaborate on org mode documents with non emacs users. Once the first release was done, I got to reflect on the infamous top comment of the Dropbox HN to make an attempt at abstracting the storage aspect of Dropbox so those org document could be made stored on a FTP server, SFTP, S3, ....
-
Ask HN: Experience using your user's Google Drive instead of a database?
> we need an abstraction for just this. "Bring your own storage"
I made exactly this: https://github.com/mickael-kerjean/filestash and there's an API from which you can abstract any kind of storage: S3, SFTP, FTP, GIT, WebDav, Samba, Local FS, NFS, Backblaze, Storj, Artifactory, .... There's even some funky ones like Mysql from which you have an abstraction where first level folders are databases, second level folders are tables and files are the actual rows
-
Let's learn how modern JavaScript frameworks work by building one
Yes, I rewrote my react app onto vanilla JS using nothing else than rxjs, didn't have the time to document it all yet but it looks like this: https://github.com/mickael-kerjean/filestash/blob/master/pub...
-
Found the ultimate Nextcloud / Owncloud replacement!
I'm not familiar with Cloudreve, but FileStash is a similar application often recommended on this subreddit.
-
HTML Web Components
I do use them on my OSS work (https://github.com/mickael-kerjean/filestash/tree/master/pub...) which is used by many thousands of people
-
UI frameworks are stuck in the last decade
- [2] current state of the rewrite where you can see this pattern in action https://github.com/mickael-kerjean/filestash-rewrite/tree/ma...
-
Ask HN: Tell us about your project that's not done yet but you want feedback on
https://github.com/mickael-kerjean/filestash
This is what I wish Dropbox was, a simple layer that make interacting with your FTP server easy so nobody has to own your data. The end game is both to be feature complete with Dropbox and be able to change every aspect of the application through plugin so everyone can get out what they want from it.
-
Meta pledges Three-Year sponsorship for Python if GIL removal is accepted
> but I don't think its the companies responsibility to give back to open source just because they use it
As someone who does quite a bit of OSS, the reality is most people are asking for things but aren't willing to pay for it. Take Microsoft, I had one of their employee asking me to support their azure stuff: https://github.com/mickael-kerjean/filestash/issues/180. When I found out the dude was actually employed by Microsoft, he started to talk some nonsense and ended up running away.
What are some alternatives?
EmailFlare - Send emails from your domain through Cloudflare for free. Self host on your account.
filemanager - 📂 Web File Browser
dashflare - An unofficial Cloudflare dashboard built on top of Cloudflare API.
SFTPGo - Full-featured and highly configurable SFTP, HTTP/S, FTP/S and WebDAV server - S3, Google Cloud Storage, Azure Blob
YourVision - AI-powered image editor
filegator - Powerful Multi-User File Manager
audioflare - An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.
minio - The Object Store for AI Data Infrastructure
web-tables-demo
h5ai - HTTP web server index for Apache httpd, lighttpd and nginx.
cloudflare-for-speed-and-security - Scale faster and save big with the global edge cloud platform - https://kerkour.com/cloudflare-for-speed-and-security
Apaxy - a simple, customisable theme for your apache directory listing