telekinesis
pyxet
telekinesis | pyxet | |
---|---|---|
12 | 4 | |
16 | 33 | |
- | - | |
5.6 | 8.1 | |
29 days ago | 10 days ago | |
Python | Python | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
telekinesis
-
Show HN: Sort and Filter Ask HN Who's Hiring by LLM-Embedding Proximity
https://payperrun.com/%3E/search?displayParams={%22q%22:%22S...
(There are quite a few, you might want to filter by date!)
-
Ask HN: Who is hiring? (November 2023)
Hey everyone, I just made this thread easier to search through here:
https://payperrun.com/%3E/search?displayParams={%22q%22:%22D...
It uses LLM embeddings to sort postsby semantic proximity, but you can also filter out posts with comma separated values like this:
-
Ask HN: What do you regret doing or not doing in your 30s?
https://news.ycombinator.com/item?id=33118584
[Shameless plug: I found all these on my llm-embedding based search engine I launched today: https://payperrun.com/%3E/search?displayParams={%22q%22:%22A...
It's much better than HN's default search: https://hn.algolia.com/?q=Ask+HN%3A+What+do+you+regret+doing... ]
-
My thoughts on starting an online business as someone who's never done it before
https://payperrun.com/%3E/search?displayParams={%22q%22:%22A...
-
We should promote more personal indexing, rather than algorhythmic indexing
There have been a few attempts at a crowdsourced-rank search engine (which is similar to what you're suggesting - people indexing the content), but it seems to be a hard cookie, most of the examples of similar ideas I could find on ProductHunt or ShowHN seem dead:
https://payperrun.com/%3E/search?displayParams={%22q%22:%22c...
(btw, I just launched this llm-embedding based search service that lets you check if a startup idea has already been tried/failed).
I don't know if this idea has a higher death rate than the baseline, but my guess is Google/PageRank is good enough for most use-cases, and then if you want quality sources, you can just follow them on YouTube, Twitter, Instagram, etc. Wait, maybe I shouldn't try to compete with Google?
-
Show HN: An Embedding-Based Search Service over ShowHN, AskHN, GitHub, More
I like the section on how it works: https://payperrun.com/%3E/search?display=How%20this%20servic...
The vector search is using https://lancedb.com/ and OpenAI embeddings.
-
Embeddings: What they are and why they matter
Behaves as I expected now!
I went here looking for more info about payperrun https://payperrun.com/%3E/welcome and clicked on the "Spotlight" section and saw 4 popups blocked - I never see popups anywhere these days and have to admit that sends me away pretty quickly.
- Show HN: Payperrun.com – A New Way to Monetize Your Code
- telekinesis: Just-in-time SDKs
- Show HN: Just-in-Time SDKs
pyxet
-
Backing up datasets locally is crucial.
Have you looked into Xethub? It's like Github with ML-extensions. Their philosophy is you commit everything: code, assets, embeddings, the source docs, models, etc.
-
LLM for Church Fathers texts
There are plenty of variations on this idea but one I found most enlightening is a workshop example for a commercial product called Xethub.
-
On storing trained models (with sklearn and other libraries) - how do you do it?
I've been playing with Xethub lately. It looks and feels like git but deals with large models and embeddings pretty seamlessly.
-
[P] pyxet: a Python library for ML teams to work with data like S3, while having the memory of Git.
Code: https://github.com/xetdata/pyxet
What are some alternatives?
chasr-server - End-To-End Encrypted GPS Tracking Service
dropbox-sdk-python - The Official Dropbox API V2 SDK for Python
terra.py - Python SDK for Terra
pycloudinary - Python package for cloudinary
DBoW2 - Enhanced hierarchical bag-of-word library for C++
bert - TensorFlow code and pre-trained models for BERT
filestack-python - Official Python SDK for Filestack - API and content management system that makes it easy to add powerful file uploading and transformation capabilities to any web or mobile application.
marqo - Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
vectordb - A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
llm-cluster - LLM plugin for clustering embeddings
interview - Interview for OneUptime