pagefind
orange
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pagefind
-
πUnderrated Open Source Projects You Should Know About π§
Pagefind is a static search library that aims to perform well on small or large sites, while using as little bandwidth as possible, and you don't have to host any infrastructure.
- Pagefind β Static low-bandwidth search at scale
- Ask HN: What Underrated Open Source Project Deserves More Recognition?
- Pagefind is a low bandwidth static search library
-
Lightweight, portable and secure Wasm runtimes and their use cases.
In theory, if we ran lower level code, we would be using less resources. That's more than a theory. Go to this video where I demonstrate Pagefind, written in Rust and compiled to Wasm as target, as a static app that ingests and indexes HTML documents and runs super efficient search queries, all client-side.
- Pagefind v1.0.0 β Stable static search at scale
-
Free Open-Source Blog Template for Developers βοΈπ
β Pagefind static search library integration
- Pagefind is a fully static search library
-
How to Start Your Blog in 2023
I use Astro SSG and Cloudflare Pages. I use https://github.com/cloudcannon/pagefind for search on my Astro setup. You can test the search functionality here https://tinyrocket.pages.dev/.
From its repo: "Pagefind runs after any static site generator and automatically indexes the built static files. Pagefind then outputs a static search bundle to your website, and exposes a JavaScript search API that can be used anywhere on your site."
Pagefind is cool!
-
Weβre the Meilisearch team! To celebrate v1.0 of our open-source search engine, Ask us Anything!
An option there is https://pagefind.app/ β not as fast as a persistent server but solves some of the deployment and bandwidth issues.
orange
-
Hierarchical Clustering
I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.
Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.
https://orangedatamining.com/
https://orange3.readthedocs.io/projects/orange-visual-progra...
- Orange Data Mining
-
The Graph of Wikipedia [video]
For all you folks who aren't ace programmer types, the Orange3[1] platform gives you a very miniaturized[2] ability to turn out these sorts of visualizations very rapidly. It's not the most stable thing in the world, but the node-based ML workflow designer is worth the price of admission all by itself.
[1] https://orangedatamining.com/
[2] The Wikipedia extension in Text limits each search result to 25 articles, so sucking all of Wikipedia is . . well, Orange text analytics crashes when I look at it sideways with a null character, so let's not think about what would happen.
- Ask HN: What Underrated Open Source Project Deserves More Recognition?
-
Taxonomy Management?
First is identifying the "similar" things in a corpus. Best way I know to do that, for non-programmer audiences, is the Orange Data Mining tool, which gives you a node-based text mining interface to perform statistical analysis on text. Hierarchical Clustering shows - very rapidly - how similar your "modules" are, which ones are most similar. There's many other techniques (semantic viewer, similarity hash, etc) as well - the right one will depend on how your content is laying about.
- Orange: Open-source machine learning and data visualization
-
What exactly is AutoGPT?
Both tools are ripoffs of a data mining framework named Orange 3
-
Why don't more people use Altair for python Visualizations instead of Plotly?
You should also check out Orange Data Mining, it allows to create a lot of charts, filter data from a chart to another, build ML models, predictions and a lot more. And you can do it with zero code.
-
Advice on Transitioning to Data Science/ML/AI without Coding Experience
You can start with a free GUI based tool Orange. It is a component based data science workflow tool, which you can use to handle 60-75% of the traditional data science tasks from classification, regression, to basic neural networks.
- Has anybody used Orange?
What are some alternatives?
pagebreak - π Open-source CLI tool for implementing pagination on any static website.
glue - Linked Data Visualizations Across Multiple Files
charabia - Library used by Meilisearch to tokenize queries and documents
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
db-benchmarks - Fair database benchmarks framework and datasets
RDKit - The official sources for the RDKit library
rosey - :rose: Open-source CLI tool for managing translations on static websites.
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
vespa - AI + Data, online. https://vespa.ai
Interactive Parallel Computing with IPython - IPython Parallel: Interactive Parallel Computing in Python
bookshop - π A component development workflow for static websites.
NumPy - The fundamental package for scientific computing with Python.