flyscrape
nifi
flyscrape | nifi | |
---|---|---|
7 | 35 | |
980 | 4,449 | |
- | 2.6% | |
8.6 | 9.9 | |
2 months ago | 3 days ago | |
Go | Java | |
Mozilla Public License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
flyscrape
- Show HN: Flyscrape – A command-line web scraper for non-expert programmers
-
Web Scraping in Python – The Complete Guide
Shameless plug:
Flyscrape[0] lets you eliminate a lot of boilerplate code that is otherwise necessary when building a scraper from scratch, while still giving you the flexibility to extract data that perfectly fit your needs.
It comes as a single binary executable and runs small JavaScript files without having to deal with npm or node.
You can have a collection of small and isolated scraping scripts, rather than full on node projects.
[0]: https://github.com/philippta/flyscrape
- FLaNK Stack Weekly for 20 Nov 2023
- FLaNK Stack Weekly for 13 November 2023
-
Show HN: Flyscrape – A standalone and scriptable web scraper in Go
Thanks for sharing! Just a small nit: the links at the bottom of this page are broken [1].
[1]: https://github.com/philippta/flyscrape/blob/master/docs/read...
- Show HN: flyscrape – An expressive and elegant web scraper
nifi
- FLaNK Stack Weekly 19 Feb 2024
- Ask HN: What are some unpopular technologies you wish people knew more about?
- FLaNK Stack Weekly for 13 November 2023
-
Ask HN: What low code platforms are worth using?
Apache NIFI (https://nifi.apache.org/).
It uses the concept of Flow-based programming. Also its so underacknolged but this tool is very flexible. I have used as an Event Bus all the 3rd-Party Integrations.
- Apache Nifi: easy to use, powerful, reliable system to process, distribute data
- Tool decision - What architecture would you choose and why?
-
Help with choosing techstack for a new DE team
Presently setting up Apache Nifi + Apache MiNiFi for the ETL portion of my work. NiFi was easy enough to figure out; but the docs for MiNiFi have been a pain due to differences between the Java and C++ versions. I then entirely configured it with the Java version so that it was easier to search for answers for the MiNiFi yaml syntax.
-
MS SQL Change Data Capture
Found it
-
Is there something like airflow but written in Scala/Java?
Apache Camel Apache Nifi Spring Cloud
-
Json splitting and Rerouting (new to nifi)
NIFI, like most Apache projects does most of its discussion on its mailing lists, but also has a slack.
What are some alternatives?
cucim - cuCIM - RAPIDS GPU-accelerated image processing library
Logstash - Logstash - transport and process your logs, events, or other data
awesome-emulators - An awesome list of emulators!
superset - Apache Superset is a Data Visualization and Data Exploration Platform
engblogs - learn from your favorite tech companies
meltano
vimGPT - Browse the web with GPT-4V and Vimium
meltano - Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2 - The prototype deploys an Application in CML using a Llama2 model from Hugging Face to answer questions augmented with knowledge extracted from the website. This prototype introduces Pinecone as a database for storing vectors for semantic search.
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
clipea - 📎🟢 Like Clippy but for the CLI. A blazing fast AI helper for your command line
Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum: