slay | Kiba | |
---|---|---|
2 | 7 | |
- | 1,722 | |
- | - | |
- | 0.0 | |
- | over 1 year ago | |
Ruby | ||
- | GNU Lesser General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
slay
Kiba
-
Ask HN: What side projects landed you a job?
I started https://github.com/thbar/kiba#kiba-etl to scratch my own itch & be able to write properly structured ETL jobs in Ruby. It was a blank-slate rewrite of something larger (activewarehouse-etl) which I could not maintain anymore.
This landed me not strictly a job, but long term consulting gigs with a number of companies in EU, UK & US.
The job was directly related to the project: companies wanted the expertise of data engineering & ETL, often with Kiba directly, but also in general.
This "side project" was totally worth it :-)
-
Ruby's Hash Is a Swiss-Army Knife
Definitely! As a matter of fact, this is the default data structure I use when writing Ruby ETL code (e.g. https://github.com/thbar/kiba/wiki).
Methods like "except" (https://docs.ruby-lang.org/en/3.2/Hash.html#method-i-except) or "fetch" (raising an error on missing key) are very convenient to write defensive data processing code!
Similarly, in Elixir, I use Maps a lot for the same type of jobs (https://hexdocs.pm/elixir/1.15.4/Map.html), with similar properties.
-
Thinking in learn Ruby
Ruby has a very cool ETL library named Kiba that fits wonderfully with Ruby's strengths.
- What ETL tool do you use?
-
Massive SQL import from csv file, nulls, best practices.
Though it might be overkill for your problem, but have you had a look at [kiba-etl](https://github.com/thbar/kiba/blob/master/README.md)?
-
My favorite Ruby gems
Kiba
-
Ruby ETL Strategies: Organizing block-based Kiba Pipelines
If you don’t use Kiba, but work with data, check it out.
What are some alternatives?
pakhi-bhasha - Dynamically typed bangla programming language written in rust
Nokogiri - Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.
goby - Goby - Yet another programming language written in Go
data-science-with-ruby - Practical Data Science with Ruby based tools.
fontsource - Self-host Open Source fonts in neatly bundled NPM packages.
Roo - Roo provides an interface to spreadsheets of several sorts.
normandy - Channels for CSP style Ruby
chronicle-etl - 📜 A CLI toolkit for extracting and working with your digital history
polytt - A type theory with native support for Polynomial Functors.
ferrum - Headless Chrome Ruby API
wintermoon - Wintermoon Framework
AW Datapipe - Unofficial ruby wrapper for the AWS SDK Data Pipeline API.