Kiba
AW Datapipe
Kiba | AW Datapipe | |
---|---|---|
7 | - | |
1,722 | 1 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | almost 4 years ago | |
Ruby | Ruby | |
GNU Lesser General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Kiba
-
Ask HN: What side projects landed you a job?
I started https://github.com/thbar/kiba#kiba-etl to scratch my own itch & be able to write properly structured ETL jobs in Ruby. It was a blank-slate rewrite of something larger (activewarehouse-etl) which I could not maintain anymore.
This landed me not strictly a job, but long term consulting gigs with a number of companies in EU, UK & US.
The job was directly related to the project: companies wanted the expertise of data engineering & ETL, often with Kiba directly, but also in general.
This "side project" was totally worth it :-)
-
Ruby's Hash Is a Swiss-Army Knife
Definitely! As a matter of fact, this is the default data structure I use when writing Ruby ETL code (e.g. https://github.com/thbar/kiba/wiki).
Methods like "except" (https://docs.ruby-lang.org/en/3.2/Hash.html#method-i-except) or "fetch" (raising an error on missing key) are very convenient to write defensive data processing code!
Similarly, in Elixir, I use Maps a lot for the same type of jobs (https://hexdocs.pm/elixir/1.15.4/Map.html), with similar properties.
-
Thinking in learn Ruby
Ruby has a very cool ETL library named Kiba that fits wonderfully with Ruby's strengths.
- What ETL tool do you use?
-
Massive SQL import from csv file, nulls, best practices.
Though it might be overkill for your problem, but have you had a look at [kiba-etl](https://github.com/thbar/kiba/blob/master/README.md)?
-
My favorite Ruby gems
Kiba
-
Ruby ETL Strategies: Organizing block-based Kiba Pipelines
If you don’t use Kiba, but work with data, check it out.
AW Datapipe
We haven't tracked posts mentioning AW Datapipe yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
Nokogiri - Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.
twitter - A Ruby interface to the Twitter API.
Roo - Roo provides an interface to spreadsheets of several sorts.
telegram-bot - Ruby gem for building Telegram Bot with optional Rails integration
data-science-with-ruby - Practical Data Science with Ruby based tools.
Yt - The reliable YouTube API Ruby client
chronicle-etl - 📜 A CLI toolkit for extracting and working with your digital history
fb_graph - This gem doesn't support FB Graph API v2.0+. Please use fb_graph2 gem instead.
slay
fb_graph2
ferrum - Headless Chrome Ruby API
ruby-trello - Implementation of the Trello API for Ruby