Kiba
Roo
Kiba | Roo | |
---|---|---|
7 | 2 | |
1,722 | 2,763 | |
- | 0.1% | |
0.0 | 3.8 | |
over 1 year ago | about 1 month ago | |
Ruby | Ruby | |
GNU Lesser General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Kiba
-
Ask HN: What side projects landed you a job?
I started https://github.com/thbar/kiba#kiba-etl to scratch my own itch & be able to write properly structured ETL jobs in Ruby. It was a blank-slate rewrite of something larger (activewarehouse-etl) which I could not maintain anymore.
This landed me not strictly a job, but long term consulting gigs with a number of companies in EU, UK & US.
The job was directly related to the project: companies wanted the expertise of data engineering & ETL, often with Kiba directly, but also in general.
This "side project" was totally worth it :-)
-
Ruby's Hash Is a Swiss-Army Knife
Definitely! As a matter of fact, this is the default data structure I use when writing Ruby ETL code (e.g. https://github.com/thbar/kiba/wiki).
Methods like "except" (https://docs.ruby-lang.org/en/3.2/Hash.html#method-i-except) or "fetch" (raising an error on missing key) are very convenient to write defensive data processing code!
Similarly, in Elixir, I use Maps a lot for the same type of jobs (https://hexdocs.pm/elixir/1.15.4/Map.html), with similar properties.
-
Thinking in learn Ruby
Ruby has a very cool ETL library named Kiba that fits wonderfully with Ruby's strengths.
- What ETL tool do you use?
-
Massive SQL import from csv file, nulls, best practices.
Though it might be overkill for your problem, but have you had a look at [kiba-etl](https://github.com/thbar/kiba/blob/master/README.md)?
-
My favorite Ruby gems
Kiba
-
Ruby ETL Strategies: Organizing block-based Kiba Pipelines
If you don’t use Kiba, but work with data, check it out.
Roo
-
Question for WPS
For reading Excel files, we use roo. Our needs are more focused on simply extracting data from the files, so we don't need anything that can modify the files directly.
-
Massive SQL import from csv file, nulls, best practices.
Have you tried the Roo gem?
What are some alternatives?
Nokogiri - Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.
AXLSX - xlsx generation with charts, images, automated column width, customizable styles and full schema validation. Axlsx excels at helping you generate beautiful Office Open XML Spreadsheet documents without having to understand the entire ECMA specification. Check out the README for some examples of how easy it is. Best of all, you can validate your xlsx file before serialization so you know for sure that anything generated is going to load on your client's machine.
data-science-with-ruby - Practical Data Science with Ruby based tools.
rubyXL - Ruby lib for reading/writing/modifying .xlsx and .xlsm files
chronicle-etl - 📜 A CLI toolkit for extracting and working with your digital history
Spreadsheet - The Ruby Spreadsheet by ywesee GmbH
slay
Creek - Ruby library for parsing large Excel files.
ferrum - Headless Chrome Ruby API
Spreadsheet Architect - Spreadsheet Architect is a library that allows you to create XLSX, ODS, or CSV spreadsheets super easily from ActiveRecord relations, plain Ruby objects, or tabular data.
AW Datapipe - Unofficial ruby wrapper for the AWS SDK Data Pipeline API.
Yomu - Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)