everypolitician-data
Kiba
everypolitician-data | Kiba | |
---|---|---|
1 | 7 | |
227 | 1,722 | |
0.0% | - | |
0.0 | 0.0 | |
7 months ago | over 1 year ago | |
Ruby | Ruby | |
- | GNU Lesser General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
everypolitician-data
-
Ask HN: Is There a Crunchbase for Policitians?
https://github.com/everypolitician/everypolitician-data
I haven't used it, so I don't know about the quality of the data
Kiba
-
Ask HN: What side projects landed you a job?
I started https://github.com/thbar/kiba#kiba-etl to scratch my own itch & be able to write properly structured ETL jobs in Ruby. It was a blank-slate rewrite of something larger (activewarehouse-etl) which I could not maintain anymore.
This landed me not strictly a job, but long term consulting gigs with a number of companies in EU, UK & US.
The job was directly related to the project: companies wanted the expertise of data engineering & ETL, often with Kiba directly, but also in general.
This "side project" was totally worth it :-)
-
Ruby's Hash Is a Swiss-Army Knife
Definitely! As a matter of fact, this is the default data structure I use when writing Ruby ETL code (e.g. https://github.com/thbar/kiba/wiki).
Methods like "except" (https://docs.ruby-lang.org/en/3.2/Hash.html#method-i-except) or "fetch" (raising an error on missing key) are very convenient to write defensive data processing code!
Similarly, in Elixir, I use Maps a lot for the same type of jobs (https://hexdocs.pm/elixir/1.15.4/Map.html), with similar properties.
-
Thinking in learn Ruby
Ruby has a very cool ETL library named Kiba that fits wonderfully with Ruby's strengths.
- What ETL tool do you use?
-
Massive SQL import from csv file, nulls, best practices.
Though it might be overkill for your problem, but have you had a look at [kiba-etl](https://github.com/thbar/kiba/blob/master/README.md)?
-
My favorite Ruby gems
Kiba
-
Ruby ETL Strategies: Organizing block-based Kiba Pipelines
If you don’t use Kiba, but work with data, check it out.
What are some alternatives?
maintenance_tasks - A Rails engine for queueing and managing data migrations.
Nokogiri - Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.