Our great sponsors
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
meltano
-
Personal Project Guidance
I would use something like meltano or airbyte, but if you really want to use Lambda for extraction I'd say there is no point spinning up a Redshift cluster just for that, Athena would be the way to go and you can use dbt pretty nicely with it and it would keep costs down.
-
Open source contributions for a Data Engineer?
Airbyte and Singer/Meltano if you want to learn more about ingestion pipelines. Airbyte and Meltano teams are very welcoming. SQLfluff a shiny SQL linter. Beautiful project with awesome maintainers.
-
Looking for open source projects that use data pipelines and big data flows
I know really sure if this is what are you looking for, but take a look at Meltano
-
Meltano ELT: Open-Source DataOps for the DevOps Era
I'm not aware of any. I did just open this issue[0] in the Meltano project to open discussion with the team/community. It could be an interesting iteration on the Singer Spec[1] if we find that users are interested in it and it helps solve some bottleneck challenges.
[0] https://gitlab.com/meltano/meltano/-/issues/2616
-
Meltano: ELT for the DevOps era — Open source, self-hosted, CLI-first, debuggable, and extensible
Good point! As expected, there's an issue about adding it already: https://gitlab.com/meltano/meltano/-/issues/1175
-
Launch HN: Airbyte (YC W20) – Open-Source ELT (Fivetran/Stitch Alternative)
At GitLab, we're not ready to give up on the Singer spec, community, and ecosystem yet, which is why I've been working on Meltano for the past year: https://meltano.com/
We think that the biggest things holding back Singer are the lack of documentation and tooling around taking existing taps and targets to production, and around building, debugging, maintaining, and testing new or existing high-quality taps and targets.
Meltano itself addresses the first problem, and provides a robust and reliable platform for building, running & orchestrating Singer- and dbt-based ELT pipelines.
At the same time, we have been working with some members of the community on a new framework for building taps and targets: https://gitlab.com/meltano/meltano/-/issues/2401, which we have decided to call the Singer SDK: https://gitlab.com/meltano/singer-sdk
supabase
-
How to get free Postgres
Sign up for SupaBase: Head over to SupaBase and sign up. Create a new workspace and project with your preferred names.
-
Creating a Pokémon guessing game using Supabase, Drizzle, and Next.js in just 2 hours!
Setting up Supabase Create a new Supabase project, and get the connection string for the database from settings > database.
-
How To Make An Insanely Fast AI App (Supabase, LLAMA 3 and Groq)
Supabase (start for free)
-
Building a self-creating website with Supabase and AI
Built with Supabase, Astro, Unreal Speech, Stable Diffusion, Replicate, Metropolitan Museum of Art
-
How I built a Markdown Rendered Blog using Supabase and Chakra UI
Supabase will be used for storing article data in the database and the cover image of the article in storage. Chakra UI will be used to provide style to the elements. By using both, we can build the blog with ease.
-
I got #1 Product of the Day on Product Hunt without Spending a Dollar
For AutoRepurpose, I opted for Supabase as the backbone of the backend. It has reliably supported Penelope AI, which garnered over 15k users in 2022 without any issues.
-
AI Inference now available in Supabase Edge Functions
Semantic search demo
-
Creating an OG image using React and Netlify Edge Functions
1. Create a new Supabase project: Visit Supabase and create a new project.
-
11 Planetscale alternatives with free tiers
Supabase positions itself as the "open source Firebase alternative." It was founded in 2020 and is a developer-friendly serverless database platform that supports over 20 frameworks, including popular tools like Next.js, React, Nuxt, Svelte, Flutter, and Vue.
-
Implementing semantic image search with Amazon Titan and Supabase Vector
You can find the full application code as a Python Poetry project on GitHub.
What are some alternatives?
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Appwrite - Build like a team of hundreds_
dagster - An orchestration platform for the development, production, and observation of data assets.
pocketbase - Open Source realtime backend in 1 file
pipelinewise - Data Pipeline Framework using the singer.io spec
nhost - The Open Source Firebase Alternative with GraphQL.
nifi - Apache NiFi
neon - Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, branching, and bottomless storage.
Prefect - The easiest way to build, run, and monitor data pipelines at scale.
next-auth - Authentication for the Web.
pipelinewise-tap-mssql - Pipelinewise tap for Microsoft SQL Server
Hasura - Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.