databricks-cli
The missing command line client for Databricks SQL (by aloneguid)
spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers. (by dotnet)
databricks-cli | spark | |
---|---|---|
1 | 3 | |
2 | 1,999 | |
- | 0.2% | |
1.8 | 0.0 | |
almost 2 years ago | 19 days ago | |
C# | C# | |
Apache License 2.0 | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
databricks-cli
Posts with mentions or reviews of databricks-cli.
We have used some of these posts to build our list of alternatives
and similar projects.
spark
Posts with mentions or reviews of spark.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-06-21.
- .NET for Apache Spark appears to be abandoned
-
Does anyone actually use ML.NET?
Re: DataFrames, that's good to know. There is the DataFrame API which is part of the Microsoft.Data.Analysis NuGet package and that's the API that the issue is tracking and shown in the sample notebook I shared. That API has no dependencies on other systems. The DataFrame you're referring to is part of the .NET for Apache Spark library which has the dependency on Apache Spark which rqeuires some initial setup.
-
What does the .NET ecosystem offer in terms of distributed data processing frameworks?
the data engineering ecosystem is new to me but my first impressions are that everything is catered toward JVM. The only somewhat promising option I've found for building a data pipeline in .NET is github.com/dotnet/spark.
What are some alternatives?
When comparing databricks-cli and spark you can also consider the following projects:
dbx - 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
ParquetSharp.DataFrame - ParquetSharp.DataFrame is a .NET library for reading and writing Apache Parquet files into/from .NET DataFrames, using ParquetSharp