cape-dataframes
koalas
Our great sponsors
cape-dataframes | koalas | |
---|---|---|
4 | 2 | |
174 | 3,319 | |
0.0% | 0.2% | |
0.0 | 4.6 | |
9 months ago | about 1 month ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
cape-dataframes
-
Show HN: Cape API – Keep your sensitive data private while using GPT-4
- How can we mitigate hallucinations and bias so that we have higher trust in AI generated text?
The features of the Cape API are designed to help solve these problems for developers, and we have a number of early customers using the API in production already.
To get started, checkout our docs: https://docs.capeprivacy.com/
View the API reference: https://api.capeprivacy.com/redoc
Join the discussion on our Discord: https://discord.gg/nQW7YxUYjh
And of course try the CapeChat playground at https://chat.capeprivacy.com/
-
Secure Sentiment Analysis with Enclaves
There are three essential components that enable this: cape encrypt, cape deploy, and cape run. The command cape encrypt encrypts inputs that can be sent into the Cape enclave for processing, cape deploy performs all needed actions for deploying a function into the enclave, and finally cape run invokes the deployed function with an input that was previously encrypted with cape encrypt. Learn more on the Cape docs.
-
Anonymize your Data with a single line!
Well, many of the features in this project are simply wrappers around other libraries like this one. Therefore, the value proposition of this project would either have to be the automation aspect or the idea that you can shield the user from the details of how the implemented techniques work. I think both approaches are risky in this setting.
-
Data Anonymization Libraries
I was wondering what other helpful and easy of use libraries are there for data anonymization like faker and cape-python ?
koalas
-
My new company uses Pyspark. I want to learn it before my starting date. Any advice?
If they're using databricks and you're familiar with pandas, koalas should be right up your alley .
-
Spark vs Pandas
If you like excessive use of square brackets.. I mean pandas, you might wanna check out Koalas. Koalas suppose to provide pandas datafrafe API implementation atop of Spark.
What are some alternatives?
popmon - Monitor the stability of a Pandas or Spark dataframe ⚙︎
Dask - Parallel computing with task scheduling
prosto - Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
datacompy - Pandas and Spark DataFrame comparison for humans and more!
exodus - Platform to audit trackers used by Android application
PandasGUI - A GUI for Pandas DataFrames
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]
private-ai - Repo for Udacity's Secure & Private AI course
fastdbfs - fastdbfs - An interactive command line client for Databricks DBFS.
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.