scanpy
Ray
scanpy | Ray | |
---|---|---|
5 | 43 | |
1,763 | 31,179 | |
2.0% | 1.8% | |
9.3 | 10.0 | |
2 days ago | 1 day ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
scanpy
- Renaming Genes for Scanpy Plot
- Useful Python Decorators for Data Scientists
-
[Scanpy] Installation issues related to pytables
Upon searching stack, I still do not understand the exact issue, as the file "hdf5extension.cp38-win_amd64" is present within C:\Users\username\anaconda3\lib\site-packages\tables\. Would anyone be able to explain the problem and any potential circumventions?
-
standardize/normalize seq data
I would suggest you explore with SCANPY and verify if your batch labels generate a strong separation in your samples (PCA, tSNE, UMAP). If you then need to correct for batches, according to how simple/complex they are, you can choose a tool from this benchmark.
-
Flipping one histogram below the axis.
I am plotting 2 1-D histograms on top of one another using a hold on command. Is there a way to have one histogram be upside down, and then to flip the entire plot 90˚? I am looking to create a violin plot https://github.com/theislab/scanpy/issues/1448 on my own, having one histogram on the left, and one on the right.
Ray
- Ray: Unified framework for scaling AI and Python applications
-
Open Source Advent Fun Wraps Up!
22. Ray | Github | tutorial
-
Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models
Training times for GSM8k are mentioned here: https://github.com/ray-project/ray/tree/master/doc/source/te...
- Ray – an open source project for scaling AI workloads
-
Methods to keep agents inside grid world.
Here's a reference from RLlib that points to docs and an example, and here's one from one of my projects that includes all my own implementations
-
TransformerXL + PPO Baseline + MemoryGym
RLlib
- Is dynamic action masking possible in Rllib?
-
AWS re:Invent 2022 Recap | Data & Analytics services
⦿ AWS Glue Data Quality - Automatic data quality rule recommendations based on your data AWS Glue for Ray - Data integration with Ray (ray.io), a popular new open-source compute framework that helps you scale Python workloads
-
Think about it for a second
https://ray.io (just dropping the link)
-
Elixir Livebook now as a desktop app
I've wondered whether it's easier to add data analyst stuff to Elixir that Python seems to have, or add features to Python that Erlang (and by extension Elixir) provides out of the box.
By what I can see, if you want multiprocessing on Python in an easier way (let's say running async), you have to use something like ray core[0], then if you want multiple machines you need redis(?). Elixir/Erlang supports this out of the box.
Explorer[1] is an interesting approach, where it uses Rust via Rustler (Elixir library to call Rust code) and uses Polars as its dataframe library. I think Rustler needs to be reworked for this usecase, as it can be slow to return data. I made initial improvements which drastically improves encoding (https://github.com/elixir-nx/explorer/pull/282 and https://github.com/elixir-nx/explorer/pull/286, tldr 20+ seconds down to 3).
[0] https://github.com/ray-project/ray
What are some alternatives?
scikit-learn - scikit-learn: machine learning in Python
optuna - A hyperparameter optimization framework
dash - Data Apps & Dashboards for Python. No JavaScript Required.
stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
data-science-ipython-notebooks - Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Faust - Python Stream Processing
deepvariant - DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
gevent - Coroutine-based concurrency library for Python
dash-cytoscape - Interactive network visualization in Python and Dash, powered by Cytoscape.js
stable-baselines - A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
pagoda2 - R package for analyzing and interactively exploring large-scale single-cell RNA-seq datasets
SCOOP (Scalable COncurrent Operations in Python) - SCOOP (Scalable COncurrent Operations in Python)