-
RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
-
RL-Adventure-2
Discontinued Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters [Moved to: https://github.com/higgsfield-ai/higgsfield]
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Lecture 7: Function approximation. (slides, code, video)
Lecture 8: Policy gradient methods. (slides, code, theory, video)
Thanks, I used a slight modification of https://github.com/pvanberg/flux-beamer
Related posts
-
Simple GitHub Issue Handled(?) By Copilot Workspace
-
Show HN: Hamilton's UI – observability, lineage, and catalog for data pipelines
-
Quantum Computing Collection of Resources
-
2024 Verizon Data Breach Investigation Report [pdf]
-
Impact of Input Length on the Reasoning Performance of Large Language Models