data_jd
tinygrad
Our great sponsors
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
data_jd
-
Random walk in 2 lines of J
I suspect the J package Jd is probably the most non-trivial public codebase. I don’t love the coding style (functions are long and scripted) and it doesn’t make use of newer lambda functions (“direct definitions”) which are easier to read. https://github.com/jsoftware/data_jd
-
Jd
An example J file because this link doesn't say much:
https://github.com/jsoftware/data_jd/blob/master/csv/csv.ijs
Here's one of the more central files that ties into how a Jd database is laid out:
https://github.com/jsoftware/data_jd/blob/master/base/common...
Not that I claim anyone in particular can read it of course. Jd uses a hierarchy of folder, database, table, column that's handled with an object system to share code between them. A folder is just a place to put databases and hardly needs to add anything, while the other levels have a lot of extra functionality. As an inverted database, Jd stores each column in a file, and accesses it using memory mapping.
https://github.com/jsoftware/data_jd/blob/master/base/folder...
https://github.com/jsoftware/data_jd/blob/master/base/table....
(I designed this system when I did some of the early work to turn JDB into Jd as a summer intern)
I found this license for jd itself. It is free only for non-commercial use:
https://github.com/jsoftware/data_jd/blob/master/doc/License...
The link you mentioned only applies to the jsource folder: the jengine code.
-
Denigma is an AI that explains code in understandable English. Test it for yourself... Today!
A longer piece from GitHub: csvreportsummary=: 3 : 0 t=. <;.2 fread PATHLOGLOGFILE b=. (<,LF)=t b=. b+.(<'!')={.each t b=. b+.(<'src: ')=5{.each t b=. b+.(<'snk: ')=5{.each t b=. b+.(<'elapsed: ')=9{.each t b=. b+.(<'rows: ')=6{.each t b=. b+.(<'error: ')=7{.each t ;b#t )
tinygrad
- How 'Open' Is OpenAI, Really?
-
LLaMA-7B in Pure C++ with full Apple Silicon support
Also in that realm is tinygrad by geohot, which has an open PR for integrating LLaMA support.
George Hotz already implemented LLaMA 7B and 15B on Twitch yesterday on GPU in Tunygrad llama branch:
https://github.com/geohot/tinygrad/tree/llama
The only problem is that it's swapping on 16GB Macbook, so you need at least 24GB in practice.
- Nvidia reported Q4 23 results today - Why is the stock up $18 (9%) AH?
-
Why TensorFlow for Python is dying a slow death
While PyTorch is obviously the future in the short term, it will be interesting to see how this space evolves.
Before Tensorflow, people (myself included) were largely coding all of this stuff pretty manually, or with the zoo of incredibly clucky homemade libs.
Tensorflow and PyTorch made the whole situation far more accessible and sane. You can get a basic neural network working in a few lines of code. Magical.
But it's still early days. George Hotz, author of tinygrad[0], a PyTorch "competitor", made a really insightful comment -- we will look back on PyTorch & friends like we look back on FORTRAN and COBOL. Yes, they were far better than assembly. But they are really clunky compared to what we have today.
What will we have in 20 years?
[0] https://github.com/geohot/tinygrad, https://tinygrad.org
- Ask HN: Strategies for working with engineers that are too smart?
-
Ask HN: How to get back into AI?
Read all the leading papers, many times, to get a deep understanding, the writing quality is usually pretty low, but the information density can be very high, you'll probably miss the important details the first time.
Most medium and low-quality papers are full of errors and noise, but you can still learn from them.
Get your hands dirty with real code.
I would take a look at those:
- PyTorch 2.0
-
From "This this is my mission an I will accomplish it" to "Anyone can do my work for free please ?" the journey of an Elon intern at twitter
George Hotz is the founder of comma.ai, and an open source god: https://github.com/geohot/tinygrad
-
10 Trending Github repositories / November, 10 2022
git clone https://github.com/geohot/tinygrad.git
What are some alternatives?
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
openpilot - openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for over 200 supported car makes and models.
minikeyvalue - A distributed key value store in under 1000 lines. Used in production at comma.ai
tensorflow_macos - TensorFlow for macOS 11.0+ accelerated using Apple's ML Compute framework.
flameshot - Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:
docs - Hardware and software docs / wiki
LevelDB - LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
jsource - J engine source mirror
mxnet - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
text-generation-webui - A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
BQNprop - Toy backpropagation implementation written in BQN.
mac-precision-touchpad - Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad