[D] What is a good way to maintain code readability and code quality while scaling up complexity in libraries like Hugging Face?

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

llama

184 53,908 8.0 Python

Inference code for Llama models

In transformers, they tried really hard to have a single function or method to deal with both self and cross attention mechanisms, masking, positional and relative encodings, interpolation etc. While it allows a user to use the same function/method for any model, it has led to severe parameter bloat. Just compare the original implementation of llama by FAIR with the implementation by HF to get an idea.

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
transformers

181 127,531 10.0 Python

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

In transformers, they tried really hard to have a single function or method to deal with both self and cross attention mechanisms, masking, positional and relative encodings, interpolation etc. While it allows a user to use the same function/method for any model, it has led to severe parameter bloat. Just compare the original implementation of llama by FAIR with the implementation by HF to get an idea.

tinygrad

25 24,614 10.0 Python

You like pytorch? You like micrograd? You love tinygrad! ❤️

what do you think about tinygrad? I think its a good example of growing and well written, (partially) well documented library with many close to reference implementations

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Reading list to join AI field from Hugging Face cofounder

1 project | news.ycombinator.com | 18 May 2024
XLSTM: Extended Long Short-Term Memory

2 projects | news.ycombinator.com | 8 May 2024
Schedule-Free Learning – A New Way to Train

3 projects | news.ycombinator.com | 6 Apr 2024
HuggingFace Transformers: Qwen2

1 project | news.ycombinator.com | 11 Jan 2024
HuggingFace Transformers Release v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2

1 project | news.ycombinator.com | 13 Dec 2023

[D] What is a good way to maintain code readability and code quality while scaling up complexity in libraries like Hugging Face?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
hardware-buttons scrape-images linkedin-bot
Post date: 10 Dec 2023

llama

Scout Monitoring

transformers

tinygrad

Related posts

Reading list to join AI field from Hugging Face cofounder

XLSTM: Extended Long Short-Term Memory

Schedule-Free Learning – A New Way to Train

HuggingFace Transformers: Qwen2

HuggingFace Transformers Release v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2

[D] What is a good way to maintain code readability and code quality while scaling up complexity in libraries like Hugging Face?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning hardware-buttons scrape-images linkedin-bot Post date: 10 Dec 2023

llama

Scout Monitoring

transformers

tinygrad

Related posts

Reading list to join AI field from Hugging Face cofounder

XLSTM: Extended Long Short-Term Memory

Schedule-Free Learning – A New Way to Train

HuggingFace Transformers: Qwen2

HuggingFace Transformers Release v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
hardware-buttons scrape-images linkedin-bot
Post date: 10 Dec 2023