base2048 | DumbIdeas | |
---|---|---|
18 | 6 | |
821 | 67 | |
- | - | |
4.0 | 4.1 | |
3 months ago | 8 months ago | |
JavaScript | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
base2048
- How does Base32 (or any Base2^n) work exactly?
- Show HN: Host a Website in the URL
-
What digit bases do you like?
qntm did a fun project of using larger bases, constrained to subsets of unicode instead of ASCII like base64. It's specifically for social channels where you're constrained by the number of code points, but not bytes, so you want to maximize data per code point. base2048 is pretty impressive, and base32768 is just absurd.
-
Android 15's dessert name is "Vanilla Ice Cream"
Numbers don't have to loop until 32k or 2048 (or even up to 1,112,064, until such point the Unicode standard allows for more)
-
Twitter's anti-Mastodon filter evasion
On a semi-related note, they mention base64 encoding messages to evade filters. There were actually other base{n} methods [1] created specifically for Twitter to be more space optimized though not as readily available to operating systems. I guess this is less useful if they are really expanding the text limit to 4k soon but figured I would add it in the event they add a parser for base64.
[1] - https://github.com/qntm/base2048
-
A rust crate that lets you compress ASCII text to a single Unicode "character"
Actually, in the case of twitter they do some weird counting. It is mostly based on codepoints, true, but some codepoints are considered "heavy" and are counted twice, see https://github.com/qntm/base2048
-
New Twitter TOS
I know of one case of Twitter doing client side validation [1]. Maybe there are more?
1 - https://github.com/qntm/base2048#note
- Base 2048
-
Hacker News top posts: May 7, 2022
Base 2048\ (15 comments)
DumbIdeas
-
[2022 Days 1-10] [Python] A graphical representation of The Beast, a single line of code that solves every day's challenge so far!
Decided to try running it through this program, which compresses python into an extremely cursed line that can be run, the creator is able to better explain it than I ever could. It turned out as
-
b='Ę͌͌ͅ͏̰̀͒̏͒͏͇͍͍͈͕͍͒́͒ͅ͏͒́'.encode();print(''.join(chr(((h<<6&64|c&63)+22)%133+10)for h,c in zip(b[1::2],b[2::2])))
Original Python Implementation (Includes a gui, thanks to u/MR_sticky_piston)
-
Day 21. Debug time! You can edit as many lines as you want, or add new lines without changing its intended behavior. The changes from the most upvoted comment get applied.
If you add this on the line before all the beeps, that'll make it linux friendly.
-
A rust crate that lets you compress ASCII text to a single Unicode "character"
This is my port of the incredible python code by /u/_DaCoolOne_. It lets you convert ascii text, e.g. python code, to a single Unicode character in a reversible way. This also lets you refactor your code into a single line! Imagine the readability!
-
T̶h̶e̶ ̶m̶o̶s̶t̶ ̶u̶p̶v̶o̶t̶e̶d̶ ̶c̶o̶m̶m̶e̶n̶t̶ ̶p̶i̶c̶k̶s̶ ̶t̶h̶e̶ ̶n̶e̶x̶t̶ ̶l̶i̶n̶e̶ ̶o̶f̶ ̶c̶o̶d̶e̶:̶ ̶D̶a̶y̶ ̶1̶3̶. Refactor time! Rewrite lines 10-12, so their behavior doesn't change. Whoever gets it with the least amount of symbols, wins. You can use multiple lines.
(Backup link as Reddit formatting will probably bastardize this code: https://github.com/DaCoolOne/DumbIdeas/blob/main/reddit\_ph\_compressor/out.py)
What are some alternatives?
ecoji - Encodes (and decodes) data as emojis
aco2022
Base256 - Encode and decode data in base 256 easily typed words
totally-safe-transmute
hatetris - Tetris which always gives you the worst piece
advent-of-code - My (mostly) python solutions to Advent of Code
Assemblies-of-putative-SARS-CoV2-spike-encoding-mRNA-sequences-for-vaccines-BNT-162b2-and-mRNA-1273 - RNA vaccines have become a key tool in moving forward through the challenges raised both in the current pandemic and in numerous other public health and medical challenges. With the rollout of vaccines for COVID-19, these synthetic mRNAs have become broadly distributed RNA species in numerous human populations. Despite their ubiquity, sequences are not always available for such RNAs. Standard methods facilitate such sequencing. In this note, we provide experimental sequence information for the RNA components of the initial Moderna (https://pubmed.ncbi.nlm.nih.gov/32756549/) and Pfizer/BioNTech (https://pubmed.ncbi.nlm.nih.gov/33301246/) COVID-19 vaccines, allowing a working assembly of the former and a confirmation of previously reported sequence information for the latter RNA. Sharing of sequence information for broadly used therapeutics has the benefit of allowing any researchers or clinicians using sequencing approaches to rapidly identify such sequences as therapeutic-derived
RedditWritesCode - The mirror of the series "The most upvoted comment picks the next line of code" on Reddit by https://www.reddit.com/user/AggravatingCorner133/
TatSu - 竜 TatSu generates Python parsers from grammars in a variation of EBNF
nomicon - The Dark Arts of Advanced and Unsafe Rust Programming
klaxon - Klaxon enables reporters and editors to monitor scores of sites on the web for newsworthy changes.
Twitter Text Obj - Twitter Text Libraries. This code is used at Twitter to tokenize and parse text to meet the expectations for what can be used on the platform.