TatSu
base2048
Our great sponsors
TatSu | base2048 | |
---|---|---|
4 | 18 | |
391 | 821 | |
1.8% | - | |
9.0 | 4.0 | |
about 1 month ago | 3 months ago | |
Python | JavaScript | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
TatSu
- GitHub - neogeny/TatSu: TatSu generates Python parsers from grammars in a variation of EBNF
-
Hacker News top posts: May 7, 2022
TatSu takes grammars in variation of EBNF, outputs memoizing Python PEG parsers\ (3 comments)
- TatSu takes grammars in variation of EBNF, outputs memoizing Python PEG parsers
base2048
- How does Base32 (or any Base2^n) work exactly?
- Show HN: Host a Website in the URL
-
What digit bases do you like?
qntm did a fun project of using larger bases, constrained to subsets of unicode instead of ASCII like base64. It's specifically for social channels where you're constrained by the number of code points, but not bytes, so you want to maximize data per code point. base2048 is pretty impressive, and base32768 is just absurd.
-
Android 15's dessert name is "Vanilla Ice Cream"
Numbers don't have to loop until 32k or 2048 (or even up to 1,112,064, until such point the Unicode standard allows for more)
-
Twitter's anti-Mastodon filter evasion
On a semi-related note, they mention base64 encoding messages to evade filters. There were actually other base{n} methods [1] created specifically for Twitter to be more space optimized though not as readily available to operating systems. I guess this is less useful if they are really expanding the text limit to 4k soon but figured I would add it in the event they add a parser for base64.
[1] - https://github.com/qntm/base2048
-
A rust crate that lets you compress ASCII text to a single Unicode "character"
Actually, in the case of twitter they do some weird counting. It is mostly based on codepoints, true, but some codepoints are considered "heavy" and are counted twice, see https://github.com/qntm/base2048
-
New Twitter TOS
I know of one case of Twitter doing client side validation [1]. Maybe there are more?
1 - https://github.com/qntm/base2048#note
- Base 2048
-
Hacker News top posts: May 7, 2022
Base 2048\ (15 comments)
What are some alternatives?
Lark - Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
ecoji - Encodes (and decodes) data as emojis
Covfefe - A parser for nondeterministic context free languages
Base256 - Encode and decode data in base 256 easily typed words
pyparsing - Python library for creating PEG parsers [Moved to: https://github.com/pyparsing/pyparsing]
hatetris - Tetris which always gives you the worst piece
participle - A parser library for Go
Assemblies-of-putative-SARS-CoV2-spike-encoding-mRNA-sequences-for-vaccines-BNT-162b2-and-mRNA-1273 - RNA vaccines have become a key tool in moving forward through the challenges raised both in the current pandemic and in numerous other public health and medical challenges. With the rollout of vaccines for COVID-19, these synthetic mRNAs have become broadly distributed RNA species in numerous human populations. Despite their ubiquity, sequences are not always available for such RNAs. Standard methods facilitate such sequencing. In this note, we provide experimental sequence information for the RNA components of the initial Moderna (https://pubmed.ncbi.nlm.nih.gov/32756549/) and Pfizer/BioNTech (https://pubmed.ncbi.nlm.nih.gov/33301246/) COVID-19 vaccines, allowing a working assembly of the former and a confirmation of previously reported sequence information for the latter RNA. Sharing of sequence information for broadly used therapeutics has the benefit of allowing any researchers or clinicians using sequencing approaches to rapidly identify such sequences as therapeutic-derived
merlin - Context sensitive completion for OCaml in Vim and Emacs
DumbIdeas
chpeg - Parsing Expression Grammar (PEG) bytecode parser/compiler library
klaxon - Klaxon enables reporters and editors to monitor scores of sites on the web for newsworthy changes.