unicode-eth
UNIC
unicode-eth | UNIC | |
---|---|---|
18 | 4 | |
25 | 234 | |
- | 1.3% | |
0.0 | 0.0 | |
about 1 year ago | 6 days ago | |
Solidity | Rust | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
unicode-eth
- Unicode Ethereum Project
- An initiative to make the Unicode Character Database (UCD) available within Smart Contracts
-
I'm 15 ETH Away from Making the Unicode Character Database (UCD) Available on Rinkeby Testnet
I started the Unicode Ethereum Project to build better Unicode primitives and utilities for Solidity developers. Part of this endeavor requires making the Unicode Character Database (UCD) available to Solidity Contracts, which requires significant number of transactions to populate the data (about ~1.2k).
- using Unicode for string;
-
Show HN: I Made a Unicode Solidity Library
I built a smart contract that took user text input, validated it, and trigger an action. While building it, I was frustrated by the lack of string utilities and introspection tools available to Solidity, so I built some of my own. I open sourced the libraries and started the Unicode Ethereum Project to port the Unicode Character Database (UCD) to Ethereum:
https://github.com/devstein/unicode-eth
Check it out and let me know what you think!
- Unicode Libraries and Contracts for Solidity
UNIC
-
I'm 15 ETH Away from Making the Unicode Character Database (UCD) Available on Rinkeby Testnet
For reference, here is an equivalent library in Rust: https://github.com/open-i18n/rust-unic/
-
icu vs rust_icu
There is also rust-unic which provides both normalization and access to the character database. I have also used this because of their text segmentation support, and I would probably recommend rust-unic in general. I hope to see more progress on that front.
-
Ć Programming Language
I try to be mindful of making my software as accessible as possible, but the following
> creating a lookup table for all the unicode material out there might've been considered impractical or performance-hitting for the developers.
just doesn't ring true to me in any way for current software. I understand that people can be using older software, which is why I strive to restrict myself to ASCII as much as possible for the widest possible support for my users, but my software also supports unicode identifiers, up to and including a whole unicode table to talk about confusables[1]. And not all TTS software "ignores" characters, which is why people advice against using 𝑓𝑎𝑛𝑐𝑦 unicode because it doesn't get read as text but instead each character is described individually. (This is also something that TTS software should support for their users' sake, but I digress.)
[1]: this is thanks to the crate unic-udc containing this information: https://github.com/open-i18n/rust-unic
-
Unicode sorting is hard & why browsers added special emoji matching to regexp
Regarding https://github.com/open-i18n/rust-unic, could it be that the project, or otherwise was superseded by https://github.com/unicode-org/icu4x ?
What are some alternatives?
Fluent - Rust implementation of Project Fluent
regex - An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
textwrap - An efficient and powerful Rust library for word wrapping text.
whatlang-rs - Natural language detection library for Rust. Try demo online: https://whatlang.org/
cpc - Text calculator with support for units and conversion
datamatrix-fu - Data Matrix barcodes in the Fusion programming language
fut - Fusion programming language. Transpiling to C, C++, C#, D, Java, JavaScript, Python, Swift, TypeScript and OpenCL C.
JHipster - JHipster, much like Spring initializr, is a generator to create a boilerplate backend application, but also with an integrated front end implementation in React, Vue or Angular. In their own words, it "Is a development platform to quickly generate, develop, & deploy modern web applications & microservice architectures."
suffix - Fast suffix arrays for Rust (with Unicode support).
ngrams - (Read-only) Generate n-grams
cldr - The home of the Unicode Common Locale Data Repository
tabwriter - Elastic tabstops for Rust.