hfst
kbnf
hfst | kbnf | |
---|---|---|
3 | 6 | |
116 | 0 | |
0.9% | - | |
4.0 | 0.0 | |
about 1 month ago | about 1 year ago | |
C++ | ||
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hfst
- A portable, modern regular expression language
-
Search-and-replace with correct grammatical case - does it exist?
So you want to go from Spende to Spende+genitive to Beitrag+genitive to Beitrags. In addition to Spacy etc. you might look at Finite State Transducers, which I believe are bidirectional, i.e. for both analysis and generation. XFST and SFST and OpenFST are a few of the FST toolkits. See https://github.com/hfst/hfst for the Helsinki FST; there's a German transducer for it at https://sourceforge.net/projects/hfst/files/resources/morphological-transducers/hfst-german-installable.tar.gz/download. I don't think there is much of a learning curve, and there should be plenty of documentation.
-
Foldable Words
The regex syntax is a bit quirky due to backwards compatibility with lexicons written in XFST, see https://github.com/hfst/hfst/wiki/Regular-Expression-Operato...
kbnf
- Ask HN: Please Review My Metalanguage
- Please Review My Metalanguage
-
A portable, modern regular expression language
The fundamental problem comes from assigning meaning to whitespace (in this case, concatenation). I had the same issues when developing KBNF ( https://github.com/kstenerud/kbnf/blob/master/kbnf.md ) which operates in a closely related space.
In early development, I took a number of cues from regex that turned out to be bad ideas, in particular using whitespace for concatenation (which all BNF dialects seem to do).
Switching to '&' for concatenation fixed it and made things a lot clearer, as it would also do for Pomsky:
'Hello' & ' '+ & ('world' | 'pomsky')
-
Ask HN: Please help me by reviewing KBNF
Hi everyone! Merry Christmas and happy new year!
I'm making a modernized BNF-style metalanguage that supports grammars for text and binary formats, but unless there's some kind of outside review, I won't find any blind spots I have until long after release.
If you have experience in grammars or protocols (text or binary), could you take a quick look?
Note: I'm not aiming for perfect expressivity; just 80-90% of use cases without making things overcomplicated so that it can be used descriptively in documentation.
https://github.com/kstenerud/kbnf/blob/master/kbnf.md
-
Social media may prevent users from reaping creative rewards of profound boredom
Boredom is what led to pretty much every project I've ever done. I usually have a list in my head of things that could be made better, but they never come to anything until I have a lengthy period of nothing to do.
Latest example: I had to take all my vacation time this year or else I'd lose it. I got so bored that I built this over the past weeks: https://github.com/kstenerud/kbnf
What are some alternatives?
HFSM2 - High-Performance Hierarchical Finite State Machine Framework
ReadingQuarkXpress - Some code to read certain Quark Xpress files
lttoolbox - Finite state compiler, processor and helper tools used by apertium
rx - Standalone version of Emacs' rx macro
apertium - Core tools (driver script, transfer, tagger, formatters) for the FOSS RBMT system Apertium
common-regex - Most common regex
simplenlg - Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
ReadableRegex.jl - regexes for people who don't really want to learn or read regexes
apertium-lex-tools - Module for compiling lexical selection rules and processing them in the pipeline.
logstash-patterns - Grok patterns for parsing and structuring log messages with logstash
fluent-plugin-grok-parser - Fluentd's Grok parser