pg_cjk_parser
Postgres CJK Parser pg_cjk_parser is a fts (full text search) parser derived from the default parser in PostgreSQL 11. When a postgres database uses utf-8 encoding, this parser supports all the features of the default parser while splitting CJK (Chinese, Japanese, Korean) characters into 2-gram tokens. If the database's encoding is not utf-8, the parser behaves just like the default parser. (by freewizard)
simonwillisonblog
The source code behind my blog (by simonw)
Our great sponsors
pg_cjk_parser | simonwillisonblog | |
---|---|---|
1 | 5 | |
4 | 46 | |
- | - | |
0.3 | 7.4 | |
over 1 year ago | 13 days ago | |
C | JavaScript | |
- | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pg_cjk_parser
Posts with mentions or reviews of pg_cjk_parser.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-07-27.
simonwillisonblog
Posts with mentions or reviews of simonwillisonblog.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-05-12.
-
Modernizing AWK, a 45-year old language, by adding CSV support
For anything down and dirty, what's wrong with -F'"'? For anything fancy there are plenty of things like the below.
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
includes csv to tsv: https://github.com/eBay/tsv-utils
- Ask HN: Share Your Personal Site
- Ask HN: Great Blogs by Programmers
-
Postgres Full-Text Search: A Search Engine in a Database
Exact phrase searching works in PostgreSQL full-text search - here's an example: https://simonwillison.net/search/?q=%22nosql+database%22
I'm using search_type=websearch https://github.com/simonw/simonwillisonblog/blob/a5b53a24b00...
That's using websearch_to_tsquery() which was added in PostgreSQL 11: https://www.postgresql.org/docs/11/textsearch-controls.html#...
What are some alternatives?
When comparing pg_cjk_parser and simonwillisonblog you can also consider the following projects:
hn-search - Hacker News Search
pg_search - pg_search builds ActiveRecord named scopes that take advantage of PostgreSQL’s full text search
zombodb - Making Postgres and Elasticsearch work together like it's 2022
pgvector - Open-source vector similarity search for Postgres