synth
tantivy
synth | tantivy | |
---|---|---|
14 | 18 | |
901 | 5,829 | |
- | - | |
8.1 | 9.3 | |
over 1 year ago | over 2 years ago | |
Rust | Rust | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
synth
- Synth: A tool for generating realistic data using a declarative data model
-
Ask HN: Freelancer? Seeking freelancer? (October 2021)
SEEKING FREELANCER | London | Remote
Synth (YC S20) [1] is an open source declarative data generator written 100% in Rust.
We are looking for someone with prior experience writing Rust in production for a 1-to-3 months contract to work with us on our core open-source project.
- Proven experience writing production Rust code, preferably in a large code base
- Knowledge of PostgreSQL at a level sufficient to design and build reliable integration
- Strong knowledge of data structures and algorithms
- Track record of contribution to open-source projects, preferably on GitHub
- Ability to work quickly and rigorously in a fully remote setting
If that sounds interesting, we want to talk to you! Shoot me an email at damien [at] getsynth.com!
[1]: https://github.com/getsynth/synth
-
Ask HN: Who is hiring? (October 2021)
Synth | Rust Software Engineer | Full Time or Part Time | London | Onsite(London)/Remote
About us: Synth is an open source declarative data generator (https://github.com/getsynth/synth). We are building Synth with the intention of solving, once and for all, the problem of generating realistic data for testing - helping big companies and small developers avoid the use of production data in testing.
Our mission is to build amazing developer tools that solve data privacy without forcing users to compromise on productivity. We have a few exciting products in our pipeline and we're backed by YCombinator and other great investors. We're based in London and building a remote-friendly culture.
We work exclusively on open source software. This is great because our community is not confined to just our core team and the users, but also includes our contributors - we believe it is way more fun this way.
We're using Rust for our main line of products - and what we would like to see ideally is:
* You have some experience with Rust that has connected you with at least one of: asynchronous I/O, meta-programming or common patterns for concurrency. Having been involved in an open-source Rust project is a bonus!
-
Creating students dataset random data
Take a look at this rust library (which works very well with python modules which generate data in certain formats): https://github.com/getsynth/synth
-
What's everyone working on this week (29/2021)?
Putting the finishing touches on a procedural macro to bind Rust code to koto we want to use in synth. Also a blog post about it is on the way.
-
What's everyone working on this week (28/2021)?
I'm working on synth https://github.com/getsynth/synth . Also working on a personal project, implementing the tcp protocol in Rust for the fun of it.
-
Are you using Rust at work? If yes, for what?
We use Rust to build synth, the open source declarative data generator.
-
Tired of creating test data by hand, we've built an open source data generator
Hey HN! We're Synth - a bunch of engineers out of Europe building tooling for developers. We're very excited about what we're working on and wanted to share it with the community.
We've been quite frustrated with the status quo of test data generation - after speaking to tons of other devs we've realised that many people are struggling when it comes to generating realistic looking test data.
Also, where people don’t want to copy sensitive production data to testing environments, data obfuscation can be a huge time-sink.
Enter Synth: a declarative data generator (see our website: https://getsynth.com/, github: https://github.com/getsynth/synth)
Synth enables devs and dev teams to have their application data models as code (basically a hierarchy of files) in their repos. These files can then be used to generate data for a local dev environment, automated testing in CI or even for sharing across organisations. The parameters of generation can also be tweaked to push the data model to its limits for QA, and even scaled for load testing / performance testing.
We're now working on taking the next step, and building a DSL around Synth. The Synth DSL will enable users to concisely define what data should look like and get going.
We're open source and written 100% in Rust. We believe that by making test data be as easy as using production data, we can improve the security and privacy for all of us. We'd love to get more early users as the initial feedback is positive but limited.
Thank you and looking forward to any feedback / ideas about how we can build a better tool for you!
P.S. Synth [launched on HN a while back](https://news.ycombinator.com/item?id=24198114) as an ML solution to create realistic (and safe) copies of your sensitive production data as a service. This approach quickly hit several limitations which couldn't address the use-cases we are trying to solve, happy to go into more details on this if anyone is interested.
-
What's everyone working on this week (23/2021)?
I'm currently trying to improve the vtable dispatch in koto (because I want to use it in synth).
-
Are you happy after changing to a Rust job?
Luckily, not all Rust jobs are crypto jobs. I'm in my third Rust job working on synth right now and am 100% happy with it.
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
What are some alternatives?
faker - Faker is a Python package that generates fake data for you.
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
content - The content behind MDN Web Docs
tantivy-wasm
aboba - Yet another audio book player (mobile friendly)
pueue - :stars: Manage your shell commands.
gdbstub - An ergonomic, featureful, and easy-to-integrate implementation of the GDB Remote Serial Protocol in Rust (with no-compromises #![no_std] support)
neon - Rust bindings for writing safe and fast native Node.js modules.
rouille - Rust programming, in French.
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.
zk - A plain text note-taking assistant