clerk

LLM based file organizer (by blankenshipz)

Clerk Alternatives

Similar projects and alternatives to clerk based on common topics and language

  • smartgpt

    A program that provides LLMs with the ability to complete complex tasks using plugins.

  • hoard

    cli command organizer written in rust (by Hyde46)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • TheReader

    An organizer for you books (hopefully audio too) written in Rust Lang.

  • anansi

    open source tooling for AI search and understanding (by infrawhispers)

  • houston

    Simple GPT-based command and script generator for the terminal (by npgrosser)

  • rustgpt

    Yet another command-line chat GPT frontend written in Rust.

  • auto-rust

    auto-rust is an experimental project that aims to automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing procedural macros.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better clerk alternative or higher similarity.

clerk reviews and mentions

Posts with mentions or reviews of clerk. We have used some of these posts to build our list of alternatives and similar projects.
  • Show HN: Out-of-the-box text classification models
    1 project | news.ycombinator.com | 18 Mar 2024
    This is fantastic, but as you note on your launch page, people are going to need custom topic taxonomies. We use several custom ones, maintained as YAML that non-technical users can edit.

    I'm guessing from having been looking for a project like yours for a decade now, that it's that custom taxonomy problem that means most OOTB don't work for people, so they make their own which they don't open source because they ended up ... tailoring ... a topic text classifier for themselves.

    The only thing I've found close to this "OOTB" is:

    https://cloud.google.com/natural-language/docs/classifying-t...

    https://cloud.google.com/natural-language/docs/categories#ca...

    And, to be frank, I can't see why I'd send my confidential information to you when I can send it to Google. (Ahem!)

    But the problem with theirs and yours is the OOTB categories are for a global topic set, something like Yahoo directory, rather than for a given discipline.

    I've found the general lists, like LCM[^1] (what you really want is LCSH subject headings, not LCM[^2]), too broad for my business or personal content, while something like ACM[^3] is more what's needed for, say, computing related content.

    For a firmwide knowledge base at a {field}-tech firm, you have a mix of the firm's focus field, and computing, and a broad scope fallback like you're starting with. Even libraries have their own topic hierarchy! [^4]. Plenty fields have controlled vocabularies[^6], and if you can't find one for a field, you can usually generate one by finding someone who is already classifying that field, and looking at their TOC. All of which is to say, to be generally useful, you have to let people BYOT (bring your own topics) for this.

    For instance, we built our topic list based on combining a reference taxonomy for our field, a reference taxonomy for computing, a reference taxonomy for business books, and the Google NLP tool mentioned above.

    There are occasional tools that try to match arbitrary documents to arbitrary hierarchies such as clerk [^5] but they are challenging for various reasons.

    You have a note to contact you for different topics, but raising this here since so far (6 hours) you had no feedback, and I'm a big fan of what you're doing and the niche is underserved.

    A couple other thoughts:

    Aside from topics taxonomy or hierarchy, we've recently found that something like properties based classification proves needed when we're 10K+ to 100K+ short and long form content documents in the knowledge base. For instance, https://en.wikipedia.org/wiki/Colon_classification, that adds "facets" like time dimension. This is incredibly helpful for relevance while still being able to drill in and just browse a topics/branch/leaf.

    I really like your "intent" classification, far more interesting than sentiment, since it could help separate blog posts from new articles, self-guided tutorials from reviews, and so on: Problem Solving, News, Informational, maybe?. Sifting these to focus a robust KB can be tremendously valuable.

    Your privacy policy is by-and-large useless, since the information being classified is unlikely personal (PII) class, and more likely confidential or non-public (NPI) class.

    You are, effectively, saying "let us have a copy of all info you're classifying", yet nowhere on your main site nor docs site do you explain how you actively prevent yourselves from seeing an API user's information.

    Ideally your "architecture" would explain how you built it to be able to do the work without you being able to see the content, not just a "pinky swear we won't look" sort of promise. Many businesses have their own confidentiality and privacy policies. Those require looping in subprocessors, which is you, and right now you can't be used.

    [^1]: https://en.wikipedia.org/wiki/Library_of_Congress_Classifica...

    [^2]: https://id.loc.gov/authorities/subjects.html

    [^3]: https://en.wikipedia.org/wiki/ACM_Computing_Classification_S...

    [^4]: https://www.ala.org/tools/topics/atoz

    [^5]: https://github.com/blankenshipz/clerk/tree/main

    [^6]: https://pitt.libguides.com/metadatadiscovery/controlledvocab...

Stats

Basic clerk repo stats
1
10
0.0
about 1 year ago

The primary programming language of clerk is Rust.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com