a2jauthor VS open-australian-legal-corpus-creator

Compare a2jauthor vs open-australian-legal-corpus-creator and see what are their differences.

open-australian-legal-corpus-creator

The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents. (by umarbutler)
SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
surveyjs.io
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
a2jauthor open-australian-legal-corpus-creator
2 3
4 57
- -
5.5 8.3
about 1 month ago 3 months ago
JavaScript Python
GNU General Public License v3.0 or later MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

a2jauthor

Posts with mentions or reviews of a2jauthor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-28.

open-australian-legal-corpus-creator

Posts with mentions or reviews of open-australian-legal-corpus-creator. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-26.
  • Show HN: Mapping almost every law, regulation and case in Australia
    1 project | news.ycombinator.com | 22 Mar 2024
    Hey HN,

    After months of hard work, I am excited to share the first ever semantic map of Australian law.

    My map represents the first attempt to map Australian laws, cases and regulations across the Commonwealth, States and Territories semantically, that is, by their underlying meaning.

    Each point on the map is a unique document in the [Open Australian Legal Corpus](https://huggingface.co/datasets/umarbutler/open-australian-l...), the largest open database of Australian law (which, full disclosure, I [created](https://umarbutler.com/how-i-built-the-largest-open-database...)). The closer any two points are on the map, the more similar they are in underlying meaning.

    As I cover in my article, there’s a lot you can learn by mapping Australian law. Some of the most interesting insights to come out of this initiative are that:

    ⦁ Migration, family and substantive criminal law are the most isolated branches of case law on the map;

    ⦁ Migration, family and substantive criminal law are the most distant branches of case law from legislation on the map;

    ⦁ Development law is the closest branch of case law to legislation on the map;

    ⦁ Case law is more of a continuum than a rigidly defined structure and the borders between branches of case law can often be quite porous; and

    ⦁ The map does not reveal any noticeable distinctions between Australian state and federal law, whether it be in style, principles of interpretation or general jurisprudence.

    If you’re interested in learning more about what the map has to teach us about Australian law or if you’d like to find out how you can create semantic maps of your own, check out the full article on my blog, which provides a detailed analysis of my map and also covers the finer details of how I built it, with code examples offered along the way.

  • I built the largest open database of Australian law
    1 project | news.ycombinator.com | 29 Oct 2023
    > Just one note - the link in your Github readme to https://umarbutler.com/open-australian-legal-corpus doesn't seem to go anywhere.

    Thanks for the heads up! I've fixed that now.

    > For someone interested in using the data (and help out with bugs/issues), where would you suggest starting?

    I think the best place to start is by downloading the Corpus (visit https://huggingface.co/datasets/umarbutler/open-australian-l... , and then click "Files and versions" and then "corpus.jsonl"). You can then use my Python library orjsonl to parse the dataset (you'd run, `corpus = orjsonl.load('corpus.jsonl')`). At that point, there's any number of applications you could use the dataset for. You could pretrain a model like BERT, ELECTRA, etc... and share it on HuggingFace. You could connect the dataset to GPT and do RAG over it. Etc...

  • Show HN: I created a first-of-its-kind open corpus of Australian law
    2 projects | news.ycombinator.com | 26 Jun 2023
    Hey HN, today I'm sharing my latest project, the Open Australian Legal Corpus, a first-of-its-kind multijurisdictional open corpus of Australian legislative and judicial documents. The idea behind this dataset was born a few months ago, when, while attempting to pretrain a BERT model for the Australian legal domain, I discovered that there was no freely accessible, openly licensed text corpus of Australian laws and cases that I could use. This was in contrast to the US, UK and EU which all had multiple large open legal corpora available. Thus, I set out to the fill the gap in Australian legal AI research by compiling a dataset of as many in force Australian laws, regulations, bills and decisions as I could find. The end product was a corpus of 97,750 texts totalling over forty million lines and half a billion tokens, and spanning five states, one external territory and the Commonwealth.

    You can view the corpus on [HuggingFace](https://huggingface.co/datasets/umarbutler/open-australian-l...) and the code used to create it on [Github]( https://github.com/umarbutler/open-australian-legal-corpus-c...).

What are some alternatives?

When comparing a2jauthor and open-australian-legal-corpus-creator you can also consider the following projects:

Actions - do the things https://actionprojects.github.io/Actions/

realestate-com-au-api - 🏠Python wrapper for the realestate.com.au API

docassemble-AssemblyLine - Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-built questions to meet your needs.

open-australian-legal-corpus-c

licensee - A Ruby Gem to detect under what license a project is distributed.

Chinese-Names-Corpus - 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

ua-gec - UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

DragonBreath - "DragonBreath F.10 (American Constitutional Supreme and Mandatory Primary Source Case/Common Law Bulk Parser)"