open-parse vs mitta-community

open-parse

Improved file parsing for LLM’s (by Filimoa)

Suggest topics

Source Code

filimoa.github.io

Suggest alternative

Edit details

mitta-community

Community repository for MittaAI users. (by MittaAI)

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

open-parse		mitta-community
	Project
3	Mentions	12
1,782	Stars	12
-	Growth	-
9.2	Activity	9.7
8 days ago	Latest Commit	about 12 hours ago
Python	Language	Python
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

open-parse

Posts with mentions or reviews of open-parse. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-07.

Show HN: Beyond text splitting – improved file parsing for LLM's
4 projects | news.ycombinator.com | 7 Apr 2024
Running OCR against PDFs and images directly in the browser
7 projects | news.ycombinator.com | 30 Mar 2024

I recently built a similar tool except it’s configured to use some deep learning libraries for the table extraction. I’m excited to integrate unitable which has state of the art performance later this week.
I built this because most of the basic layout detection libraries have terrible performance on anything non trivial. Deep learning is really the long term solution here.
https://github.com/Filimoa/open-parse
Show HN: Open-source, high performance document chunking for LLM's
1 project | news.ycombinator.com | 28 Mar 2024

mitta-community

Posts with mentions or reviews of mitta-community. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-30.

Running OCR against PDFs and images directly in the browser
7 projects | news.ycombinator.com | 30 Mar 2024

Here's an EasyOCR service: https://github.com/MittaAI/mitta-community/tree/main/service.... A PDF to image processor is being built and should be out in a few weeks.
No docs, but happy to help anyone wanting to use it. Email is kord @ the company I'm working on.
LaVague: Open-source Large Action Model to automate Selenium browsing
8 projects | news.ycombinator.com | 13 Mar 2024

I built this with Playwright and OpenAI's function calling stuff (sorry, no time for docs): https://github.com/MittaAI/mitta-community/tree/main/service...
My thought was to put the results of this in a vector store, with any errors that resulted as opposed to wasting time training a model.
AI FFmpeg
1 project | news.ycombinator.com | 29 Jan 2024

Source for the FFmpeg build (no AI): https://github.com/MittaAI/mitta-community/tree/main/service...
An Amount of the Web Is Machine Translated: Insights from Multi-Way Parallelism
1 project | news.ycombinator.com | 16 Jan 2024

I just built a translate pipeline for MittaAI: https://github.com/MittaAI/mitta-community/tree/main/cookboo.... The pipelines crawl and translate any publicly available page. The `translate` pipeline uses Gemini, but could be changed to another model for use.
Show HN: Talk to any ArXiv paper just by changing the URL
5 projects | news.ycombinator.com | 20 Dec 2023

If you want a generalized version, try this: https://github.com/MittaAI/mitta-community/tree/main/cookboo...
The query pipeline isn't that sophisticated, but it could be altered to do page reference and use keyterms first to filter, instead of doing the vector similarity on all data.
Show HN: Hacker News Activity Analysis with GPT-4 Agent
1 project | news.ycombinator.com | 20 Dec 2023

> A service to preprocess the data with custom prompts would be neat.
It is pretty cool to mess about with it. I posted it last week, but didn't get any nibbles: https://github.com/MittaAI/mitta-community/tree/main/cookboo...
Gemini Is Google's Best AI Model Yet, but Who Cares?
1 project | news.ycombinator.com | 18 Dec 2023

It doesn't matter if the demos were good or not. Demos don't always go as planned. I've found using Gemini for image scene description and chat works well, and have it running a portion of an image scene to spoken "thoughts" pipeline using ElevenLabs for the voice: https://github.com/MittaAI/mitta-community/tree/main/cookboo...
Pirate Visualizer Pipeline with Gemini and dalle-3
1 project | news.ycombinator.com | 14 Dec 2023
Hacker News AI Pipeline for Mitta.ai
1 project | news.ycombinator.com | 12 Dec 2023
Vision to Speech Pipeline with OpenAI
1 project | news.ycombinator.com | 7 Dec 2023