vapoursynth
unstructured
Our great sponsors
vapoursynth | unstructured | |
---|---|---|
10 | 12 | |
1,534 | 6,415 | |
2.7% | 23.0% | |
9.2 | 9.8 | |
4 days ago | 3 days ago | |
C++ | HTML | |
GNU Lesser General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vapoursynth
- FLaNK 15 Jan 2024
- FFmpeg is getting better with multithreaded transcoding pipelines
-
Ffmprovisr – Making FFmpeg Easier
Since ffmpeg CLI still makes me pull ny hair out, I am going to plug vapoursynth:
https://www.vapoursynth.com/
Its Pythonic video filters... But also so much more: https://vsdb.top/
And Staxrip, which makes such good use of ffmpeg, vapoursynth, and dozens of other encoders and tools that I reboot from linux to Windows just to use it: https://github.com/staxrip/staxrip
-
[Guide] Installing av1an on Ubuntu 22.04
git clone https://github.com/vapoursynth/vapoursynth cd vapoursynth ./configure make sudo make install
-
Im making a video editor in Python. Yes, i'm crazy. No, it wont lag
Are you aware of Vapoursynth? https://www.vapoursynth.com/
-
Fast Real Time JavaScript Video Manipulation / Postprocessing
I have a few options here to process the individual frames for example using ImageData, which exposes the data as an array of pixels, so you could easily 'borrow' some VapourSynth filters for this:
-
AV1 encoder for Linux
Does the name AviSynth mean anything to you? If so and you want a similar Linux native tool "inspired" by AviSynth, to quote the website, check out http://www.vapoursynth.com/ Just bear in mind that the scripting language is different.
-
NVIDIA Optical Flow CUDA interface: CUarray vs CUdeviceptr
I'm a total newbie to CUDA. I'm trying to implement NVOF in VapourSynth video processing framework. I got the NVOF context initialized. Next step is the buffers!
-
How to compile Av1an on Windows without breaking your eggs
Download vapoursynth r57 portable from https://github.com/vapoursynth/vapoursynth/releases it's a 7z file so you should have 7-zip installed
-
[Guild] How to compile Av1an on Ubuntu 21.04
Download and compile vapoursynth 1) Go to vapoursynth’s gihub (https://github.com/vapoursynth/vapoursynth/releases) download/wget the tar.gz 2) extract it with tar -xf 3) cd into the folder 4) sudo ./autogen.sh 5) sudo ./configure 6) sudo make install
unstructured
-
LlamaCloud and LlamaParse
Be careful with unstructured:
https://github.com/Unstructured-IO/unstructured/blob/d11c70c...
from: https://github.com/open-webui/open-webui/issues/687
- FLaNK 15 Jan 2024
-
Bash One-Liners for LLMs
I’ve been looking at this
https://freeling-user-manual.readthedocs.io/en/v4.2/modules/...
at the freeling library in general, also spaCy and NLTK. The chunking algorithms being used in the likes of LangChain are remarkably bad surprisingly.
There is also
https://github.com/Unstructured-IO/unstructured
But I don’t like it, can’t explain why yet.
My intuition is that 1st step is clean sentences and paragraphs and titles/labels/headers. Then probably an LLM can handle outlining and table of contents generation using a stripped down list of objects in the text.
BRIO/BERT summarization could also have a role of some type.
Those are my ideas so far.
- Unstructured – OSS libraries and APIs to build custom preprocessing pipelines
-
More intelligent Pdf parsers
Unstructured is the best one I’ve used so far: https://www.unstructured.io
- Help extracting data from multiple PDF's
- Pre-processing text documents such as PDFs, HTML and Word Documents for LLMs
-
Using ChatGPT to read multiple PDFs and create writing using them as sources
https://www.unstructured.io/ can parse PDFs, then you can feed all of them to Claude, which has a 100k context window.
-
How can I convert restaurant’s traditional menu in pdf file to well structured list of menu items with prices in Excel file? Thank you
If the copy & pase method does not work: One approach is to use the functionality of Unstructured to parse the PDF. If need be, it can do OCR on the PDF too if you have Detectron2 installed. After conversion you would still have to save it as an excel file though.
-
PDF GPT allows you to chat with the contents of your PDF file
I would check out https://github.com/Unstructured-IO/unstructured (what lang chain uses) or https://github.com/axa-group/Parsr (probably what unstructured copied to get their startup off the ground lol)
What are some alternatives?
Av1an - Cross-platform command-line AV1 / VP9 / HEVC / H264 encoding framework with per scene quality encoding
llmsherpa - Developer APIs to Accelerate LLM Projects
moviepy - Video editing with Python
Parsr - Transforms PDF, Documents and Images into Enriched Structured Data
ffmpeg-python - Python bindings for FFmpeg - with complex filtering support
ragflow - RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
staxrip - 🎞 Video encoding GUI for Windows.
pdfGPT - PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
vidcutter - A modern yet simple multi-platform video cutter and joiner.
awesome-document-understanding - A curated list of resources for Document Understanding (DU) topic
FFMPerative - Chat to Compose Video
vault-ai - OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.