nougat
stability-sdk
nougat | stability-sdk | |
---|---|---|
13 | 116 | |
8,155 | 2,403 | |
3.5% | 0.3% | |
7.5 | 5.5 | |
28 days ago | 6 days ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
nougat
-
Show HN: Talk to any ArXiv paper just by changing the URL
https://github.com/facebookresearch/nougat/tree/main
- FLaNK Stack for 04 December 2023
- Detexify LaTeX Handwriting Symbol Recognition
-
Pix2tex: Using a ViT to convert images of equations into LaTeX code
If you're looking for more e2e math / latex aware OCR checkout https://github.com/facebookresearch/nougat
- Nougat: Open-source LaTeX aware OCR for math-heavy books
-
Did anyone manage to get nougat running?
git clone --recurse-submodules https://github.com/facebookresearch/nougat.git PyProject
- Nougat: Facebook Research PDF to .mdd Model
-
Linear Book Scanner – The open-source automatic book scanner
> For the scientific literature, we need a ChatGPT equivalent to reconstruct LaTeX source that can reproduce each page. (We really need a successor to LaTeX that isn't such an arcane language, and can author fixed and flowable text with equal ease.)
Check out Nougat: OCRing scientific papers with a deep net trained end to end. It was released by Meta a few days ago.
“PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents.”
https://facebookresearch.github.io/nougat/
-
Nougat: Neural Optical Understanding for Academic Documents
The paper (and examples) as HTML: https://facebookresearch.github.io/nougat/
Repo with code, including a CLI tool for converting a PDF to Mathpix Markdown: https://github.com/facebookresearch/nougat
stability-sdk
- FLaNK Stack for 04 December 2023
-
dall-e-3 has been removed from the API! What's an alternative API service?
I use stable diffusion via stability.ai.
-
Stability AI launches SDXL 0.9: A Leap Forward in AI Image Generation — Stability AI
SDXL 0.9 is now available on the Clipdrop by Stability AI platform. Stability AI API and DreamStudio customers will be able to access the model this Monday, 26th June as well as other leading image generating tools like NightCafe.
-
Best place to start with creating my own ai program!?
And for your imaging purposes, you might explore things such as the new StableStudio that Stability AI has. Here's an article about StableStudio. Here are imaging things you can do in StableStudio via its API.
-
I'll Rate Your Channels....
I know there are tons are programs out there but I've made some really cool stuff with this free one https://beta.dreamstudio.ai then rotoscoped it onto a green screen then animated it
-
With twenty years of material, this series is a goldmine
Currently Midjourney is the best one, but you need to pay. For low to no cost you could use https://app.leonardo.ai/ or https://beta.dreamstudio.ai/
-
Arto de artefarita inteligenteco (Jen kiel vi povas fari ĝin)
https://beta.dreamstudio.ai/ (belaj bildoj)
-
The Future of AI Relies on a High School Teacher’s Free Database
> Of course new SD models are also on the horizon…
SDXL is available at https://beta.dreamstudio.ai/ though they say they're going to release more variants.
I think ControlNet is a lot more interesting than just "better tuned models"; it means there's no line between creating something yourself and asking an AI to do it anymore.
- Stability AI (Stable Diffusion) Releases DreamStudio Beta
-
“Anime of a Japanese robot signing a letter to delay AI progress”
(this is actually generated using the new sdxl beta model in https://beta.dreamstudio.ai )
What are some alternatives?
LIMoE-pytorch - PyTorch implementation of LIMoE
stable-diffusion-grpcserver - An implementation of a server for the Stability AI Stable Diffusion API
libcolorpicker - Color Picker Library For iOS
stable-diffusion - This version of CompVis/stable-diffusion features an interactive command-line script that combines text2img and img2img functionality in a "dream bot" style interface, a WebGUI, and multiple features and other enhancements. [Moved to: https://github.com/invoke-ai/InvokeAI]
typst - A new markup-based typesetting system that is powerful and easy to learn.
dnd-ai-art-bot - A Discord bot to generate AI art. Compatible with Dreamstudio
advanced-brightness-slider-tweak - iOS Tweak that manipulates the brightness slider in the control center so the display brightness and the white point intensity can be modified
jsoncrack.com - ✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.
NotiBlock - An iOS jailbreak tweak to write custom filters to block notifications
stable-diffusion-webui - Stable Diffusion web UI
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
material_stable_diffusion - Tileable Stable Diffusion - Cog model