nougat
semantic-kernel
nougat | semantic-kernel | |
---|---|---|
13 | 47 | |
8,103 | 18,454 | |
3.5% | 4.2% | |
7.5 | 9.9 | |
27 days ago | about 1 hour ago | |
Python | C# | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
nougat
-
Show HN: Talk to any ArXiv paper just by changing the URL
https://github.com/facebookresearch/nougat/tree/main
- FLaNK Stack for 04 December 2023
- Detexify LaTeX Handwriting Symbol Recognition
-
Pix2tex: Using a ViT to convert images of equations into LaTeX code
If you're looking for more e2e math / latex aware OCR checkout https://github.com/facebookresearch/nougat
- Nougat: Open-source LaTeX aware OCR for math-heavy books
-
Did anyone manage to get nougat running?
git clone --recurse-submodules https://github.com/facebookresearch/nougat.git PyProject
- Nougat: Facebook Research PDF to .mdd Model
-
Linear Book Scanner β The open-source automatic book scanner
> For the scientific literature, we need a ChatGPT equivalent to reconstruct LaTeX source that can reproduce each page. (We really need a successor to LaTeX that isn't such an arcane language, and can author fixed and flowable text with equal ease.)
Check out Nougat: OCRing scientific papers with a deep net trained end to end. It was released by Meta a few days ago.
βPDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents.β
https://facebookresearch.github.io/nougat/
-
Nougat: Neural Optical Understanding for Academic Documents
The paper (and examples) as HTML: https://facebookresearch.github.io/nougat/
Repo with code, including a CLI tool for converting a PDF to Mathpix Markdown: https://github.com/facebookresearch/nougat
semantic-kernel
-
#SemanticKernel β πChat Service demo running Phi-2 LLM locally with #LMStudio
There is an amazing sample on how to create your own LLM Service class to be used in Semantic Kernel. You can view the Sample here: https://github.com/microsoft/semantic-kernel/blob/3451a4ebbc9db0d049f48804c12791c681a326cb/dotnet/samples/KernelSyntaxExamples/Example16_CustomLLM.cs
-
Semantic Tests for SemanticKernel Plugins using skUnit
This week, I had the chance to explore the SemanticKernel code base, particularly the core plugins. SemanticKernel comes equipped with these built-in plugins:
- FLaNK Stack for 04 December 2023
- Semantic Kernel
-
Getting Started with Semantic Kernel and C#
In this article we'll look at the high-level capabilities building AI orchestration systems in C# with Semantic Kernel, a rapidly maturing open-source AI orchestration framework.
-
Agency: Pure Go LangChain Alternative
I'm using Semantic Kernel (https://github.com/microsoft/semantic-kernel) and it's really nice. Makes building more complex workflows really simple without sacrificing control.
A bunch of examples (https://github.com/microsoft/semantic-kernel/blob/main/dotne...) for how to handle just about anything you need to do with OAI with a lot less boilerplate.
-
New: LangChain templates β fastest way to build a production-ready LLM app
I haven't tried it but there's Microsoft semantic-kernel.
https://github.com/microsoft/semantic-kernel
-
Overview: AI Assembly Architectures
Semantic Kernel github.com/microsoft/semantic-kernel
-
Automated Routing of Tasks to Optimal Models: A PR for Semantic-Kernel
The need for efficient model routing has been a point of discussion in the community. Addressing this, I've submitted a pull request to Semantic-Kernel that introduces an automated multi-model connector.
What are some alternatives?
LIMoE-pytorch - PyTorch implementation of LIMoE
langchain - β‘ Building applications with LLMs through composability β‘ [Moved to: https://github.com/langchain-ai/langchain]
libcolorpicker - Color Picker Library For iOS
langchain - π¦π Build context-aware reasoning applications
typst - A new markup-based typesetting system that is powerful and easy to learn.
guidance - A guidance language for controlling large language models.
advanced-brightness-slider-tweak - iOS Tweak that manipulates the brightness slider in the control center so the display brightness and the white point intensity can be modified
guidance - A guidance language for controlling large language models. [Moved to: https://github.com/guidance-ai/guidance]
NotiBlock - An iOS jailbreak tweak to write custom filters to block notifications
autogen - A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
private-gpt - Interact with your documents using the power of GPT, 100% privately, no data leaks