generative-manim
TheoremQA
generative-manim | TheoremQA | |
---|---|---|
5 | 2 | |
203 | 152 | |
- | - | |
7.3 | 7.6 | |
13 days ago | 16 days ago | |
Jupyter Notebook | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
generative-manim
-
Intuitive Guide to Convolution
https://github.com/360macky/generative-manim :
> Generative Manim is a prototype of a web app that uses GPT-4 to generate videos with Manim. The idea behind this project is taking advantage of the power of GPT-4 in programming, the understanding of human language and the animation capabilities of Manim to generate a tool that could be used by anyone to create videos. Regardless of their programming or video editing skills.
"TheoremQA: A Theorem-driven [STEM] Question Answering dataset" (2023) https://github.com/wenhuchen/TheoremQA#leaderboard
How do you score memory retention and video watching comprehension? The classic educators' optimization challenge
"Khan Academy’s 7-Step Approach to Prompt Engineering for Khanmigo"
- Generative Manim
-
Generative Manim: An experiment to generate Manim code with AI
Experiment App: https://generative-manim.streamlit.app
-
How can I only export animations and not images?
To build this project.
TheoremQA
-
Intuitive Guide to Convolution
https://github.com/360macky/generative-manim :
> Generative Manim is a prototype of a web app that uses GPT-4 to generate videos with Manim. The idea behind this project is taking advantage of the power of GPT-4 in programming, the understanding of human language and the animation capabilities of Manim to generate a tool that could be used by anyone to create videos. Regardless of their programming or video editing skills.
"TheoremQA: A Theorem-driven [STEM] Question Answering dataset" (2023) https://github.com/wenhuchen/TheoremQA#leaderboard
How do you score memory retention and video watching comprehension? The classic educators' optimization challenge
"Khan Academy’s 7-Step Approach to Prompt Engineering for Khanmigo"
-
I asked 60 LLMs a set of 20 questions
Additional benchmarks:
- "TheoremQA: A Theorem-driven Question Answering dataset" (2023) https://github.com/wenhuchen/TheoremQA#leaderboard
- legalbench
What are some alternatives?
streamlit-manim - Seeing if I can put together an interactive version of 3b1b's Manim in Streamlit
GodMode - AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
manim-on-streamlit - Example Streamlit app that you can fork to test out share.streamlit.io
fiddler-auditor - Fiddler Auditor is a tool to evaluate language models.
streamlit-manim - Seeing if I can put together an interactive version of 3b1b's Manim in Streamlit
LocalAI - :robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
promptfoo - Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.
bench - A tool for evaluating LLMs