awesome-chatgpt-prompts
bookcorpus
Our great sponsors
awesome-chatgpt-prompts | bookcorpus | |
---|---|---|
157 | 3 | |
103,383 | 778 | |
- | - | |
7.0 | 3.1 | |
25 days ago | 10 months ago | |
HTML | Python | |
Creative Commons Zero v1.0 Universal | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-chatgpt-prompts
- Top ChatGPT prompts I could find with ranking system
- FLaNK Stack Weekly 12 February 2024
-
🌌 5 Open-Source GPT Wrappers to Boost Your AI Experience 🎁
Aside from the built-in prompts powered by awesome-chatgpt-prompts (Are you an ETH dev, a financial analyst, or a personal trainer today?), you can also create, share and debug your chat tools with prompt templates.
- Aprimorando as respostas do ChatGPT com prompts estratégicos
-
Ask HN: Daily practices for building AI/ML skills?
I've found the following resources helpful:
- 15 Rules For Crafting Effective GPT Chat Prompts (https://expandi.io/blog/chat-gpt-rules/)
- Awesome ChatGPT Prompts (https://github.com/f/awesome-chatgpt-prompts)
For more resources of like nature, you can search for "mega prompt".
-
Prompt writing communities
Someone assembled an adhoc page in Github that is amassing quite a large library of prompt ideas [Github]
-
Ask HN: Collection of best GPT-4 prompts?
I like to use PromptLayer for this. But you could easily set up a simple CRUD web app to track prompts/average completion token # length, different variations.
There is also awesome-chatgpt-prompts (https://github.com/f/awesome-chatgpt-prompts) which has some interesting ones. What are you looking for?
- Supercharge your writing with ChatGPT prompts
-
Introducing YourChat: A multi-platform LLM chat client that supports the APIs of text-generation-webui and llama.cpp.
* Built-In Prompts: Channel creativity using integrated prompts sourced from github.com/f/awesome-chatgpt-prompts.
-
Yet another ChatGPT generated workout... but modified.
So, I jumped into the ChatGPT fitness wagon to generate a New And Improved® workout that will have a mix of bodybuilding and calisthenics. I used a pre-made prompt to generate a PPL+FB and specified things like fitness leve, equipment, schedules, etc. in order to make if fit my current status. From there I made it fit some of my needs and chose some exercises that I wanted to do every day: wrist and core.
bookcorpus
- Show HN: New AI Dataset Based on LibGen and Sci-Hub
- Can chat GPT overtake Google if they play their cards right?
-
On the Danger of Stochastic Parrots [pdf]
The GPT-3 paper (section 2.2) mentions using two datasets referred to as "books1" and "books2", which are 12B and 55B byte pair encoded tokens each.
Project Gutenberg has 3B word tokens I believe, so it seems like it could be one of them, assuming the ratio of word tokens to byte-pair tokens is something like 3:12 to 3:55.
Another likely candidate alongside Gutenberg is libgen, apparently, and looks like there have been successful efforts to create a similar dataset called bookcorpus: https://github.com/soskek/bookcorpus/issues/27). The discussion on that github issue suggests bookcorpus is very similar to "books2", which would make gutenberg "books1"?
This might be why the paper is intentionally vague about the books used?
What are some alternatives?
ChatGPT-pdf - A Chrome extension for downloading your ChatGPT history to PNG, PDF or a sharable link
Replicate-Toronto-BookCorpus - This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset
gpt-prompts-cli - CLI for selecting or defining prompts to use with the ChatGPT chatbot, which retrieves the prompts from the awesome-chatgpt-prompts repository.
instagram-scraper - scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]
trafilatura - Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
gpt_index - LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index]
open-discourse - Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
llm-workflow-engine - Power CLI and Workflow manager for LLMs (core package)
stc - Distributed free search engine and AI tools that grant access to knowledge
chatgpt-google-extension - A browser extension that enhance search engines with ChatGPT
korean-word-ipa-dictionary - Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)