llimo
wink-nlp
llimo | wink-nlp | |
---|---|---|
3 | 21 | |
10 | 1,152 | |
- | 1.5% | |
7.6 | 8.1 | |
23 days ago | 8 days ago | |
JavaScript | JavaScript | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llimo
-
Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch
This system predicts "was" as the next word because it usually is the next word after "dog" (in the source data). This library was built to ultimately provide completions, not have a conversation, so no doubt OpenAI's approach works better for chat.
I am however already making a chat model. Here's my approach if anyone cares: The completer already gives great completions and fast, but some of them make no sense to what was asked. The chat model I'm working on here (https://github.com/bennyschmidt/llimo/pull/1) can just get all completions and use parts-of-speech codes to match a completion to the cursor. I don't have this fully implemented yet, but you can get the idea in this PR. This is like an NLP layer specific to chat - has nothing to do with the next-token prediction in general, and there are no NLP libraries in `next-token-prediction` (the npm). The example I've been using to explain this is:
User: "Where is Paris?"
wink-nlp
-
Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch
This is awesome, thanks. I've been messing with wink's NLP library (https://winkjs.org/wink-nlp/) to transform user queries and format responses so I can make a proper chat bot - will see what I can learn from these!
- Show HN: WinkNLP introduces key sentence extraction
- WinkNLP's recent feature — key sentence extraction delivers a performance of over 450,000 tokens/second or 1500 sentences/second on Apple M1/16GB
-
How to visualize timeline of a Wiki article?
Automatic generation of the timeline — a graphical representation of a time period, on which important events are marked — from a Wikipedia article is a fascinating idea and very useful in quickly grasping the historical perspective. This post outlines the approach to create a well formatted timeline from any Wikipedia article using WinkNLP’s API and Named Entity Recognition (NER) feature:
- WinkNLP delivers 600k tokens/second speed on browsers (MBP M1)
-
Hacker News top posts: Nov 24, 2022
Show HN: WinkNLP delivers 600k tokens/second speed on browsers\ (2 comments)
- Show HN: WinkNLP delivers 600k tokens/second speed on browsers (MBP M1)
What are some alternatives?
nlp.js - An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
ml-classify-text-js - Machine learning based text classification in JavaScript using n-grams and cosine similarity
wink-eng-lite-model - English lite language model for wink-nlp.
Recognizers-Text - Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text
DataTurks - ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
wink-mqtt-rs - MQTT Relay for the Jailbroken Wink Hub v1, with Home Assistant MQTT autodiscovery support
echarts4r - 🐳 ECharts 5 for R
WantWords - An open-source online reverse dictionary.
nodejs-language - Node.js client for Google Cloud Natural Language: Derive insights from unstructured text using Google machine learning.
chatternet-client-http
malaya - Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
st-chat - Streamlit Component, for a Chatbot UI