Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →
Top 23 Python Bert Projects
-
Project mention: Las 10 Mejores Herramientas de Inteligencia Artificial de Código Abierto | dev.to | 2024-08-21
[(https://dev-to-uploads.s3.amazonaws.com/uploads/articles/mwbeic3x9gtowahgunjl.png)](https://github.com/huggingface/transformers)
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Project mention: The open source LLM framework Haystack is trending on GitHub | news.ycombinator.com | 2024-08-26 -
Project mention: Search for anything ==> Immich fails to download textual.onnx | /r/immich | 2023-09-15
-
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
-
Project mention: StreamingLLM: tiny tweak to KV LRU improves long conversations | news.ycombinator.com | 2024-02-13
This seems only to work cause large GPTs have redundant, undercomplex attentions. See this issue in BertViz about attention in Llama: https://github.com/jessevig/bertviz/issues/128
-
ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
-
-
InfluxDB
Purpose built for real-time analytics at any scale. InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.
-
-
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
-
Project mention: LLMware.ai 🤖: A revolutionary Python platform that will accelerate your enterprise | dev.to | 2024-08-29
-
Project mention: I want to extract important keywords from large documents... | /r/LangChain | 2023-12-07
Use something else like KeyBERT or BERTopic: https://github.com/MaartenGr/KeyBERT It's much faster.
-
Project mention: [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset? | /r/MachineLearning | 2023-10-28
I'm using Top2Vec with Doc2Vec embeddings to find topics in a dataset of ~4000 social media posts. This dataset has three groups:
-
-
-
-
FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering. (by deepset-ai)
-
-
beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Project mention: Any* Embedding Model Can Become a Late Interaction Model - If You Give It a Chance! | dev.to | 2024-08-29The source code for these experiments is open-source and utilizes beir-qdrant, an integration of Qdrant with the BeIR library. While this package is not officially maintained by the Qdrant team, it may prove useful for those interested in experimenting with various Qdrant configurations to see how they impact retrieval quality. All experiments were conducted using Qdrant in exact search mode, ensuring the results are not influenced by approximate search.
-
-
-
SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling" (by keyu-tian)
-
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Bert discussion
Python Bert related posts
-
Any* Embedding Model Can Become a Late Interaction Model - If You Give It a Chance!
-
LLMware.ai 🤖: A revolutionary Python platform that will accelerate your enterprise
-
The open source LLM framework Haystack is trending on GitHub
-
Build Search and RAG for Any Website with Firecrawl and Trieve
-
Are we all prompting wrong? Balancing Creativity and Consistency in RAG.
-
Natural Language Queries for SQL using SLIM
-
AI enthusiasm #6 - Finetune any LLM you want💡
-
A note from our sponsor - Scout Monitoring
www.scoutapm.com | 8 Sep 2024
Index
What are some of the best open-source Bert projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | transformers | 131,636 |
2 | haystack | 16,539 |
3 | clip-as-service | 12,360 |
4 | PaddleNLP | 11,933 |
5 | bertviz | 6,697 |
6 | ERNIE | 6,243 |
7 | BERT-pytorch | 6,153 |
8 | BERTopic | 5,963 |
9 | awesome-pretrained-chinese-nlp-models | 4,694 |
10 | llmware | 4,446 |
11 | KeyBERT | 3,416 |
12 | Top2Vec | 2,913 |
13 | AliceMind | 1,967 |
14 | ABSA-PyTorch | 1,966 |
15 | DeBERTa | 1,959 |
16 | FARM | 1,733 |
17 | jiant | 1,622 |
18 | beir | 1,543 |
19 | scibert | 1,475 |
20 | finetuner | 1,463 |
21 | SparK | 1,418 |
22 | BERT-NER | 1,194 |
23 | contextualized-topic-models | 1,190 |