paperetl
ciscoconfparse
Our great sponsors
paperetl | ciscoconfparse | |
---|---|---|
12 | 2 | |
315 | 778 | |
7.6% | - | |
6.3 | 9.7 | |
5 months ago | 7 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
paperetl
- Show HN: Open-source Rule-based PDF parser for RAG
-
Oracle of Zotero: LLM QA of Your Research Library
Nice project!
I've spent quite a lot of time in the medical/scientific literature space. With regards to LLMs, specifically RAG, how the data is chunked is quite important. With that, I have a couple projects that might be beneficial additions.
paperetl (https://github.com/neuml/paperetl) - supports parsing arXiv, PubMed and integrates with GROBID to handle parsing metadata and text from arbitrary papers.
paperai (https://github.com/neuml/paperai) - builds embeddings databases of medical/scientific papers. Supports LLM prompting, semantic workflows and vector search. Built with txtai (https://github.com/neuml/txtai).
While arbitrary chunking/splitting can work, I've found that integrating parsing that has knowledge of medical/scientific paper structure increases the overall accuracy and experience of downstream applications.
-
[P] Parse research papers into structured data
paperai | paperetl
- Parse research papers into a structured dataset
- ETL for medical and scientific papers
- Show HN: ETL for Medical and Scientific Papers
-
Seeking Advice: How to extract Abstract from scientific journals (.pdfs) 10k+.
paperai and paperetl are a set of projects to consider for this task.
- paperetl: ETL processes for medical and scientific papers
ciscoconfparse
-
Could someone point me in the right direction with a python question? Possibly an example too?
Ciscoconfparse
-
Cisco Configuration Parser
How is that different from https://github.com/mpenning/ciscoconfparse ?
What are some alternatives?
SciencePlots - Matplotlib styles for scientific plotting
ConfigParser - [Moved to: https://github.com/arezazadeh/cisco_config_parser]
tika-python - Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.
ansible-collection-tp-link-easy-smart-switch - Manage TP-Link Easy Smart Switches with Ansible
paperai - π π€ Semantic search and workflows for medical/scientific papers
EdiZon_CheatsConfigsAndScripts - The official EdiZon Editor Config and Editor Script repository.
rdm - Our regulatory documentation manager. Streamlines 62304, 14971, and 510(k) documentation for software projects.
devnetnode - An application for Network engineers to manage Cisco devices (Python Tkinter).
dagster - An orchestration platform for the development, production, and observation of data assets.
DirectFire_Converter - DirectFire Firewall Converter - Network Security, Next-Generation Firewall Configuration Conversion, Firewall Syntax Translation and Firewall Migration Tool - supports Cisco ASA, Fortinet FortiGate (FortiOS), Juniper SRX (JunOS), SSG / Netscreen (ScreenOS) and WatchGuard (support for further devices in development). Similar to FortiConverter, SmartMove, Expedition etc.
science-parse - Science Parse parses scientific papers (in PDF form) and returns them in structured form.
booze-tools - Booze Tools will become the complete programming-language development workbench, all written in Python 3.9 (for now).