Python structured-data

Open-source Python projects categorized as structured-data

Top 8 Python structured-data Projects

structured-data
  1. autogluon

    Fast and Accurate ML in 3 Lines of Code

    Project mention: AIM Weekly for 04Nov2024 | dev.to | 2024-11-04

    🌐 Composed Image Retrieval πŸ“Ž Intro to Multimodal LLama 3.2 πŸ› οΈ Multi Agent Concierge πŸ’» RAG with Langchain Granite, Milvus 🫢 Download content βœ… Transformer Replacement? πŸ€– vLLM for runing models 🌐 Amphion πŸ“ Autogluon πŸš™ Notebook LLama like Google's Notebook LLM 🫢 Monocle2ai for tracing GenAI app code LFA&D Project πŸ€– Bee Agent Framework βœ… LLama RFP Response ▢️ GenAI Script πŸ‘½ Simular AI Agent S 🦾 DrawDB with AI ✨ Ollama with LLama 3.2 Vision!!!! Preview πŸš• Powerful RAG Checker πŸ“Š SQL Generator πŸ’» Role of LLMs 🐍 Document Extraction πŸ•ΆοΈ Open Source Vector DB Reddit πŸ” The Practical Guide to Self Hosting LLM 🦾 Stagehand Controller πŸ•ΆοΈ Understanding HNSWLIB 🐍 Best practices in RAG πŸ’» Enigma Agent πŸ“ Langchain, Ollama, Phi3 for Function Calling πŸ”‹ Compass Judger πŸ“ Princeton NLP SimPO πŸ” Princeton NLP ProLong πŸ”‹ Princeton NLP HELMET 🧐 Ollama Cheatsheet πŸš• Princeton NLP CopyCat πŸ“Š Princeton NLP Shp πŸ•ΆοΈ Can LLM Solve Hard Github Issues πŸ“ Enabling Large Language Models to Generate Text with Citations πŸ”‹ Princeton NLP CharXiv πŸ“Š Awesome AI Agents List 🦾 Nomic’s Matryoshka text embedding model

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. llama_cloud_services

    Knowledge Agents and Management in the Cloud

    Project mention: LlamaIndex File Chat Workflow with A2A Protocol | dev.to | 2025-06-02

    LlamaParse Documentation

  4. superpipe

    Superpipe - optimized LLM pipelines for structured data

  5. parsee-core

    Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.

  6. Scrapontologies

    Python library for Entities, relationships and schemas extraction from documents

    Project mention: Exploiting LLMs for entity and schema extraction from unstructured documents | news.ycombinator.com | 2024-09-18
  7. flickypedia

    A tool to copy CC-licensed images from Flickr to Wikimedia Commons

  8. sdk

    Lightfeed SDK to search and filter web data (by lightfeed)

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. jertl

    A minimum viable Python package for processing structured data

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python structured-data discussion

Log in or Post with

Python structured-data related posts

  • Blazingly fast E-Commerce in Nuxt

    4 projects | dev.to | 24 Mar 2025
  • Guide to SeleniumBase β€” A Better & Easier Selenium

    2 projects | dev.to | 16 Dec 2024
  • The HTTP Query Method

    2 projects | news.ycombinator.com | 16 Sep 2024

Index

What are some of the best open-source structured-data projects in Python? This list will help you:

# Project Stars
1 autogluon 8,962
2 llama_cloud_services 4,015
3 superpipe 110
4 parsee-core 71
5 Scrapontologies 40
6 flickypedia 11
7 sdk 5
8 jertl 0

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?