extruct
contextualise
extruct | contextualise | |
---|---|---|
3 | 10 | |
821 | 1,036 | |
1.3% | - | |
3.8 | 5.9 | |
10 days ago | about 1 month ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
extruct
-
GitHub – GSA/code-gov: An informative repo for all Code.gov repos
https://github.com/rushter/selectolax#simple-benchmark )
(Apache Nutch is a Java-based web crawler which supports e.g. CommonCrawl (which backs various foundational LLMs)) https://en.wikipedia.org/wiki/Apache_Nutch#Search_engines_bu... . But extruct extracts more types of metadata and data than Nutch AFAIU: https://github.com/scrapinghub/extruct )
datasette-graphql adds a GraphQL HTTP API to a SQLite database:
-
Alternative to extruct python library ? (scraping schema.org, jsonld, twitter and fb card)
Is there an alternative for extruct python library in golang ?
-
Scraping MMA fighter stats from a list of names
Seems like sherdog.com supports schema.org data markup - which is really easy to scrape! There's a brilliant python parser for https://github.com/scrapinghub/extruct.
contextualise
-
Ask HN: What software are you dogfooding?
Contextualise, a tool to manage projects and/or activities with lots of unstructured data: a personal knowledge management tool of sorts. The link is here: (https://contextualise.dev/).
It's a MIT-licensed open source project: https://github.com/brettkromkamp/contextualise
-
Ask HN: What's your most starred repo?
That would be Contextualise (https://github.com/brettkromkamp/contextualise) with 980 stars. The project is 3-4 years old. So, it's slow-going. Nevertheless, there are many (underappreciated) projects that should have a lot more stars than they do, so I am not complaining :)
-
Has anyone ever monetized Python outside of a typical job?
Somewhat indirectly, yes. I am the developer behind Contextualise a topic maps-based knowledge management application written in Python. The application and its GitHub repository generate a lot of interest (in the semantic knowledge management space) and have provided me with many freelance projects over the years.
-
If you were asked to showcase your best projects, which ones will you choose?
That would have to be Contextualise (https://contextualise.dev/) and its accompanying open source project (https://github.com/brettkromkamp/contextualise).
I've been working on knowledge graph-related problems (and accompanying applications) for years and Contextualise is probably the most visible component of that work.
-
The Winamp Skin Museum is powered by a sqlite3 database with 1.2gb of metadata
I have built a graph-based knowledge management system (https://github.com/brettkromkamp/contextualise) on top of SQLite. It runs great. Also, from a management point of view (e.g., deployments, backups) its ease of use is second to none. I migrated the application from PostgreSQL (which is also a great RDBMS) to SQLite and I haven’t looked back.
- Personal Knowledge Management (PKM) open source application: Contextualise
-
Contextualise: Structured Thinking
Contextualise is a simple but effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources: https://github.com/brettkromkamp/contextualise. Contextualise's main dependency is TopicDB, an open source topic maps-based graph store implemented in Python. The Contextualise web application is implemented with the Flask framework.
-
[Request] Do you have examples of production-grade open source flask solutions?
Forgot to provide the link to the actual GitHub repo: https://github.com/brettkromkamp/contextualise
- Structure Your Knowledge
-
Flask Examples in Reality
I have a relatively popular Flask application in production: Contextualise (https://contextualise.dev). It’s an open source project, so you can take a look at the code base and hopefully learn something of use: https://github.com/brettkromkamp/contextualise
What are some alternatives?
rdflib - RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.
opensanctions - An open database of international sanctions data, persons of interest and politically exposed persons
PyLD - JSON-LD processor written in Python
wurm - A simple sqlite3-based ORM for Python
code-gov - An informative repo for all Code.gov repos
memoized_coduals - Shows that it is possible to implement reverse mode autodiff using a variation on the dual numbers called the codual numbers
kylo - Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
zsl-kg - Framework for zero-shot learning with knowledge graphs.
metatron - A Python 3.x HTML Meta tag parser, with emphasis on OpenGraph and complex meta tag schemes
Banana-RDF - Banana RDF
PheKnowLator - PheKnowLator: Heterogeneous Biomedical Knowledge Graphs and Benchmarks Constructed Under Alternative Semantic Models
securedrop - GitHub repository for the SecureDrop whistleblower platform. Do not submit tips here!