SaaSHub helps you find the best software and product alternatives Learn more →
Contextgem Alternatives
Similar projects and alternatives to contextgem
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
-
validex
Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.
-
boilerbot
Official Open-Source Implementation of BoilerBot: A Reliable Task-Oriented Chatbot Enhanced with Large Language Models.
-
pinterest-scrapy-scraper
Production-ready Pinterest scraper built with Python Scrapy. Extract pins, boards, and search data with ScrapeOps proxy integration. Features 3 specialized spiders, JavaScript rendering, and CSV export. Perfect for content marketing, market research, and social media analytics.
-
system-prompt
A collection of system prompts and AI kernels for use with large language models. (by thatoldfarm)
-
-
-
agenta
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
contextgem discussion
contextgem reviews and mentions
-
Transform DOCX into LLM-ready data
As part of work on my open-source project ContextGem, I've built a native, zero-dependency DOCX converter that transforms Word documents into LLM-ready data.
This custom-built converter directly processes Word XML, provides comprehensive content extraction + covers what other open-source tools often miss or lack support for:
- Rich paragraph and sentence metadata for enhanced context
- Misaligned tables
- Comments, footnotes, and textboxes
- Embedded images
The converted document can then be easily used in ContextGem's LLM extraction workflows.
Perfect for developers building contract intelligence applications where precision matters. The converter preserves document structure and relationships, empowering LLMs to better understand and analyze document content.
Try it / share with your dev team today and see the difference in your document processing pipeline!
GitHub: https://github.com/shcherbak-ai/contextgem
All DocxConverter features: https://contextgem.dev/converters/docx.html
-
I Built an Open-Source Framework to Make LLM Data Extraction Dead Simple
After getting tired of writing endless boilerplate to extract structured data from documents with LLMs, I built ContextGem - a free, open-source framework that makes this radically easier.
- ContextGem: Easier and faster way to build LLM extraction workflows
-
A note from our sponsor - SaaSHub
www.saashub.com | 9 Jun 2026
Stats
shcherbak-ai/contextgem is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of contextgem is Python.