Converting PDF into HTML: is it possble?

This page summarizes the projects mentioned and recommended in the original post on /r/AskProgramming

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
surveyjs.io
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • pdf2htmlEX

    Discontinued Convert PDF to HTML without losing text or format.

  • Things I have tried: - pdf2htmlEX: Very elegant for normal conversions for users in the browser, but it is so elegant that it keeps the layout, strips tags and put them as styling (CSS) and converts tables to background images; not something useful for me - pdftohtml: Not the most pretty output, disregards tables, puts a lot of tags into the HTML.

  • Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

  • Things I still want to try: - Parsr

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Issue getting Parsr GUI up and running

    1 project | /r/docker | 13 Sep 2023
  • Does anyone know where I can get access to a prebuilt general document understanding model?

    1 project | /r/datascience | 7 Oct 2022
  • Turn your (PDF,Image) documents into structured data

    1 project | news.ycombinator.com | 2 Feb 2022
  • [D] What pdf parser do you use for paragraph parsing for huggingface models

    2 projects | /r/MachineLearning | 13 Jul 2021
  • Show HN: I just open sourced my document/website extractor for Vision-LLMs

    2 projects | news.ycombinator.com | 2 Apr 2024