tika-docker
SheetJS js-xlsx
tika-docker | SheetJS js-xlsx | |
---|---|---|
20 | 61 | |
103 | 34,507 | |
4.9% | 0.4% | |
4.1 | 2.4 | |
about 1 month ago | 17 days ago | |
Shell | JavaScript | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tika-docker
- Text Extraction from Documents
- Apache Tika – Extract text and metadata from doc types (the backbone of RAG)
-
Demystifying Text Data with the Unstructured Python Library
If you accept running Java, the Apache Tika is extremely good at parsing content (https://tika.apache.org/)
- Ajuda com Buscador
-
How do you manage and find large amount of files?
Apache Tika can spit out text from lots of formats. I've used it with grep (or rg) to make a small scale searching of local folders. Tika does a really good job at OCR for finding if text is in a file.
-
40 Containers & Counting...
https://tika.apache.org Meta data from things.
- Hosted app to manage server inventory
- Best FOSS (ideally Docker) that can split PDF files ?
- OK, ElasticSearch works, text files are indexed. How about images? Can images be indexed in NextCloud and fulltextsearched?
-
Document Parsing - an unsolved problem?
At my previous job we had the same problem which we solved by using Tika. We called it on the server along with other stuff, but there is also a Python binding.
SheetJS js-xlsx
-
how to work with .xlsx files?
ExcelJS and XLSX (SheetJS) are great libraries to work with XLSX files. The former I've found a bit easier to work with but less efficient in general.
-
What kind of Programmer / language should I be looking for?
Sure. I manipulate excel files programatically in the browser all the time. I don't really understand your exact workflow, but I use Javascript with xlsx and React.
- Excel To Json ?
-
React App Won't Read xlsx File
Looking at the xlsx documentation, to parse files in the browser, rather than readFile, you use read, which is designed to parse binary data directly, rather than read from disk. There are a bunch of different formats if you go to the XLSX NPM page and scroll down to "Acquiring and Extracting Data". Importantly, it seems the data must already be serialized, so a Blob won't work, but we can work with that.
-
We compete with GitHub. Bing does not show our website
Last year, Bing and Edge erroneously flagged our website https://sheetjs.com/ as "dangerous": https://i.imgur.com/BvA3zrk.png
At the time, there was no "Safety Report" to indicate why Bing thought it was dangerous. The report page linked to https://www.bing.com/toolbox/bing-site-safety?url=https%3a%2... and it said "That web page doesn't exist"
To fix it, we had to register with "Bing Webmaster Tools" (https://www.bing.com/webmasters/about) and raise a support ticket.
Within a few days, the issue "resolved itself". It's possible that raising a ticket forced some automatic refresh of the indexed data for the domain.
-
Product Comparison App (JS Demo Project)
xlsx.
-
Ask HN: Who is hiring? (February 2023)
SheetJS | https://sheetjs.com/ | Software Developer | Full time, Remote (US) | $165K - $240K
We're a bootstrapped company building open source solutions for spreadsheets and structured data. With over 1.5M unique monthly visitors, companies across the business world turn to us for challenging data processing problems. Over the last 10 years, we have pushed the boundaries of JavaScript and the web.
In this role, you will master new and established technologies while working on high-impact projects used by millions of people across the world. Balancing research and engineering, you will design and implement creative solutions that draw from your academic and professional experience.
https://sheetjs.com/careers/ more details
- Help to draw graph in reactjs from data in excel sheet
-
PDF, Excel, Docx generate on React and Node js
For more, you can visit xlsx documentation Link.
-
Active data pull from excel to html charts
There are libraries like https://github.com/SheetJS/sheetjs to parse excel and https://www.chartjs.org/ for all kinds of charts/graphs. Not really much HTML involved here.. the markup gets generated by the chart library.
What are some alternatives?
Paperless-ng - A supercharged version of paperless: scan, index and archive all your physical documents
ExcelJS - Excel Workbook Manager
sist2 - Lightning-fast file system indexer and search tool
HANDSONTABLE - JavaScript data grid with a spreadsheet look & feel. Works with React, Angular, and Vue. Supported by the Handsontable team âš¡
spyglass - A personal search engine: Create a searchable library from your personal documents, interests, and more!
Jspreadsheet CE - Jspreadsheet is a lightweight vanilla javascript plugin to create amazing web-based interactive tables and spreadsheets compatible with other spreadsheet software.
yew - Rust / Wasm framework for creating reliable and efficient web applications
Luckysheet - Luckysheet is an online spreadsheet like excel that is powerful, simple to configure, and completely open source.
spacedrive - Spacedrive is an open source cross-platform file explorer, powered by a virtual distributed filesystem written in Rust.
ag-Grid - The best JavaScript Data Table for building Enterprise Applications. Supports React / Angular / Vue / Plain JavaScript.
self-hosted_docker_setups - A collection of my docker-compose files used to setup self-hosted services on Raspberry Pi 4 running 64-bit Raspberry Pi OS
React Data Grid - Feature-rich and customizable data grid React component