JavaScript Data

Open-source JavaScript projects categorized as Data

Top 23 JavaScript Data Projects

  • SheetJS js-xlsx

    📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

  • Project mention: how to work with .xlsx files? | /r/node | 2023-06-28

    ExcelJS and XLSX (SheetJS) are great libraries to work with XLSX files. The former I've found a bit easier to work with but less efficient in general.

  • Tabulator

    Interactive Tables and Data Grids for JavaScript

  • Project mention: Tabulator – JavaScript Tables and Data Grids | news.ycombinator.com | 2024-02-09
  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
  • Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

  • Project mention: LlamaCloud and LlamaParse | news.ycombinator.com | 2024-02-20

    I'm part of the team that build LlamaParse. It's net improvement compare to other PDF->Structured Text extractors (I build several in the past, includig https://github.com/axa-group/Parsr).

    For character extraction, LlamaParse use a mixture of OCR / character extraction from the PDF (it's the only parser I'm aware of that address some of the buggy PDF font issues, check the 'text' mode to see raw document before reconstruction), use a mixture of heuristic and Machine learning models to reconstruct the document.

    Once plug with a Recursive retrieval strategy, allow you to get Sota result on question answering over complexe text (see notebook: https://github.com/run-llama/llama_parse/blob/main/examples/...).

    AMA

  • Countly

    Countly is a product analytics platform that helps teams track, analyze and act-on their user actions and behaviour on mobile, web and desktop applications.

  • Project mention: Which analytics tool do you use for your Android/iOS apps? | news.ycombinator.com | 2023-07-18

    * [Countly](https://countly.com/)

  • gray-matter

    Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

  • Project mention: Building a flat-file CMS with Angular | dev.to | 2024-03-05

    Writing in markdown is super convenient, and supported by just about any text editor. To convert these .md files to browser-ready HTML, I wrote a simple little Node.js script using two great npm packages called gray-matter and showdown.

  • react-refetch

    A simple, declarative, and composable way to fetch data for React components

  • awesome-json-datasets

    A curated list of awesome JSON datasets that don't require authentication.

  • Project mention: JSON Datasets | news.ycombinator.com | 2023-05-24
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • kea

    Batteries Included State Management for React

  • covid19_scenarios

    Models of COVID-19 outbreak trajectories and hospital demand

  • covid19

    JSON time-series of coronavirus cases (confirmed, deaths and recovered) per country - updated daily (by pomber)

  • arquero

    Query processing and transformation of array-backed data tables.

  • Project mention: Show HN: Matrices – explore, visualize, and share large datasets | news.ycombinator.com | 2023-12-07

    Hey HN, I'm excited to share a new side project I've been working on.

    The product is called Matrices. You can check it out here: https://matrices.com/.

    With Matrices, you can *explore*, *visualize*, and *share* large (100k rows) datasets–all without code. Filter data down to just what you want, visualize it with built-in charts, and share your results with one click.

    You can use it today (no login or waitlist or anything). Just copy and paste your data from a google sheet or CSV file.

    It's hard to describe the feeling of "gliding over data" you get with Matrices, so I'd rather *show* you how it works instead. This 75s video will give you a sense of how it works: https://www.youtube.com/watch?v=Rrh9_I3Ux8E.

    Data is stored locally in your browser until you publish it, though small sample does go to the OpenAI APIs for AI-assisted features.

    I started building Matrices because I wanted a tool that made it easy to explore new datasets. When I'm first trying to dig into data, I'll have one question... that leads to another... that will invariably lead to five more questions. It's sort of a fractal process, and I couldn't find many good options that were fast, responsive, and visual.

    I figured this crowd would be interested in tech stack as well, it's using arquero [1] bindings over apache arrow for in-memory analytics, and visx [2] for visualizations. I'd like to add duckdb-wasm support at some point to open up a wider set of databases. Data is serialized as parquet to save a bit on bandwidth + storage.

    Give it a spin, and let me know what you think. This is my first 'serious frontend project' so I appreciate any and all feedback and bug reports. Feel free to comment here (I'll be around most of the day), or shoot me a note: [email protected]

    [1]: https://uwdata.github.io/arquero/

  • kuwala

    Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demograp

  • Project mention: Show HN: GeoSage – A ETL Webtool for Geo and Demographics Data from the Open Web | news.ycombinator.com | 2023-10-05

    --> Google Trends Data for Regions (Coming Soon)

    The tool goes beyond our previously published CLI tool (https://github.com/kuwala-io/kuwala/tree/master/kuwala) by providing a hostable solution with a user-friendly interface. We have not open-sourced it yet but a demo is available here: https://geosage.kuwala.io/.

    Urban planners can utilize movement data to analyze foot traffic in different city zones. Marketers can leverage demographic data to tailor campaigns more effectively. Developers can build their apps on top of it.

    To round it up .... GeoSage brings...

    Unified Data Management: Access data from OSM, Facebook, and soon Google, all in one place.

  • pldb

    PLDB: a Programming Language Database. A computable encyclopedia about programming languages.

  • Project mention: Programming Language Database | news.ycombinator.com | 2023-09-13
  • minecraft-data

    Language independent module providing minecraft data for minecraft clients, servers and libraries.

  • panini

    A super simple flat file generator. (by foundation)

  • react-native-big-list

    This is a high performance list view for React Native with support for complex layouts using a similar FlatList usage to make easy the replacement. This list implementation for big list rendering on React Native works with a recycler focused on performance and memory usage and so it permits processing thousands items on the list.

  • dataframe-js

    A javascript library providing a new data structure for datascientists and developpers

  • PostGUI

    A React web application to query and share any PostgreSQL database.

  • genshin-db

    npm package with searching functions for Genshin Impact data of all in-game languages. Data parsed/organized directly from GenshinData repo.

  • spamscanner

    Spam Scanner is a Node.js anti-spam, email filtering, and phishing prevention tool and service. Built for @ladjs, @forwardemail, @cabinjs, @breejs, and @lassjs.

  • strapi-plugin-config-sync

    :recycle: CLI & GUI for continuous migration of config data across environments

  • topokanji

    Topologically ordered lists of kanji for effective learning

  • data-store

    Easily get, set and persist config data. Fast. Supports dot-notation in keys. No dependencies. (by jonschlinkert)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

JavaScript Data related posts

Index

What are some of the best open-source Data projects in JavaScript? This list will help you:

Project Stars
1 SheetJS js-xlsx 34,479
2 Tabulator 6,183
3 Parsr 5,645
4 Countly 5,455
5 gray-matter 3,775
6 react-refetch 3,432
7 awesome-json-datasets 3,183
8 kea 1,921
9 covid19_scenarios 1,366
10 covid19 1,231
11 arquero 1,186
12 kuwala 755
13 pldb 675
14 minecraft-data 601
15 panini 590
16 react-native-big-list 519
17 dataframe-js 457
18 PostGUI 424
19 genshin-db 354
20 spamscanner 265
21 strapi-plugin-config-sync 235
22 topokanji 168
23 data-store 158

Sponsored
Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com