Civic Auth comes with multiple SSO options, optional embedded wallets, and user management — all implemented with just a few lines of code. Start building today. Learn more →
Top 23 TypeScript Data Science Projects
-
Project mention: RisingWave Turns Four: Our Journey Beyond Democratizing Stream Processing | dev.to | 2025-04-18
By making RisingWave compatible with PostgreSQL, we ensured that any developer familiar with SQL could immediately start writing streaming queries. This wasn't just about syntax; it meant RisingWave could plug seamlessly into existing data workflows and connect easily with a vast ecosystem of familiar tools like DBeaver, Grafana, Apache Superset, dbt, and countless others.
-
Civic Auth
Auth in Less Than 5 Minutes. Civic Auth comes with multiple SSO options, optional embedded wallets, and user management — all implemented with just a few lines of code. Start building today.
-
I first tried to use growthbook. They had only react support. I thought - I could use the js sdk and work around it. Ok fine. It seemed a bit complicated to use in terms of their UI. Okay fine, I try to find an easier one maybe I can self-host. That way I could even put it behind cloudflare CDN and use caching on it and clever cache-busting when I change values could help propagate changes. Okay fine I have a plan. I ended up going with Flagsmith instead. It was even easier. Perfect.
-
OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Project mention: Show HN: OpenMetadata – OSS platform for data discovery observability governance | news.ycombinator.com | 2024-07-17* It seems like DataHub has an async Kafka ingestion approach while OpenMetadata is API
We do not use Kafka by default. If someone needs kafka they can add it. However for Metadata APIs, we do not feel like Kafka is needed. Lot of projects are getting dependent on Kafka and calling it as real-time. Its unnecessary burden on users who are going to operate in production for 99% of use-cases Kafka is not needed, coming from a Kafka committer :)
2. Yes all of our APIs and Entity definitions are generated using JsonSchema. For us, Json Schema has been awesome, all of our backend / ingestion and UI is generated from JsonSchema and its easy to extend and add new models when needed
3. IMO, we have much more coverage , you can look at the types available here https://github.com/open-metadata/OpenMetadata/tree/main/open... and we are support JsonSchema as a type from a long time
-
-
danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Website: Danfo.js
-
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
hydrogen
:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
-
-
I see, interesting. Hadn't come upon this use-case before but makes sense.
I've made a GitHub issue for this feature: https://github.com/pretzelai/pretzelai/issues/142
If you'd like to be updated when we have this feature in, please leave a comment on the issue. Alternatively, my email is in my bio - feel free to email me so that when we have this feature, we can send you an update!
-
responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
-
By the way, I'm a part of Latitude's team, and it would mean a lot if you could check out our open-source framework for embedded analytics and give us a star on GitHub.
-
tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
-
-
buckaroo
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
Location: Boston
Remote: Yes
Willing to relocate: Yes
Technologies: talking to users, python, pandas/numpy, jupyter, js/ts
Résumé/CV: https://www.linkedin.com/in/paddymullen/
Email: [email protected]
In my next role, I want a broad mandate to make a meaningful impact within an organization by developing products that address genuine business challenges, with a preference for data related problems.
Recently I built the data table for Jupyter/Pandas/Polars that I have wanted for over a decade. The open source Buckaroo (https://github.com/paddymul/buckaroo) data table combines a performant table, summary statistics, and a low code UI to expedite common data analysis tasks.
-
dsensei
AI-powered key driver analysis tool that pinpoints root cause behind metrics fluctuation in one minute.
-
-
-
-
-
duck-ui
Duck-UI is a web-based interface for interacting with DuckDB, a high-performance analytical database system. It features a SQL editor, data import/export, data explorer, query history, theme toggle, and keyboard shortcuts, all running seamlessly in the browser using DuckDB's WebAssembly (WASM) capabilities.
-
Project mention: Fast and secure ingress to remote AIs with ngrok, Deepseek, and Ollama | dev.to | 2025-02-13
Hosted: Hosted platforms work like a SaaS—they launch your LLM on their infrastructure, and you get a simple web app or API to explore. There are plenty of hosted AI/LLM platforms already, like RunPod, Mystic, BentoML, and others. Major cloud providers also have platforms for training and deploying LLMs, like Google’s Vertex AI or AWS’ AI Services. Hosted platforms win out on simplicity but don’t come with privacy and compliance guarantees, and trend on the expensive side.
-
litlytics
🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis
Project mention: Show HN: LitLytics – simple open source LLM-driven data analytics | news.ycombinator.com | 2024-10-01 -
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
TypeScript Data Science discussion
TypeScript Data Science related posts
-
DVC Extension for Visual Studio Code
-
How to Build an Internal Data Application Using Google Sheets as a Data Source
-
Show HN: Explore variables in Python notebook – JupyterLab extension
-
Using VSCode to track and visualize AI experiments
-
35+ Newly Launched GitHub Projects Every Developer
-
Using VSCode to track and visualize AI experiments
-
Using VS Code to track and visualize ML experiments
-
A note from our sponsor - Civic Auth
www.civic.com | 24 Apr 2025
Index
What are some of the best open-source Data Science projects in TypeScript? This list will help you:
# | Project | Stars |
---|---|---|
1 | superset | 65,806 |
2 | growthbook | 6,509 |
3 | OpenMetadata | 6,481 |
4 | nteract | 6,238 |
5 | danfojs | 4,903 |
6 | dtale | 4,893 |
7 | orchest | 4,112 |
8 | hydrogen | 3,944 |
9 | tad | 3,286 |
10 | pretzelai | 2,110 |
11 | responsible-ai-toolbox | 1,519 |
12 | latitude | 906 |
13 | tellery | 356 |
14 | voici.js | 337 |
15 | buckaroo | 270 |
16 | dsensei | 262 |
17 | markup | 258 |
18 | IDP | 209 |
19 | vscode-dvc | 204 |
20 | nitro | 201 |
21 | duck-ui | 182 |
22 | pipeline | 132 |
23 | litlytics | 98 |