Jupyter Notebook Audio

Open-source Jupyter Notebook projects categorized as Audio

Top 8 Jupyter Notebook Audio Projects

  • awesome-python-applications

    💿 Free software that works great, and also happens to be open-source Python.

  • digital_video_introduction

    A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

  • Project mention: Breakdown of AV1 Video Codec | news.ycombinator.com | 2023-12-25

    There's a great introduction to video tech, including codecs, at https://github.com/leandromoreira/digital_video_introduction

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ast

    Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer". (by YuanGongND)

  • SpecVQGAN

    Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

  • Project mention: Text-to-Audio Generation Using Instruction Tuned LLM and Latent Diffusion Model | news.ycombinator.com | 2023-04-28

    Excellent. Some of the theory here goes back to Oct/2021 and beyond [1].

    The riffusion.com [2] guys made this practical. Also, my video of high-level overview and examples [3].

    1. SpecVQGAN: https://github.com/v-iashin/SpecVQGAN

    2. Riffusion: ://www.riffusion.com/

    3. Riffusion high-level overview: https://youtu.be/olkLVGcvib8

  • sudo_rm_rf

    Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

  • BMT

    Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

  • vid2cleantxt

    Python API & command-line tool to easily transcribe speech-based video files into clean text

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • WOLOF-ASR-Wav2Vec2

    Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Audio related posts

Index

What are some of the best open-source Audio projects in Jupyter Notebook? This list will help you:

Project Stars
1 awesome-python-applications 16,200
2 digital_video_introduction 15,095
3 ast 995
4 SpecVQGAN 318
5 sudo_rm_rf 298
6 BMT 220
7 vid2cleantxt 156
8 WOLOF-ASR-Wav2Vec2 12

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com