pydub
pythonic-cv
Our great sponsors
pydub | pythonic-cv | |
---|---|---|
25 | 36 | |
8,339 | 38 | |
- | - | |
0.0 | 0.0 | |
21 days ago | about 2 years ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pydub
- Looking for help with a winamp project please.
-
Best language(s) for creating/manipulating sounds
Honestly while, C++ is used for professional audio software, you can get a lot done with python and a library like pydub, or you can even learn to manipulate audio files without any libraries in any language. So if you are not particulary interested in C++ at the moment you can start with Python, which is easier to learn. You can check out other python audio manipulation libraries here
-
ChatGPT and Whisper APIs
I doubt it will matter if you're breaking up mid sentence if you pass in the previous as a prompt and split words. This is how Whisper does it internally.
It's not absolutely perfect, but splitting on the word boundary is one line of code with the same package in their docs: https://github.com/jiaaro/pydub/blob/master/API.markdown#sil...
25MB is also a lot. That's 30 minutes to an hour on MP3 at reasonable compression. A 2 hour movie would have three splits.
-
FFmpeg 6.0
Even given an option it can be difficult to find the corresponding documentation, if only because of the many different submodules and encoders and decoders and filters that have o-so-slightly different options. That said, I've just switched from pydub to ffmpeg-python (due to memory issues of the former[1]) and judging from the Jupiter notebook[2] it seems a much more intuitive method of constructing ffmpeg pipelines.
[1] https://github.com/jiaaro/pydub/issues/135
[2] https://github.com/kkroening/ffmpeg-python/tree/master/examp...
-
Download & Trim MP3 from Youtube with Python
With the file downloaded, we're now going to arbitrarily slice it locally (you might have considered wheter it is possible to simply download a clip from youtube; all reliable methods I've found will essentially boil down to downloading the whole and then editing locally). For that we'll use the pydub library:
-
Playing multiple .wav and/or mp3 files in Python
I guess it's possible in theory, a quick search suggest pydub library.But you may find something better if you do a little research.
-
I made a cross-platform command-line app called maestro to play music!
Uses https://github.com/cheofusi/just_playback to play sound. It's actually surprising how hard it was to find a cross-platform Python module to play sound that doesn't require an external dependency like ffmpeg. Even then, modules like https://github.com/jiaaro/pydub don't support features like seeking/scrubbing, which was a must-have for my project.
-
Batch conversion FLAC to WAV
Once python is installed, you will also need to install the "pydub" package for this script to work. If you're on a Windows computer, you can do this from the command line (run the "cmd") program. If you're on mac, you can do this from the terminal. Basically, the way that you do this is using "pip" -- a "helper" program that comes with python. Once you launch the command line, just run the command python -m pip install pydub --upgrade and you should see a message showing that it successfully installed. If you're struggling with this step, just google how to "pip install python packages" and you can find a lot of beginner guides.
-
How can I modify the pitch of an audio file and save it to disk?
That is kinda what serverless functions are built for. Looks like python has some good libraries for this: https://github.com/jiaaro/pydub.
-
Playing large audio files?
The files are big, so it's not feasible to load one in all at once. They have to be streamed/chunked somehow. (sadly, pydub doesn't support this...)
pythonic-cv
-
Play a Video on Loop, Replace Video with Photo, Then Go Back to Video
I did something similar in this example, where the “input” was someone covering up a webcam to switch to the next display source (in my case switching through a list of videos that would continue from where they left off).
-
[pyautogui] What is faster?
If you’re wanting to continuously process your screen like a live video stream you might be interested in (my library) pythonic-cv, which supports MSS as a video input backend.
-
Python Code Help - OpenCV Project
I’ve previously done something similar here, but the transition was triggered by covering a webcam (e.g. with a finger).
-
Better alternative to pyautogui image recognition?
pythonic-cv (disclaimer: my library) includes MSS as an input stream option.
-
[Discussion] Any other ways to find convergence of pixels by color?
If that’s of interest, OpenCV provides a detailed stitching example, although my revision is likely a fair amount easier to follow and understand (but it’s been optimised for video, so if you’re wanting to use it with minimal modification you’ll need to provide your images in a sequence where each image has overlap with the one before it, not in a random order).
-
Class has a method called release instead of close. I want to use the With keyword. How do I tell python to call release instead of close
As something of a side note, you may be interested in (my library) pythonic-cv.
-
Write efficient async code in computer vision programs
I wrote pythonic-cv because I found that pipelines regularly require pre and post processing that can be done in parallel across frames - you might want to take a look :-)
-
Is FER just this slow or is it just me?
Likely areas for parallelisation depend on the operations that are happening - if there are independent stages then they can often be made to run at the same time in separate threads (or processes). Concurrency is similar, although more about doing something else while waiting for I/O, and can generally be solved with threads or asyncio coroutines. A common improvement for video-focused computer vision pipelines is reading in/capturing the next frame while the previous frame is being processed (e.g. like is done with pythonic-cv - disclaimer: my library), but given the frame rates you specified then that may help a bit but is likely not the main culprit. ML-based algorithms can often benefit from the inherent parallelisation of a GPU-based implementation, although that can be difficult or even impossible to achieve depending on the algorithm being used.
- Image stitching
-
[Bug] After trying to send to my database my video is suddenly lagging
Personally I’d approach this using pythonic-cv - it has a VideoReader class that supports separate preprocess and process functions, and has threading built in. Then again, I wrote the library, so it’s not so surprising that I’d jump to using it.
What are some alternatives?
librosa - Python library for audio and music analysis
ffmpeg-python - Python bindings for FFmpeg - with complex filtering support
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
kaleidoscope - Apply a kaleidoscope effect to images and videos
pyAudioAnalysis - Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
python-mss - An ultra fast cross-platform multiple screenshots module in pure Python using ctypes.
interactive-projectivity-open - Interactive projections using computer vision
mutagen - Python module for handling audio metadata
SmoothStream - Webcam, PiCamera streaming over the network with Python made easy.
audioread - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
OpenCV - Open Source Computer Vision Library