SincNet
stereo-image-generation
SincNet | stereo-image-generation | |
---|---|---|
3 | 2 | |
1,097 | 33 | |
- | - | |
0.0 | 10.0 | |
about 3 years ago | over 1 year ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SincNet
- Does this SincNet (neural architecture) contain a discriminator?
-
TypeError: layer_norm(): argument 'input' (position 1) must be Tensor, not SincNet.
the sincnet class is taken from here: https://github.com/mravanelli/SincNet/blob/master/dnn_models.py
-
[R][P] Announcing audax, a audio ML/DL framework in Jax
Code for https://arxiv.org/abs/1808.00158 found: https://github.com/mravanelli/SincNet
stereo-image-generation
-
I have a Rokid Air and am looking for suggestions as to how to include it in a HS classroom.
In context computer science class, for example, you may consider familiarizing students with generating stereo SBS image based on images of their choosing, perhaps using stable-diffusion-webui-depthmap-script (works in A1111 UI), or to keep things more focused https://github.com/m5823779/stereo-image-generation (no UI, but very simple to use in command-line).
-
3D side by side images (cross your eyes slowly until the images superimpose and you will see in 3D)
I started from that repository but it didn't work and had to rework a lot of what was there to make it better and more optimized. I can't share my code ATM because it's in draft-state (meaning : horrible mess) and I'm still working a lot on it but I couldn't resist to share a few of my results!
What are some alternatives?
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
cnnimageretrieval-pytorch - CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch
speechbrain - A PyTorch-based Speech Toolkit
edge-connect - EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
merged_depth - Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models
Image-Forgery-Detection-CNN - Image forgery detection using convolutional neural networks. Group 10's final project for TU Delft's course CS4180 Deep Learning 2019.
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ruptures - ruptures: change point detection in Python
calibrated-backprojection-network - PyTorch Implementation of Unsupervised Depth Completion with Calibrated Backprojection Layers (ORAL, ICCV 2021)
UHV-OTS-Speech - A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
whisper-timestamped - Multilingual Automatic Speech Recognition with word-level timestamps and confidence