a-PyTorch-Tutorial-to-Image-Captioning
pytorch-tutorial
a-PyTorch-Tutorial-to-Image-Captioning | pytorch-tutorial | |
---|---|---|
1 | 3 | |
2,657 | 29,160 | |
- | - | |
0.0 | 0.0 | |
almost 2 years ago | 9 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
a-PyTorch-Tutorial-to-Image-Captioning
-
[R] end-to-end image captioning
I have found this repository: https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning that, seemingly, requires only images and captions, but this is quite old (3 years ago), and is based on LSTMs. I was hoping there are transformers-based implementations that I could use.
pytorch-tutorial
-
PyTorch - What does contiguous() do?
I was going through this example of a LSTM language model on github (link).What it does in general is pretty clear to me. But I'm still struggling to understand what calling contiguous() does, which occurs several times in the code.
-
How to 'practice' pytorch after finishing its basic tutorial?
I tried to move straight to practicing implementing papers and trying to understand other people's codes but failed miserably. I feel like there was too much of a gap between the basic tutorial and being able to implement ideas into code....hence the question: Is there any resource/way to practice pytorch in general? I did find this and this, but I just wanted to hear what others have gone through to become better at PyTorch up to the point they can build stuff from their own ideas
- [P] Probabilistic Machine Learning: An Introduction, Kevin Murphy's 2021 e-textbook is out
What are some alternatives?
meshed-memory-transformer - Meshed-Memory Transformer for Image Captioning. CVPR 2020
mixture-of-experts - PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
BLIP - PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
InceptionTime - InceptionTime: Finding AlexNet for Time Series Classification
image-to-latex - Convert images of LaTex math equations into LaTex code.
Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
catr - Image Captioning Using Transformer
pytorch-grad-cam - Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
clip-glass - Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"
BigGAN-PyTorch - The author's officially unofficial PyTorch BigGAN implementation.
blip - A tool for seeing your Internet latency. Try it at http://gfblip.appspot.com/
bonito - A PyTorch Basecaller for Oxford Nanopore Reads