-
Towards-A-Deep-and-Unified-Understanding-of-Deep-Neural-Models-in-NLP
Code implementation of paper Towards A Deep and Unified Understanding of Deep Neural Models in NLP
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
To fine-tune a pre-trained model, we could use the run_langauge_modeling.py. All we need are two text files; one containing the training text pieces, and another containing the text pieces for evaluation.
In Towards a Deep and Unified Understanding of Deep Neural Models in NLP, the authors propose a way to answer this. They also provide the code that we could use to analyze the GPT-2 model with.
Thanks to jessevig’s BertViz tool, we can peek at how GPT-2 works by visualizing the attention values.
Related posts
-
AI enthusiasm #6 - Finetune any LLM you want💡
-
How to leverage the state-of-the-art NLP models in Rust
-
Any recommendation for the replacement of the toolkit jiant? [Research] [Discussion]
-
Lack of activation in transformer feedforward layer?
-
Haystack DB – 10x faster than FAISS with binary embeddings by default