diff --git a/README.md b/README.md
index 793691d..41119bd 100644
--- a/README.md
+++ b/README.md
@@ -5,18 +5,17 @@
 	<img src="https://camo.githubusercontent.com/64f8905651212a80869afbecbf0a9c52a5d1e70beab750dea40a994fa9a9f3c6/68747470733a2f2f617765736f6d652e72652f62616467652e737667" alt="Awesome" data-canonical-src="https://awesome.re/badge.svg" style="max-width: 100%;">	     
 </p>
 
-A curated (still actively updated) list of practical guide resources of LLMs. It's based on our survey paper: [Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond](https://arxiv.org/abs/2304.13712). The survey is partially based on the second half of this [Blog](https://jingfengyang.github.io/gpt).
+A curated (still actively updated) list of practical guide resources of LLMs. It's based on our survey paper: [Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond](https://arxiv.org/abs/2304.13712) and efforts from @[xinyadu](https://github.com/xinyadu). The survey is partially based on the second half of this [Blog](https://jingfengyang.github.io/gpt). We also build an evolutionary tree of modern Large Language Models (LLMs) to trace the development of language models in recent years and highlights some of the most well-known models. 
 
-These sources aim to help practitioners navigate the vast landscape of large language models (LLMs) and their applications in natural language processing (NLP) applications. If you find any resources in our repository helpful, please feel free to use them (and don't forget to cite our paper!)
+These sources aim to help practitioners navigate the vast landscape of large language models (LLMs) and their applications in natural language processing (NLP) applications. We also include their usage restrictions based on the model and data licensing information.
+If you find any resources in our repository helpful, please feel free to use them (don't forget to cite our paper! 😃). We welcome pull requests to refine this figure! 
 
-## Latest News💥
-- We used PowerPoint to plot the figure and released the source file [pptx](./source/figure_gif.pptx) for our GIF figure. [4/27/2023]
-- We released the source file for the still version [pptx](./source/figure_still.pptx), and replaced the figure in this repo with the still version. [4/29/2023]
-- Add AlexaTM, UniLM, UniLMv2 to the figure, and correct the logo for Tk. [4/29/2023]
+<p align="center">
+<img width="600" src="./imgs/tree.jpg"/>
+</p>
 
-We welcome pull requests to refine this figure, and if you find the source helpful, please cite our paper.
 
-    ```bibtex
+```bibtex
     @article{yang2023harnessing,
         title={Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond}, 
         author={Jingfeng Yang and Hongye Jin and Ruixiang Tang and Xiaotian Han and Qizhang Feng and Haoming Jiang and Bing Yin and Xia Hu},
@@ -25,31 +24,64 @@ We welcome pull requests to refine this figure, and if you find the source helpf
         archivePrefix={arXiv},
         primaryClass={cs.CL}
     }
-    ```
-
-## Practical Guide for Models
-
-We build an evolutionary tree of modern Large Language Models (LLMs) to trace the development of language models in recent years and highlights some of the most well-known models, in the following figure:
+```
 
-<p align="center">
-<img width="600" src="./imgs/models-colorgrey.jpg"/>
-</p>
+## Latest News💥
+- We added usage and restrictions section.
+- We used PowerPoint to plot the figure and released the source file [pptx](./source/figure_gif.pptx) for our GIF figure. [4/27/2023]
+- We released the source file for the still version [pptx](./source/figure_still.pptx), and replaced the figure in this repo with the still version. [4/29/2023]
+- Add AlexaTM, UniLM, UniLMv2 to the figure, and correct the logo for Tk. [4/29/2023]
+- Add usage and Restrictions (for commercial and research purposes) section. Credits to [Dr. Du](https://github.com/xinyadu).  [5/8/2023]
+
+
+
+
+## Other Practical Guides for LLMs
+
+- **Why did all of the public reproduction of GPT-3 fail? In which tasks should we use GPT-3.5/ChatGPT?** 2023, [Blog](https://jingfengyang.github.io/gpt) 
+- **Building LLM applications for production**, 2023, [Blog](https://huyenchip.com/2023/04/11/llm-engineering.html)
+- **Data-centric Artificial Intelligence**, 2023, [Repo](https://github.com/daochenzha/data-centric-AI)/[Blog](https://towardsdatascience.com/what-are-the-data-centric-ai-concepts-behind-gpt-models-a590071bb727)/[Paper](https://arxiv.org/abs/2303.10158)
+
+
+## Catalog
+* [The Practical Guides for Large Language Models ](#the-practical-guides-for-large-language-models-)
+   * [Practical Guide for Models](#practical-guide-for-models)
+      * [BERT-style Language Models: Encoder-Decoder or Encoder-only](#bert-style-language-models-encoder-decoder-or-encoder-only)
+      * [GPT-style Language Models: Decoder-only](#gpt-style-language-models-decoder-only)
+   * [Practical Guide for Data](#practical-guide-for-data)
+      * [Pretraining data](#pretraining-data)
+      * [Finetuning data](#finetuning-data)
+      * [Test data/user data](#test-datauser-data)
+   * [Practical Guide for NLP Tasks](#practical-guide-for-nlp-tasks)
+      * [Traditional NLU tasks](#traditional-nlu-tasks)
+      * [Generation tasks](#generation-tasks)
+      * [Knowledge-intensive tasks](#knowledge-intensive-tasks)
+      * [Abilities with Scaling](#abilities-with-scaling)
+      * [Specific tasks](#specific-tasks)
+      * [Real-World ''Tasks''](#real-world-tasks)
+      * [Efficiency](#efficiency)
+      * [Trustworthiness](#trustworthiness)
+      * [Benchmark Instruction Tuning](#benchmark-instruction-tuning)
+      * [Alignment](#alignment)
+         * [Safety Alignment (Harmless)](#safety-alignment-harmless)
+         * [Truthfulness Alignment (Honest)](#truthfulness-alignment-honest)
+         * [Practical Guides for Prompting (Helpful)](#practical-guides-for-prompting-helpful)
+         * [Alignment Efforts of Open-source Communtity](#alignment-efforts-of-open-source-communtity)
+   * [Usage and Restractions (Models and Data)](#Usage-and-Restrictions)
 
-### Other Practical Guides for LLMs
-- Why did all of the public reproduction of GPT-3 fail? In which tasks should we use GPT-3.5/ChatGPT? [Blog](https://jingfengyang.github.io/gpt) 
-- Building LLM applications for production, 2023. [Blog](https://huyenchip.com/2023/04/11/llm-engineering.html)
+## Practical Guide for Models
 
 ### BERT-style Language Models: Encoder-Decoder or Encoder-only
 
 - BERT **BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding**, 2018, [Paper](https://aclanthology.org/N19-1423.pdf)
-- RoBERTa **ALBERT: A Lite BERT for Self-supervised Learning of Language Representations**, 2019, [Paper](https://arxiv.org/abs/1909.11942)
+- RoBERTa **RoBERTa: A Robustly Optimized BERT Pretraining Approach**, 2019, [Paper](https://arxiv.org/abs/1907.11692)
 - DistilBERT **DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter**, 2019, [Paper](https://arxiv.org/abs/1910.01108)
 - ALBERT **ALBERT: A Lite BERT for Self-supervised Learning of Language Representations**, 2019, [Paper](https://arxiv.org/abs/1909.11942)
 - UniLM **Unified Language Model Pre-training for Natural Language Understanding and Generation**, 2019 [Paper](https://arxiv.org/abs/1905.03197)
 - ELECTRA **ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS**, 2020, [Paper](https://openreview.net/pdf?id=r1xMH1BtvB)
-- T5 **"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"**. *Colin Raffel et al.* JMLR 2019. [Paper](https://arxiv.org/abs/1910.10683)]
-- GLM **"GLM-130B: An Open Bilingual Pre-trained Model"**. 2022. [Paper](https://arxiv.org/abs/2210.02414)] 
-- AlexaTM **"AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model"**. *Saleh Soltan et al.* arXiv 2022. [Paper](https://arxiv.org/abs/2208.01448)]
+- T5 **"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"**. *Colin Raffel et al.* JMLR 2019. [Paper](https://arxiv.org/abs/1910.10683)
+- GLM **"GLM-130B: An Open Bilingual Pre-trained Model"**. 2022. [Paper](https://arxiv.org/abs/2210.02414)
+- AlexaTM **"AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model"**. *Saleh Soltan et al.* arXiv 2022. [Paper](https://arxiv.org/abs/2208.01448)
 - ST-MoE **ST-MoE: Designing Stable and Transferable Sparse Expert Models**. 2022 [Paper](https://arxiv.org/abs/2202.08906)
 
 
@@ -60,7 +92,7 @@ We build an evolutionary tree of modern Large Language Models (LLMs) to trace th
 - GPT-3 **"Language Models are Few-Shot Learners"**. NeurIPS 2020. [Paper](https://arxiv.org/abs/2005.14165)
 - OPT **"OPT: Open Pre-trained Transformer Language Models"**. 2022. [Paper](https://arxiv.org/abs/2205.01068)
 - PaLM **"PaLM: Scaling Language Modeling with Pathways"**. *Aakanksha Chowdhery et al.* arXiv 2022. [Paper](https://arxiv.org/abs/2204.02311)
-- BLOOM  **"BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"**. 2022. [Paper](https://arxiv.org/abs/2211.05100)]
+- BLOOM  **"BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"**. 2022. [Paper](https://arxiv.org/abs/2211.05100)
 - MT-NLG **"Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model"**. 2021. [Paper](https://arxiv.org/abs/2201.11990)
 - GLaM **"GLaM: Efficient Scaling of Language Models with Mixture-of-Experts"**. ICML 2022. [Paper](https://arxiv.org/abs/2112.06905)
 - Gopher **"Scaling Language Models: Methods, Analysis & Insights from Training Gopher"**. 2021. [Paper](http://arxiv.org/abs/2112.11446v2)
@@ -70,7 +102,9 @@ We build an evolutionary tree of modern Large Language Models (LLMs) to trace th
 - GPT-4 **"GPT-4 Technical Report"**. 2023. [Paper](http://arxiv.org/abs/2303.08774v2)
 - BloombergGPT **BloombergGPT: A Large Language Model for Finance**, 2023, [Paper](https://arxiv.org/abs/2303.17564)
 - GPT-NeoX-20B: **"GPT-NeoX-20B: An Open-Source Autoregressive Language Model"**. 2022. [Paper](https://arxiv.org/abs/2204.06745)
-
+- PaLM 2: **"PaLM 2 Technical Report"**. 2023. [Tech.Report](https://arxiv.org/abs/2305.10403)
+- LLaMA 2: **"Llama 2: Open foundation and fine-tuned chat models"**. 2023. [Paper](https://arxiv.org/pdf/2307.09288)
+- Claude 2: **"Model Card and Evaluations for Claude Models"**. 2023. [Model Card](https://www-files.anthropic.com/production/images/Model-Card-Claude-2.pdf)
 
 
 
@@ -78,6 +112,7 @@ We build an evolutionary tree of modern Large Language Models (LLMs) to trace th
 
 
 ### Pretraining data
+- **RedPajama**, 2023. [Repo](https://github.com/togethercomputer/RedPajama-Data)
 - **The Pile: An 800GB Dataset of Diverse Text for Language Modeling**, Arxiv 2020. [Paper](https://arxiv.org/abs/2101.00027)
 - **How does the pre-training objective affect what large language models learn about linguistic properties?**, ACL 2022. [Paper](https://aclanthology.org/2022.acl-short.16/)
 - **Scaling laws for neural language models**, 2020. [Paper](https://arxiv.org/abs/2001.08361)
@@ -179,6 +214,7 @@ We build a decision flow for choosing LLMs or fine-tuned models~\protect\footnot
 - **SPeC: A Soft Prompt-Based Calibration on Mitigating Performance Variability in Clinical Notes Summarization**, Arxiv 2023. [Paper](https://arxiv.org/abs/2303.13035)
   
 2. Spurious biases
+- **Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning**, Findings of ACL 2023 [Paper](https://aclanthology.org/2023.findings-acl.284/)
 - **Shortcut learning of large language models in natural language understanding: A survey**, 2023 [Paper](https://arxiv.org/abs/2208.11857)
 - **Mitigating gender bias in captioning system**, WWW 2020 [Paper](https://dl.acm.org/doi/abs/10.1145/3442381.3449950)
 - **Calibrate Before Use: Improving Few-Shot Performance of Language Models**, ICML 2021 [Paper](https://arxiv.org/abs/2102.09690)
@@ -199,7 +235,7 @@ We build a decision flow for choosing LLMs or fine-tuned models~\protect\footnot
 - **Cross-task generalization via natural language crowdsourcing instructions**, ACL 2022 [Paper](https://aclanthology.org/2022.acl-long.244.pdf)
 - Tk-INSTRUCT: **Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks**, EMNLP 2022 [Paper](https://aclanthology.org/2022.emnlp-main.340/)
 - FLAN-T5/PaLM: **Scaling Instruction-Finetuned Language Models**, Arxiv 2022 [Paper](https://arxiv.org/abs/2210.11416)
-- **The Flan Collection: Designing Data and Methods for Effective Instruction Tuning**, Arxiv 2023 [Paper](https://arxiv.org/abs/2208.03299)
+- **The Flan Collection: Designing Data and Methods for Effective Instruction Tuning**, Arxiv 2023 [Paper](https://arxiv.org/abs/2301.13688)
 - **OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization**, Arxiv 2023 [Paper](https://arxiv.org/abs/2212.12017)
 
 ### Alignment
@@ -227,20 +263,321 @@ We build a decision flow for choosing LLMs or fine-tuned models~\protect\footnot
 
 #### Practical Guides for Prompting (Helpful)
 
-- OpenAI Cookbook. [Blog](https://github.com/openai/openai-cookbook/blob/main/techniques_to_improve_reliability.md)
-- Prompt Engineering. [Blog](https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/)
-- ChatGPT Prompt Engineering for Developers! [Course](https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/)
+- **OpenAI Cookbook**. [Blog](https://github.com/openai/openai-cookbook/blob/main/techniques_to_improve_reliability.md)
+- **Prompt Engineering**. [Blog](https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/)
+- **ChatGPT Prompt Engineering for Developers!** [Course](https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/)
 
 #### Alignment Efforts of Open-source Communtity
 
 - **Self-Instruct: Aligning Language Model with Self Generated Instructions**, Arxiv 2022 [Paper](https://arxiv.org/abs/2212.10560)
-- Alpaca. [Repo](https://github.com/tatsu-lab/stanford_alpaca)
-- Vicuna. [Repo](https://github.com/lm-sys/FastChat)
-- Dolly. [Blog](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)
-- DeepSpeed-Chat. [Blog](https://github.com/microsoft/DeepSpeedExamples/tree/master/applications/DeepSpeed-Chat)
-- GPT4All. [Repo](https://github.com/nomic-ai/gpt4all)
-- OpenAssitant. [Repo](https://github.com/LAION-AI/Open-Assistant)
-- ChatGLM. [Repo](https://github.com/THUDM/ChatGLM-6B)
-- MOSS. [Repo](https://github.com/OpenLMLab/MOSS)
-
+- **Alpaca**. [Repo](https://github.com/tatsu-lab/stanford_alpaca)
+- **Vicuna**. [Repo](https://github.com/lm-sys/FastChat)
+- **Dolly**. [Blog](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)
+- **DeepSpeed-Chat**. [Blog](https://github.com/microsoft/DeepSpeedExamples/tree/master/applications/DeepSpeed-Chat)
+- **GPT4All**. [Repo](https://github.com/nomic-ai/gpt4all)
+- **OpenAssitant**. [Repo](https://github.com/LAION-AI/Open-Assistant)
+- **ChatGLM**. [Repo](https://github.com/THUDM/ChatGLM-6B)
+- **MOSS**. [Repo](https://github.com/OpenLMLab/MOSS)
+- **Lamini**. [Repo](https://github.com/lamini-ai/lamini/)/[Blog](https://lamini.ai/blog/introducing-lamini)
+
+## Usage and Restrictions
+
+<!-- We build a decision flow for choosing LLMs or fine-tuned models~\protect\footnotemark for user's NLP applications.  -->
+<!-- The decision flow helps users assess whether their downstream NLP applications at hand meet specific conditions and, based on that evaluation, determine whether LLMs or fine-tuned models are the most suitable choice for their applications. -->
+
+We build a table summarizing the LLMs usage restrictions (e.g. for commercial and research purposes). In particular, we provide the information from the models and their pretraining data's perspective.
+We urge the users in the community to refer to the licensing information for public models and data and use them in a responsible manner.
+We urge the developers to pay special attention to licensing, make them transparent and comprehensive, to prevent any unwanted and unforeseen usage.
+
+<table class="table table-bordered table-hover table-condensed">
+    <thead><tr><th title="Field #1">LLMs</th>
+    <th title="Field #2" colspan="3" align="center">Model</th>
+    <!-- <th title="Field #3"></th> -->
+    <!-- <th title="Field #4"></th> -->
+    <th title="Field #5" colspan="2" align="center">Data</th>
+    <!-- <th title="Field #6"></th> -->
+    </tr></thead>
+    <tbody><tr>
+    <td> </td>
+    <td><b>License<b></td>
+    <td><b>Commercial Use<b></td>
+    <td><b>Other noteable restrictions<b></td>
+    <td><b>License<b></td>
+    <td><b>Corpus<b></td>
+    </tr>
+    <tr>
+        <td colspan="6" align="left"><b>Encoder-only</b></td>
+    <tr>
+    <tr>
+    <td>BERT series of models (general domain)</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>BooksCorpus, English Wikipedia</td>
+    </tr>
+    <tr>
+    <td>RoBERTa</td>
+    <td>MIT license</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>BookCorpus, CC-News, OpenWebText, STORIES</td>
+    </tr>
+    <tr>
+    <td>ERNIE</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>English Wikipedia</td>
+    </tr>
+    <tr>
+    <td>SciBERT</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>BERT corpus, <a href="https://aclanthology.org/N18-3011.pdf">1.14M papers from Semantic Scholar</a></td>
+    </tr>
+    <tr>
+    <td>LegalBERT</td>
+    <td>CC BY-SA 4.0</td>
+    <td>❌</td>
+    <td> </td>
+    <td>Public (except data from the <a href="https://case.law/">Case Law Access Project</a>)</td>
+    <td>EU legislation,  US court cases, etc.</td>
+    </tr>
+    <tr>
+    <td>BioBERT</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td><a href="https://www.nlm.nih.gov/databases/download/terms_and_conditions.html">PubMed</a></td>
+    <td>PubMed, PMC</td>
+    </tr>
+    <tr>
+        <td colspan="6" align="left"><b>Encoder-Decoder</b></td>
+    <tr>
+    <tr>
+    <td>T5</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>C4</td>
+    </tr>
+    <tr>
+    <td>Flan-T5</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>C4, Mixture of tasks (Fig 2 in paper)</td>
+    </tr>
+    <tr>
+    <td>BART</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>RoBERTa corpus </td>
+    </tr>
+    <tr>
+    <td>GLM</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>BooksCorpus and English Wikipedia</td>
+    </tr>
+    <tr>
+    <td>ChatGLM</td>
+    <td><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/MODEL_LICENSE">ChatGLM License</a></td>
+    <td>❌</td>
+    <td>No use for illegal purposes or military research, no harm the public interest of society</td>
+    <td>N/A</td>
+    <td>1T tokens of Chinese and English corpus</td>
+    </tr>
+    <tr>
+        <td colspan="6" align="left"><b>Decoder-only</b></td>
+    <tr>
+    <td>GPT2 </td>
+    <td><a href="https://github.com/openai/gpt-2/blob/master/LICENSE">Modified MIT License</a></td>
+    <td>✅</td>
+    <td>Use GPT-2 responsibly and clearly indicate your content was created using GPT-2.</td>
+    <td>Public</td>
+    <td>WebText</td>
+    </tr>
+    <tr>
+    <td>GPT-Neo</td>
+    <td>MIT license</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td><a href="https://pile.eleuther.ai/">Pile</a></td>
+    </tr>
+    <tr>
+    <td>GPT-J</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>Pile</td>
+    </tr>
+    <tr>
+    <td>---&gt; Dolly</td>
+    <td>CC BY NC 4.0</td>
+    <td>❌</td>
+    <td> </td>
+    <td>CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI</td>
+    <td>Pile, Self-Instruct</td>
+    </tr>
+    <tr>
+    <td>---&gt; GPT4ALL-J</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td><a href="https://huggingface.co/datasets/nomic-ai/gpt4all-j-prompt-generations">GPT4All-J dataset</a></td>
+    </tr>
+    <tr>
+    <td>Pythia</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>Pile</td>
+    </tr>
+    <tr>
+    <td>---&gt; Dolly v2</td>
+    <td>MIT license</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td>Pile, databricks-dolly-15k</td>
+    </tr>
+    <tr>
+    <td>OPT</td>
+    <td><a href="https://github.com/facebookresearch/metaseq/blob/main/projects/OPT/MODEL_LICENSE.md?fbclid=IwAR1BFK5X1XdUpx_QXoiqyfzYWdNAXJPcg8Cf0ddv5T7sa2UrLUvymj1J8G4">OPT-175B LICENSE AGREEMENT</a></td>
+    <td>❌</td>
+    <td>No development relating to surveillance research and military, no harm the public interest of society</td>
+    <td>Public</td>
+    <td>RoBERTa corpus, the Pile, PushShift.io Reddit</td>
+    </tr>
+    <tr>
+    <td>---&gt; OPT-IML</td>
+    <td><a href="https://github.com/facebookresearch/metaseq/blob/main/projects/OPT/MODEL_LICENSE.md?fbclid=IwAR1BFK5X1XdUpx_QXoiqyfzYWdNAXJPcg8Cf0ddv5T7sa2UrLUvymj1J8G4">OPT-175B LICENSE AGREEMENT</a></td>
+    <td>❌</td>
+    <td>same to OPT</td>
+    <td>Public</td>
+    <td>OPT corpus, Extended version of Super-NaturalInstructions</td>
+    </tr>
+    <tr>
+    <td>YaLM</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Unspecified</td>
+    <td>Pile, Teams collected Texts in Russian</td>
+    </tr>
+    <tr>
+    <td>BLOOM</td>
+    <td><a href="https://bigscience.huggingface.co/blog/the-bigscience-rail-license">The BigScience RAIL License</a></td>
+    <td>✅</td>
+    <td>No use of generating verifiably false information with the purpose of harming others; <br/>content without expressly disclaiming that the text is machine generated</td>
+    <td>Public</td>
+    <td>ROOTS corpus (Lauren¸con et al., 2022)</td>
+    </tr>
+    <tr>
+    <td>---&gt; BLOOMZ</td>
+    <td><a href="https://bigscience.huggingface.co/blog/the-bigscience-rail-license">The BigScience RAIL License</a></td>
+    <td>✅</td>
+    <td>same to BLOOM</td>
+    <td>Public</td>
+    <td>ROOTS corpus, xP3</td>
+    </tr>
+    <tr>
+    <td>Galactica</td>
+    <td><a href="https://github.com/paperswithcode/galai/blob/main/LICENSE-MODEL.md">CC BY-NC 4.0</a></td>
+    <td>❌</td>
+    <td> </td>
+    <td>N/A</td>
+    <td>The Galactica Corpus</td>
+    </tr>
+    <tr>
+    <td>LLaMA</td>
+    <td><a href="https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform">Non-commercial bespoke license</a></td>
+    <td>❌</td>
+    <td>No development relating to surveillance research and military, no harm the public interest of society</td>
+    <td>Public</td>
+    <td>CommonCrawl, C4, Github, Wikipedia, etc.</td>
+    </tr>
+    <tr>
+    <td>---&gt; Alpaca</td>
+    <td>CC BY NC 4.0</td>
+    <td>❌</td>
+    <td> </td>
+    <td>CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI</td>
+    <td>LLaMA corpus, Self-Instruct</td>
+    </tr>
+    <tr>
+    <td>---&gt; Vicuna</td>
+    <td>CC BY NC 4.0</td>
+    <td>❌</td>
+    <td> </td>
+    <td>Subject to terms of Use of the data generated by OpenAI; <br/>Privacy Practices of ShareGPT</td>
+    <td>LLaMA corpus, 70K conversations from <a href="http://sharegpt.com/">ShareGPT.com</a></td>
+    </tr>
+    <tr>
+    <td>---&gt; GPT4ALL</td>
+    <td>GPL Licensed LLaMa</td>
+    <td>❌</td>
+    <td> </td>
+    <td>Public</td>
+    <td><a href="https://huggingface.co/datasets/nomic-ai/gpt4all_prompt_generations">GPT4All dataset</a></td>
+    </tr>
+    <tr>
+    <td>OpenLLaMA</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td><a href="https://www.together.xyz/blog/redpajama">RedPajama</a></td>
+    </tr>
+    <tr>
+    <td>CodeGeeX</td>
+    <td><a href="https://github.com/THUDM/CodeGeeX/blob/main/MODEL_LICENSE">The CodeGeeX License</a></td>
+    <td>❌</td>
+    <td>No use for illegal purposes or military research</td>
+    <td>Public</td>
+    <td>Pile, CodeParrot, etc.</td>
+    </tr>
+    <tr>
+    <td>StarCoder</td>
+    <td><a href="https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement">BigCode OpenRAIL-M v1 license</a></td>
+    <td>✅</td>
+    <td>No use of generating verifiably false information with the purpose of harming others; <br/>content without expressly disclaiming that the text is machine generated</td>
+    <td>Public</td>
+    <td><a href="https://arxiv.org/pdf/2211.15533.pdf">The Stack</a></td>
+    </tr>
+    <td>MPT-7B</td>
+    <td>Apache 2.0</td>
+    <td>✅</td>
+    <td> </td>
+    <td>Public</td>
+    <td><a href="https://arxiv.org/abs/2010.11934">mC4 (english)</a>, <a href="https://arxiv.org/pdf/2211.15533.pdf">The Stack</a>, <a href="https://www.together.xyz/blog/redpajama">RedPajama</a>, <a href="https://aclanthology.org/2020.acl-main.447/">S2ORC</a></td>
+    <tr>
+        <td><a href="https://huggingface.co/tiiuae/falcon-40b">falcon</a></td>
+        <td><a href="https://huggingface.co/tiiuae/falcon-40b/blob/main/LICENSE.txt">TII Falcon LLM License</a></td>
+        <td>✅/❌</td>
+        <td>Available under a license allowing commercial use</td>
+        <td>Public</td>
+        <td><a href="https://huggingface.co/datasets/tiiuae/falcon-refinedweb">RefinedWeb</a></td>
+    </tr>
+    </tbody></table>
+
+## Star History
+
+[![Star History Chart](https://api.star-history.com/svg?repos=Mooler0410/LLMsPracticalGuide&type=Date)](https://star-history.com/#Mooler0410/LLMsPracticalGuide&Date)
 
diff --git a/awesome_examples/tableQA.md b/awesome_examples/tableQA.md
index 2a98982..27294ae 100644
--- a/awesome_examples/tableQA.md
+++ b/awesome_examples/tableQA.md
@@ -4,6 +4,8 @@ In this lesson, we ask the model to answer questions based on a table. The table
 
 Comparing the following two examples, ChatGPT is vulnerable to table row order perturbation, while GPT4 is robust to table row order perturbation. Such robustness could probably be due to two reasons. The first reason is larger model size and more pretraining data of GPT4. Secondly, better truthfulness stemming from better RLHF alignment could help GPT4 follow different formats of the same instructions better. 
 
+Note that smaller finetuned models heavily suffer from such non-robustness issue, according to the paper: [TableFormer: Robust Transformer Modeling for Table-Text Encoding](https://arxiv.org/pdf/2203.00274.pdf)
+
 # Example 1 (2022/04/29)
 
 ## ChatGPT
diff --git a/imgs/qr_version.jpg b/imgs/qr_version.jpg
new file mode 100644
index 0000000..01b8da7
Binary files /dev/null and b/imgs/qr_version.jpg differ
diff --git a/imgs/tree.jpg b/imgs/tree.jpg
new file mode 100644
index 0000000..99486bc
Binary files /dev/null and b/imgs/tree.jpg differ
diff --git a/imgs/tree.png b/imgs/tree.png
index 24d20af..681b42c 100644
Binary files a/imgs/tree.png and b/imgs/tree.png differ
diff --git a/source/README.md b/source/README.md
index ed00f88..da034db 100644
--- a/source/README.md
+++ b/source/README.md
@@ -1,4 +1,5 @@
 ### change log
 
 - V1 (04/07/2023): First version of the figure. 
-- V2 (04/29/2023): Second version of the figure. (The gif version is not updated) 
+- V2 (04/29/2023): Second version of the figure. (The gif version is not updated)
+- V3 (08/06/2023): added Claude 2 and LLama-2-Chat
diff --git a/source/figure_gif.pptx b/source/figure_gif.pptx
index 1f5e771..5c3b2ab 100644
Binary files a/source/figure_gif.pptx and b/source/figure_gif.pptx differ
diff --git a/source/figure_still.pptx b/source/figure_still.pptx
index e0b0c78..f693241 100644
Binary files a/source/figure_still.pptx and b/source/figure_still.pptx differ

LLMs	Model			Data
	License	Commercial Use	Other noteable restrictions	License	Corpus
Encoder-only
BERT series of models (general domain)	Apache 2.0	✅		Public	BooksCorpus, English Wikipedia
RoBERTa	MIT license	✅		Public	BookCorpus, CC-News, OpenWebText, STORIES
ERNIE	Apache 2.0	✅		Public	English Wikipedia
SciBERT	Apache 2.0	✅		Public	BERT corpus, 1.14M papers from Semantic Scholar
LegalBERT	CC BY-SA 4.0	❌		Public (except data from the Case Law Access Project)	EU legislation, US court cases, etc.
BioBERT	Apache 2.0	✅		PubMed	PubMed, PMC
Encoder-Decoder
T5	Apache 2.0	✅		Public	C4
Flan-T5	Apache 2.0	✅		Public	C4, Mixture of tasks (Fig 2 in paper)
BART	Apache 2.0	✅		Public	RoBERTa corpus
GLM	Apache 2.0	✅		Public	BooksCorpus and English Wikipedia
ChatGLM	ChatGLM License	❌	No use for illegal purposes or military research, no harm the public interest of society	N/A	1T tokens of Chinese and English corpus
Decoder-only
GPT2	Modified MIT License	✅	Use GPT-2 responsibly and clearly indicate your content was created using GPT-2.	Public	WebText
GPT-Neo	MIT license	✅		Public	Pile
GPT-J	Apache 2.0	✅		Public	Pile
---> Dolly	CC BY NC 4.0	❌		CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI	Pile, Self-Instruct
---> GPT4ALL-J	Apache 2.0	✅		Public	GPT4All-J dataset
Pythia	Apache 2.0	✅		Public	Pile
---> Dolly v2	MIT license	✅		Public	Pile, databricks-dolly-15k
OPT	OPT-175B LICENSE AGREEMENT	❌	No development relating to surveillance research and military, no harm the public interest of society	Public	RoBERTa corpus, the Pile, PushShift.io Reddit
---> OPT-IML	OPT-175B LICENSE AGREEMENT	❌	same to OPT	Public	OPT corpus, Extended version of Super-NaturalInstructions
YaLM	Apache 2.0	✅		Unspecified	Pile, Teams collected Texts in Russian
BLOOM	The BigScience RAIL License	✅	No use of generating verifiably false information with the purpose of harming others; content without expressly disclaiming that the text is machine generated	Public	ROOTS corpus (Lauren¸con et al., 2022)
---> BLOOMZ	The BigScience RAIL License	✅	same to BLOOM	Public	ROOTS corpus, xP3
Galactica	CC BY-NC 4.0	❌		N/A	The Galactica Corpus
LLaMA	Non-commercial bespoke license	❌	No development relating to surveillance research and military, no harm the public interest of society	Public	CommonCrawl, C4, Github, Wikipedia, etc.
---> Alpaca	CC BY NC 4.0	❌		CC BY NC 4.0, Subject to terms of Use of the data generated by OpenAI	LLaMA corpus, Self-Instruct
---> Vicuna	CC BY NC 4.0	❌		Subject to terms of Use of the data generated by OpenAI; Privacy Practices of ShareGPT	LLaMA corpus, 70K conversations from ShareGPT.com
---> GPT4ALL	GPL Licensed LLaMa	❌		Public	GPT4All dataset
OpenLLaMA	Apache 2.0	✅		Public	RedPajama
CodeGeeX	The CodeGeeX License	❌	No use for illegal purposes or military research	Public	Pile, CodeParrot, etc.
StarCoder	BigCode OpenRAIL-M v1 license	✅	No use of generating verifiably false information with the purpose of harming others; content without expressly disclaiming that the text is machine generated	Public	The Stack
MPT-7B	Apache 2.0	✅		Public	mC4 (english), The Stack, RedPajama, S2ORC
falcon	TII Falcon LLM License	✅/❌	Available under a license allowing commercial use	Public	RefinedWeb