Question-Answer Generation for Data Augmentation
data augmentation, neural network, generative adversarial network, question generation, reading comprehension
本篇論文涉及自然語言處理領域中的三個問題:自然語言理解、重要內容提取以及問題生成,而我們的主旨在於建立更好的問題生成模型。現有的問題生成研究大多假設答案是已知或可獲得的,有的研究則將答案萃取以及問題生成視為兩階段任務。以往的問題生成神經網路模型多以遞歸神經網路(recurrent neural network)實現,我們採用一個基於注意力(attention)機制的問題答案生成模型,同時能夠達成答案萃取與問題生成,並且以問答模型評估生成之問答能否達到資料增強(data augmentation)的效果。實驗結果顯示在大資料集預訓練的模型較無法在轉移時受益於資料增強,但其他實驗中的問答模型以問答生成模型產生的樣本當作擴充資料,可以使該問答模型有更好的準確度。我們也實驗使用生成對抗網路(generative adversarial network)產生更多擴充資料,結果顯示基於對抗生成網路產生的樣本未能顯著提高資料增強之效能。
Question generation is a popular issue with the rise of deep neural network; however, previous works either assuming that answers for question generation are known or considering answer extraction and question generation as separated tasks. Here we propose a question-answer generation model based on attention technique. The result shows that a fine-tuned question-answer generation model gains better performance and it can be a good data augmentation method for question-answering. We also find that the generative adversarial network does not significantly improve the performance of the question-answer generation model on data augmentation. Besides, we test the performance of data augmentation in various circumstances, and we find that the model pre-trained on a large corpus does not benefit from data augmentation.
目次 Table of Contents
摘要 ii
Abstract iii
Chapter 1 Introduction 1
1.1 Motivation and Problem Description 1
1.2 Main Contribution 2
1.3 Thesis Structure 3
Chapter 2 Related Work 5
2.1 Question Generation 5
2.2 Generative Adversarial Network in Text Generative Model 6
2.3 Data Augmentation in NLP 7
Chapter 3 Background 9
3.1 Related Model 9
3.1.1 Transformer 9
3.1.2 Bidirectional Encoder Representations from Transformer 12
3.2 Algorithm 13
3.2.1 Cross-Entropy 13
3.2.3 Beam Search 14
Chapter 4 Method 15
4.1 Question-Answer Generation Model 15
4.2 Diversity-Promoting Generative Adversarial Network 16
4.3 Diverse Beam Search 19
Chapter 5 Evaluation 21
5.1 Experimental Setup 21
5.1.1 Question-Answer Generation Model 21
5.1.2 Adversarial Training 22
5.1.3 Question-Answering Model 22
5.2 Evaluation Metrics 23
5.2.1 Bilingual Evaluation Understudy 24
5.2.2 Recall-Oriented Understudy for Gisting Evaluation 24
5.2.3 Exact Match and F1 score 24
5.3 Results and Analysis 25
5.3.1 Question-Answer Generation Model 25
5.3.2 Question-Answering Model 25
5.4 Summary 30
Chapter 6 Conclusion and Future Work 33
6.1 Conclusion 33
6.2 Future Work 34
Bibliography 35
