Both the train and test set require a tab-separated format. Each line in the train (or test) file corresponds to an instance, and it should be arranged as
label sentence#1 sentence#2 other_info
For more details about the data format, you can download the Quora Question Pair dataset used in this paper.