GitHub / graykode/xlnet-pytorch issues and pull requests
#17 - Parameter initialized with torch.randn may be not a good choice
Issue -
State: open - Opened by lddsdu almost 4 years ago
#16 - position-wise feedforward only one linear layer?
Issue -
State: open - Opened by yuanenming about 4 years ago
#15 - Runtime Error in colab
Issue -
State: open - Opened by shiningrain almost 5 years ago
- 1 comment
#14 - Batch training
Issue -
State: open - Opened by tonyzhao6 about 5 years ago
#13 - Error and general question
Issue -
State: open - Opened by jbmaxwell over 5 years ago
- 1 comment
#12 - Confusion about the relative position embedding with attn_type='bi' but bsz=1
Issue -
State: open - Opened by NotANumber124 almost 6 years ago
- 1 comment
Labels: help wanted, question
#11 - RuntimeError: Expected object of scalar type Byte but got scalar type Bool for argument #2 'other'
Issue -
State: open - Opened by Petkomat almost 6 years ago
- 3 comments
Labels: bug
#10 - how to save the pre-trained language model?
Issue -
State: open - Opened by TE-andrewshin almost 6 years ago
- 1 comment
Labels: question
#9 - how to do inference?
Issue -
State: open - Opened by EricBK almost 6 years ago
- 1 comment
Labels: question
#8 - Model name 'bert-large-uncased' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese).
Issue -
State: open - Opened by 17764591637 about 6 years ago
- 1 comment
Labels: bug
#7 - do we need to create_data for every epoch?
Issue -
State: closed - Opened by mehdimashayekhi about 6 years ago
#6 - TypeError:can't convert np.ndarray of type numpy.bool_
Issue -
State: open - Opened by menggehe about 6 years ago
- 2 comments
Labels: bug
#5 - about first men‘s bug
Issue -
State: open - Opened by sherry-1001 about 6 years ago
Labels: bug
#4 - The permutation function " _local_perm" is confused.
Issue -
State: closed - Opened by chenwq95 about 6 years ago
#3 - Reimplement training time and the performance on each task?
Issue -
State: open - Opened by Zzmonica about 6 years ago
- 5 comments
Labels: question
#2 - Low accuracy on sample task
Issue -
State: open - Opened by ndalton12 about 6 years ago
- 1 comment
Labels: question
#1 - Re-implementation Performance
Issue -
State: open - Opened by lukemelas about 6 years ago
- 1 comment
Labels: question