MixSeq
| PAPER | 
|---|
| mixSeq: A Simple Data Augmentation Method for Neural Machine Translation | 
| SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup | 
mixup
Omni-Scale 1D-CNN
CoST
TS2Vec
Long Sequence Modeling
| PAPER | 
|---|
| Longformer: The Long-Document Transformer | 
| Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context | 
| Poolingformer: Long Document Modeling with Pooling Attention |