MixSeq
| PAPER |
|---|
| mixSeq: A Simple Data Augmentation Method for Neural Machine Translation |
| SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup |
mixup
Omni-Scale 1D-CNN
CoST
TS2Vec
Long Sequence Modeling
| PAPER |
|---|
| Longformer: The Long-Document Transformer |
| Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context |
| Poolingformer: Long Document Modeling with Pooling Attention |