MixSeq
PAPER |
---|
mixSeq: A Simple Data Augmentation Method for Neural Machine Translation |
SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup |
mixup
Omni-Scale 1D-CNN
CoST
TS2Vec
Long Sequence Modeling
PAPER |
---|
Longformer: The Long-Document Transformer |
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context |
Poolingformer: Long Document Modeling with Pooling Attention |