STrXL: Approximating Permutation Invariance/Equivariance to Model Arbitrary Cardinality Sets

Autores/as

  • Kendra Givens Middle Tennessee State University
  • David Ludwig Middle Tennessee State University
  • Joshua Phillips Middle Tennessee State University

DOI:

https://doi.org/10.32473/flairs.37.1.135568

Resumen

Current deep-learning techniques for processing sets are limited to a fixed cardinality, causing a steep increase in computational complexity when the set is large. To address this, we have taken techniques used to model long-term dependencies from natural language processing and combined them with the permutation equivariant architecture, Set Transformer (STr). The result is Set Transformer XL (STrXL), a novel deep learning model capable of extending to sets of arbitrary cardinality given fixed computing resources. STrXL's extension capability lies in its recurrent architecture. Rather than processing the entire set at once, STrXL processes only a portion of the set at a time and uses a memory mechanism to provide additional input from the past. STrXL is particularly applicable to processing sets of high-throughput sequencing (HTS) samples of DNA sequences as their set sizes can range into hundreds of thousands. When tasked with classifying HTS prairie soil samples and MNIST digits, results show that STrXL exhibits an expected memory size-accuracy trade-off that scales proportionally with the complexity of downstream tasks, but, unlike STr, is capable of generalizing to sets of arbitrary cardinality.

Descargas

Publicado

2024-05-13

Cómo citar

Givens, K., Ludwig, D., & Phillips, J. (2024). STrXL: Approximating Permutation Invariance/Equivariance to Model Arbitrary Cardinality Sets. The International FLAIRS Conference Proceedings, 37(1). https://doi.org/10.32473/flairs.37.1.135568

Número

Sección

Main Track Proceedings