Optimize Vision Transformer Architecture via Efficient Attention Modules: A Study on the Monocular Depth Estimation Task Permalink
Schiavella, C., Cirillo, L., Papa, L., Russo, P., & Amerini, I. (2023, September). Optimize Vision Transformer Architecture via Efficient Attention Modules: A Study on the Monocular Depth Estimation Task. In International Conference on Image Analysis and Processing (pp. 383-394). Cham: Springer Nature Switzerland.