Optimize Vision Transformer Architecture via Efficient Attention Modules: A Study on the Monocular Depth Estimation Task Permalink
Published in International Conference on Image Analysis and Processing, 2023
Recommended citation: Schiavella, C., Cirillo, L., Papa, L., Russo, P., & Amerini, I. (2023, September). Optimize Vision Transformer Architecture via Efficient Attention Modules: A Study on the Monocular Depth Estimation Task. In International Conference on Image Analysis and Processing (pp. 383-394). Cham: Springer Nature Switzerland.