跳转到主要内容

标签(标签)

资源精选(342) Go开发(108) Go语言(103) Go(99) angular(82) LLM(78) 大语言模型(63) 人工智能(53) 前端开发(50) LangChain(43) golang(43) 机器学习(39) Go工程师(38) Go程序员(38) Go开发者(36) React(33) Go基础(29) Python(24) Vue(22) Web开发(20) Web技术(19) 精选资源(19) 深度学习(19) Java(18) ChatGTP(17) Cookie(16) android(16) 前端框架(13) JavaScript(13) Next.js(12) 安卓(11) 聊天机器人(10) typescript(10) 资料精选(10) NLP(10) 第三方Cookie(9) Redwoodjs(9) ChatGPT(9) LLMOps(9) Go语言中级开发(9) 自然语言处理(9) PostgreSQL(9) 区块链(9) mlops(9) 安全(9) 全栈开发(8) OpenAI(8) Linux(8) AI(8) GraphQL(8) iOS(8) 软件架构(7) RAG(7) Go语言高级开发(7) AWS(7) C++(7) 数据科学(7) whisper(6) Prisma(6) 隐私保护(6) JSON(6) DevOps(6) 数据可视化(6) wasm(6) 计算机视觉(6) 算法(6) Rust(6) 微服务(6) 隐私沙盒(5) FedCM(5) 智能体(5) 语音识别(5) Angular开发(5) 快速应用开发(5) 提示工程(5) Agent(5) LLaMA(5) 低代码开发(5) Go测试(5) gorm(5) REST API(5) kafka(5) 推荐系统(5) WebAssembly(5) GameDev(5) CMS(5) CSS(5) machine-learning(5) 机器人(5) 游戏开发(5) Blockchain(5) Web安全(5) Kotlin(5) 低代码平台(5) 机器学习资源(5) Go资源(5) Nodejs(5) PHP(5) Swift(5) devin(4) Blitz(4) javascript框架(4) Redwood(4) GDPR(4) 生成式人工智能(4) Angular16(4) Alpaca(4) 编程语言(4) SAML(4) JWT(4) JSON处理(4) Go并发(4) 移动开发(4) 移动应用(4) security(4) 隐私(4) spring-boot(4) 物联网(4) nextjs(4) 网络安全(4) API(4) Ruby(4) 信息安全(4) flutter(4) RAG架构(3) 专家智能体(3) Chrome(3) CHIPS(3) 3PC(3) SSE(3) 人工智能软件工程师(3) LLM Agent(3) Remix(3) Ubuntu(3) GPT4All(3) 软件开发(3) 问答系统(3) 开发工具(3) 最佳实践(3) RxJS(3) SSR(3) Node.js(3) Dolly(3) 移动应用开发(3) 低代码(3) IAM(3) Web框架(3) CORS(3) 基准测试(3) Go语言数据库开发(3) Oauth2(3) 并发(3) 主题(3) Theme(3) earth(3) nginx(3) 软件工程(3) azure(3) keycloak(3) 生产力工具(3) gpt3(3) 工作流(3) C(3) jupyter(3) 认证(3) prometheus(3) GAN(3) Spring(3) 逆向工程(3) 应用安全(3) Docker(3) Django(3) R(3) .NET(3) 大数据(3) Hacking(3) 渗透测试(3) C++资源(3) Mac(3) 微信小程序(3) Python资源(3) JHipster(3) 语言模型(2) 可穿戴设备(2) JDK(2) SQL(2) Apache(2) Hashicorp Vault(2) Spring Cloud Vault(2) Go语言Web开发(2) Go测试工程师(2) WebSocket(2) 容器化(2) AES(2) 加密(2) 输入验证(2) ORM(2) Fiber(2) Postgres(2) Gorilla Mux(2) Go数据库开发(2) 模块(2) 泛型(2) 指针(2) HTTP(2) PostgreSQL开发(2) Vault(2) K8s(2) Spring boot(2) R语言(2) 深度学习资源(2) 半监督学习(2) semi-supervised-learning(2) architecture(2) 普罗米修斯(2) 嵌入模型(2) productivity(2) 编码(2) Qt(2) 前端(2) Rust语言(2) NeRF(2) 神经辐射场(2) 元宇宙(2) CPP(2) 数据分析(2) spark(2) 流处理(2) Ionic(2) 人体姿势估计(2) human-pose-estimation(2) 视频处理(2) deep-learning(2) kotlin语言(2) kotlin开发(2) burp(2) Chatbot(2) npm(2) quantum(2) OCR(2) 游戏(2) game(2) 内容管理系统(2) MySQL(2) python-books(2) pentest(2) opengl(2) IDE(2) 漏洞赏金(2) Web(2) 知识图谱(2) PyTorch(2) 数据库(2) reverse-engineering(2) 数据工程(2) swift开发(2) rest(2) robotics(2) ios-animation(2) 知识蒸馏(2) 安卓开发(2) nestjs(2) solidity(2) 爬虫(2) 面试(2) 容器(2) C++精选(2) 人工智能资源(2) Machine Learning(2) 备忘单(2) 编程书籍(2) angular资源(2) 速查表(2) cheatsheets(2) SecOps(2) mlops资源(2) R资源(2) DDD(2) 架构设计模式(2) 量化(2) Hacking资源(2) 强化学习(2) flask(2) 设计(2) 性能(2) Sysadmin(2) 系统管理员(2) Java资源(2) 机器学习精选(2) android资源(2) android-UI(2) Mac资源(2) iOS资源(2) Vue资源(2) flutter资源(2) JavaScript精选(2) JavaScript资源(2) Rust开发(2) deeplearning(2) RAD(2)

介绍该论文的中文版博客 链接

Citation

If it is helpful for your work, please cite this paper:

@misc{guo2021attention_survey,
      title={Attention Mechanisms in Computer Vision: A Survey}, 
      author={Meng-Hao Guo and Tian-Xing Xu and Jiang-Jiang Liu and Zheng-Ning Liu and Peng-Tao Jiang and Tai-Jiang Mu and Song-Hai Zhang and Ralph R. Martin and Ming-Ming Cheng and Shi-Min Hu},
      year={2021},
      eprint={2111.07624},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

image

  • TODO : Code about different attention mechanisms based on Jittor will be released gradually.
  • TODO : Code link will come soon.
  • TODO : collect more related papers. Contributions are welcome.

🔥 (citations > 200)

Channel attention

  • Squeeze-and-Excitation Networks (CVPR 2018) pdf, (PAMI2019 version) pdf 🔥
  • Image superresolution using very deep residual channel attention networks (ECCV 2018) pdf 🔥
  • Context encoding for semantic segmentation (CVPR 2018) pdf 🔥
  • Spatio-temporal channel correlation networks for action classification (ECCV 2018) pdf
  • Global second-order pooling convolutional networks (CVPR 2019) pdf
  • Srm : A style-based recalibration module for convolutional neural networks (ICCV 2019) pdf
  • You look twice: Gaternet for dynamic filter selection in cnns (CVPR 2019) pdf
  • Second-order attention network for single image super-resolution (CVPR 2019) pdf 🔥
  • DIANet: Dense-and-Implicit Attention Network (AAAI 2020)pdf
  • Spsequencenet: Semantic segmentation network on 4d point clouds (CVPR 2020) pdf
  • Ecanet: Efficient channel attention for deep convolutional neural networks (CVPR 2020) pdf 🔥
  • Gated channel transformation for visual recognition (CVPR2020) pdf
  • Fcanet: Frequency channel attention networks (ICCV 2021) pdf

Spatial attention

  • Recurrent models of visual attention (NeurIPS 2014), pdf 🔥
  • Show, attend and tell: Neural image caption generation with visual attention (PMLR 2015) pdf 🔥
  • Draw: A recurrent neural network for image generation (ICML 2015) pdf 🔥
  • Spatial transformer networks (NeurIPS 2015) pdf 🔥
  • Multiple object recognition with visual attention (ICLR 2015) pdf 🔥
  • Action recognition using visual attention (arXiv 2015) pdf 🔥
  • Videolstm convolves, attends and flows for action recognition (arXiv 2016) pdf 🔥
  • Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition (CVPR 2017) pdf 🔥
  • Learning multi-attention convolutional neural network for fine-grained image recognition (ICCV 2017) pdf 🔥
  • Diversified visual attention networks for fine-grained object classification (TMM 2017) pdf 🔥
  • High-Order Attention Models for Visual Question Answering (NeurIPS 2017) pdf
  • Attentional pooling for action recognition (NeurIPS 2017) pdf 🔥
  • Non-local neural networks (CVPR 2018) pdf 🔥
  • Attentional shapecontextnet for point cloud recognition (CVPR 2018) pdf
  • Relation networks for object detection (CVPR 2018) pdf 🔥
  • a2-nets: Double attention networks (NeurIPS 2018) pdf 🔥
  • Attention-aware compositional network for person re-identification (CVPR 2018) pdf 🔥
  • Tell me where to look: Guided attention inference network (CVPR 2018) pdf 🔥
  • Pedestrian alignment network for large-scale person re-identification (TCSVT 2018) pdf 🔥
  • Learn to pay attention (ICLR 2018) pdf 🔥
  • Attention U-Net: Learning Where to Look for the Pancreas (MIDL 2018) pdf 🔥
  • Psanet: Point-wise spatial attention network for scene parsing (ECCV 2018) pdf 🔥
  • Self attention generative adversarial networks (ICML 2019) pdf 🔥
  • Attentional pointnet for 3d-object detection in point clouds (CVPRW 2019) pdf
  • Co-occurrent features in semantic segmentation (CVPR 2019) pdf
  • Factor Graph Attention (CVPR 2019) pdf
  • Attention augmented convolutional networks (ICCV 2019) pdf 🔥
  • Local relation networks for image recognition (ICCV 2019) pdf
  • Latentgnn: Learning efficient nonlocal relations for visual recognition(ICML 2019) pdf
  • Graph-based global reasoning networks (CVPR 2019) pdf 🔥
  • Gcnet: Non-local networks meet squeeze-excitation networks and beyond (ICCVW 2019) pdf 🔥
  • Asymmetric non-local neural networks for semantic segmentation (ICCV 2019) pdf 🔥
  • Looking for the devil in the details: Learning trilinear attention sampling network for fine-grained image recognition (CVPR 2019) pdf
  • Second-order non-local attention networks for person re-identification (ICCV 2019) pdf 🔥
  • End-to-end comparative attention networks for person re-identification (ICCV 2019) pdf 🔥
  • Modeling point clouds with self-attention and gumbel subset sampling (CVPR 2019) pdf
  • Diagnose like a radiologist: Attention guided convolutional neural network for thorax disease classification (arXiv 2019) pdf
  • L2g autoencoder: Understanding point clouds by local-to-global reconstruction with hierarchical self-attention (arXiv 2019) pdf
  • Generative pretraining from pixels (PMLR 2020) pdf
  • Exploring self-attention for image recognition (CVPR 2020) pdf
  • Cf-sis: Semantic-instance segmentation of 3d point clouds by context fusion with self attention (ACM MM 20) pdf
  • Disentangled non-local neural networks (ECCV 2020) pdf
  • Relation-aware global attention for person re-identification (CVPR 2020) pdf
  • Segmentation transformer: Object-contextual representations for semantic segmentation (ECCV 2020) pdf 🔥
  • Spatial pyramid based graph reasoning for semantic segmentation (CVPR 2020) pdf
  • Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation (CVPR 2020) pdf
  • End-to-end object detection with transformers (ECCV 2020) pdf 🔥
  • Pointasnl: Robust point clouds processing using nonlocal neural networks with adaptive sampling (CVPR 2020) pdf
  • Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers (CVPR 2021) pdf
  • An image is worth 16x16 words: Transformers for image recognition at scale (ICLR 2021) pdf 🔥
  • Is Attention Better Than Matrix Decomposition? (ICLR 2021) pdf
  • An empirical study of training selfsupervised vision transformers (CVPR 2021) pdf
  • Ocnet: Object context network for scene parsing (IJCV 2021) pdf 🔥
  • Point transformer (ICCV 2021) pdf
  • PCT: Point Cloud Transformer (CVMJ 2021) pdf
  • Pre-trained image processing transformer (CVPR 2021) pdf
  • An empirical study of training self-supervised vision transformers (ICCV 2021) pdf
  • Segformer: Simple and efficient design for semantic segmentation with transformers (arxiv 2021) pdf
  • Beit: Bert pre-training of image transformers (arxiv 2021) pdf
  • Beyond Self-attention: External attention using two linear layers for visual tasks (arxiv 2021) pdf
  • Query2label: A simple transformer way to multi-label classification (arxiv 2021) pdf
  • Transformer in transformer (arxiv 2021) pdf

Temporal attention

  • Jointly attentive spatial-temporal pooling networks for video-based person re-identification (ICCV 2017) pdf 🔥
  • Video person reidentification with competitive snippet-similarity aggregation and co-attentive snippet embedding (CVPR 2018) pdf
  • Scan: Self-and-collaborative attention network for video person re-identification (TIP 2019) pdf

Branch attention

  • Training very deep networks (NeurIPS 2015) pdf 🔥
  • Selective kernel networks (CVPR 2019) pdf 🔥
  • CondConv: Conditionally Parameterized Convolutions for Efficient Inference (NeurIPS 2019) pdf
  • Dynamic convolution: Attention over convolution kernels (CVPR 2020) pdf
  • ResNest: Split-attention networks (arXiv 2020) pdf 🔥

ChannelSpatial attention

  • Residual attention network for image classification (CVPR 2017) pdf 🔥
  • SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning (CVPR 2017) pdf 🔥
  • CBAM: convolutional block attention module (ECCV 2018) pdf 🔥
  • Harmonious attention network for person re-identification (CVPR 2018) pdf 🔥
  • Recalibrating fully convolutional networks with spatial and channel “squeeze and excitation” blocks (TMI 2018) pdf
  • Mancs: A multi-task attentional network with curriculum sampling for person re-identification (ECCV 2018) pdf 🔥
  • Bam: Bottleneck attention module(BMVC 2018) pdf 🔥
  • Pvnet: A joint convolutional network of point cloud and multi-view for 3d shape recognition (ACM MM 2018) pdf
  • Learning what and where to attend (ICLR 2019) pdf
  • Dual attention network for scene segmentation (CVPR 2019) pdf 🔥
  • Abd-net: Attentive but diverse person re-identification (ICCV 2019) pdf
  • Mixed high-order attention network for person re-identification (ICCV 2019) pdf
  • Mlcvnet: Multi-level context votenet for 3d object detection (CVPR 2020) pdf
  • Improving convolutional networks with self-calibrated convolutions (CVPR 2020) pdf
  • Relation-aware global attention for person re-identification (CVPR 2020) pdf
  • Strip Pooling: Rethinking spatial pooling for scene parsing (CVPR 2020) pdf
  • Rotate to attend: Convolutional triplet attention module, (WACV 2021) pdf
  • Coordinate attention for efficient mobile network design (CVPR 2021) pdf
  • Simam: A simple, parameter-free attention module for convolutional neural networks (ICML 2021) pdf

SpatialTemporal attention

  • An end-to-end spatio-temporal attention model for human action recognition from skeleton data (AAAI 2017) pdf 🔥
  • Diversity regularized spatiotemporal attention for video-based person re-identification (arXiv 2018) 🔥
  • Interpretable spatio-temporal attention for video action recognition (ICCVW 2019) pdf
  • A Simple Baseline for Audio-Visual Scene-Aware Dialog (CVPR 2019) pdf
  • Hierarchical lstms with adaptive attention for visual captioning (TPAMI 2020) pdf
  • Stat: Spatial-temporal attention mechanism for video captioning, (TMM 2020) pdf
  • Gta: Global temporal attention for video action understanding (arXiv 2020) pdf
  • Multi-granularity reference-aided attentive feature aggregation for video-based person re-identification (CVPR 2020) pdf
  • Read: Reciprocal attention discriminator for image-to-video re-identification (ECCV 2020) pdf
  • Decoupled spatial-temporal transformer for video inpainting (arXiv 2021) pdf
  • Towards Coherent Visual Storytelling with Ordered Image Attention (arXiv 2021) pdf

原文:https://github.com/MenghaoGuo/Awesome-Vision-Attentions