1. Object Detection
AAAI-2021 Papers: [Link] to 1000
Attention:
AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing [Paper]
Explicitly Modeled Attention Maps for Image Classification
582: Efficient License Plate Recognition via Holistic Position Attention
Split then Refine: Stacked Attention-Guided ResUNets for Blind Single Image Visible Watermark Removal
MANGO: A Mask Attention Guided One-Stage Scene Text Spotter
Self-Supervised Attention-Aware Reinforcement Learning
TDAF: Top-Down Attention Framework for Vision Tasks
Object Relation Attention for Image Paragraph Captioning
DAST: Unsupervised Domain Adaptation in Semantic Segmentation Based on Discriminator Attention and Self-Training
1951: Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences
1975: Dual Sparse Attention Network for Session-Based Recommendation
2832: Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
4355: Efficient Folded Attention for Medical Image Reconstruction and Segmentation
5186: Attention-Based Multi-Level Fusion Network for Light Field Depth Estimation
6288: A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition
6429: Patch-Wise Attention Network for Monocular Depth Estimation
9885: Context-Aware Attentional Pooling (CAP) for Fine-Grained Visual Classification
HOT-VAE: Learning High-Order Label Correlation for Multi-Label Classification via AttentionBased Variational Autoencoders
2261: Arbitrary Video Style Transfer via Multi-Channel Correlation
Multi-Tasks:
Multi-Domain Multi-Task Rehearsal for Lifelong Learning [Paper]
Multi-task Learning by Leveraging the Semantic Information [Paper]
RevMan: Revenue-Aware Multi-Task Online Insurance Recommendation
Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
Towards Fully End-to-End Task-Oriented Dialog System with GPT-2
Deep Multi-Task Learning for Diabetic Retinopathy Grading in Fundus Images
Bridging Towers of Multi-Task Learning with a Gating Mechanism for Aspect-Based Sentiment Analysis and Sequential Metaphor Identification
Multi-Task Recurrent Modular Networks
Progressive Multi-Task Learning with Controlled information Flow for Joint Entity and Relation Extraction
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Community-Aware Multi-Task Transportation Demand Prediction
7857: A Unified Multi-Task Learning Framework for Joint Extraction of Entities and Relations
8202: Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus
Boosting Multi-task Learning through Combination of Task Labels - with Applications in ECG Phenotyping
Graph-Enhanced Multi-Task Learning of Multi-Level Transition Dynamics for Session-Based Recommendation
9720: Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning
9905: TempLe: Learning Template of Transitions for Sample Efficient Multi-Task RL
MTAAL: Multi-Task Adversarial Active Learning for Medical Named Entity Recognition and Normalization
10119: Maximum Roaming Multi-Task Learning
Patch-based:
1252: A Spatial Regulated Patch-Wise Approach for Cervical Dysplasia Diagnosis
6429: Patch-Wise Attention Network for Monocular Depth Estimation
Detection Applications:
898: RGB-D Salient Object Detection via 3D Convolutional Neural Networks
1322: Pyramidal Feature Shrinking for Salient Object Detection
GradingNet: Towards Providing Reliable Supervisions for Weakly Supervised Object Detection by Grading the Box Candidates
Structure-Consistent Weakly Supervised Salient Object Detection with Local Saliency Coherence
2179: RESA: Recurrent Feature-Shift Aggregator for Lane Detection
2258: KGDet: Keypoint-Guided Fashion Detection
2571: Instance Mining with Class Feature Banks for Weakly Supervised Object Detection
2764: Dynamic Anchor Learning for Arbitrary-Oriented Object Detection
2967: Semi-Supervised Sequence Classification through Change Point Detection
3018: Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination
3337: Voxel R-CNN: Towards High Performance Voxel-Based 3D Object Detection
3345: Co-Mining: Self-Supervised Learning for Sparsely Annotated Object Detection
3496: DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection
3588: Inference Fusion with Associative Semantics for Unseen Object Detection
3692: PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection
3951: Regularizing Attention Networks for Anomaly Detection in Visual Question Answering
4296: Rethinking Object Detection in Retail Stores
4658: Adaptive Pattern-Parameter Matching for Robust Pedestrian Detection
4979: Semantic Consistency Networks for 3D Object Detection
Automated Model Design and Benchmarking of Deep Learning Models for COVID-19 Detection with Chest CT Scans
7292: A Systematic Evaluation of Object Detection Networks for Scientific Plots
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation CoDesign
7562: Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection
9153: StarNet: Towards Weakly Supervised Few-Shot Object Detection
10152: Few-Shot Learning for Multi-Label Intent Detection
3500: Fooling Thermal Infrared Pedestrian Detector in Real World Using Small Bulbs
351: CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud
364: R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object
5421: AnchorFace: An Anchor-Based Facial Landmark Detector across Large Poses