Name: See-through-Text Grouping for Referring Image Segmentation
Brand: Future Tech Pavilion, FUTEX
SKU: P0019600003448

Home
About
Lastest
- News
Technologies
Media
- Videos
- Photos
- Download
- Press Releases
Awards
- Future Tech Award
- AI 創新獎
Events
English
- 繁體中文
- English

:::

Home
/
Year
/
2021
/
Electronics & Optoelectronics
/
See-through-Text Grouping for Referring Image Segmentation

recommend

Dual Deep Learning Models for Gastric Premalignant Condition Diagnosis in Precision Health

An Artificial Intelligence Medicine RecognitionVerification System in Hospital Dispensing Room

Human-Robot Co-Dancing: A Computer Vision-Based, No-Code, Intuitive Robot Arm Choreography Interface and Human-Robot Collaborative Creation System

AI-Embedded 5-Axis CNC Controller

Trace

Technical Name	See-through-Text Grouping for Referring Image Segmentation
Project Operator	Institute of Information Science, Academia Sinica
Project Host	劉庭祿
Summary	We propose an iterative learning scheme to tackle the referring image segmentation. In each iteration starting from a given a referring expression, the scheme learns to predict its relevance to each pixel and derives a see-through-text embedding pixel-wise heatmap. Then, a ConvRNN refines the heatmap for altering the referring expression to start the next iteration.
Scientific Breakthrough	The technique iteratively updates the language expression, generates and refines the heatmap to tackle the referring image segmentation. Our model is end-to-end trainable and shows the SOTA performance on four datasets without using an object detector or an attribute predictor as the existing models. This technique is easy to train and provides additional attention-based referring representation.
Industrial Applicability	The multi-modal analysis is one of the main trends in current research. The referring image segmentation addressed by our technique is a cross-modal application which associates computer vision and natural language processing. The multi-modal representation embedding method in this technique can be used as a template for the industry to develop multimedia applications on a combination of visual and language or other different modal.
Keyword	Computer Vision Deep Learning Convolutional Neural Network Convolutional-Recurrent Neural Network Image Segmentation Natural Language Referring Segmentation Embedding Attention Referring Expression

Email
dj_chen_tw@yahoo.com.tw

Matchmaking

other people also saw