4.5 Article

Deep learning-based apple detection using a suppression mask R-CNN

期刊

PATTERN RECOGNITION LETTERS
卷 147, 期 -, 页码 206-211

出版社

ELSEVIER
DOI: 10.1016/j.patrec.2021.04.022

关键词

Vision system; Fruit detection; Deep learning; Robotic harvesting; Image segmentation

向作者/读者索取更多资源

Researchers have developed a novel deep learning-based apple detection framework called Suppression Mask R-CNN, which achieves high detection accuracy and efficiency in complex orchard environments. By collecting a comprehensive apple orchard dataset using a color camera under different lighting conditions, the framework is able to achieve a detection time of 0.25 seconds per frame and an F1 score of 0.905 on a standard desktop computer, outperforming state-of-the-art models.
Robotic apple harvesting has received much research attention in the past few years due to growing shortage and rising cost in labor. One key enabling technology towards automated harvesting is accurate and robust apple detection, which poses great challenges as a result of the complex orchard environment that involves varying lighting conditions and foliage/branch occlusions. This letter reports on the development of a novel deep learning-based apple detection framework named Suppression Mask R-CNN. Specifically, we first collect a comprehensive apple orchard dataset for Gala and Blondee apples, using a color camera, under different lighting conditions (overcast and front lighting vs. back lighting). We then develop a novel suppression Mask R-CNN for apple detection, in which a suppression branch is added to the standard Mask R-CNN to suppress non-apple features generated by the original network. Comprehensive evaluations are performed, which show that the developed suppression Mask R-CNN network outperforms state-of-the-art models with a higher F1-score of 0.905 and a detection time of 0.25 second per frame on a standard desktop computer. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Representation Learning by Rotating Your Faces

Luan Tran, Xi Yin, Xiaoming Liu

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Article Computer Science, Artificial Intelligence

Shape My Face: Registering 3D Face Scans by Surface-to-Surface Translation

Mehdi Bahri, Eimear O' Sullivan, Shunwang Gong, Feng Liu, Xiaoming Liu, Michael M. Bronstein, Stefanos Zafeiriou

Summary: This paper introduces a new learning-based approach for non-rigid registration of face scans, which is faster, more robust, has fewer parameters, and can generalize to previously unseen datasets compared to standard registration algorithms. The model's registration quality is extensively evaluated on diverse data, demonstrating robustness and generalizability across different facial scans.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Article Automation & Control Systems

System design and control of an apple harvesting robot

Kaixiang Zhang, Kyle Lammers, Pengyu Chu, Zhaojian Li, Renfu Lu

Summary: This study presents a robotic apple harvesting prototype with mechatronic design and motion control. The prototype utilizes deep learning for fruit detection and localization, incorporates a pneumatic/motor actuation mechanism for dexterous movements, and features a vacuum-based end-effector for apple detachment. Additionally, a nonlinear control scheme is developed for accurate and agile motion control, demonstrated through field experiments to showcase the robot's performance in apple harvesting.

MECHATRONICS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

TURNIP: TIME-SERIES U-NET WITH RECURRENCE FOR NIR IMAGING PPG

Armand Comas, Tim K. Marks, Hassan Mansour, Suhas Lohit, Yechi Ma, Xiaoming Liu

Summary: Imaging photoplethysmography (iPPG) is a method used to estimate a person's pulse waveform by processing a video of their face, and in situations with insufficient visible spectrum illumination, a modular framework with a novel time-series U-net architecture can be used for heartbeat signal estimation. The proposed method outperforms existing models on challenging datasets containing monochromatic NIR videos taken in different conditions.

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Full-Velocity Radar Returns by Radar-Camera Fusion

Yunfei Long, Daniel Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan

Summary: This paper presents a closed-form solution for the full-velocity estimate of Doppler returns using optical flow from camera images, and addresses the association problem between radar returns and camera images with a trained neural network. Experimental results on the nuScenes dataset validate the effectiveness of the method in velocity estimation and accumulation of radar points, showing significant improvements over the state-of-the-art.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction

Feng Liu, Luan Tran, Xiaoming Liu

Summary: The study focuses on inferring the 3D structure of a generic object from a 2D image. By utilizing semi-supervised learning and decomposing the image into latent representations, the approach enables modeling and model fitting using real 2D images, resulting in superior 3D reconstruction.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Riggable 3D Face Reconstruction via In-Network Optimization

Ziqian Bai, Zhaopeng Cui, Xiaoming Liu, Ping Tan

Summary: This paper introduces a method for riggable 3D face reconstruction from monocular images, utilizing a trainable network to estimate personalized face rig and per-image parameters, achieving beyond static reconstructions and supporting downstream applications such as video retargeting. The network utilizes in-network optimization to enforce constraints and data-driven priors to constrain the ill-posed monocular setting, leading to state-of-the-art reconstruction accuracy, robustness, and generalization ability.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Depth Completion with Twin Surface Extrapolation at Occlusion Boundaries

Saif Imran, Xiaoming Liu, Daniel Morris

Summary: The method proposed in the paper models both foreground and background depths in difficult occlusion-boundary regions by using a multi-hypothesis depth representation. It performs twin-surface extrapolation instead of interpolation in these regions. The approach trains a network to simultaneously do surface extrapolation and surface fusion using an asymmetric loss function on a novel twin-surface representation.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Towards High Fidelity Face Relighting with Realistic Shadows

Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu

Summary: This study introduces a novel deep face relighting method that can accurately handle shadows while maintaining local facial details. By predicting the ratio image between source and target images and modifying shadows using shadow masks, the method demonstrates state-of-the-art face relighting performance.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Radar-Camera Pixel Depth Association for Depth Completion

Yunfei Long, Daniel Morris, Xiaoming Liu, Marcos Castro, Punarjay Chakravarty, Praveen Narayanan

Summary: This study proposes a mapping method from radar returns to pixels to achieve image-guided radar and video depth completion. By integrating radar and video data at the pixel level, superior performance to using camera and radar alone is demonstrated.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Article Computer Science, Artificial Intelligence

On Learning 3D Face Morphable Mode from In-the-Wild Images

Luan Tran, Xiaoming Liu

Summary: This paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of in-the-wild face images, significantly enhancing its representation power and making significant contributions to face alignment, 3D reconstruction, and face editing.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Deep Facial Non-Rigid Multi-View Stereo

Ziqian Bai, Zhaopeng Cui, Jamal Ahmed Rahim, Xiaoming Liu, Ping Tan

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Gotta Adapt 'Em All: Joint Pixel and Feature-Level Domain Adaptation for Recognition in the Wild

Luan Tran, Kihyuk Sohn, Xiang Yu, Xiaoming Liu, Manmohan Chandraker

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

暂无数据