4.7 Article

SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 20, 期 10, 页码 2593-2604

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2018.2865686

关键词

Object detection; ship dataset; neural networks; ship detection

资金

  1. National High-Resolution Earth Observation System Major Projects of China [02-Y30B19-9001-15/17]
  2. National Natural Science Foundation of China [61671332, 41771452, 41771454]
  3. Guangzhou Science and Technology Project [201604020070]
  4. Key Research and Development Program of Hubei Province of China [2016AAA018]

向作者/读者索取更多资源

In this paper, we introduce a new large-scale dataset of ships, called SeaShips, which is designed for training and evaluating ship object detection algorithms. The dataset currently consists of 31 455 images and covers six common ship types (ore carrier, bulk cargo carrier, general cargo ship, container ship, fishing boat, and passenger ship). All of the images are from about 10 080 real-world video segments, which are acquired by the monitoring cameras in a deployed coastline video surveillance system. They are carefully selected to mostly cover all possible imaging variations, for example, different scales, hull parts, illumination, viewpoints, backgrounds, and occlusions. All images are annotated with ship-type labels and high-precision bounding boxes. Based on the SeaShips dataset, we present the performance of three detectors as a baseline to do the following: 1) elementarily summarize the difficulties of the dataset for ship detection; 2) show detection results for researchers using the dataset; and 3) make a comparison to identify the strengths and weaknesses of the baseline algorithms. In practice, the SeaShips dataset would hopefully advance research and applications on ship detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Engineering, Electrical & Electronic

Rethinking Lightweight: Multiple Angle Strategy for Efficient Video Action Recognition

Jianyu Chen, Zhongyuan Wang, Kangli Zeng, Zheng He, Zixiang Xiong

Summary: In the task of video action recognition, we propose a lightweight video feature extraction strategy that performs convolution on the video cube from three orthogonal angles to learn appearance and motion features. Our approach achieves better recognition performance with lower computational volume than 3D CNN.

IEEE SIGNAL PROCESSING LETTERS (2022)

Article Computer Science, Artificial Intelligence

Single-channel Multi-speakers Speech Separation Based on Isolated Speech Segments

Shanfa Ke, Zhongyuan Wang, Ruimin Hu, Xiaochen Wang

Summary: This paper proposes a single-channel multi-speaker speech separation method based on the similarity between the speaker feature center and the mixture feature in the deep embedding space. The method achieves separation of speaker's speech by extracting isolated speech segments and creating a speaker feature center. A residual-based deep embedding network is introduced for faster speed and better feature extraction. Experimental results show that the proposed method outperforms competing algorithms in Signal-to-Distortion Ratio.

NEURAL PROCESSING LETTERS (2023)

Article Computer Science, Information Systems

Local Eyebrow Feature Attention Network for Masked Face Recognition

Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Zhen Han, Kui Jiang

Summary: During the COVID-19 pandemic, wearing masks has become ubiquitous, posing a challenge for traditional face recognition algorithms. This article proposes a local eyebrow feature attention network for recognizing masked faces, which utilizes the identification potential of eyebrow features due to the limited visible information on masked faces. The network includes feature extraction, eyebrow region pooling, and feature fusion, achieving superior performance compared to state-of-the-art methods on benchmark datasets.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

DeepAPP: A Deep Reinforcement Learning Framework for Mobile Application Usage Prediction

Zhihao Shen, Kang Yang, Xi Zhao, Jianhua Zou, Wan Du

Summary: This paper aims to predict a set of apps a user will open on her mobile device in the next time slot. Such information is essential for smartphone operations to improve user experience. By using the deep reinforcement learning framework DeepAPP, which learns from historical app usage data and adapts to time-varying behavior, the authors address the challenges of accurately capturing complex contextual environment and predicting a set of apps.

IEEE TRANSACTIONS ON MOBILE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

Interactive reinforced feature selection with traverse strategy

Kunpeng Liu, Dongjie Wang, Wan Du, Dapeng Oliver Wu, Yanjie Fu

Summary: In this paper, a single-agent Monte Carlo-based reinforced feature selection method is proposed, along with two efficiency improvement strategies: early stopping strategy and reward-level interactive strategy. The proposed method aims to find the optimal feature subset for a given machine learning task by traversing the feature set and selecting features one by one. Additionally, the early stopping strategy and reward-level interactive strategy are introduced to enhance the training efficiency.

KNOWLEDGE AND INFORMATION SYSTEMS (2023)

Article Engineering, Electrical & Electronic

FaceFormer: Aggregating Global and Local Representation for Face Hallucination

Yuanzhi Wang, Tao Lu, Yanduo Zhang, Zhongyuan Wang, Junjun Jiang, Zixiang Xiong

Summary: In this paper, the authors propose a method called FaceFormer, which combines global features from Transformers and local features from CNNs to restore high-quality face images.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

Multi-Scale Hybrid Fusion Network for Single Image Deraining

Kui Jiang, Zhongyuan Wang, Peng Yi, Chen Chen, Guangcheng Wang, Zhen Han, Junjun Jiang, Zixiang Xiong

Summary: This study focuses on addressing the problem of generating rain-free images under complex rain conditions using deep learning models. By designing a multi-level pyramid structure, non-local fusion module, attention fusion module, and residual learning branch to handle different challenges, the results demonstrate that our method achieves superior performance in generating rain-free images.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Interpretable Model-Driven Deep Network for Hyperspectral, Multispectral, and Panchromatic Image Fusion

Xin Tian, Kun Li, Wei Zhang, Zhongyuan Wang, Jiayi Ma

Summary: In this study, we propose an interpretable model-driven deep network called HMPNet for hyperspectral, multispectral, and panchromatic image fusion. The model utilizes a deep prior to describe the relationship between high-resolution hyperspectral and panchromatic images. By solving the optimization problem using the proximal gradient descent algorithm and unrolling the iterative steps into network modules, we obtain the HMPNet. The HMPNet simplifies parameter selection, achieves a balance between spatial and spectral qualities, and improves generalization capability with modules that have explainable physical meanings.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Joint Segmentation and Identification Feature Learning for Occlusion Face Recognition

Baojin Huang, Zhongyuan Wang, Kui Jiang, Qin Zou, Xin Tian, Tao Lu, Zhen Han

Summary: This study proposes a joint segmentation and identification feature learning framework for occlusion face recognition and solves the limitations of existing algorithms that focus on visible facial components. The method includes an occlusion prediction module and a channel refinement network to address this issue. The study also builds large-scale synthetic and real-world occlusion face datasets for experiments, and the results show significant improvements in face verification and face identification tasks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Information Systems

CPU: Cross-Rack-Aware Pipelining Update for Erasure-Coded Storage

Haiqiao Wu, Wan Du, Peng Gong, Dapeng Oliver Wu

Summary: In this article, a parallel Cross-rack Pipelining Update scheme (CPU) is proposed to achieve efficient update in distributed storage systems (DSSs). By dividing the update information into small units and transmitting them in parallel along multiple racks, CPU reduces update time and improves performance. The optimization of update path and slice size in CPU contributes to its effectiveness in reducing average update time.

IEEE TRANSACTIONS ON CLOUD COMPUTING (2022)

Proceedings Paper Computer Science, Information Systems

DRLIC: Deep Reinforcement Learning for Irrigation Control

Xianzhong Ding, Wan Du

Summary: This paper presents a deep reinforcement learning-based irrigation system that improves irrigation efficiency by considering both current and future soil moisture loss. Through experiments, it is found that the system can save up to 9.52% of water.

2022 21ST ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN 2022) (2022)

Proceedings Paper Computer Science, Information Systems

Poster Abstract: Smart Irrigation Control Using Deep Reinforcement Learning

Xianzhong Ding, Wan Du

Summary: This research proposes an IoT-based irrigation system that utilizes advanced control algorithms and sensing technology to improve irrigation efficiency and conserve water resources.

2022 21ST ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN 2022) (2022)

Article Computer Science, Artificial Intelligence

Rethinking Prior-Guided Face Super-Resolution: A New Paradigm With Facial Component Prior

Tao Lu, Yuanzhi Wang, Yanduo Zhang, Junjun Jiang, Zhongyuan Wang, Zixiang Xiong

Summary: This article proposes a novel pre-prior guided approach that extracts facial prior information from high-resolution face images and embeds them into low-resolution ones, resulting in high-frequency information-rich low-resolution face images and improved face reconstruction performance.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Engineering, Electrical & Electronic

Few-Shot Semantic Segmentation via Frequency Guided Neural Network

Xiya Rao, Tao Lu, Zhongyuan Wang, Yanduo Zhang

Summary: In this paper, a frequency guided network (FGNet) is proposed for few-shot semantic segmentation. The FGNet explicitly models the semantic information of different frequencies and guides the semantic alignment of the object. Experimental results show that FGNet achieves superior performance compared to state-of-the-art methods.

IEEE SIGNAL PROCESSING LETTERS (2022)

Article Computer Science, Information Systems

Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic

Zhenfeng Shao, Gui Cheng, Jiayi Ma, Zhongyuan Wang, Jiaming Wang, Deren Li

Summary: This article proposes a method for social distance monitoring using UAV, which utilizes a lightweight pedestrian detection network to detect pedestrians in real-time and calculate the social distance between them. Experimental results show that the proposed method outperforms traditional models on different datasets and enables real-time detection.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

暂无数据