4.4 Article

Seoul bike trip duration prediction using data mining techniques

Journal

IET INTELLIGENT TRANSPORT SYSTEMS
Volume 14, Issue 11, Pages 1465-1474

Publisher

WILEY
DOI: 10.1049/iet-its.2019.0796

Keywords

data mining; feature extraction; mean square error methods; regression analysis; traffic information systems; intelligent transportation systems; random forests; nearest neighbour methods; Seoul bike trip duration prediction; data mining techniques; trip distance; Seoul bike data; Seoul bike sharing system; intelligent transport systems; traveller information systems; trip-time prediction; rental bikes; feature engineering; feature extraction; statistical models; linear regression; gradient boosting machines; k nearest neighbour; Random Forest; root mean squared error; coefficient of variance; mean absolute error; median absolute error

Funding

  1. National Research Foundation of Korea [5199990214660] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Trip duration is the most fundamental measure in all modes of transportation. Hence, it is crucial to predict the trip-time precisely for the advancement of Intelligent Transport Systems and traveller information systems. To predict the trip duration, data mining techniques are employed in this study to predict the trip duration of rental bikes in Seoul Bike sharing system. The prediction is carried out with the combination of Seoul Bike data and weather data. The data used include trip duration, trip distance, pickup and dropoff latitude and longitude, temperature, precipitation, wind speed, humidity, solar radiation, snowfall, ground temperature and 1-hour average dust concentration. Feature engineering is done to extract additional features from the data. Four statistical models are used to predict the trip duration. (a) Linear regression, (b) Gradient boosting machines, (c) k nearest neighbour and (d) Random Forest (RF). Four performance metrics root mean squared error, coefficient of variance, mean absolute error and median absolute error is used to determine the efficiency of the models. In comparison with the other models, the best model RF can explain the variance of 93% in the testing set and 98% (R-2) in the training set. The outcome proves that RF is effective to be employed for the prediction of trip duration.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Multiple Types of Cancer Classification Using CT/MRI Images Based on Learning Without Forgetting Powered Deep Learning Models

Malliga Subramanian, Jaehyuk Cho, Veerappampalayam Easwaramoorthy Sathishkumar, Obuli Sai Naren

Summary: Cancer is the second leading cause of death globally, with one in six deaths attributed to it. Early detection improves the chances of survival, and the use of Artificial Intelligence (AI) for automated cancer detection can help evaluate more cases in less time.

IEEE ACCESS (2023)

Article Management

Information Retrieval and Optimization in Distribution and Logistics Management Using Deep Reinforcement Learning

Li Yang, V. E. Sathishkumar, Adhiyaman Manickam

Summary: This paper proposes an integrated deep reinforcement learning-based logistics management model (DELLMM) to increase and optimize logistic distribution. An optimization approach can be used in inventors and price control applications. The research methodology gives the fundamentals of information retrieval and the scope of blockchain integration. The experimental results show that DELLMM improves logistics management and optimized distribution compared to other methods with the highest operability of 94.35%, latency reduction of 97.12%, efficiency of 98.01%, trust enhancement of 96.37%, and sustainability of 97.80%.

INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS AND SUPPLY CHAIN MANAGEMENT (2023)

Article Computer Science, Artificial Intelligence

A Gradient Boosted Decision Tree-Based Influencer Prediction in Social Network Analysis

Neelakandan Subramani, Sathishkumar Veerappampalayam Easwaramoorthy, Prakash Mohan, Malliga Subramanian, Velmurugan Sambath

Summary: Twitter, Instagram and Facebook are rapidly expanding and reporting daily news, social activities and actual occurrences. Social network analysis (SNA) research faces ethical challenges due to technology advances and increasing ethics regulation. This study investigates how influencer content generates interactions and develops a framework for identifying users with the ability to influence others.

BIG DATA AND COGNITIVE COMPUTING (2023)

Article Multidisciplinary Sciences

Learning without forgetting by leveraging transfer learning for detecting COVID-19 infection from CT images

Malliga Subramanian, Veerappampalayam Easwaramoorthy Sathishkumar, Jaehyuk Cho, Kogilavani Shanmugavadivel

Summary: COVID-19, a global pandemic, has a high mortality rate. CT scans can assist in diagnosing and monitoring the disease, but visual inspection is time-consuming. This study utilizes a Convolution Neural Network (CNN) with transfer learning and integrates Learning without Forgetting (LwF) to enhance the model's generalization capabilities. The wide ResNet model with LwF method achieves superior performance in classifying original and delta-variant datasets.

SCIENTIFIC REPORTS (2023)

Article Ecology

Forest fire and smoke detection using deep learning-based learning without forgetting

Veerappampalayam Easwaramoorthy Sathishkumar, Jaehyuk Cho, Malliga Subramanian, Obuli Sai Naren

Summary: This study investigates fire/smoke detection from images using AI-based computer vision techniques. Transfer learning is implemented on pre-trained models to reduce training time and complexity. The Xception model performs well with LwF and achieves high accuracy on both the new and original datasets.

FIRE ECOLOGY (2023)

Article Engineering, Multidisciplinary

Machine learning algorithms to predict the catalytic reduction performance of eco-toxic nitrophenols and azo dyes contaminants (Invited Article)

V. E. Sathishkumar, A. G. Ramu, Jaehyuk Cho

Summary: This research study utilized machine learning techniques to effectively remove hazardous substances like azo dyes and nitrophenols from drinking water using the catalyst PdO-NiO. The results showed that the XGB algorithm performed best with 4-NP and DNP, the RF algorithm performed best with TNP, MB, and RHB, and the SVM algorithm performed best with MO.

ALEXANDRIA ENGINEERING JOURNAL (2023)

Article Engineering, Multidisciplinary

Phonetic-based Forward Online Transliteration Tool from English to Tamil Language

S. Anbukkarasi, D. Elangovan, Jayalakshmi Periyasamy, V. E. Sathishkumar, S. Sree Dharinya, M. Sandeep Kumar, J. Prabhu

Summary: Transliteration is the process of mapping the characters of one language to those of another language based on phonetics. This process is particularly important in India, where people speak a variety of languages and may struggle to read different scripts. Transliteration plays a crucial role in various Natural Language Processing applications, including information retrieval, machine translation, and speech recognition. While transliteration works have been carried out in languages like Japanese, Chinese, and English, there is limited research on Indian languages, especially Tamil. This paper focuses on the transliteration of Unicode Tamil characters using a phonetics-based forward list processing method, which shows promising results.

INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING (2023)

Review Cell Biology

Advancements in computer-assisted diagnosis of Alzheimer's disease: A comprehensive survey of neuroimaging methods and AI techniques for early detection

Kogilavani Shanmugavadivel, V. E. Sathishkumar, Jaehyuk Cho, Malliga Subramanian

Summary: This article reviews methods and techniques for early detection of Alzheimer's Disease and provides a comprehensive analysis of AD diagnosis datasets. The research findings are important for improving the accuracy of Alzheimer's Disease detection.

AGEING RESEARCH REVIEWS (2023)

Article Green & Sustainable Science & Technology

Enhancing Sustainable Transportation: AI-Driven Bike Demand Forecasting in Smart Cities

Malliga Subramanian, Jaehyuk Cho, Sathishkumar Veerappampalayam Easwaramoorthy, Akash Murugesan, Ramya Chinnasamy

Summary: This study compares the performance of various time series and machine learning algorithms for predicting bike demand and finds that GRU algorithm performs the best. ARIMA and SARIMA models produce less accurate predictions, likely due to their assumptions of linearity and stationarity in the data.

SUSTAINABILITY (2023)

Article Multidisciplinary Sciences

Numerical and Machine Learning Approach for Fe3O4-Au/Blood Hybrid Nanofluid Flow in a Melting/Non-Melting Heat Transfer Surface with Entropy Generation

Shaik Jakeer, Sathishkumar Veerappampalayam Easwaramoorthy, Seethi Reddy Reddisekhar Reddy, Hayath Thameem Basha

Summary: This study presents a novel implementation of an intelligent numerical computing solver using an MLP feed-forward backpropagation ANN and the Levenberg-Marquard algorithm to interpret the Cattaneo-Christov heat flux model. The effect of entropy production and melting heat transfer on the ferrohydrodynamic flow of the Fe3O4-Au/blood Powell-Eyring hybrid nanofluid is demonstrated. The artificial neural network model is used for data selection, network construction, training, and evaluation, with various physical factors impacting variables such as velocity, temperature, entropy generation, friction coefficient, and heat transfer rate.

SYMMETRY-BASEL (2023)

Article Engineering, Multidisciplinary

Exploring the finite-time dissipativity of Markovian jump delayed neural networks

V. E. Sathishkumar, R. Vadivel, Jaehyuk Cho, Nallappan Gunasekaran

Summary: In this paper, the finite-time dissipativity analysis of Markovian jump-delayed neural networks (MJDNNs) is studied. Less conservative results for extended dissipativity conditions are established for delayed MJDNNs. An appropriate Lyapunov-Krasovskii functional (LKF) with novel inequality, the composite slack-matrix-based integral inequality (CSMBII), is used to achieve this. Sufficient conditions including CSMBII are employed to derive a delay-dependent finite-time dissipativity condition in terms of linear matrix inequalities (LMIs), which are used to formulate the finite dissipativity condition for the delayed MJNNs. Numerical examples confirm the utility of the suggested approach, including a real-world application of the benchmark problem associated with the designed MJDNNs.

ALEXANDRIA ENGINEERING JOURNAL (2023)

Article Mathematics

Exploring the Influence of Induced Magnetic Fields and Double-Diffusive Convection on Carreau Nanofluid Flow through Diverse Geometries: A Comparative Study Using Numerical and ANN Approaches

Shaik Jakeer, Seethi Reddy Reddisekhar Reddy, Sathishkumar Veerappampalayam Easwaramoorthy, Hayath Thameem Basha, Jaehyuk Cho

Summary: This study investigates the importance of induced magnetic fields and double-diffusive convection in the radiative flow of Carreau nanofluid through three different geometries. The fluid transport equations were simplified using self-similarity variables and solved using the Runge-Kutta-Fehlberg method. The study demonstrates how various dynamic factors influence the fluid's transport characteristics through graphical representations.

MATHEMATICS (2023)

Article Computer Science, Information Systems

Enhanced Feature Model Based Hybrid Neural Network for Text Detection on Signboard, Billboard and News Tickers

S. Anbukkarasi, Veerappampalayam Easwaramoorthy Sathishkumar, C. R. Dhivyaa, Jaehyuk Cho

Summary: Recognizing text from nature scene images and videos is a challenging task due to their complex features and variations. However, text recognition is highly useful in various applications. This research paper proposes a model that combines Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) to successfully detect and recognize text characters in natural images. The experimental results demonstrate that the proposed model outperforms other methods on multiple datasets.

IEEE ACCESS (2023)

Article Mathematics, Applied

Fuzzy adaptive learning control network (FALCN) for image clustering and content-based image retrieval on noisy dataset

S. Neelakandan, Sathishkumar Veerappampalayam Easwaramoorthy, A. Chinnasamy, Jaehyuk Cho

Summary: It has been shown that fuzzy systems are useful for classification and regression, but they are mostly used in controlled environments. An image clustering technique using color, texture, and shape information is developed for content-based picture retrieval in large image datasets. The challenge of labeling a large number of photos is addressed by using unsupervised learning, specifically the K-means clustering algorithm. In comparison to fuzzy c-means clustering, K-means clustering has better performance in lower-dimensional space resilience and initialization resistance. The dominant triple HSV color space is a perceptual color space composed of saturation (S), hue (H), and value (V), which are closely related to human color perception. A deep learning technique called RBNN is built using Gaussian function, fuzzy adaptive learning control network (FALCN), clustering, and radial basis neural network to achieve image segmentation and feature extraction. The suggested FALCN fuzzy system is excellent at clustering images and extracting image properties. Traditional fuzzy network systems tend to have redundant output neurons when receiving noisy input. Finally, random convolutional weights are used to extract features from unlabeled data.

AIMS MATHEMATICS (2023)

Article Engineering, Multidisciplinary

MRMR-EHO-Based Feature Selection Algorithm for Regression Modelling

V. E. Sathishkumar, Yongyun Cho

Summary: In classical regression theory, fitting a single function model to a data set is a complex and unreliable process in complex and noisy domains. To overcome these difficulties, piecewise regression models and proper feature selection are proposed in this paper. The hybridization of Elephant Herding Optimization (EHO) and minimum Redundancy and Maximum Relevance (mRMR) is used for feature selection to improve regression problems. The results demonstrate the effectiveness of CUBIST and mRMR-EHO feature selection in various datasets and indicate that it can be used as an effective tool for predictive data modeling.

TEHNICKI VJESNIK-TECHNICAL GAZETTE (2023)

No Data Available