Welcome to MLDA 2024

3rd International Conference on Machine Learning, NLP and Data Mining (MLDA 2024)

July 13 ~ 14, 2024, Virtual Conference



Accepted Papers
AI-Powered Solutions for Missing Data in Pipeline Risk Assessments

Syed JehanzebAdeel Haider, Enbridge Energy Inc.,Houston, USA

ABSTRACT

The use of Artificial Intelligence (AI) and Machine Learning (ML) in the oil and gas pipeline industry has shown significant promise, particularly in addressing challenges posed by incomplete datasets. This paper explores the application of AI in filling missing data for risk assessments, with a focus on safety-critical scenarios. Through a detailed process flow, this paper illustrates the potential pitfalls and risks associated with relying solely on AI-generated data. This paper also suggests strategies to balance AI reliance with real data acquisition, emphasizing the importance of consequence analysis, cost-benefit considerations, and a hybrid approach to ensure the safety and reliability of operations across the pipeline and broader oil and gas industry.

Keywords

Artificial Intelligence (AI), Machine Learning (ML), Risk Assessment, Pipeline Safety, ALARP


Detecting Droplets for Crop Spraying Systems Using Machine Learning

Debmalya Ray, India

ABSTRACT

Agricultural Development combined with technology has made great progress in recent years, making it possible to improve the yield for farmers. This project combines deep learning algorithms with spraying technology to design a machine vision precision real-time targeting spraying system for field scenarios. Highly efficient mechanized nozzles are used to spray and apply nutrients and pesticides to crops so that farmers can increase production and mitigate the gap between supplies and demands. We employ high-speed visualization [8] to quantify the impact and evaporation of a droplet on a solid surface. This will also help us to identify the density/area covered by a single spray at a time and correct the delta part left to be covered at first work. This paper is focused on using image classification techniques with a computer vision algorithm to extract the parameters required from a single image at a time and convert it into structured data so that an unsupervised algorithm can cluster the regions based on density.

Keywords

machine vision, image processing methods, unsupervised learning, droplets impact.


Web-based Automation Testing and Tools Leveraging AI and ML

Narendar Kumar Ale, University Of Cumberlands, United States of America

ABSTRACT

Software testing remains an essential phase of the software development lifecycle particularly for web-based applications. The integration of AI and ML automation testing has reached new heights in efficiency accuracy and coverage. This paper discusses the latest advancements in web automation testing tools that leverage AI and ML providing insights into their benefits and selection criteria.

Keywords

Automation Testing, AI, ML, Web Applications.


Robust Multi-modal Face Anti-spoofing: Handling Missing Modalities and Fusion Techniques

Zain Ul Abideen1, Muhammad Asim2,3, and Mohammed A. ELAffendi3, 1School of Computer Science and Engineering, Central South University, Changsha 410083, China, 2School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China, 3EIAS Data Science Lab, College of Computer and Information Sciences, and Center of Excellence in Quantum and Intelligent Computing, Prince Sultan University, Riyadh 11586, Saudi Arabia

ABSTRACT

Recent advancements in multi-modal learning have significantly enhanced face antispoofing systems. Despite these improvements, real-world applications often face the challenge of missing modalities from different imaging sensors. Previous studies have largely ignored this issue or have increased model complexity without effectively addressing it. This study presents a robust yet straightforward methodology utilizing a multi-modal face anti-spoofing architecture with spatial-temporal encoders and a dedicated fusion unit. The spatial-temporal encoders extract features from each modality using ResNet34 and Transformer architectures, while augmentation and regularization techniques further enhance model performance. Various fusion methods are assessed for their effectiveness in managing missing modalities. Additionally, we present FaceMAE, a modular autoencoder designed to predict and reconstruct missing-modalities. FaceMAE functions via a dual-phase process: encoding detected modalities to produce latent representations and subsequently decoding them to reconstruct missing modalities. Through the incorporation of transformer encoders and a flexible fusion module, FaceMAE enhances the ability to differentiate between live and spoof facial images. Evaluations on datasets such as CASIA-SURF, CASIA-SURF CeFA, and WMCA indicate that our method achieves competitive results.

Keywords

Artificial intelligence,Computer vision, multi-modal learning, face anti-spoofing, missing modality scenarios, face attack detection, Data augmentation, spatial-temporal encoders.


A Combine Cnn-rnn Based Approach for Augmenting the Performance of Speech Emotions Recognition

Afaq Ahmed, Muaaz Bin Kaleem, Department of Computer Science, Central South University, Changsha, China

ABSTRACT

Due to the advancement of neural networks and the increasing demand for accurate and real-time Speech Emotion Recognition (SER) in human-computer interactions, it is necessary to com-pare existing methods and databases in SER in order to arrive at feasible solutions and a complete understanding of this open-ended problem in SER. To detect and recognize the emotions expressed in speech, various techniques have been used in the literature, including well-established speech analysis and classification techniques. These techniques, including speech analysis and classification, have been used to extract emotions from signals. In this study, we propose a novel method for analysing signals called Wavelet-Scaled Spectrogram which com-bines the frequency and scale spectrum of a signal using wavelet transform. This method is effective in analysing signals at different scales and frequency content. In order to train models for speech emotion identification, a large number of handcrafted features and intermediary depictions i.e., frequency-time plot that have traditionally been utilized in data compilation, collection, and analysis. The development of end-to-end models which extract characteristics and learn directly from raw speech signals to improve speech recognition has recently been studied by re-searchers following the emergence of deep learning. After training and evaluation on the famous speech databases EmoDB, RAVDESS and IEMOCAP, the proposed model is evaluated on various speakers in both speaker-independent and speaker-dependent modes and on a variety of different voices. When advanced preprocessing techniques or data augmentation are omitted from the proposed architecture, the results demonstrate that it can produce products comparable to those produced by the current state of the art. Three concurrent CNN pipelines and a series of modified local features learning blocks (LFLBs) achieved the highest classification accuracy attainable.

Keywords

Artificial Intelligence, Network Intrusion Detection Systems, Machine Learning, Ensemble Models, Cybersecurity, Feature Selection.


Enhancing Sentiment Analysis for Low-resource Pashto Language: a Bert-infused Lstm Framework

Abdul Hamid Azizi1, Muhammad Asim2, and Mudasir Ahmad Wani3, 1School of Computer Science and Engineering, Central South University, Changsha 410083 P.R. China, 2EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia, 3School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China

ABSTRACT

In the domain of Natural Language Processing (NLP), sentiment analysis plays a pivotal role, aiding in applications ranging from user feedback interpretation to customer sentiment tracking. However, the focus of sentiment analysis has predominantly centered on major languages such as English, side-lining the requirements of less commonly used languages. In this context, low-resource languages like Pashto encounter a considerable deficiency in sentiment analysis tools and resources, limiting their potential in the field of NLP. To address this critical gap, the proposed work is focused on Pashto sentiment analysis, a low-resource language, and utilizes Romanized Pashto data collected from Twitter using Tweepy, a Python library for accessing the Twitter API. To enhance the data quality, a sequence of pre-processing steps are implemented, encompassing the elimination of extraneous information, stemming, and vectorization. Additionally, in light of the absence of a Pashto language-specific stop words list and stemming dictionary, previous research endeavours have faced limitations. In response, this study also proposed a proactive approach by crafting a comprehensive list of Pashto stop words and constructing a stemming dictionary called PasLex (Pashto Lexicon) with the guidance of domain experts. Using BERT as a tokenizer and combining it with Long Short-Term Memory (LSTM) network leads to improved sentiment detection. We also check the validity of the proposed model by combining BERT with Support Vector Machine (SVM) for sentiment classification. In both cases the proposed approach has outperformed the state-of-the-art methods by achieving highest accuracy, precision, recall, and F1-measure of 92.33%, 92.5%, 92.33%, and 92.32%, respectively.

Keywords

Sentiment Analysis, Low Resource Languages, Pashto, BERT, LSTM, SVM.