NISL Researcher

Hamid Bostani

Postdoctoral Researcher in Machine Learning Security
Security, Reasoning and Validation (SerVal) Research Group
Interdisciplinary Centre for Security, Reliability and Trust (SnT)
University of Luxembourg

I am a well-organized, hard-working, and ambitious person who fights to achieve his dreams. I believe that my strong enthusiasm as well as my intrinsic capabilities in problem-solving, along with my proper academic and technical experiences, are convincing signs that can promise me to gain high achievements in computer science.

About me

I am a Postdoctoral Researcher at the University of Luxembourg, where I work with Dr. Maxime Cordy in the SerVal group at SnT. My research lies at the intersection of artificial intelligence and cybersecurity, with a particular focus on trustworthy AI and adversarial machine learning.

I received my PhD in Computer Science from Radboud University, The Netherlands, under the supervision of Prof. Veelasha Moonsamy and Dr. Erik Poll. During my doctoral studies, I was a visiting researcher at King’s College London and University College London in UK, where I collaborated with Dr. Fabio Pierazzi and Prof. Lorenzo Cavallaro.

My current research explores the security and trustworthiness of machine learning systems, particularly in the context of malware detection.

Honors & Awards

3rd Prize, VERSEN PhD Thesis Awards, SEN, The Neterlands (2026)

Awarded a Fully-Funded Postdoctoral Position, University of Luxembourg, Luxembourg (2025)

Awarded a Fully-Funded PhD Position, Radboud University, The Neterlands (2020)

Best Employee Award, National Organization of Educational Testing, Iran (2019)

Research Funding, Iran National Science Foundation, Iran (2018)

Outstanding Researcher Award, South Tehran Branch, Islamic Azad University, Iran (2017)

Outstanding Thesis Award, The 5th Research, Scientific & Technological National Festival of Islamic Azad University, Iran (2017)

Outstanding Paper Award, ICSPIS'2016, Amirkabir University of Technology, Iran (2016)

Selected Paper, IST'2016, Iran Telecommunication Research Center, Iran (2016)

News

1 May 2026, My PhD thesis was awarded 3rd Prize in the VERSEN PhD Thesis Awards 2026, a national award recognizing it as one of the best PhD theses in Software Engineering in the Netherlands.

9 September 2025, I successfully defended my PhD thesis and was awarded a PhD degree by Radboud University, Nijmegen, The Netherlands. You can refer to my public defense to watch the defense and graduation.

1 August 2025, I will start my new job at Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg as a Postdoctoral Researcher in Machine Learning Security.

1 July 2025, I have joined the program committee for the 18th ACM Workshop on Artificial Intelligence and Security (AISec 2025).

17 June 2025, My paper titled "Beyond Learning Algorithms: The Crucial Role of Data in Robust Malware Detection," has been published by the top-tier magazine IEEE Security & Privacy.

12 May 2025, My PhD dissertation has been approved by the Manuscript Committee and is scheduled for defense on September 9, 2025, at 2:30 PM in the Auditorium at Radboud University.

1 March 2025, My PhD dissertation, titled "Rethinking the Security of Machine Learning in Malware Detection," has been submitted at Radboud University and approved by my supervisors.

2 December 2024, My paper titled "Level Up with ML Vulnerability Identification: Leveraging Domain Constraints in Feature Space for Robust Android Malware Detection, " has been accepted by the top-tier journal ACM Transactions on Privacy and Security.

20 September 2024, I attend ESORICS International Workshop on Security and Artificial Intelligence 2024, Bydgoszcz, Poland to present my paper titled "Improving Adversarial Robustness in Android Malware Detection by Reducing the Impact of Spurious Correlations".

20 July 2024, My paper titled "Improving Adversarial Robustness in Android Malware Detection by Reducing the Impact of Spurious Correlations," has been accepted for presentation at at SECAI 2024 Workshop of the 29th European Symposium on Research in Computer Security (ESORICS 2024).

6 June 2024, I have joined the program committee for the 17th ACM Workshop on Artificial Intelligence and Security (AISec 2024).

20 December 2023, My manuscript titled "EvadeDroid: A practical evasion attack on machine learning for black-box Android malware detection, " has been accepted by the top-tier journal Computers & Security.

28 Ocotober 2023, The paper I co-authored, titled "Targeted and Troublesome: Tracking and Advertising on Children's Websites," has been accepted at the 45th IEEE Symposium on Security and Privacy (IEEE S&P'24).

2 Ocotober 2023, I will visit King's College London and University College London for 6 months and work on Adversarial Training under the supervision of Dr. Fabio Pierazzi and Prof. Lorenzo Cavallaro.

20 April 2023, I attend ICT.OPEN2023, Utrecht, The Netherlands to present my poster titled "Improving Robustness of Machine Learning-based Android Malware Detection against Realizable Adversarial Examples".

1 August 2022, I attend the Summer School on Privacy-Preserving Machine Learning (PPML) organized by the department of Computer Science at ITU Copenhagen, Denmark.

14 January 2022, My book chapter titled "Hybrid and modified OPFs for intrusion detection systems and large-scale problems, " is published as Chapter 5 of Elsevier's book, Optimum-Path Forest: Theory, Algorithms, and Applications.

1 October 2020, I have been employed as a PhD Candidate in the Digital Security Group of Institute for Computing and Information Sciences, Radboud University, The Netherlands.

23 August 2020, My paper titled "A Strong Coreset Algorithm to Accelerate OPF as a Graph-based Machine Learning in Large-Scale Problems, " has been accepted by the top-tier journal Information Sciences with a minor revision.

Projects

Rethinking the Security of Machine Learning in Malware Detection
October 2020 - October 2024

Over the past decade, machine learning has emerged as a powerful tool for detecting malware by leveraging both static and dynamic analysis techniques. However, the robustness of these systems is increasingly threatened by adversarial evasion attacks, where malware manipulates its features to evade detection while maintaining its properties, such as malicious functionality. This poses significant challenges to the reliability of machine learning-based malware detection. This project, which is my PhD research, rethinks the security of such systems by exploring practical solutions through realistic threat models and reliable defense strategies. By addressing the limitations of existing approaches that rely on simplistic or impractical assumptions, this work aims to provide a more practical perspective for building robust machine learning-based malware detection systems.

Members: Hamid Bostani Research Assistant of Computer Science

Research Assistant of Computer Science

Veelasha Moonsamy Assistant Professor in Digital Security group at Radboud University

Assistant Professor in Digital Security group at Radboud University

Associate Professor in Digital Security group at Radboud University

Developing a New generation of Optimum-path Forest (OPF) as a Graph-based Machine Mearning in order to Achieve an Efficient Pattern Recognition Tool for Using on Massive Datasets
March 2017 - November 2019

Over the past decades, machine learning as the most efficient tool for pattern recognition has been the subject of many studies. Optimum-path forest (OPF) is an outstanding graph-based machine learning method that reduces the pattern recognition problems into the partitioning of the graphs derived from the input data sets. OPF as a natural multi-classifier is a fast, simple, and parameter-independent machine that supports partial overlapping among the classes. Howover, this machine is an effective machine learning algorithm just for a reasonable size of the input data sets. It seems that keeping a small sketch (or synopsis) of some data that contains an approximation of the original data can be a proper solution for the high computational complexity of OPF over the massive data sets. Coreset is a special data summarization method that can be used for holding the small sketch of massive data sets. Therefore, to cope with high computational complexity of OPF over massive data sets, this project tries to use the idea of coreset in the context of OPF to provide a scalable OPF. This work is supported in part by Iran National Science Foundation grant No. 96010151.

Members: Hamid Bostani Research Assistant of Computer Science

Research Assistant of Computer Science

Mansour Sheikhan Full Professor of Communication Engineering

Full Professor of Communication Engineering

Behrad Mahboobi Assistant Professor of Communication Engineering

Assistant Professor of Communication Engineering

Intrusion Detection and Identification of Attacks on the Internet of Things (IoT) Using a Combination of Machine Learning Methods
May 2014 - September 2015

The Internet of Things (IoT) is a worldwide network including all identifiable heterogeneous objects around us such as smartphones, laptops or smart sensors that can connect to the Internet by using a wide range of technologies. IoT is able to provide accessibility to the Internet for all physical objects since it is a hybrid network of the Internet and diverse networks with heterogeneous nodes. Generally, due to the insecure nature of the Internet as well as Wireless Sensor Networks, which are the main components of IoT, implementing security mechanisms in IoT seems necessary. To deal with intrusions that may occur in IoT, a novel multi-faceted intrusion detection system is proposed in this thesis which can detect both cyber-attacks and insider-attacks of IoT. This project is my master thesis which was done under supervision of professor Mansour Sheikhan.

Members: Hamid Bostani Research Assistant of Computer Science

Research Assistant of Computer Science

Mansour Sheikhan Full Professor of Communication Engineering

Full Professor of Communication Engineering

Publications

Pre-prints

On the Effectiveness of Adversarial Training on Malware Classifiers

H. Bostani, J. Cortellazzi, D. Arp, F. Pierazzi, V. Moonsamy and L. Cavallaro

Under peer review (2024).

https://arxiv.org/abs/2412.18218

Adversarial Training (AT) has been widely applied to harden learning-based classifiers against adversarial evasive attacks. However, its effectiveness in identifying and strengthening vulnerable areas of the model's decision space while maintaining high performance on clean data of malware classifiers remains an under-explored area. In this context, the robustness that AT achieves has often been assessed against unrealistic or weak adversarial attacks, which negatively affect performance on clean data and are arguably no longer threats. Previous work seems to suggest robustness is a task-dependent property of AT. We instead argue it is a more complex problem that requires exploring AT and the intertwined roles played by certain factors within data, feature representations, classifiers, and robust optimization settings, as well as proper evaluation factors, such as the realism of evasion attacks, to gain a true sense of AT's effectiveness. In our paper, we address this gap by systematically exploring the role such factors have in hardening malware classifiers through AT. Contrary to recent prior work, a key observation of our research and extensive experiments confirm the hypotheses that all such factors influence the actual effectiveness of AT, as demonstrated by the varying degrees of success from our empirical analysis. We identify five evaluation pitfalls that affect state-of-the-art studies and summarize our insights in ten takeaways to draw promising research directions toward better understanding the factors' settings under which adversarial training works at best.

2025

Beyond Learning Algorithms: The Crucial Role of Data in Robust Malware Detection

H. Bostani and V. Moonsamy

IEEE Security & Privacy Magazine on 5 March 2025

DOI: 10.1109/MSEC.2025.3550686

In the battle against evolving cyber threats, robust malware detection with machine learning requires more than just advanced algorithms—it demands high-quality data. This article emphasizes data quality's role and how leveraging coresets—constructed from key samples that preserve the datasets’ properties—to enrich training sets can enhance malware classifiers’ resilience.

Level Up with ML Vulnerability Identification: Leveraging Domain Constraints in Feature Space for Robust Android Malware Detection

H. Bostani, Z. Zhao, Z. Liu, and V. Moonsamy

ACM Transactions on Privacy and Security (TOPS), vol. 28, No. 2, pp. 1–32, 2025

DOI: 10.1145/3711899

Machine Learning (ML) promises to enhance the efficacy of Android Malware Detection (AMD); however, ML models are vulnerable to realistic evasion attacks—crafting realizable Adversarial Examples (AEs) that satisfy Android malware domain constraints. To eliminate ML vulnerabilities, defenders aim to identify susceptible regions in the feature space where ML models are prone to deception. The primary approach to identifying vulnerable regions involves investigating realizable AEs, but generating these feasible apps poses a challenge. For instance, previous work has relied on generating either feature-space norm-bounded AEs or problem-space realizable AEs in adversarial hardening. The former is efficient but lacks full coverage of vulnerable regions while the latter can uncover these regions by satisfying domain constraints but is known to be time-consuming. To address these limitations, we propose an approach to facilitate the identification of vulnerable regions. Specifically, we introduce a new interpretation of Android domain constraints in the feature space, followed by a novel technique that learns them. Our empirical evaluations across various evasion attacks indicate effective detection of AEs using learned domain constraints, with an average of 89.6%. Furthermore, extensive experiments on different Android malware detectors demonstrate that utilizing our learned domain constraints in Adversarial Training (AT) outperforms other AT-based defenses that rely on norm-bounded AEs or state-of-the-art non-uniform perturbations. Finally, we show that retraining a malware detector with a wide variety of feature-space realizable AEs results in a 77.9% robustness improvement against realizable AEs generated by unknown problem-space transformations, with up to 70x faster training than using problem-space realizable AEs.

2024

Improving Adversarial Robustness in Android Malware Detection by Reducing the Impact of Spurious Correlations

Hamid Bostani, Zhengyu Zhao, and Veelasha Moonsamy

ESORICS 2024 International Workshops

https://arxiv.org/abs/2408.16025

Machine learning (ML) has demonstrated significant advancements in Android malware detection (AMD); however, the resilience of ML against realistic evasion attacks remains a major obstacle for AMD. One of the primary factors contributing to this challenge is the scarcity of reliable generalizations. Malware classifiers with limited generalizability tend to overfit spurious correlations derived from biased features. Consequently, adversarial examples (AEs), generated by evasion attacks, can modify these features to evade detection. In this study, we propose a domain adaptation technique to improve the generalizability of AMD by aligning the distribution of malware samples and AEs. Specifically, we utilize meaningful feature dependencies, reflecting domain constraints in the feature space, to establish a robust feature space. Training on the proposed robust feature space enables malware classifiers to learn from predefined patterns associated with app functionality rather than from individual features. This approach helps mitigate spurious correlations inherent in the initial feature space. Our experiments conducted on DREBIN, a renowned Android malware detector, demonstrate that our approach surpasses the state-of-the-art defense, Sec-SVM, when facing realistic evasion attacks. In particular, our defense can improve adversarial robustness by up to 55% against realistic evasion attacks compared to Sec-SVM.

EvadeDroid: A practical evasion attack on machine learning for black-box Android malware detection

H. Bostani and V. Moonsamy

Computers & Security, vol. 139, pp. 1–18, 2024

DOI: 10.1016/j.cose.2023.103676

Over the last decade, researchers have extensively explored the vulnerabilities of Android malware detectors to adversarial examples through the development of evasion attacks; however, the practicality of these attacks in real-world scenarios remains arguable. The majority of studies have assumed attackers know the details of the target classifiers used for malware detection, while in reality, malicious actors have limited access to the target classifiers. This paper introduces EvadeDroid, a problem-space adversarial attack designed to effectively evade black-box Android malware detectors in real-world scenarios. EvadeDroid constructs a collection of problem-space transformations derived from benign donors that share opcode-level similarity with malware apps by leveraging an n-gram-based approach. These transformations are then used to morph malware instances into benign ones via an iterative and incremental manipulation strategy. The proposed manipulation technique is a query-efficient optimization algorithm that can find and inject optimal sequences of transformations into malware apps. Our empirical evaluations, carried out on 1K malware apps, demonstrate the effectiveness of our approach in generating real-world adversarial examples in both soft- and hard-label settings. Our findings reveal that EvadeDroid can effectively deceive diverse malware detectors that utilize different features with various feature types. Specifically, EvadeDroid achieves evasion rates of 80%–95% against DREBIN, Sec-SVM, ADE-MA, MaMaDroid, and Opcode-SVM with only 1–9 queries. Furthermore, we show that the proposed problem-space adversarial attack is able to preserve its stealthiness against five popular commercial antiviruses with an average of 79% evasion rate, thus demonstrating its feasibility in the real world.

2023

Targeted and Troublesome: Tracking and Advertising on Children's Websites

Z. Moti, A. Senol, H. Bostani, F. Zuiderveen Borgesius, V. Moonsamy, A. Mathur, and G. Acar

Proceedings of the 45th IEEE Symposium on Security and Privacy (IEEE S&P 2024)

DOI: 10.1109/SP54263.2024.00118

On the modern web, trackers and advertisers frequently construct and monetize users’ detailed behavioral profiles without consent. Despite various studies on web tracking mechanisms and advertisements, there has been no rigorous study focusing on websites targeted at children. To address this gap, we present a measurement of tracking and (targeted) advertising on websites directed at children. Motivated by the lack of a comprehensive list of child-directed (i.e., targeted at children) websites, we first build a multilingual classifier based on web page titles and descriptions. Applying this classifier to over two million pages from the Common Crawl dataset, we compile a list of two thousand child-directed websites. Crawling these sites from five vantage points, we measure the prevalence of trackers, fingerprinting scripts, and advertisements. Our crawler detects ads displayed on child-directed websites and determines if ad targeting is enabled by scraping ad disclosure pages whenever available. Our results show that around 90% of child-directed websites embed one or more trackers, and about 27% contain targeted advertisements—a practice that should require verifiable parental consent. Next, we identify improper ads on child-directed websites by developing an ML pipeline that processes both images and text extracted from ads. The pipeline allows us to run semantic similarity queries for arbitrary search terms, revealing ads that promote services related to dating, weight loss, and mental health, as well as ads for sex toys and flirting chat services. Some of these ads feature repulsive, sexually-explicit and highly-inappropriate imagery. In summary, our findings indicate a trend of non-compliance with privacy regulations and troubling ad safety practices among many advertisers and child-directed websites. To ensure the protection of children and create a safer online environment, regulators and stakeholders must adopt and enforce more stringent measures.

2022

Chapter 5 - Hybrid and modified OPFs for intrusion detection systems and large-scale problems

M. Sheikhan, H. Bostani

In Optimum-Path Forest, pp. 109-136, Academic Press, Elsevier, 2022.

DOI: 10.1016/B978-0-12-822688-9.00013-X

In this chapter, in order to show the efficiency of OPF in the intrusion detection systems (IDSs) and also in the large-scale problems, we introduce five hybrid and modified OPFs as follows: (a) a modified OPF using unsupervised learning and social network concept; (b) a hybrid IDS using unsupervised OPF based on MapReduce approach; (c) a hybrid IDS using a modified OPF (MOPF) and selected input features; (d) a modified OPF using Markov cluster process algorithm; and (e) a modified OPF based on the coreset concept. Furthermore, the MOPF-based IDS is improved in the last section as a contribution by using an outperformed clustering algorithm.

2020

A Strong Coreset Algorithm to Accelerate OPF as a Graph-based Machine Learning in Large-Scale Problems

H. Bostani, M. Sheikhan, B. Mahboobi

Information Sciences, vol. 555, pp. 424–441, 2021

DOI: 10.1016/j.ins.2020.10.009

Optimum-path forest (OPF) is one of the efficient graph-based frameworks that can determine the patterns of input dataset by extracting the optimal partitions of graph obtained through encoding data into a graph. Since OPF was introduced based on simple assumptions without considering the requirements of large-scale problems, this machine learning is an effective algorithm only for a reasonable size of input datasets. To provide a scalable OPF, this study introduces a strong coreset for accelerating OPF algorithm. Applying this approach can expedite OPF procedure, especially when it is working on massive datasets. Accordingly, a novel algebra is developed to represent the problem of OPF as an optimization problem for the proposed coreset definition. A novel coreset construction algorithm that can approximate the OPF solutions is subsequently proposed in order to improve the OPF construction speed. The simulation results of diverse experiments on various benchmark datasets illustrate computation gain and superiority of the proposed algorithm in terms of the construction and classification speeds as compared to the original algorithm while displaying reliably accurate performance. The presented coreset construction algorithm performs the training and testing phases of OPF up to 6.1 and 4.9 times faster than before, respectively.

2017

Developing a Fast Supervised Optimum-path Forest Based on Coreset

H. Bostani, M. Sheikhan, B. Mahboobi

In Proc. 19th International Symposium on Artificial Intelligence and Signal Processing (AISP’2017), pp. 172-177, 2017

DOI: 10.1109/AISP.2017.8324076

Optimum-path forest (OPF) is an effective graph-based machine learning that simplifies the pattern recognition problems into the partitioning the corresponding derived graphs of the input datasets. The amounts of the samples in the input datasets and, consequently the size of the node set of their corresponding derived graphs has a major effect on the speed of OPF. In this study a novel version of OPF is introduced which utilizes coreset approach for reducing the scale of the input dataset. From the aspect of the computational geometry, coreset is a small set of points that includes the best representative points of the original point set with regard to a geometric objective function. Our method finds the most informative vertices (samples) by proposing a novel incremental coreset construction algorithm. The experimental results of the proposed method reduces the input data samples, and the execution times of the construction and the classification phases of OPF by 80%, 60%, and 12%, respectively, in contrast to the traditional OPF.

Hybrid of Anomaly-Based and Specification-Based IDS for Internet of Things Using Unsupervised OPF based on Map-Reduce Approach

H. Bostani, M. Sheikhan

Computer Communications, vol. 98, pp. 52-71, 2017

DOI: 10.1016/j.comcom.2016.12.001

Internet of Things (IoT) is a novel paradigm in computer networks in which resource-constrained objects connect to unreliable Internet by using a wide range of technologies. The insecure nature of the Internet and wireless sensor networks, that are the main components of IoT, make IoT vulnerable to different attacks, especially routing attacks (as insider attacks). A novel real-time hybrid intrusion detection framework is proposed in this study that consists of anomaly-based and specification-based intrusion detection modules for detecting two well-known routing attacks in IoT called sinkhole and selective-forwarding attacks. For this purpose, the specification-based intrusion detection agents, that are located in the router nodes, analyze the behavior of their host nodes and send their local results to the root node through normal data packets. In addition, an anomaly-based intrusion detection agent, that is located in the root node, employs the unsupervised optimum-path forest algorithm for projecting clustering models by using incoming data packets. This agent, which is based on the MapReduce architecture, can work in a distributed platform for projecting clustering models and consequently parallel detecting of anomalies as a global detection approach. The proposed method makes decision about suspicious behavior by using a voting mechanism. Notably, the proposed method is also extended to detect wormhole attack. The deployment of the hybrid proposed model is investigated in a smart-city scenario by an existing platform, as well. The free network's scale and the ability to identify malicious nodes are two key features of the proposed framework that are evaluated through different experiments in this study. The experimental results of simulated scenarios showed that the proposed hybrid method can achieve true positive rate of 76.19% and false positive rate of 5.92% when both sinkhole and selective-forwarding attacks were launched simultaneously. These rates in detecting wormhole attack are 96.02% and 2.08%, respectively.

Modifying Supervised Optimum-Path Forest in Intrusion Detection Systems Using Social Network Approaches and Unsupervised Learning

H. Bostani, M. Sheikhan

Pattern Recognition, vol. 62, pp. 56-72, 2017

DOI: 10.1016/j.patcog.2016.08.027

Optimum-path forest (OPF) is a graph-based machine learning method that can overcome some limitations of the traditional machine learning algorithms that have been used in intrusion detection systems. This paper presents a novel approach for intrusion detection using a modified OPF (MOPF) algorithm for improving the performance of traditional OPF in terms of detection rate (DR), false alarm rate (FAR), and time of execution. To address the problem of scalability in large datasets and also for achieving high attack recognition rates, the proposed framework employs the k-means clustering algorithm, as a partitioning module, for generating different homogeneous training subsets from original heterogeneous training samples. In the proposed MOPF algorithm, the distance between unlabeled samples and the root (prototype) of every sample in OPF is also considered in classifying unlabeled samples with the aim of improving the accuracy rate of traditional OPF algorithm. Moreover, the centrality and the prestige concepts in the social network analysis are employed in a pruning module for determining the most informative samples in training subsets to speed up the traditional OPF algorithm. The experimental results on NSL-KDD dataset show that the proposed method performs better than traditional OPF in terms of accuracy rate, DR, FAR, and cost per example (CPE) evaluation metrics.

A Security Mechanism for Detecting Intrusions in Internet of Things Using Selected Features Based on MI-BGSA

M. Sheikhan, H. Bostani

International Journal of Information & Communication Technology Research, vol. 9, no. 2, pp. 53-62, 2017

Url: journal.itrc.ac.ir/index.php/ijictr/article/view/11

Internet of things (IoT) is a novel emerging approach in computer networks wherein all heterogeneous objects around us, which usually are resource-constrained objects, can connect to each other and also the Internet by using a broad range of technologies. IoT is a hybrid network which includes the Internet and also wireless sensor networks (WSNs) as the main components of IoT; so, implementing security mechanisms in IoT seems necessary. This paper introduces a novel intrusion detection architecture model for IoT that provides the possibility of distributed detection. The proposed hybrid model uses anomaly and misuse intrusion detection agents based on the supervised and unsupervised optimum-path forest models for providing the ability to detect internal and externals attacks, simultaneously. The number of input features to the proposed classifier is reduced by a hybrid feature selection algorithm, as well. The experimental results of simulated scenarios show the superior performance of proposed security mechanism in multi-faceted detection.

2016

Binary Gravitational Search Algorithm (BGSA): Improved Efficiency

M. Sheikhan, H. Bostani

Encyclopedia of Information Assurance, 2016

Url: www.taylorfrancis.com/books/....

Today, detecting anomalous traffic and preventing it in computer networks has become increasingly important for the community of security researchers. An intrusion detection system (IDS) is an effective tool for reaching high security. This is a software tool for analyzing system behavior or network traffic as input data to detect deviations from normal behavior. With the development of computer networks, highdimensional input data analysis has become a huge problem in IDSs. One solution for overcoming this problem is feature selection, which is a process for selecting an optimal subset of features. Populationbased heuristic search algorithms have been widely used for this optimization problem. This entry presents a novel feature selection method based on a binary gravitational search algorithm (BGSA). The proposed method, which is called modified BGSA (MBGSA), uses BGSA for performing the global search to find the best subset of features through the wrapper method. Moreover, for improving the efficiency of BGSA, mutual information (MI) feature selector under the uniform information distribution (MIFS-U) method, which works as a filter method, is integrated into BGSA as the inner optimization layer. In fact, with the computation of the relevance between each selected feature and the target class and the redundancy between the selected features (in the feature subset generated by the wrapper), MIFS-U will find more valuable features that have maximum relevance to the target class and minimum redundancy to each other. The experimental results on NSL-KDD dataset using different classifiers show that the proposed method can find better subset features and achieve higher accuracy and an improved detection rate using fewer features as compared to standard BGSA and binary particle swarm optimization (BPSO) feature selection methods.

Modification of Optimum-Path Forest using Markov Cluster Process Algorithm

H. Bostani, M. Sheikhan

In Proc. 2nd International Conference on Signal Processing and Intelligent Systems (ICSPIS’2016), pp. 1-5, 2016 (Winner of the Outstanding Paper Award)

DOI: 10.1109/ICSPIS.2016.7869874

Optimum-path forest (OPF) is a novel supervised graph-based classifier which reduces the classification problem into partitioning of vertices in a graph derived from the data samples. One of the main processes in OPF is identifying the optimum set of key samples named prototypes. This process is based on creating a minimum spanning tree on a complete weighted graph which is derived from the training samples; hence, it is much time-consuming for large-scale problems. In this study, for overcoming this limitation, the process of finding the prototypes in traditional OPF is modified by using Markov cluster (MCL) algorithm. The graph partitioning in MCL is based on finding key samples named attractors, which attract other related samples; so the obtained attractors can be selected as prototypes for generating optimum-path trees. Experiments on public benchmark datasets show that the speed of proposed modified OPF is improved considerably as compared to the traditional OPF.

A Hybrid Intrusion Detection Architecture for Internet of Things

M. Sheikhan, H. Bostani

In Proc. 8th International Symposium on Telecommunication (IST’2016), pp. 601-606, 2016 (Selected as one of the Best Paper)

DOI: 10.1109/ISTEL.2016.7881893

In computer networks, Internet of things (IoT) is an emerging paradigm wherein smart and resource-constrained objects can connect to Internet by using a wide range of technologies. Due to the insecure nature of Internet and also wireless sensor networks (WSNs), which are the main components of IoT, implementing security mechanisms in IoT seems necessary. To deal with intrusions which may occur in IoT, a novel intrusion detection architecture model for IoT is proposed in this paper. This model is based on MapReduce approach with the aim of distributed detection. To provide multi-faceted detection (from the Internet and WSNs sides), the proposed model consists of anomaly-based and misuse-based intrusion detection agents that use supervised and unsupervised optimum-path forest model for intrusion detection. The experimental results of simulated scenarios show the superior performance of proposed method in intrusion detection for IoT.

2015

Hybrid of Binary Gravitational Search Algorithm and Mutual Information for Feature Selection in Intrusion Detection Systems

H. Bostani, M. Sheikhan

Soft Computing, vol. 21, no. 9, pp. 2307-2324, 2017

DOI: 10.1007/s00500-015-1942-8

Intrusion detection systems (IDSs) play an important role in the security of computer networks. One of the main challenges in IDSs is the high-dimensional input data analysis. Feature selection is a solution to overcoming this problem. This paper presents a hybrid feature selection method using binary gravitational search algorithm (BGSA) and mutual information (MI) for improving the efficiency of standard BGSA as a feature selection algorithm. The proposed method, called MI-BGSA, used BGSA as a wrapper-based feature selection method for performing global search. Moreover, MI approach was integrated into the BGSA, as a filter-based method, to compute the feature–feature and the feature–class mutual information with the aim of pruning the subset of features. This strategy found the features considering the least redundancy to the selected features and also the most relevance to the target class. A two-objective function based on maximizing the detection rate and minimizing the false positive rate was defined as a fitness function to control the search direction of the standard BGSA. The experimental results on the NSL-KDD dataset showed that the proposed method can reduce the feature space dramatically. Moreover, the proposed algorithm found better subset of features and achieved higher accuracy and detection rate as compared to the some standard wrapper-based and filter-based feature selection methods.

Skills

Professional National Certifications

Education in a Nutshell, Radboud University, Nijmegen, The Netherlands (2023)

Presentation Skills, Radboud University, Nijmegen, The Netherlands (2023)

Summer School on Privacy-Preserving Machine Learning, ITU Copenhagen and Aarhus University, Copenhagen, Denmark (2022)

Advanced Conversation, Radboud University, Nijmegen, The Netherlands (2021)

SQL Server Query Tuning and Optimization, Faratar As Danesh Institute, Tehran, Iran (2018)

SQL Server 2016 – Design & Implementation, Faratar As Danesh Institute, Tehran, Iran (2018)

Professional SCRUM Master, Faratar As Danesh Institute, Tehran, Iran (2016)

MCSD Web Pack 2012, Kahkeshane Noor Institute, Tehran, Iran (2016)

ETL (SSIS) and Data Mining (SSAS) 2012, Faratar As Danesh Institute, Tehran, Iran (2014)

Data Warehousing & OLAP using SSAS 2012, Faratar As Danesh Institute, Tehran, Iran (2014)

Win Application (C# & intro ADO.NET), South Industrial Management Institute, Shiraz, Iran (2008)

Computer Knowledge

Programming and Scripting: C/C++, C#, Java, HTML & CSS, MATLAB, Python

Tools and IDEs: PyTorch, Scikit-Learn, Hugging Face Transformers, MATLAB Optimization and Neural Net Tools, Microsoft Visual Studio, SQL Server Management Studio, Visual Paradigm, Microsoft Office, Azure Boards

Software Development Technologies: C#.Net Windows Form, ASP.Net Web Form, APS.Net MVC, Javascritp, JQuery & AngularJS, ADO.NET Entity Framework, Database, (programming), Java 2 Platform Micro Edition (J2ME), Microsoft BI Technologies (Analysis, Integrated, and Reporting Services)

Software Development Methodologies: RUP, EUP, SCRUM

OS: Windows