Credit Card Fraud Detection Using Adaboost and Majority Voting
Credit Card Fraud Detection Using Adaboost and Majority Voting
Credit Card Fraud Detection Using Adaboost and Majority Voting
ISSN: 2278-0181
RTICCT - 2019 Conference Proceedings
Abstract: - Credit card fraud is a serious problem in financial effective fraud detection system to reduce or eliminate fraud cases
services. Billions of dollars are lost due to credit card fraud is important. There have been various studies on credit card fraud
every year. There is a lack of research studies on analyzing detection. Machine learning and related methods are most
real-world credit card data owing to confidentiality issues. In commonly used, which include artificial neural networks, rule-
this paper, machine learning algorithms are used to detect induction techniques, decision trees, logistic regression, and
credit card fraud. Standard models are firstly used. Then, support vector machines [1]. These methods are used either
hybrid methods which use AdaBoost and majority voting standalone or by combining several methods together to form
methods are applied. To evaluate the model efficacy, a hybrid models.
publicly available credit card data set is used. Then, a real-
world credit card data set from a financial institution is In this paper, a total of twelve machine learning
analyzed. In addition, noise is added to the data samples to algorithms are used for detecting credit card fraud. The
further assess the robustness of the algorithms. The algorithms range from standard neural networks to deep learning
experimental results positively indicate that the majority models. They are evaluated using both benchmark and realworld
voting method achieves good accuracy rates in detecting fraud credit card data sets. In addition, the AdaBoost and majority
cases in credit cards. voting methods are applied for forming hybrid
models. To further evaluate the robustness and reliability of the
1. INTRODUCTION models, noise is added to the real-world data set. The key
contribution of this paper is the evaluation of a variety of machine
Fraud is a wrongful or criminal deception aimed to learning models with a real-world credit card data set for fraud
bring financial or personal gain. In avoiding loss from fraud, two detection. While other researchers have used various methods on
mechanisms can be used: fraud prevention and fraud detection. publicly available data sets, the data set used in this paper are
Fraud prevention is a proactive method, where it stops fraud from extracted from actual credit card transaction information over
happening in the first place. On the other hand, fraud detection is three months.
needed when a fraudulent transaction is attempted by a fraudster.
The organization of this paper is as follows. In Section
Credit card fraud is concerned with the illegal use of II, related studies on single and hybrid machine learning
credit card information for purchases. Credit card transactions can algorithms for financial applications is given. The machine
be accomplished either physically or digitally. In physical learning algorithms used in this study are presented in Section III.
transactions, the credit card is involved during the transactions. In The experiments with both benchmark and realworld credit card
digital transactions, this can happen over the telephone or the data sets are presented in Section IV. Concluding remarks and
internet. Cardholders typically provide the card number, expiry recommendations for further work
date, and card verification number are given in Section V.
through telephone or website.
2. RELATED WORK
With the rise of e-commerce in the past decade, the use
of credit cards has increased dramatically. The number of credit In this section, single and hybrid machine learning
card transactions in 2011 in Malaysia were at about 320 million, algorithms for financial applications are reviewed. Various
and increased in 2015 to about 360 million. Along with the rise of financial applications from credit card fraud to financial statement
credit card usage, the number of fraud cases have been constantly fraud are reviewed.
increased. While numerous authorization techniques have been in
place, credit card fraud cases have not hindered effectively. A. SINGLE MODELS
Fraudsters favour the internet as their identity and location are For credit card fraud detection, Random Forest (RF),
hidden. The rise in credit card fraud has a big impact on the Support Vector Machine, (SVM) and Logistic Regression (LOR)
financial industry. The global credit card fraud in 2015 reached to were examined in. The data set consisted of one-year transactions.
a staggering USD $21.84 billion. Data under-sampling was used to examine the algorithm
performances, with RF demonstrating a better performance as
Loss from credit card fraud affects the merchants, compared with SVM and LOR [6]. An Artificial Immune
where they bear all costs, including card issuer fees, charges, and Recognition System (AIRS) for credit card fraud detection was
administrative charges. Since the merchants need to bear the loss, proposed in. AIRS is an improvement over the standard AIS
some goods are priced higher, or discounts and incentives are model, where negative selection was used to achieve higher
reduced. Therefore, it is imperative to reduce the loss, and an
precision. This resulted in an increase of accuracy by 25% and Testing credit card FDSs using real data set is a difficult
reduced system response time by 40%. task.
The fraud has to be deducted in real time and the number of
A credit card fraud detection system was proposed in, false alert.
which consisted of a rule-based filter, Dumpster–Shafer adder, 3.2 PROPOSED SYSTEM
transaction history database, and Bayesian learner. The Total of twelve machine learning algorithms are used for
Dempster–Shafer theory combined various evidential information detecting credit card fraud.
and created an initial belief, which was used to classify a The algorithms range from standard neural networks to deep
transaction as normal, suspicious, or abnormal. If a transaction learning models.
was suspicious, the belief was further evaluated using transaction In addition, the AdaBoost and majority voting methods are
history from Bayesian learning. applied for forming hybrid models.
The key contribution of this paper is the evaluation of a
Simulation results indicated a 98% true positive rate. A variety of machine learning models with a real-world credit card
modified Fisher Discriminant function was used for credit card data set for fraud detection.
fraud detection in. The modification made the traditional
functions to become more sensitive to important instances. A SYSTEM ARCHITECTURE DIAGRAM –
weighted average was utilized to calculate variances, which BASEPAPER
allowed learning of profitable transactions. The results from the
modified function confirm it can eventuate more profit.
B. HYBRID MODELS
Hybrid models are combination of multiple individual
models. A hybrid model consisting of the Multilayer Perceptron
(MLP) neural network, SVM, LOR, and Harmony Search (HS)
optimization was used in to detect corporate tax evasion. HS was
useful for finding the best parameters for the classification
models. Using data from the food and textile sectors in Iran, the
MLP with HS optimization acquired the highest accuracy rates at
90.07%. A hybrid clustering system with outlier detection
capability was used to detect fraud in lottery and online games.
The system aggregated online algorithms with statistical
information from the input data to identify a number of fraud
types. The training data set was compressed into the main
memory while new data samples could be incrementally added
into the stored datacubes. The system achieved a high detection
rate at 98%, with a 0.1% false alarm rate.
To tackle financial distress, clustering and classifier 3.2.1 Advantages Of Proposed System
ensemble methods were used to form hybrid models in. The SOM The system is very fast due to AdaBoost Technique.
and k-means algorithms were used for clustering, while LOR, Effective Majority Voting techniques.
MLP, and DT were used for classification. Based on these
methods, a total of 21 hybrid models with different combinations 4.0 METHODOLOGY
were created and evaluated with the data set. The SOM with the
MLP classifier performed the best, yielding the highest prediction MACHINE LEARNING ALGORITHMS
accuracy. An integration of multiple models, i.e. RF, DR, Roush A total of twelve algorithms are used in this
Set Theory (RST), and back-propagation neural network was used experimental study. They are used in conjunction with the
in to build a fraud detection model for corporate financial AdaBoos
statements. Company financial statements in period of 1998 to
2008 were used as the data set. The results showed that the hybrid ALGORITHMS
model of RF and RST gave the highest classification accuracy. Naïve Bayes (NB) uses the Bayes’ theorem with strong
or naïve independence assumptions for classification. Certain
3. PROBLEM STATEMENT features of a class are assumed to be not correlated to others. It
requires only a small training data set for estimating the means
3.1 EXISTING MODEL and variances is needed for classification.
Three methods to detect fraud are presented.
Firstly, clustering model is used to classify the legal and The presentation of data in form of a tree structure is
fraudulent transaction using data parameter value. useful
Secondly, Gaussian mixture model past behavior and current for ease of interpretation by users. The Decision Tree (DT) is a
behavior can be calculated to detect any abnormalities from the collection of nodes that creates decision on features connected to
past behavior. certain classes. Every node represents a splitting rule for a feature.
Lastly, Bayesian networks are used to describe the statistics New nodes are established until the stopping criterion is met. The
of a particular user and the statistics of different fraud scenarios. class label is determined based on the majority of samples that
belong to a particular leaf. The Random Tree (RT) operates as a
3.1.1 Drawbacks
DT operator, with the exception that in each split, only a random
The high amount of losses due to fraud and the
subset of features is available. It learns from both nominal and
awareness of the relation between loss and the available limit has
numerical data samples. The subset size is defined using a subset
to be reduced.
ratio parameter.
The SVM can tackle both classification and regression A study on credit card fraud detection using machine
data. SVM builds a model by assigning new samples to one learning algorithms has been presented in this paper. A number of
category or another, creating a non-probabilistic binary linear standard models which include NB, SVM, and DL have been
classifier. It represents the data samples as points in the space used in the empirical evaluation. A publicly available credit card
mapped so such that the data samples of different categories can data set has been used for evaluation using individual (standard)
be separated by a margin as wide as possible. A summary of the models and hybrid models using AdaBoost and majority voting
strengths and limitations of the methods discussed earlier is given combination methods. The MCC metric has been adopted as a
in Table I. performance measure, as it takes into account the true and false
TABLE I positive and negative predicted outcomes. The best MCC score is
STRENGTHS AND LIMITATIONS OF MACHINE 0.823, achieved using majority voting. A real credit card data set
LEARNING METHODS from a financial institution has also been used for evaluation. The
same individual and hybrid models have been employed. A
Model Strengths Strengths Limitations perfect MCC score of 1 has been achieved using AdaBoost and
Limitations majority voting methods. To further evaluate the hybrid models,
Bayesian Good for binary Need good noise from 10% to 30% has been added into the data samples. The
classification understanding of majority voting method has yielded the best MCC score of 0.942
problems; efficient typical and for 30% noise added to the data set. This shows that the majority
use of abnormal voting method is stable in performance in the presence of noise.
computational behaviors for
For future work, the methods studied in this paper will
resources; suitable different types of
for real-time fraud cases be extended to online learning models. In addition, other online
operations. learning models will be investigated. The use of online learning
Trees Easy to understand Potential of over- will enable rapid detection of fraud cases, potentially in real-time.
and implement; the fitting if the This in turn will help detect and prevent fraudulent transactions
procedures training set does not before they take place, which will reduce the number of losses
require a low represent the incurred every day in the financial sector.