Online citations, reference lists, and bibliographies.

Towards Detection Of Phishing Websites On Client-side Using Machine Learning Based Approach

Ankit Kumar Jain, Brij B. Gupta
Published 2018 · Computer Science
Cite This
Download PDF
Analyze on Scholarcy
Share
The existing anti-phishing approaches use the blacklist methods or features based machine learning techniques. Blacklist methods fail to detect new phishing attacks and produce high false positive rate. Moreover, existing machine learning based methods extract features from the third party, search engine, etc. Therefore, they are complicated, slow in nature, and not fit for the real-time environment. To solve this problem, this paper presents a machine learning based novel anti-phishing approach that extracts the features from client side only. We have examined the various attributes of the phishing and legitimate websites in depth and identified nineteen outstanding features to distinguish phishing websites from legitimate ones. These nineteen features are extracted from the URL and source code of the website and do not depend on any third party, which makes the proposed approach fast, reliable, and intelligent. Compared to other methods, the proposed approach has relatively high accuracy in detection of phishing websites as it achieved 99.39% true positive rate and 99.09% of overall detection accuracy.
This paper references
10.1109/SURV.2013.030713.00020
A Survey of Phishing Email Filtering Techniques
Ammar Almomani (2013)
10.1145/1280680.1280692
Anti-Phishing Phil: the design and evaluation of a game that teaches people not to fall for phish
Steve Sheng (2007)
10.1016/j.cose.2015.07.006
Utilisation of website logo for phishing detection
Kang-Leng Chiew (2015)
10.4018/IJCAC.2017070101
Detection, Avoidance, and Attack Pattern Mechanisms in Modern Web Application Vulnerabilities: Present and Future Challenges
Shashank Gupta (2017)
Malicious URL Detection using Machine Learning: A Survey
Doyen Sahoo (2017)
10.1186/s13635-016-0034-3
A novel approach to protect against phishing attacks at client side using auto-updated white-list
Ankit Kumar Jain (2016)
10.1016/j.dss.2016.05.005
PhishWHO: Phishing webpage detection via identity keywords extraction and target domain name finder
Choon Lin Tan (2016)
10.4018/IJCAC.2017010104
Parallel and Distributed Population based Feature Selection Framework for Health Monitoring
Naoual El Aboudi (2017)
10.1186/s13635-016-0050-3
Identification performance of evidential value estimation for ridge-based biometrics
Johannes Kotzerke (2016)
10.1184/R1/6469805.v1
An Empirical Analysis of Phishing Blacklists
Steve Sheng (2009)
10.1093/comjnl/bxx035
Detection of Phishing Websites Based on Probabilistic Neural Networks and K-Medoids Clustering
El-Sayed M. El-Alfy (2017)
10.1016/j.chb.2016.02.065
Phishing threat avoidance behaviour: An empirical investigation
Nalin Asanka Gamagedara Arachchilage (2016)
10.1007/978-3-319-11116-2_15
A Taxonomy of Hyperlink Hiding Techniques
Guanggang Geng (2014)
10.1016/j.cose.2013.10.004
A comprehensive and efficacious architecture for detecting phishing webpages
R. Gowtham (2014)
10.1002/pra2.2015.145052010040
Online search in english as a non-native language
Peng Chu (2015)
10.1007/s12652-017-0616-z
Two-level authentication approach to protect from phishing attacks in real time
Ankit Kumar Jain (2018)
10.1108/ICS-02-2013-0009
Examining the effectiveness of phishing filters against DNS based phishing attacks
Swapan Purkait (2015)
10.1007/s00521-016-2275-y
Fighting against phishing attacks: state of the art and future challenges
Brij B. Gupta (2016)
10.1016/j.asoc.2015.05.059
Detection of phishing attacks in Iranian e-banking using a fuzzy-rough hybrid system
Gholam Ali Montazer (2015)
10.1145/2019599.2019606
CANTINA+: A Feature-Rich Machine Learning Framework for Detecting Phishing Web Sites
Guang Xiang (2011)
10.1109/TC.2015.2401017
Secure Distributed Deduplication Systems with Improved Reliability
Jin Li (2015)
10.1186/s13635-015-0028-6
Markov process-based retrieval for encrypted JPEG images
Hang Cheng (2016)
10.4018/IJCAC.2017010101
Enhancing the Browser-Side Context-Aware Sanitization of Suspicious HTML5 Code for Halting the DOM-Based XSS Vulnerabilities in Cloud
Brij Bhooshan Gupta (2017)
10.1016/j.ins.2017.05.031
Insight of the protection for data security under selective opening attacks
Zhengan Huang (2017)
10.1155/2017/5421046
Phishing Detection: Analysis of Visual Similarity Based Approaches
Ankit Kumar Jain (2017)
10.1504/IJICS.2018.10016392
Detection of phishing attacks in financial and e-banking websites using link and visual similarity relation
Ankit Kumar Jain (2018)
10.1145/1242572.1242659
Cantina: a content-based approach to detecting phishing web sites
Yue Zhang (2007)
10.1007/s11280-016-0418-9
Two-stage ELM for phishing Web pages detection using hybrid features
Wei Zhang (2016)



This paper is referenced by
10.1007/978-981-15-0199-9_30
Artificial Intelligence and Cybersecurity: Past, Presence, and Future
Thanh Cong Truong (2020)
10.1109/FUZZ-IEEE.2019.8858884
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Mahdieh Zabihimayvan (2019)
10.1142/s0218213019300023
Machine Learning and Nature Inspired Based Phishing Detection: A Literature Survey
Andronicus Ayobami Akinyelu (2019)
10.1109/ACCESS.2020.2995157
CNN Based Malicious Website Detection by Invalidating Multiple Web Spams
Dongjie Liu (2020)
10.1145/3297156.3297264
Detecting Chinese Domain Name Piracy
Dan Li (2018)
10.1016/j.cose.2020.101793
A graph-theoretic approach for the detection of phishing webpages
Choon Lin Tan (2020)
10.1007/S12652-018-0798-Z
A machine learning based approach for phishing detection using hyperlinks information
Ankit Kumar Jain (2019)
10.1109/IAEAC47372.2019.8997947
A Bi-Directional LSTM Model with Attention for Malicious URL Detection
Fangli Ren (2019)
A Survey of URL-based Phishing Detection
Eint Sandi Aung (2019)
10.3390/sym12030410
Artificial Intelligence in the Cyber Domain: Offense and Defense
Thanh Cong Truong (2020)
10.1145/3205977.3205992
"Kn0w Thy Doma1n Name": Unbiased Phishing Detection Using Domain Name Based Features
Hossein Shirazi (2018)
10.1016/j.eswa.2018.09.029
Machine learning based phishing detection from URLs
Ozgur Koray Sahingoz (2019)
10.15388/20-infor404
Comparison of Classification Algorithms for Detection of Phishing Websites
Paulius Vaitkevicius (2020)
10.1007/978-981-15-0146-3_128
Detection of Phishing Websites Using Machine Learning
Ahmed Raad Abbas (2020)
10.1109/ICIMIA48430.2020.9074837
Gravitational Search Based Feature Selection for Enhanced Phishing Websites Detection
S. Priya (2020)
10.1016/j.neunet.2020.02.013
CNN-MHSA: A Convolutional Neural Network and multi-head self-attention combined approach for detecting phishing websites
Xi Xiao (2020)
10.1109/ACCESS.2019.2892066
Phishing Website Detection Based on Multidimensional Features Driven by Deep Learning
Peng Yang (2019)
An Evasion Attack against ML-based Phishing URL Detectors
Bushra Sabir (2020)
10.1145/3388142.3388170
Learning-based models to detect runtime phishing activities using URLs
Surya Srikar Sirigineedi (2020)
10.33665/ijear.2018.v05i02.003
Phishing URL detection system based on URL features using SVM
Bireswar Banik (2018)
10.1049/IET-IFS.2019.0006
Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting
Waleed Ali (2019)
10.1007/978-3-030-20005-3_4
A Framework of New Hybrid Features for Intelligent Detection of Zero Hour Phishing Websites
Thomas Nagunwa (2019)
10.1109/JIOT.2019.2954919
Countering Malicious URLs in Internet of Things Using a Knowledge-Based Approach and a Simulated Expert
Sajid Anwar (2020)
10.1002/itl2.135
On detecting and mitigating phishing attacks through featureless machine learning techniques
Cristian Henrique M. Souza (2020)
Semantic Scholar Logo Some data provided by SemanticScholar