Cyber Security

Machine learning technique detects phishing sites based on markup visualization

Published

September 8, 2021

Machine learning models trained on the visual representation of website code can help improve the accuracy and speed of detecting phishing websites.

This is according to a paper (PDF) by security researchers at the University of Plymouth and the University of Portsmouth, UK.

The researchers aim to address the shortcomings of existing detection methods, which are either too slow or not accurate enough.

Turning web code into images

The technique developed by the researchers uses “binary visualization” libraries to transform the markup and code of web pages into images.

Using this method, they created a dataset of legitimate and phishing images of websites.

Visual differences between the legitimate PayPal login page and a phishing equivalent

The dataset was then used to train a machine learning model to classify legitimate and phishing websites based on the differences in their binary visualization.

To test a new website, the target webpage’s code is transformed through binary visualization and run through the trained model.

To speed up the model’s performance, the researchers used MobileNet, a neural network that has been optimized to run on resource-constrained devices as opposed to cloud servers.

The system also gradually builds up a database of legitimate and phishing websites to avoid excessive and unnecessary inferences.

Overview of the proposed approach

Accurate detection of phishing websites

According to the researchers’ experiments, the model reached 94% accuracy in detecting phishing websites. And since it uses a very small neural network, it can run on user devices and provide near-real-time results.

“We have tested the technique with actual phishing and legit sites,” Stavros Shiaeles, one of the paper’s co-authors, told The Daily Swig.

This is not the first time that binary visualization and machine learning has been used in cybersecurity. In 2019, Shiaeles, who is a cybersecurity lecturer at the University of Portsmouth, was among the co-authors of another technique that used ML and binary visualization to detect malware with promising results.

After testing the phishing website detection system, the team is now taking the next step to make the technique ready for adoption.

“We are working on a new extended method and we are trying to apply for a patent,” Shiaeles said. “Based on the results we initially have I don’t see the point not to be adopted. The accuracy is 100%.”

Source: https://portswigger.net/daily-swig/machine-learning-technique-detects-phishing-sites-based-on-markup-visualization

Advertisement. Scroll to continue reading.

In this article:Machine Learning, Phishing, Research

Click to comment

Cyber Security

DHS cyber review board to examine China-linked cyberattacks of Microsoft

The Cyber Safety Review Board will assess how a hacking group reportedly linked to China leveraged a vulnerability in Microsoft Exchange Online to access...

NewsNews1August 12, 2023

Cyber Security

Charming Kitten hackers use new ‘NokNok’ malware for macOS

Security researchers observed a new campaign they attribute to the Charming Kitten APT group where hackers used new NokNok malware that targets macOS systems. The...

NewsNews1July 9, 2023

Cyber Security

Deserialized web security roundup: Twitter 2FA backlash, GoDaddy suffers years-long attack campaign, and XSS Hunter adds e2e encryption

Twitter faced further criticism this week when Elon Musk’s social networking platform announced SMS-based 2FA will only be available to paying customers going forward....

NewsNews1February 25, 2023

Cyber Security

Deserialized web security roundup: KeePass dismisses ‘vulnerability’ report, OpenSSL gets patched, and Reddit admits phishing hack

KeePass has become the latest password manager utility obliged to defend its reputation following the discovery of an alleged vulnerability. Security researchers warned that it might be...

NewsNews1February 13, 2023

Newsworthy News Global Political Local New Site

Cyber Security

Machine learning technique detects phishing sites based on markup visualization

Turning web code into images

Accurate detection of phishing websites

Leave a Reply
Cancel reply

Leave a Reply

You May Also Like

Cyber Security

DHS cyber review board to examine China-linked cyberattacks of Microsoft

Cyber Security

Charming Kitten hackers use new ‘NokNok’ malware for macOS

Cyber Security

Deserialized web security roundup: Twitter 2FA backlash, GoDaddy suffers years-long attack campaign, and XSS Hunter adds e2e encryption

Cyber Security

Deserialized web security roundup: KeePass dismisses ‘vulnerability’ report, OpenSSL gets patched, and Reddit admits phishing hack

Turning web code into images

Accurate detection of phishing websites

Leave a Reply Cancel reply

Leave a Reply

You May Also Like

Cyber Security

DHS cyber review board to examine China-linked cyberattacks of Microsoft

Cyber Security

Charming Kitten hackers use new ‘NokNok’ malware for macOS

Cyber Security

Deserialized web security roundup: Twitter 2FA backlash, GoDaddy suffers years-long attack campaign, and XSS Hunter adds e2e encryption

Cyber Security

Deserialized web security roundup: KeePass dismisses ‘vulnerability’ report, OpenSSL gets patched, and Reddit admits phishing hack

Leave a Reply
Cancel reply