Email Security

Google’s RETVec Open Source Text Vectorizer Bolsters Malicious Email Detection

Google shows how RETVec, a new and open source text vectorizer, can improve the detection of phishing attacks, spam and other harmful content.

Eduard Kovacs

November 30, 2023

Google revealed on Wednesday that a new text vectorizer developed by its researchers significantly boosts efficiency in detecting malicious emails in Gmail inboxes.

The new text vectorizer, called RETVec (Resilient & Efficient Text Vectorizer), has been described by Google as “an efficient, resilient, and multilingual text vectorizer designed for neural-based text processing”.

The internet giant has been leveraging text classification models to identify phishing attacks, scams, inappropriate comments and other harmful content on services such as YouTube and Gmail.

However, threat actors have been coming up with ways to evade these classifiers, using invisible characters, homoglyphs, and keyword stuffing.

RETVec aims to boost the efficiency of text classifiers while significantly reducing computation costs, and the tests conducted by Google over the past year seem to show that it has achieved its goal.

In its tests, Google replaced the text vectorizer previously used to detect spam in Gmail with RETVec. The company noticed a 38% improvement in spam detection, and a significant reduction in false positives and false negatives. In addition, the company saw a solid improvement in terms of performance.

“RETVec achieves these improvements by combining a novel, highly-compact character encoder, an augmentation-driven training regime, and the use of metric learning,” Google explained.

It added, “Due to its novel architecture, RETVec works out-of-the-box on every language and all UTF-8 characters without the need for text preprocessing, making it the ideal candidate for on-device, web, and large-scale text classification deployments. Models trained with RETVec exhibit faster inference speed due to its compact representation. Having smaller models reduces computational costs and decreases latency, which is critical for large-scale applications and on-device models.”

Advertisement. Scroll to continue reading.

RETVec has been detailed by Google in a paper and it has been made open source. A tutorial is also available for entities interested in using the new text vectorizer.

Written By Eduard Kovacs

Eduard Kovacs (@EduardKovacs) is a managing editor at SecurityWeek. He worked as a high school IT teacher for two years before starting a career in journalism as Softpedia’s security news reporter. Eduard holds a bachelor’s degree in industrial informatics and a master’s degree in computer techniques applied in electrical engineering.

Latest News

CIEM Chat: How to Reduce Cloud Identity Risk

March 26, 2024

Join the session as we discuss the challenges and best practices for cybersecurity leaders managing cloud identities.

Virtual Event: Ransomware Resilience & Recovery Summit

April 17, 2024

SecurityWeek’s Ransomware Resilience and Recovery Summit helps businesses to plan, prepare, and recover from a ransomware incident.

Why Using Microsoft Copilot Could Amplify Existing Data Quality and Privacy Issues

Microsoft provides an easy and logical first step into GenAI for many organizations, but beware of the pitfalls. (Alastair Paterson)

Beyond the Buzz: Rethinking Alcohol as a Cybersecurity Bonding Ritual

Jennifer Leggio makes the case for more alcohol-free networking events at conferences, and community-building opportunities for sober individuals working in cybersecurity. (Jennifer Leggio)

Navigating Vendor Speak: A Security Practitioner’s Guide to Seeing Through the Jargon

As a security industry, we need to focus our energies on those professionals among us who know how to walk the walk. (Joshua Goldfarb)

SD-WAN: Don’t Build a Dead End, Prepare for Future-Proof Secure Networking

SD-WAN must be scalable, stable, secure, and fully operational to serve as a strong base for seamless modernization and progression to SASE. (Etay Maor)

You Against the World: The Offenders Dilemma

Foreign attackers have many more toolsets at their disposal, so we need to make sure we’re selective about our modeling, preparation and how we assess and fortify ourselves. (Tom Eston)

Cloud Security

Microsoft Cloud Hack Exposed More Than Exchange, Outlook Emails

Cloud security researcher warns that stolen Microsoft signing key was more powerful and not limited to Outlook.com and Exchange Online.

Ryan NaraineJuly 21, 2023

Compliance

DMARC Implemented on Half of U.S. Government Domains

Government agencies in the United States have made progress in the implementation of the DMARC standard in response to a Department of Homeland Security...

Eduard KovacsJanuary 3, 2018

Email Security

DMARC Adoption Low in Fortune 500, FTSE 100 Companies

Many Fortune 500, FTSE 100 and ASX 100 companies have failed to properly implement the DMARC standard, exposing their customers and partners to phishing...

Eduard KovacsAugust 23, 2017

Application Security

VMware Patches VM Escape Flaw Exploited at Geekpwn Event

Virtualization technology giant VMware on Tuesday shipped urgent updates to fix a trio of security problems in multiple software products, including a virtual machine...

Ryan NaraineDecember 13, 2022

Application Security

Fortinet Ships Emergency Patch for Already-Exploited VPN Flaw

Fortinet on Monday issued an emergency patch to cover a severe vulnerability in its FortiOS SSL-VPN product, warning that hackers have already exploited the...

Ryan NaraineDecember 12, 2022

Cybercrime

Enterprises Warned About Zix-Themed Credential Phishing Attacks

Enterprise users have been warned that cybercriminals may be trying to phish their credentials by luring them with fake emails that appear to be...

Eduard KovacsSeptember 28, 2021

Cloud Security

Proofpoint to Acquire Tessian for AI-Powered Email Security Tech

Proofpoint removes a formidable competitor from the crowded email security market and adds technology to address risk from misdirected emails.

Ryan NaraineOctober 30, 2023

Cloud Security

Microsoft’s Verified Publisher Status Abused in Email Theft Campaign

Microsoft and Proofpoint are warning organizations that use cloud services about a recent consent phishing attack that abused Microsoft’s ‘verified publisher’ status.

Eduard KovacsJanuary 31, 2023

SECURITYWEEK NETWORK:

ICS:

SecurityWeek

Email Security

Google’s RETVec Open Source Text Vectorizer Bolsters Malicious Email Detection

More from Eduard Kovacs

Latest News

Trending

CIEM Chat: How to Reduce Cloud Identity Risk

Virtual Event: Ransomware Resilience & Recovery Summit

People on the Move

Expert Insights

Why Using Microsoft Copilot Could Amplify Existing Data Quality and Privacy Issues

Beyond the Buzz: Rethinking Alcohol as a Cybersecurity Bonding Ritual

Navigating Vendor Speak: A Security Practitioner’s Guide to Seeing Through the Jargon

SD-WAN: Don’t Build a Dead End, Prepare for Future-Proof Secure Networking

You Against the World: The Offenders Dilemma

Related Content

Cloud Security

Microsoft Cloud Hack Exposed More Than Exchange, Outlook Emails

Compliance

DMARC Implemented on Half of U.S. Government Domains

Email Security

DMARC Adoption Low in Fortune 500, FTSE 100 Companies

Application Security

VMware Patches VM Escape Flaw Exploited at Geekpwn Event

Application Security

Fortinet Ships Emergency Patch for Already-Exploited VPN Flaw

Cybercrime

Enterprises Warned About Zix-Themed Credential Phishing Attacks

Cloud Security

Proofpoint to Acquire Tessian for AI-Powered Email Security Tech

Cloud Security

Microsoft’s Verified Publisher Status Abused in Email Theft Campaign

SECURITYWEEK NETWORK:

ICS:

More from Eduard Kovacs

Latest News

Trending

Daily Briefing Newsletter

CIEM Chat: How to Reduce Cloud Identity Risk

Virtual Event: Ransomware Resilience & Recovery Summit

People on the Move

Expert Insights

Why Using Microsoft Copilot Could Amplify Existing Data Quality and Privacy Issues

Beyond the Buzz: Rethinking Alcohol as a Cybersecurity Bonding Ritual

Navigating Vendor Speak: A Security Practitioner’s Guide to Seeing Through the Jargon

SD-WAN: Don’t Build a Dead End, Prepare for Future-Proof Secure Networking

You Against the World: The Offenders Dilemma

Related Content

Cloud Security

Microsoft Cloud Hack Exposed More Than Exchange, Outlook Emails

Compliance

DMARC Implemented on Half of U.S. Government Domains

Email Security

DMARC Adoption Low in Fortune 500, FTSE 100 Companies

Application Security

VMware Patches VM Escape Flaw Exploited at Geekpwn Event

Application Security

Fortinet Ships Emergency Patch for Already-Exploited VPN Flaw

Cybercrime

Enterprises Warned About Zix-Themed Credential Phishing Attacks

Cloud Security

Proofpoint to Acquire Tessian for AI-Powered Email Security Tech

Cloud Security

Microsoft’s Verified Publisher Status Abused in Email Theft Campaign