Joshua T. Goodman - Redmond WA, US Robert L. Rounthwaite - Fall City WA, US Daniel Gwozdz - Sammamish WA, US John D. Mehr - Seattle WA, US Nathan D. Howell - Seattle WA, US Micah C. Rupersburg - Seattle WA, US Bryan T. Starbuck - Duvall WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
H04L 29/00
US Classification:
726 13
Abstract:
The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with origination information as well as other information embedded in the body of the message that allows a recipient of the message to contact and/or respond to the sender of the message call be extracted as features. The features, or a subset thereof, can be normalized and/or deobfuscated prior to being employed as features of the machine learning systems. The (deobfuscated) features can be employed to populate a plurality of feature lists that facilitate spam detection and prevention. Exemplary features include an email address, an IP address, a URL, an embedded image pointing to a URL, and/or portions thereof.
John D. Mehr - Seattle WA, US Nathan D. Howell - Seattle WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 15/173
US Classification:
709238, 709206, 709207, 709235
Abstract:
Message header spam filtering is described. In an embodiment, a message is received that includes header entries arranged in an ordered sequence which indicates a path by which the message was communicated. The header entries are parsed to categorize each header entry as a header type where the header types are listed in the ordered sequence. A quantity of each different header type is determined, and a determination is made as to whether the message is likely a spam message based at least in part on the quantity corresponding to a particular header type. In another embodiment, a numeric representation of the ordered sequence is created where the numeric representation includes unique integers assigned to each different header type. A determination is made as to whether the message is likely a spam message based at least in part on the numeric representation of the ordered sequence of header types.
Robert L. Rounthwaite - Fall City WA, US Joshua T. Goodman - Redmond WA, US David E. Heckerman - Bellevue WA, US John D. Mehr - Seattle WA, US Nathan D. Howell - Seattle WA, US Micah C. Rupersburg - Seattle WA, US Dean A. Slawson - Redmond WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 15/16
US Classification:
709206, 709207, 709223, 709224, 713154
Abstract:
The subject invention provides for a feedback loop system and method that facilitate classifying items in connection with spam prevention in server and/or client-based architectures. The invention makes uses of a machine-learning approach as applied to spam filters, and in particular, randomly samples incoming email messages so that examples of both legitimate and junk/spam mail are obtained to generate sets of training data. Users which are identified as spam-fighters are asked to vote on whether a selection of their incoming email messages is individually either legitimate mail or junk mail. A database stores the properties for each mail and voting transaction such as user information, message properties and content summary, and polling results for each message to generate training data for machine learning systems. The machine learning systems facilitate creating improved spam filter(s) that are trained to recognize both legitimate mail and spam mail and to distinguish between them.
John D. Mehr - Seattle WA, US Nathan D Howell - Seattle WA, US Paul S Rehfuss - Seattle WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 15/16
US Classification:
709206, 709207, 709202, 706 16, 707 10, 715205
Abstract:
Techniques are presented for assigning reputations to email senders. In one implementation, real-time statistics and heuristics are constructed, stored, analyzed, and used to formulate a sender reputation level for use in evaluating and controlling a given sender's connection to an message transfer agent or email recipient. A sender with an unfavorable reputation may be denied a connection before resources are spent receiving and processing email messages from the sender. A sender with a favorable reputation may be rewarded by having safeguards removed from the connection, which also saves system resources. The statistics and heuristics may include real-time analysis of traffic patterns and delivery characteristics used by an email sender, analysis of content, and historical or time-sliced views of all of the above.
Origination/Destination Features And Lists For Spam Prevention
Joshua T. Goodman - Redmond WA, US Robert L. Rounthwaite - Fall City WA, US Daniel Gwozdz - Sammamish WA, US John D. Mehr - Seattle WA, US Nathan D. Howell - Seattle WA, US Micah C. Rupersburg - Seattle WA, US Bryan T. Starbuck - Duvall WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
H04L 29/06
US Classification:
726 13, 726 22
Abstract:
The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with origination information as well as other information embedded in the body of the message that allows a recipient of the message to contact and/or respond to the sender of the message can be extracted as features. The features, or a subset thereof, can be normalized and/or deobfuscated prior to being employed as features of the machine learning systems. The (deobfuscated) features can be employed to populate a plurality of feature lists that facilitate spam detection and prevention. Exemplary features include an email address, an IP address, a URL, an embedded image pointing to a URL, and/or portions thereof.
John D. Mehr - Seattle WA, US Nathan D. Howell - Seattle WA, US Micah C. Rupersburg - Seattle WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 15/16
US Classification:
709206
Abstract:
The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with the subject line, timestamps, and the message body can be extracted and employed to generate one or more features. In particular, subject lines and message bodies can be examined for consecutive, repeating characters, blobs, the association or distance between such characters, blobs and non-blob portions of the message. The values or counts obtained can be broken down into one or more ranges corresponding to a degree of spaminess. Presence and type of attachments to messages, percentage of non-white-space and non-numeric characters of a message, and determining message delivery times can be used to identify spam. A time-based delta can be computed to facilitate determining the delivery time.
Bryan T. Starbuck - Duvall WA, US Robert L. Rounthwaite - Fall City WA, US David E. Heckerman - Bellevue WA, US Joshua T. Goodman - Redmond WA, US Eliot C. Gillum - Los Gatos CA, US Nathan D. Howell - Seattle WA, US Kenneth R. Aldinger - Redmond WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 15/16
US Classification:
709206, 709201, 709203, 709204
Abstract:
The subject invention provides for an advanced and robust system and method that facilitates detecting spam. The system and method include components as well as other operations which enhance or promote finding characteristics that are difficult or the spammer to avoid and finding characteristics in non-spam that are difficult for spammers to duplicate. Exemplary characteristics include examining origination features in pairs, analyzing character and/or number sequences, strings, and sub-strings, detecting various entropy levels of one or more character sequences, strings and/or sub-strings as well as analyzing message and/or feature sizes.
Manjunath Bharadwaj - Bellevue WA, US Nathan Howell - Seattle WA, US Wei Jiang - Redmond WA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06F 11/00
US Classification:
370230, 3703953
Abstract:
Several approaches to selectively filtering network traffic are described. One approach involves a system for selectively filtering network traffic. The system includes a helper application, which is coupled to a networking program, and is used to identify a user-initiated request. A network filter driver is coupled to the networking program, for intercepting the user-initiated request. A filtering service is coupled to both the helper application and the network filter driver, and is used to determine if the user-initiated request is allowable. If the request is allowable, the filtering service is configured to generate a special identifier, which the helper application is configured to include in a subsequent request. The filtering service is configured to allow a subsequent request which includes the special identifier, and the network filter driver's configured to strip a special identifier from subsequent requests.
Sonia Rykiel Jul 2019 - Apr 2020
Head of Machine Learning
Godaddy Oct 2013 - Jun 2019
Senior Principal Architect
Alpha Heavy Industries Oct 2013 - Jun 2019
Principal
Ebay Jan 2009 - May 2011
Principal Software Engineer
Positronic Industries Sep 2008 - Dec 2008
Software Development Engineer
Skills:
Distributed Systems Machine Learning Scalability Big Data C++ Haskell Algorithms Text Classification Nosql Time Series Analysis Sentiment Analysis Compiler Development High Performance Computing Spam Filtering Kernel Programming Reverse Engineering Llvm Compilers
Nathan Howell (Nate)'s profile on Myspace, the leading social entertainment destination powered by the passion of our fans.
Googleplus
Nathan Howell
Work:
Alpha Heavy Industries - Principal (2011) EBay - Principal Developer (2009-2011) Positronic (acquired by eBay) - Software Developer (2008-2008) Microsoft - Senior Developer (1999-2008)
Nathan Howell
Nathan Howell
Nathan Howell
Nathan Howell
Nathan Howell
Nathan Howell
Nathan Howell
Flickr
Youtube
Nathan Howell - Football Highlights, McNairy ...
Nathan Howell's Senior Year at McNairy Central High School - Football ...
Duration:
9m 49s
Nathan Howell - Senior 2011 McNairy Central H...
Highlights of Nathan Howell's 2010 senior football season at McNairy C...
Duration:
9m 50s
The Nathan P. Howell House - 238 Main Street,...
The historic Nathan P. Howell house is the largest property on renowne...
Duration:
59s
4th Grade- Natha Howell Elementary (Mrs. Reyn...
Duration:
12m 57s
Nathan Howell Stand Up Second City Chicago Sh...
Description.
Duration:
6m 56s
nate howell unplugged
nate howell in concert and dk2020 doing the lights.
Cooke Tomlin and Chandler Pittman scored for the Mustangs, while Nathan Howell scored for the Bobcats. Madison's win sets up the 15 A-AA championship matchup with Trinity Christian Academy. The Lions beat Lexington 4-1 in the other semifinal. ...