Student Seminar Report & Project Report With Presentation (PPT,PDF,DOC,ZIP)

Full Version: spam detection algorithm source code pdf
You're currently viewing a stripped down version of our content. View the full version with proper formatting.

Guest

spam detection algorithm source code pdf
ABSTRACT:

Compromised machines are one of the key security threats on the Internet; they are often used to launch various security attacks such as spamming and spreading malware, DDoS, and identity theft. Given that spamming provides a key economic incentive for attackers to recruit the large number of compromised machines, we focus on the detection of the compromised machines in a network that are involved in the spamming activities, commonly known as spam zombies. We develop an effective spam zombie detection system named SPOT by monitoring outgoing messages of a network. SPOT is designed based on a powerful statistical tool called Sequential Probability Ratio Test, which has bounded false positive and false negative error rates. In addition, we also evaluate the performance of the developed SPOT system using a two-month e-mail trace collected in a large US campus network. Our evaluation studies show that SPOT is an effective and efficient system in automatically detecting compromised machines in a network.

For example, among the 440 internal IP addresses observed in the e-mail trace, SPOT identifies 132 of them as being associated with compromised machines. Out of the 132 IP addresses identified by SPOT, 126 can be either independently confirmed (110) or highly likely (16) to be compromised. Moreover, only seven internal IP addresses associated with compromised machines in the trace are missed by SPOT. In addition, we also compare the performance of SPOT with two other spam zombie detection algorithms based on the number and percentage of spam messages originated or forwarded by internal machines, respectively, and show that SPOT outperforms these two detection algorithms.

EXISTING SYSTEM:

Major security challenge on the Internet is the existence of the large number of compromised machines. Such machines have been increasingly used to launch various security attacks including spamming and spreading malware, DDoS, and identity theft

DISADVANTAGES OF EXISTING SYSTEM:

They are often used to launch various security attacks such as spamming and spreading malware, DDoS, and identity theft.

A major security challenge on the Internet is the existence of the large number of compromised machines.

Their approaches are better suited for large e-mail service providers to understand the aggregate global characteristics of spamming botnets instead of being deployed by individual networks to detect internal compromised machines. Moreover, their approaches cannot support the online detection requirement in the network environment considered in this paper.

The existing algorithm is less effective.

Identifying and cleaning compromised machines in a network remain a significant challenge for system administrators of networks of all sizes.

PROPOSED SYSTEM:

In this paper, we focus on the detection of the compromised machines in a network that are used for sending spam messages, which are commonly referred to as spam zombies.

The nature of sequentially observing outgoing messages gives rise to the sequential detection problem. In this paper, we will develop a spam zombie detection system, named SPOT, by monitoring outgoing messages. SPOT is designed based on a statistical method called Sequential Probability Ratio Test (SPRT), As a simple and powerful statistical method, SPRT has a number of desirable features. It minimizes the expected number of observations required to reach a decision among all the sequential and non-sequential statistical tests with no greater error rates. This means that the SPOT detection system can identify a compromised machine quickly.

In proposed system to develop an effective spam zombie detection system named SPOT.

SPOT is used to monitoring outgoing messages of a network.

SPOT is designed based on a statistical method called sequential probability ratio test (SPRT).

SPOT can be used to test between two hypotheses whether the machine is compromised or not.

ADVANTAGES OF PROPOSED SYSEM:

SPOT is an effective and efficient system in automatically detecting compromised machines in a network. For example, among the 440 internal IP addresses observed in the e-mail trace, SPOT identifies 132 of them as being associated with compromised machines. Out of the 132 IP addresses identified by SPOT, 126 can be either independently confirmed (110) or are highly likely (16) to be compromised.

SPOT has bounded false positive and false negative error rates.

It also minimizes the number of required observations to detect a spam zombie.