PLAGIARISM AUTO-DETECTION
#1


[attachment=5158]



PLAGIARISM AUTO-DETECTION


PLAGIARISM AUTO-DETECTION IN ARABIC SCRIPTS USING
STATEMENT-BASED FINGERPRINTS MATCHING AND
FUZZY-SET INFORMATION RETRIEVAL



SALHA MOHAMMED ALZAHRANI
A project report submitted in partial fulfilment of the
requirements for the award of the degree of
Master of Science (Computer Science)



ABSTRACT
Many plagiarism detection techniques and tools have been developed mainly
for English scripts. It has been found that different methods use different document
descriptors ranging from characters to document structure. There is possibly no
research involved in Arabic plagiarism detection although Arabic is the academic
language in Arab universities and schools. Therefore in this study, two techniques
have been developed for Arabic; three least-frequent 4-grams fingerprints matching
and fuzzy-set IR using statement-based document representation. Two statements are
treated as either similar if their fingerprints matched in the first technique, or if the
degree of similarity computed by the second technique exceeded the threshold value.
The corpora used in this study has 100 document collected from Arabic Wikipedia
with 3763 statements and 54346 non-stopped, stemmed words in total. Another 15
query documents with 943 statements were constructed with different degree of
plagiarism. Preprocessing operations were applied on the corpus collection and query
documents, such as removing stop words and stemming. Resulted documents were
stored into a database. In this study, preliminary experiments were carried out using
WCopyFind and a naïve algorithm and results are still accurate, just not optimal.
Thus, more investigation of three least-frequent 4-grams fingerprints matching and
fuzzy-set IR techniques has been done to handle more practices of plagiarism
effectively, such as rewording, rephrasing and restructuring of the statements. Our
results using both techniques with Arabic are as successful as with English taking
into account Arabic natural language processing is much more complex than English.
The main conclusion is that Arabic plagiarism best can be handled with fuzzy-set IR
since it outperforms the three least-frequent 4-grams fingerprints matching in terms
of detecting similar, but not necessarily the same, statements.





Reply

Important Note..!

If you are not satisfied with above reply ,..Please

ASK HERE

So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page
Popular Searches: arabic plagiarism, online plagiarism detection free, plagiarism detection for free, plagiarism detection methods, turnitin plagiarism detection free, ppt for plagiarism detection of image, computer plagiarism examples famous,

[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  SUSPICIOUS EMAIL DETECTION seminar class 11 7,856 21-04-2016, 11:16 AM
Last Post: dhanabhagya
  DATA LEAKAGE DETECTION project topics 16 13,185 31-07-2015, 02:59 PM
Last Post: seminar report asees
  An Acknowledgement-Based Approach for the Detection of routing misbehavior in MANETs mechanical engineering crazy 2 2,991 26-05-2015, 03:04 PM
Last Post: seminar report asees
  An Acknowledgment-Based Approach For The Detection Of Routing Misbehavior In MANETs electronics seminars 7 4,741 27-01-2015, 12:09 AM
Last Post: Guest
  Credit Card Fraud Detection Using Hidden Markov Models alagaddonjuan 28 20,761 04-09-2014, 11:31 PM
Last Post: Charlescic
  Digital Image Processing Techniques for the Detection and Removal of Cracks in Digiti electronics seminars 4 4,911 22-07-2013, 09:37 PM
Last Post: Guest
  OBSTACLE DETECTION AND AVOIDANCE ROBOT seminar surveyer 5 7,617 24-06-2013, 10:44 AM
Last Post: computer topic
  Hybrid Intrusion Detection with Weighted Signature Generation over Anomalous Internet electronics seminars 6 3,327 26-04-2013, 01:58 PM
Last Post: Guest
  Intelligent system for Gas, Human detection and Temperature Monitor control using GSM seminar surveyer 3 3,500 17-04-2013, 11:37 PM
Last Post: [email protected]
  Fuzzy Impulse Noise Detection and Reduction project topics 1 1,864 05-12-2012, 03:58 PM
Last Post: seminar details

Forum Jump: