Abstract
Abstract
II
II
THE CONSTRUCTION OF TECHNOLOGY PROJECT
DETECTION SYSTEM AND ITS ALGORITHM
ABSTRACT
Technology project detection is one type of document copy detection technology, it makes declaring technology project much more standardized and is also an important measure to judge whether the same item is repeated for award. To encourage and reward the majority of research scientists and stimulate them to play to their full potential, reward system has been designed to award their scientific and technological achievements, also to let them make greater contribution with greater enthusiasm in the future. Because of a certain share of the award, it must detect those declaring technology projects to guarantee programs awarded fairly and impartially .This test is to declare the project of each approximate detection.
In this paper, it proposes the longest backward segmentation algorithm for Chinese sentence and thus improve the segmentation accuracy; Secondly, based on structural characteristics of science and technology projects, the paper designs an architecture of project detection system and its corresponding detection algorithm. According to different components the thesis adopts different similarity detection algorithm to calculate the corresponding similarity. For example, on the basis of the VSM model based on he proposed N-gram, build detection similarity algorithm for the body, judge similarity as to the same word frequency on different word order. This article also presents the construction and implementation of each functional module about detection system, these modules include pre-processing module, the content analysis module, the similarity calculation module, the results display module. In addition, the system provides testing options, testing staff may selects project components to be detected. Finally, large number of experiments have been made on the basis of the above mentioned, experiments show that the system has strong practicality to h