使用clonedigger来检查python中的重复代码

最新推荐文章于 2024-02-13 08:18:46 发布

ace_fei

最新推荐文章于 2024-02-13 08:18:46 发布

阅读量1.9k

点赞数

分类专栏： Python 文章标签： python

本文链接：https://blog.csdn.net/ace_fei/article/details/52513281

版权

Python 专栏收录该内容

33 篇文章 0 订阅

订阅专栏

安装Clonedigger

$ sudo pip install clonedigger
$ clonedigger -help
Usage: To run Clone Digger type:
python clonedigger.py [OPTION]... [SOURCE FILE OR DIRECTORY]...

The typical usage is:
python clonedigger.py source_file_1 source_file_2 ...
  or
python clonedigger.py path_to_source_tree
Don't forget to remove automatically generated sources, tests and third party libraries from the source tree.

Notice:
The semantics of threshold options is discussed in the paper "Duplicate code detection using anti-unification", which can be downloaded from the site http://clonedigger.sourceforge.net . All arguments are optional. Supported options are:


Options:
  -h, --help            show this help message and exit
  -l LANGUAGE, --language=LANGUAGE
                        the programming language
  --no-recursion        do not traverse directions recursively
  -o OUTPUT, --output=OUTPUT
                        the name of the output file ("output.html" by default)
  --clustering-threshold=CLUSTERING_THRESHOLD
                        read the paper for semantics
  --distance-threshold=DISTANCE_THRESHOLD
                        the maximum amount of differences between pair of
                        sequences in clone pair (5 by default). Larger value
                        leads to larger amount of false positives
  --hashing-depth=HASHING_DEPTH
                        default value if 1, read the paper for semantics.
                        Computation can be speeded up by increasing this value
                        (but some clones can be missed)
  --size-threshold=SIZE_THRESHOLD
                        the minimum clone size. The clone size for its turn is
                        equal to the count of lines of code in its the largest
                        fragment
  --clusterize-using-dcup
                        mark each statement with its D-cup value instead of
                        the most similar pattern. This option together with
                        --hashing-depth=0 make it possible to catch all
                        considered clones (but it is slow and applicable only
                        to small programs)
  --dont-print-time     do not print time
  -f, --force
  --force-diff          force highlighting of differences based on the diff
                        algorithm
  --fast                find only clones, which differ in variable and
                        function names and constants
  --ignore-dir=IGNORE_DIRS
                        exclude directories from parsing
  --eclipse-output=ECLIPSE_OUTPUT
                        for internal usage only
  --cpd-output          output as PMDs CPDs XML format. If output file not
                        defined, output.xml is generated
  --report-unifiers
  --func-prefixes=F_PREFIXES
                        skip functions/methods with these prefixes (provide a
                        CSV string as argument)
  --file-list=FILE_LIST
                        a file that contains a list of file names that must be
                        processed by Clone Digger

使用Clonedigger

clonedigger -l python --ignore-dir=/path_to/exclude_file -o ./output.html -f --size-threshold=10 /path_to/src/

集成到Jenkins，使用violations插件生成trend图

$ clonedigger  --cpd-output -o ./cpdout.xml  /path_to/src/

这里写图片描述

ace_fei

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
使用clonedigger来检查python中的重复代码

安装Clonedigger$ sudo pip install clonedigger$ clonedigger -helpUsage: To run Clone Digger type:python clonedigger.py [OPTION]... [SOURCE FILE OR DIRECTORY]...The typical usage is:python clonedigger.
复制链接

扫一扫