Main Phases
Graph Extraction, Sentence Embedding, Image Generation, Classification
1. Graph Extraction
Main idea: normalize the source code of a function, and perform static analysis to extract the PDG ****of the function.
2. Sentence Embedding
Main idea: regard a line of code as a sentence and embed them into a fixed-length vector.
3. Image Generation
Main idea: After sentence embedding, apply centrality analysis to reset vector, which can reflect the difference between importance of sencences(or node of graph). use three centralities to correspond to the three channels of images and form the image.
4. Classification
Main idea: train a CNN model to detect the gerenated image to detect vulnerability.