code of cuda is 417 times faster than code of cpu
poisson5point(no preconditioner)
1000*1000 cg: 1021 收敛
gmres(50) 261
gmres(100) 217
cg_m 1269 时间都差不多
bicgstab 无穷大,不收敛
smooth_aggregate: 2.9second
Preconditioner statistics
Number of Levels:5
Operator Complexity:1.32831745
Grid Complexity:1.148772
level unknownsnonzeros:
</