- 博客(634)
- 资源 (32)
- 收藏
- 关注

原创 MachineLP的CV与NLP
深度学习这些坑你都遇到过吗?(1)检测Faster RCNN:RPN,anchor,sliding windowsDL杂记:YOLOV3之禅(2)识别tf40:图像检索(triplet_loss)之Conditional Similarity Networks(3)OCROCR综述OCR工程git:https://github.com/MachineLP/OCR_repo1.tf20: CNN—识别字符验证码2.身份证识别——生成身份证号和汉字...
2020-05-17 11:07:03
1862

原创 MachineLP博客目录
MachineLP: 其实事物发展有自己的潮流和规律,当你身处潮流之中的时候,要紧紧抓住潮流的机会,想办法脱颖而出,即使没有成功,也会更加洞悉时代的脉搏,收获珍贵的知识和经验。而如果潮流已经退去,这个时候再去往这个方向上努力,只会收获迷茫与压抑,对时代、对自己都没有什么帮助。 但是时代的浪潮犹如海滩上的浪花,总是一浪接着一浪,只要你站在海边,身处这个行业之中,下...
2017-11-02 10:48:28
39676
9
原创 Stable - Diffusion. How it works?
,很明显就是扩散过程发生隐空间中(latent space),其实就是对图片做了压缩,Stable Diffusion会先训练一个自编码器,来学习将图像压缩成低维表示。Stable Diffusion原来的名字叫“
2024-05-01 11:19:55
307
原创 Diffusion. How it works?
AIGC()的爆火之后,是哪些技术的推动?首先不得不提的是GAN(Generative adversarial networks),顾名思义就是让AI内部的两个模型互相对比,从而生成最接近现实的真实目标。但这个算法有一个严重问题,由于模型互相对比的标准是现成的样本,因此生成的内容实质上是对现有内容无限逼近的模仿,而模仿,意味着它无法真正超越和突破。基于GAN的缺点研究者们提出了,它正是今年以来陆续涌现的包括 Stable Diffusion 开源模型在内的诸多 AIGC 图片生成模型的技术核心。
2024-05-01 11:09:43
1245
原创 GPT-1, GPT-2, GPT-3, InstructGPT / ChatGPT and GPT-4 总结
在 GPT-1 之前,NLP 通常是一种监督模型。对于每个任务,都有一些标记数据,然后根据这些标记数据开发监督模型。这种方法存在几个问题:首先,需要标记数据。但 NLP 不像 CV,它有一个标记良好的数据imagenet。其次,这些不同任务训练的模型并不是很通用。例如,翻译训练出来的模型很难直接用于分类。
2024-03-17 14:52:04
937
原创 OpenCV之二值图像 联通组件寻找
python代码:import cv2 as cvimport numpy as npdef connected_components_demo(src): src = cv.GaussianBlur(src, (3, 3), 0) gray = cv.cvtColor(src, cv.COLOR_BGR2GRAY) ret, binary = cv.threshold(gray, 0, 255, cv.THRESH_BINARY | cv.THRESH_OTSU)
2021-08-11 15:04:03
664
1
原创 OpenCV之图像二值化与去噪
python代码:import cv2 as cvimport numpy as npdef method_1(image): gray = cv.cvtColor(image, cv.COLOR_BGR2GRAY) t, binary = cv.threshold(gray, 0, 255, cv.THRESH_BINARY | cv.THRESH_OTSU) return binarydef method_2(image): blurred = .
2021-08-11 14:59:18
1758
原创 OpenCV之图像二化自适应阈值算法
python代码:import cv2 as cvimport numpy as np## THRESH_BINARY = 0# THRESH_BINARY_INV = 1# THRESH_TRUNC = 2# THRESH_TOZERO = 3# THRESH_TOZERO_INV = 4#src = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src
2021-08-11 14:56:29
656
原创 OpenCV之图像二值寻找算法 – TRIANGLE
python代码:import cv2 as cvimport numpy as np#import tensorflow as tf# tf.enable_eager_execution()## THRESH_BINARY = 0# THRESH_BINARY_INV = 1# THRESH_TRUNC = 2# THRESH_TOZERO = 3# THRESH_TOZERO_INV = 4#src = cv.imread("./test.png")cv.namedWin
2021-08-11 14:54:19
685
原创 OpenCV之图像二值寻找算法 – OTSU
python代码:import cv2 as cvimport numpy as np## THRESH_BINARY = 0# THRESH_BINARY_INV = 1# THRESH_TRUNC = 2# THRESH_TOZERO = 3# THRESH_TOZERO_INV = 4#src = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)
2021-08-11 14:52:31
271
原创 OpenCV之基本阈值操作
python代码:import cv2 as cvimport numpy as np## THRESH_BINARY = 0# THRESH_BINARY_INV = 1# THRESH_TRUNC = 2# THRESH_TOZERO = 3# THRESH_TOZERO_INV = 4#src = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)
2021-08-11 14:51:02
209
原创 OpenCV之二值图像介绍
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)T = 127# 转换为灰度图像gray = cv.cvtColor(src, cv.COLOR_BGR2GRAY)h, w = gray.shapeT = cv.mean(gray)[0]print("
2021-08-11 14:49:20
614
原创 OpenCV之图像模板匹配
python代码:import cv2 as cvimport numpy as npdef template_demo(): src = cv.imread("./test.png") tpl = cv.imread("./test01.png") cv.imshow("input", src) cv.imshow("tpl", tpl) th, tw = tpl.shape[:2] result = cv.matchTemplate(src,
2021-08-11 14:47:33
326
原创 OpenCV之拉普拉斯金字塔
python代码:import cv2 as cvimport numpy as npdef laplaian_demo(pyramid_images): level = len(pyramid_images) for i in range(level-1, -1, -1): if (i-1) < 0: h, w = src.shape[:2] expand = cv.pyrUp(pyramid_images
2021-08-11 14:46:04
321
原创 OpenCV之图像金字塔
python代码:import cv2 as cvdef pyramid_down(pyramid_images): level = len(pyramid_images) print("level = ",level) for i in range(level-1, -1, -1): expand = cv.pyrUp(pyramid_images[i]) cv.imshow("pyramid_down_"+str(i), expand)
2021-07-30 15:57:19
187
原创 OpenCV之Canny边缘检测器
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)# t1 = 100, t2 = 3*t1 = 300edge = cv.Canny(src, 100, 300)cv.imshow("mask image", edge)cv.imwrite("./edge.
2021-07-30 15:51:50
396
原创 OpenCV之USM 锐化增强算法
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)# sigma = 5、15、25blur_img = cv.GaussianBlur(src, (0, 0), 5)usm = cv.addWeighted(src, 1.5, blur_img, -0.5,
2021-07-30 15:50:43
619
原创 OpenCV之图像锐化
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)# sharpen_op = np.array([[-1, -1, -1], [-1, 9, -1], [-1, -1, -1]], dtype=np.float32)sharpen_op = np.array([
2021-07-23 15:31:06
1068
1
原创 OpenCV之图像梯度 – 拉普拉斯算子(二阶导数算子)
python代码:import cv2 as cvimport numpy as npimage = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", image)h, w = image.shape[:2]src = cv.GaussianBlur(image, (0, 0), 1)dst = cv.Laplacian(src, cv.CV_32F, ksize=
2021-07-23 15:11:05
810
原创 OpenCV之更多梯度算子
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)robert_x = np.array([[1, 0],[0, -1]], dtype=np.float32)robert_y = np.array([[0, -1],[1, 0]], dtype=np.float
2021-07-21 20:18:32
163
原创 OpenCV之Sobel算子
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)h, w = src.shape[:2]x_grad = cv.Sobel(src, cv.CV_32F, 1, 0)y_grad = cv.Sobel(src, cv.CV_32F, 0, 1)x_grad
2021-07-21 20:10:41
324
原创 OpenCV之自定义滤波器
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)blur_op = np.ones([5, 5], dtype=np.float32)/25.shape_op = np.array([[0, -1, 0], [-1, 5,
2021-07-21 20:06:07
505
1
原创 OpenCV之快速的图像边缘滤波算法
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)h, w = src.shape[:2]dst = cv.edgePreservingFilter(src, sigma_s=100, sigma_r=0.4, flags=cv.RECURS_FILTER)re
2021-07-21 20:04:05
454
1
原创 OpenCV之图像积分图算法
python代码:import cv2 as cvimport numpy as npdef get_block_sum(ii, x1, y1, x2, y2, index): tl = ii[y1, x1][index] tr = ii[y2, x1][index] bl = ii[y1, x2][index] br = ii[y2, x2][index] s = (br - bl - tr + tl) return sdef blur_
2021-07-21 19:53:27
234
原创 [语音识别] 文本加标点--BERT预测
文本加标点--训练BLSTMbert4keras==0.5.9# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippets
2021-07-19 20:07:29
1112
3
原创 [语音识别] 文本加标点--训练BERT
文本加标点--训练BLSTMbert4keras==0.5.9# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippe.
2021-07-19 20:04:29
507
原创 [语音识别] 文本加标点--LSTM预测
文本加标点--训练BLSTMbert4keras==0.5.9# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippets
2021-07-19 20:00:23
696
1
原创 [语音识别] 文本加标点--训练BLSTM+ATTENTION
文本加标点--训练BLSTMbert4keras==0.5.9# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippets
2021-07-19 19:51:20
374
原创 [语音识别] 文本加标点--训练BLSTM
文本加标点--训练BLSTMbert4keras==0.5.9# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippets
2021-07-19 19:49:19
292
原创 [语音识别] 文本加标点--训练LSTM
文本加标点--训练LSTMkeras版本:# -*- coding:utf-8 -*-import os import reimport sysimport timeimport jiebaimport gensim import loggingimport numpy as npimport pandas as pdfrom tqdm import tqdmimport multiprocessingfrom bert4keras.snippets import ..
2021-07-19 19:46:16
434
1
原创 [语音识别] 文本加标点--准备训练数据 (适用于LSTM与BERT)
Paper: U2: Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition WeNet: Production First and Production Ready End-to-End Speech Recognition Toolkit, v1. WeNet: Production Oriented Streaming and Non-streaming En.
2021-07-19 19:19:19
603
2
原创 OpenCV之边缘保留滤波算法 – 均值迁移模糊(mean-shift blur)
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)h, w = src.shape[:2]dst = cv.pyrMeanShiftFiltering(src, 15, 30, termcrit=(cv.TERM_CRITERIA_MAX_ITER+cv.TERM
2021-07-18 15:06:29
288
原创 OpenCV之边缘保留滤波算法 – 高斯双边模糊
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)h, w = src.shape[:2]dst = cv.bilateralFilter(src, 0, 100, 10)result = np.zeros([h, w*2, 3], dtype=src.dtyp
2021-07-18 15:03:03
198
原创 OpenCV之图像去噪声
python代码:import cv2 as cvimport cv2 as cvimport numpy as npdef add_salt_pepper_noise(image): h, w = image.shape[:2] nums = 10000 rows = np.random.randint(0, h, nums, dtype=np.int) cols = np.random.randint(0, w, nums, dtype=np.int)
2021-07-18 14:53:00
1082
原创 OpenCV之图像噪声
python 代码:import cv2 as cvimport numpy as npdef add_salt_pepper_noise(image): h, w = image.shape[:2] nums = 10000 rows = np.random.randint(0, h, nums, dtype=np.int) cols = np.random.randint(0, w, nums, dtype=np.int) for i in rang
2021-07-18 14:36:00
744
原创 OpenCV之中值模糊
import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)dst = cv.medianBlur(src, 5)cv.imshow("blur ksize=5", dst)cv.waitKey(0)cv.destroyAllWindows()#include <open.
2021-07-18 14:32:05
180
原创 OpenCV之图像均值与高斯模糊
python代码:import cv2 as cvimport numpy as npsrc = cv.imread("./test.png")cv.namedWindow("input", cv.WINDOW_AUTOSIZE)cv.imshow("input", src)dst1 = cv.blur(src, (5, 5))dst2 = cv.GaussianBlur(src, (5, 5), sigmaX=15)dst3 = cv.GaussianBlur(src, (0,
2021-07-18 14:30:45
166
原创 OpenCV之图像卷积操作
python代码:import cv2 as cvimport numpy as npdef custom_blur(src): h, w, ch = src.shape print("h , w, ch", h, w, ch) result = np.copy(src) for row in range(1, h-1, 1): for col in range(1, w-1, 1): v1 = np.int32(src[
2021-07-18 14:29:56
398
原创 OpenCV之图像直方图反向投影
python代码:import cv2 as cvimport numpy as npfrom matplotlib import pyplot as pltdef back_projection_demo(): sample = cv.imread("./Mat.png") # hist2d_demo(sample) target = cv.imread("./test.png") # hist2d_demo(target) roi_hsv = cv
2021-07-18 14:23:58
195
原创 [语音识别] kaldi -- aidatatang_200zh脚本解析:三音速详解
# train tri1 [first triphone pass]steps/train_deltas.sh --cmd "$train_cmd" \2500 20000 data/train data/lang exp/mono_ali exp/tri1 || exit 1;# steps/train_deltas.sh <num-leaves> <tot-gauss> <data-dir> <lang- dir> <alignment-di.
2021-07-17 00:30:12
431
Face Recognition(face_recognition) Using Hadoop Streaming API
2017-10-08
caffe深度学习薛开宇笔记实例-基于卷积神经网络的声音识别
2016-12-27
Deep Learning textbook
2016-12-09
训练xml文件时,产生info.txt文件
2016-11-01
斯坦福大学机器学习课程个人笔记完整版
2016-04-19
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人