Voiceprint Recognition

Prospect of Voiceprint Recognition Based on Deep Learning

Shuaijie Shan1 , Jianbao Liu1* and Yaowu Dun1 

1 College of Electrical Engineering, Naval University of Engineering, Wuhan, Hubei,

430000, China

* Corresponding author’s e-mail: [email protected]; [email protected]

Abstract. As a biometric technology, voiceprint recognition is as unique as human fingerprint

and pupil, so voiceprint recognition has great potential in practical application. Aiming at the

feature extraction method and voiceprint recognition method, this paper first introduces the

principle of voiceprint recognition, traditional MFCC, LPDA and other feature parameters and

their performance; secondly, the traditional voiceprint recognition methods such as GMM and

GMM-SVM are introduced, as well as their shortcomings and improvement schemes. Aiming at

the shortcomings of voiceprint recognition system based on traditional algorithms in accuracy,

robustness and real-time, this paper introduces the role of deep learning neural network in

different stages of voiceprint recognition, and introduces the characteristics and network

structure of some typical algorithms based on deep learning. Finally, according to the advantages

and disadvantages of deep learning in voiceprint recognition, the development prospect and

challenges of voiceprint recognition technology are analysed.

1. Introduction

As a biometric technology, voiceprint recognition can uniquely identify a person, so it is also known as

speaker recognition technology[1]. Compared with other biometric technologies, it has obvious

advantages of convenient data collection, simple equipment and high secrecy.

Voiceprint recognition was first proposed for the feasibility of human ear recognition mechanism

and machine listening recognition[2]. Voiceprint recognition technology can be divided into speaker

recognition technology and speaker confirmation technology according to different functions, and can

be divided into text correlation, text restriction and text independence according to different audio

content[3]. The early voiceprint recognition technology is mostly based on template matching[4],

probability equation analysis, dynamic time warping[5] and other methods; feature parameter extraction

is based on cepstrum, Fourier transform[6,7], MFCC and LPC[8]. In recent years, voiceprint features

are mostly based on the feature vector represented by I-vector[9]; recognition is based on Gaussian

mixture model and its optimization model[10,11].

Since 2014, deep learning has been gradually applied in the field of voiceprint recognition, such as

DNN, CNN and LSTM models; j-vector, d-vector and x-vector feature parameters have been

developed[12,13]. Then, aiming at the deep learning ability to extract highly abstract features and strong

nonlinear classification ability, an end-to-end neural network structure is designed and improved to

realize the integration of feature extraction and classification recognition[14,15].

This paper presents three steps of voiceprint recognition. This paper introduces the voiceprint feature

parameters and their performance, the voiceprint recognition method and its principle, and the voiceprint

recognition model based on deep learning. At the same time, the paper also summarizes the advantages

and disadvantages of traditional recognition methods and recognition methods based on deep learning.

2. Brief analysis of voiceprint recognition

2.1. Analysis and principle of voiceprint recognition

Voiceprint refers to the spectrum of sound waves carrying voice information detected by electro

acoustics. Because of the particularity of human body structure and the complex physical process

between organs, the sound is produced. Therefore, in theory, the short-term spectrum, sound source,

time series, rhythm and other characteristics of each person's voice are different, that is, voiceprints are

as unique and unique as fingerprints. Voiceprint recognition includes two steps: training and detection,

and its technical schematic diagram is shown in Figure 1.

                                  Figure 1. Schematic diagram of voiceprint recognition technology

2.2. Development of voiceprint recognition technology

The voiceprint features can be divided into Auditory feature that can be identified and described by

human ears and Acoustic feature that can be extracted from acoustic signals by mathematical methods.

 Mel cepstrum coefficient (MFCC): MFCC is derived from the distorted frequency scale based on

hum

  • 1
    点赞
  • 14
    收藏
    觉得还不错? 一键收藏
  • 3
    评论
# ethminer [![standard-readme compliant](https://img.shields.io/badge/readme style-standard-brightgreen.svg)](https://github.com/RichardLitt/standard-readme) [![Gitter](https://img.shields.io/gitter/room/nwjs/nw.js.svg)][Gitter] [![Releases](https://img.shields.io/github/downloads/ethereum-mining/ethminer/total.svg)][Releases] > Ethereum miner with OpenCL, CUDA and stratum support **Ethminer** is an Ethash GPU mining worker: with ethminer you can mine every coin which relies on an Ethash Proof of Work thus including Ethereum, Ethereum Classic, Metaverse, Musicoin, Ellaism, Pirl, Expanse and others. This is the actively maintained version of ethminer. It originates from [cpp-ethereum] project (where GPU mining has been discontinued) and builds on the improvements made in [Genoil's fork]. See [FAQ](#faq) for more details. ### Features - OpenCL mining - Nvidia CUDA mining - realistic benchmarking against arbitrary epoch/DAG/blocknumber - on-GPU DAG generation (no more DAG files on disk) - stratum mining without proxy - OpenCL devices picking - farm failover (getwork + stratum) ## Table of Contents - [Install](#install) - [Usage](#usage) - [Examples connecting some pools](#examples-connecting-some-pools) - [Build](#build) - [Continuous Integration and development builds](#continuous-integration-and-development-builds) - [Building from source](#building-from-source) - [CMake configuration options](#cmake-configuration-options) - [Maintainer](#maintainer) - [Contribute](#contribute) - [F.A.Q.](#faq) ## Install [![Releases](https://img.shields.io/github/downloads/ethereum-mining/ethminer/total.svg)][Releases] Standalone **executables** for _Linux_, _macOS_ and _Windows_ are provided in the [Releases] section. Download an archive for your operating system and unpack the content to a place accessible from command line. The ethminer is ready to go. | Builds | Release | Date | | ------ | ------- | ---- | | Last | [![GitHub release](https://img.shields.io/github/release/ethereum-mining/ethminer/all.svg)](https://github.com/ethereum-mining/ethminer/releases) | [![GitHub Release Date](https://img.shields.io/github/release-date-pre/ethereum-mining/ethminer.svg)](https://github.com/ethereum-mining/ethminer/releases) | | Stable | [![GitHub release](https://img.shields.io/github/release/ethereum-mining/ethminer.svg)](https://github.com/ethereum-mining/ethminer/releases) | [![GitHub Release Date](https://img.shields.io/github/release-date/ethereum-mining/ethminer.svg)](https://github.com/ethereum-mining/ethminer/releases) | ## Usage The **ethminer** is a command line program. This means you launch it either from a Windows command prompt or Linux console, or create shortcuts to predefined command lines using a Linux Bash script or Windows batch/cmd file. For a full list of available command, please run: ```sh ethminer --help ``` ### Examples connecting some pools Check our [samples](POOL_EXAMPLES_ETH.md) to see how to connect to different pools. ## Build ### Continuous Integration and development builds | CI | OS | Status | Development builds | | ------------- | ------------- | ----- | ----------------- | | [Travis CI] | Linux, macOS | [![Travis CI](https://img.shields.io/travis/ethereum-mining/ethminer.svg)][Travis CI] | ✗ No build artifacts, [Amazon S3 is needed] for this | | [AppVeyor] | Windows | [![AppVeyor](https://img.shields.io/appveyor/ci/ethereum-mining/ethminer.svg)][AppVeyor] | ✓ Build artifacts available for all PRs and branches | The AppVeyor system automatically builds a Windows .exe for every commit. The latest version is always available [on the landing page](https://ci.appveyor.com/project/ethereum-mining/ethminer) or you can [browse the history](https://ci.appveyor.com/project/ethereum-mining/ethminer/history) to access previous builds. To download the .exe on a build under 'JOB NAME' select 'Configuration: Release', choose 'ARTIFACTS' then download the zip file. ### Building from source This project uses [CMake] and [Hunter] package manager. 1. Make sure git submodules are up to date ```sh git submodule update --init --recursive ``` 2. Create a build directory. ```sh mkdir build; cd build ``` 3. Configure the project with CMake. Check out additional [configuration options](#cmake-configuration-options). ```sh cmake .. ``` Note: In Windows, it is possible to have issues with VS 2017 compilers, in that case, use VS 2017 installer to get VS 2015 compilers and use: ```sh cmake .. -G "Visual Studio 15 2017 Win64" -Tv140,host=x64 ``` 4. Build the project using [CMake Build Tool Mode]. This is a portable variant of `make`. ```sh cmake --build . ``` Note: In Windows, it is possible to have compiler issues if you don't specify build config. In that case use: ```sh cmake --build . --config Release ``` 5. _(Optional, Linux only)_ Install the built executable. ```sh sudo make install ``` #### OpenCL support on Linux If you're planning to use [OpenCL on Linux](https://github.com/ruslo/hunter/wiki/pkg.opencl#pitfalls) you have to install OpenGL libraries. E.g. on Ubuntu run: ```sh sudo apt-get install mesa-common-dev ``` #### Disable Hunter If you want to install dependencies yourself or use system package manager you can disable Hunter by adding [-DHUNTER_ENABLED=OFF](https://docs.hunter.sh/en/latest/reference/user-variables.html#hunter-enabled) to configuration options. ### CMake configuration options Pass these options to CMake configuration command, e.g. ```sh cmake .. -DETHASHCUDA=ON -DETHASHCL=OFF ``` - `-DETHASHCL=ON` - enable OpenCL mining, `ON` by default, - `-DETHASHCUDA=ON` - enable CUDA mining, `ON` by default. - `-DAPICORE=ON` - enable API Server, `ON` by default. - `-DETHDBUS=ON` - enable D-Bus support, `OFF` by default. ## Maintainer [![Gitter](https://img.shields.io/gitter/room/ethereum-mining/ethminer.svg)][Gitter] - Paweł Bylica [@chfast](https://github.com/chfast) ## Contribute [![Gitter](https://img.shields.io/gitter/room/ethereum-mining/ethminer.svg)][Gitter] To meet the community, ask general questions and chat about ethminer join [the ethminer channel on Gitter][Gitter]. All bug reports, pull requests and code reviews are very much welcome. ## License Licensed under the [GNU General Public License, Version 3](LICENSE). ## F.A.Q 1. Why is my hashrate with Nvidia cards on Windows 10 so low? The new WDDM 2.x driver on Windows 10 uses a different way of addressing the GPU. This is good for a lot of things, but not for ETH mining. For Kepler GPUs: I actually don't know. Please let me know what works best for good old Kepler. For Maxwell 1 GPUs: Unfortunately the issue is a bit more serious on the GTX750Ti, already causing suboptimal performance on Win7 and Linux. Apparently about 4MH/s can still be reached on Linux, which, depending on ETH price, could still be profitable, considering the relatively low power draw. For Maxwell 2 GPUs: There is a way of mining ETH at Win7/8/Linux speeds on Win10, by downgrading the GPU driver to a Win7 one (350.12 recommended) and using a build that was created using CUDA 6.5. For Pascal GPUs: You have to use the latest WDDM 2.1 compatible drivers in combination with Windows 10 Anniversary edition in order to get the full potential of your Pascal GPU. 2. Why is a GTX 1080 slower than a GTX 1070? Because of the GDDR5X memory, which can't be fully utilized for ETH mining (yet). 3. Are AMD cards also affected by slowdowns with increasing DAG size? Only GCN 1.0 GPUs (78x0, 79x0, 270, 280), but in a different way. You'll see that on each new epoch (30K blocks), the hashrate will go down a little bit. 4. Can I still mine ETH with my 2GB GPU? Not really, your VRAM must be above the DAG size (Currently about 2.15 GB.) to get best performance. Without it severe hash loss will occur. 5. What are the optimal launch parameters? The default parameters are fine in most scenario's (CUDA). For OpenCL it varies a bit more. Just play around with the numbers and use powers of 2. GPU's like powers of 2. 6. What does the `--cuda-parallel-hash` flag do? [@davilizh](https://github.com/davilizh) made improvements to the CUDA kernel hashing process and added this flag to allow changing the number of tasks it runs in parallel. These improvements were optimised for GTX 1060 GPUs which saw a large increase in hashrate, GTX 1070 and GTX 1080/Ti GPUs saw some, but less, improvement. The default value is 4 (which does not need to be set with the flag) and in most cases this will provide the best performance. 7. What is ethminer's relationship with [Genoil's fork]? [Genoil's fork] was the original source of this version, but as Genoil is no longer consistently maintaining that fork it became almost impossible for developers to get new code merged there. In the interests of progressing development without waiting for reviews this fork should be considered the active one and Genoil's as legacy code. 8. Can I CPU Mine? No, use geth, the go program made for ethereum by ethereum. 9. CUDA GPU order changes sometimes. What can I do? There is an environment var `CUDA_DEVICE_ORDER` which tells the Nvidia CUDA driver how to enumerates the graphic cards. Following values are valid: * FASTEST_FIRST (Default) - causes CUDA to guess which device is fastest using a simple heuristic. * PCI_BUS_ID - orders devices by PCI bus ID in ascending order. To prevent some unwanted changes in the order of your CUDA devices you **might set the environment to `PCI_BUS_ID`**. This can be done: * Linux: * Adapt /etc/environment file and add a line `CUDA_DEVICE_ORDER=PCI_BUS_ID` * Adapt your start script launching ethminer and add a line `export CUDA_DEVICE_ORDER=PCI_BUS_ID` * Windows: * Adapt your environment using the control panel (just search `setting environment windows control panel` using your favorite search engine) * Adapt your start (.bat) file launching ethminer and add a line `setx CUDA_DEVICE_ORDER=PCI_BUS_ID` or `set CUDA_DEVICE_ORDER=PCI_BUS_ID` * For more details about `setx` and `set` see [Amazon S3 is needed]: https://docs.travis-ci.com/user/uploading-artifacts/ [AppVeyor]: https://ci.appveyor.com/project/ethereum-mining/ethminer [CMake]: https://cmake.org/ [CMake Build Tool Mode]: https://cmake.org/cmake/help/latest/manual/cmake.1.html#build-tool-mode [cpp-ethereum]: https://github.com/ethereum/cpp-ethereum [Genoil's fork]: https://github.com/Genoil/cpp-ethereum [Gitter]: https://gitter.im/ethereum-mining/ethminer [Hunter]: https://docs.hunter.sh/ [Releases]: https://github.com/ethereum-mining/ethminer/releases [Travis CI]: https://travis-ci.org/ethereum-mining/ethminer

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 3
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值