Information Extraction from Speech

最新推荐文章于 2024-08-08 19:51:53 发布

faith66667

最新推荐文章于 2024-08-08 19:51:53 发布

阅读量695

点赞数 10

文章标签：机器学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/faith66667/article/details/136385864

版权

(520|600).666

Information Extraction from Speech and Text

Programming Assignment # 1

Due March 7, 2024.

You will model letters of English text using hidden Markov models. Some ordinary text has

been selected and, to keep matters simple, all numerals and punctuation have been purged,

upper case letters have been lower-cased, and inter-word spacing, new lines and paragraph

breaks, have all been normalized to single spaces. The alphabet of the resulting text is

therefore the 26 lower case English letters and the white-space, and is formally denoted as

Y = {a, b, c, . . . , z, #}, with # denoting the white-space character.

The text is 35,000 characters long, and has been divided into a 30,000 character training set,

named A, and a 5,000 character test set, named B.

1. Model this text with a fully connected 2-state HMM, with states 1 and 2 .

Let t1 denote the transition 1 → 2 , and t2 denote the self-loop 1 → 1 . Similarly,

let t3 denote the transition 2 → 1 , and t4 denote the self-loop 2 → 2 , so that the

transition probability matrix may be written as

Let the initial state of the Markov chain be either 1 or 2 with equal probability.

Let the emission probabilities be associated with the states, i.e. let

q(y |t2) ≡ q(y |t3) ≡ q(y | 1 ) and q(y |t1) ≡ q(y |t4) ≡ q(y | 2 ).

Use the Baum-Welch algorithm and the training text A to estimate the probabilities

p(tj ), j = 1, 2, 3, 4, and the emission probabilities q(y | s), y ∈ Y and s ∈ { 1 , 2 }.

(a) Initialize the transition probabilities to be slightly different from uniform, as

p(t1) = 0.51 = p(t3) and p(t2) = 0.49 = p(t4).

Initialize the emission probabilities to also be slightly different from uniform, as

q(a| 1 ) = q(b| 1 ) = . . . = q(m| 1 ) = 0.0370 = q(n| 2 ) = q(o| 2 ) = . . . = q(z| 2 ),

q(a| 2 ) = q(b| 2 ) = . . . = q(m| 2 ) = 0.0371 = q(n| 1 ) = q(o| 1 ) = . . . = q(z| 1 ),

q(#| 1 ) = 0.0367 = q(#| 2 ).

What would happen if all probabilities were set to be uniform, i.e 1

(b) Plot the average log-probability of the training and test data after k iterations,

as a function of the number of iterations, for k = 1, 2, . . . , 600.

(c) Plot the emission probabilities of a few particular letters for each state, e.g.

qk(a| 1 ) versus qk(a| 2 ) and qk(n| 1 ) versus qk(n| 2 ),

as a function of the number of iterations, for k = 1, 2, . . . , 600.

(d) Study the emission probability distributions q600(·| 1 ) and q600(·| 2 ) to see where

they differ the most, as well as how the transition probabilities differ from their

initial values. Try to explain what the machine has learned about English text.

2. Increasing Model Complexity: Repeat the Exercises 1(a) through 1(d) with a fully

connected 4-state HMM. Modify the initialization in 1(a) to account for 4 states.

3. Alternate Initialization of Output Probabilities: HMM estimation is sometimes sensitive

to the initialization of the model parameters. You will now investigate an alternative

to the initialization of Exercise 1(a).

(a) Compute the relative frequency q(y) of the letters in Y from the entire text A.

(b) Generate a vector of random numbers r(y), compute the average r =

and use it to create a zero-mean perturbation vector δ(y) = r(y) − r.

(c) Choose a small λ > 0, though not too small, such that both

q(y| 1 ) = q(y) − λδ(y) > 0 and q(y| 2 ) = q(y) + λδ(y) > 0 ∀ y ∈ Y.

Note: q(·| 1 ) and q(·| 2 ) are bona fide probability assignments on Y. (Why?)

Use the two q(y|s) thus generated, along with the p(tj ) from Exercise 1(a), to initialize

the Baum-Welch iteration. Compare the resulting plots of average log-probability

versus k with those of 1(b), as well as the final values of the average log-probabilities.

Caution: Make sure you mitigate numerical underflow problems when computing the forward and backward probabilities. Use the normalization described in §2.8 if needed.

Submission: Turn in all your plots and discussion, and your source code, via GradeScope;

make sure your code is well documented. Points may be deducted for incomprehensible

code. Your code may be rerun on different training and test data or with a different initialization to check its correctness; make sure it runs on a linux machine with standard

WX：codehelp

关注

10
点赞
踩
6

收藏

觉得还不错? 一键收藏
0
评论
Information Extraction from Speech

(520|600).666Information Extraction from Speech and TextProgramming Assignment # 1Due March 7, 2024.You will model letters of English text using hidden Markov models. Some ordinary text hasbeen selected and, to keep matters simple, all numerals and punctua
复制链接

扫一扫

faith66667 CSDN认证博客专家 CSDN认证企业博客

码龄1年

28: 原创

127万+: 周排名

10万+: 总排名

1万+: 访问

: 等级

686: 积分

265: 粉丝

366: 获赞

12: 评论

339: 收藏

私信

关注

热门文章

最新评论

3D printer materials estimation
CSDN-Ada助手: 首先要恭喜你写了第16篇博客，标题看起来很有趣味性！对于3D打印材料的估算，你给出了很好的观点和建议。接下来，我建议你可以尝试深入探讨不同类型的3D打印材料的特性和应用场景，以及它们在实际项目中的使用效果。希望你可以继续保持创作热情，不断进步！
UVic CSC360 Jupyterlab environment
CSDN-Ada助手: 恭喜您写了第18篇博客！看到您分享关于UVic CSC360 Jupyterlab environment的内容，我觉得非常有帮助。希望您能继续保持创作的热情，并且可以考虑分享一些实际应用场景下的案例分析或者深入研究的内容，这样可以让读者更加深入地了解相关主题。期待您的更多精彩内容！
LCSCI4207 PlayingCard
CSDN-Ada助手: 恭喜您发布了新的博客“LCSCI4207 PlayingCard”！看到您持续不断地创作，真是令人钦佩。希望您能继续保持这样的创作热情和动力，不断探索更多有趣的主题和内容。或许在下一篇博客中，可以分享一些关于玩牌的趣闻或心得体会，让读者更加期待您的创作！期待您更多精彩的内容，加油！
32555 Software Development
CSDN-Ada助手: 恭喜您写下了第15篇博客“32555 Software Development”！持续不断地创作是非常了不起的，您的热情和努力让我非常钦佩。对于下一步的创作建议，我想建议您可以尝试探索一些新的软件开发技术或者案例分析，这样可以为读者带来更多新鲜的内容。当然，这只是我的一些建议，希望能对您有所帮助。期待您更多的精彩作品！

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。