Python中使用Word2Vector

最新推荐文章于 2024-08-19 09:22:40 发布

长弓Smile

最新推荐文章于 2024-08-19 09:22:40 发布

阅读量4.1k

点赞数 2

分类专栏：信息抽取与问答系统

本文链接：https://blog.csdn.net/u012485480/article/details/84036402

版权

在Windows环境下，使用Python 3.6进行Word2Vec训练时遇到'str' object has no attribute 'seek'异常及编码问题。通过修改源码参数类型，增加忽略错误参数解决文件打开问题。训练过程中出现的Windows警告可通过设置解决。实验结果显示了与'中国'和'男人'相似度最高的词汇列表。

摘要由CSDN通过智能技术生成

我的环境是win10 + python 3.6 （64位）

参考步骤：
https://blog.csdn.net/u012052268/article/details/78643260#word2vec的python应用

出现的问题：
1.出现异常 ‘str’ object has no attribute ‘seek’ 发生在word2vec.py中。
源码如下：

 try:
 # Assume it is a file-like object and try treating it as such
      # Things that don't have seek will trigger an exception
      self.source.seek(0)
      for line in itertools.islice(self.source, self.limit):
          line = utils.to_unicode(line).split()
          i = 0
          while i < len(line):
              yield line[i: i + self.max_sentence_length]
              i += self.max_sentence_length
 except AttributeError:
      # If it didn't work like a file, use it as a string filename
      with utils.smart_open(self.source) as fin:
          for line in