活蹦乱跳的小天音 2023-03-09 16:52 采纳率: 75%
浏览 27
已结题

words()和raw()以及word_tokenize()的区别

words()和raw()比较
主要是想请问一下这俩有啥区别呀

from nltk.corpus import PlaintextCorpusReader as txtR
corpora = txtR(r'D:\python test\1', ['total book1.txt'])
corpora.words()
from nltk.corpus import PlaintextCorpusReader as txtR
corpora = txtR(r'D:\python test\1', ['total book1.txt'])
corpora.raw()
  • 写回答

2条回答 默认 最新

  • 关注

    #corpora.words()返回
    ['Research', 'on', 'incidental', 'second', 'language', ...]
    #corpora.raw()返回
    "Research on incidental second language (L2) vocabulary acquisition through reading has claimed that repeated encounters with unfamiliar words and the relative elaboration of processing these words facilitate word learning. However, so far both variables have been investigated in isolation. To help close this research gap, the current study investigates the differential effects of the variables 'word exposure frequency' and 'elaboration of word processing' on the initial word learning and subsequent word retention of advanced learners of L2 English. Whereas results showed equal effects for both variables on initial word learning, subsequent word retention was more contingent on elaborate processing of form-meaning relationships than on word frequency. These results, together with those of the studies reviewed, suggest that processing words again after reading (input output cycles) is superior to reading-only tasks. The findings have significant implications for adaptation and development of teaching materials that enhance L2 vocabulary learning."

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

问题事件

  • 系统已结题 3月30日
  • 已采纳回答 3月22日
  • 创建了问题 3月9日

悬赏问题

  • ¥15 基于卷积神经网络的声纹识别
  • ¥15 Python中的request,如何使用ssr节点,通过代理requests网页。本人在泰国,需要用大陆ip才能玩网页游戏,合法合规。
  • ¥100 为什么这个恒流源电路不能恒流?
  • ¥15 有偿求跨组件数据流路径图
  • ¥15 写一个方法checkPerson,入参实体类Person,出参布尔值
  • ¥15 我想咨询一下路面纹理三维点云数据处理的一些问题,上传的坐标文件里是怎么对无序点进行编号的,以及xy坐标在处理的时候是进行整体模型分片处理的吗
  • ¥15 CSAPPattacklab
  • ¥15 一直显示正在等待HID—ISP
  • ¥15 Python turtle 画图
  • ¥15 stm32开发clion时遇到的编译问题