Unique Words

Problem Description
A common problem faced by electronic information providers is determining the number of unique words in a document. The case of a word does not affect its uniqueness. For example, The, tHE and The are all considered equivalent. Punctuation can appear in these documents and is handled as follows:
1) Periods '.' and exclamation marks '!' may appear at the end of a sentence and should not be considered a word, or part of a word.
2) Dashes '-' appear between hyphenated words. The hyphenated words should be considered separately.
3) Commas ',' colons ':' and semicolons ';' appear within a sentence and should not be considered a word, or part of a word.
4) Apostrophes ' appear within contractions and possessive forms. These symbols should be treated as if they never appeared (i.e., as if they were deleted from the word).

Input
The input file contains a series of documents, each separated by an entire line of text containing only the word EOD Each document will contain no more than 1,000 lines and at most 100 unique words. All input lines will not contain more than 80 characters. Numbers, control characters, and punctuation symbols not listed above will not appear in the text. An entire line containing only the string EOT identifies the end of the list of documents; note this last document is terminated by EOT and not EOD

Output
The output should be an alphabetically sorted list of all unique words, with each unique word displayed in uppercase.

Sample Input
The banker hammered home his two-part message! His message,
at times satirical, was that the bank's situation was a mess.
EOD
Hello world
EOD
This is a
final example
EOT

Sample Output
WORDS IN DOCUMENT #1
A
AT
BANKER
BANKS
HAMMERED
HIS
HOME
MESS
MESSAGE
PART
SATIRICAL
SITATUATION
THAT
THE
TIMES
TWO
WAS
WORDS IN DOCUMENT #2
HELLO
WORLD
WORDS IN DOCUMENT #3
A
EXAMPLE
FINAL
IS
THIS

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2017-12-02 15:25
关注
http://www.acmerblog.com/hdu-2397-unique-words-3612/

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Unique Words
2017-12-01 16:39

回答 1 已采纳 http://www.acmerblog.com/hdu-2397-unique-words-3612/
array_unique只给我一个结果 mysql php
2012-12-25 03:26

回答 2 已采纳 One thing you can try is DISTINCT. You can use DISTINCT with multiple fields: SELECT DISTINCT(one
如何从PHP中的文本文件目录中获取唯一单词的数量？ php
2014-12-03 08:08

回答 1 已采纳 You can try this : $allWords = array(); foreach (glob("*.txt") as $filename) // loop on each fil
unique-words:返回字符串或数组中的唯一单词
2021-04-30 22:06

const unique = require ( 'unique-words' ) ; console . log ( unique ( 'one two one two three' ) ) ; // => ['one', 'two', 'three'] console . log ( unique ( [ 'foo' , 'foo' , 'foo bar' , 'bar' , 'bar baz...
用cmd imp导入时，一直提示‘IMP-00019: 由于 ORACLE 错误 1 而拒绝行 IMP-00003: 遇到 ORACLE 错误 1 ORA-00001: 违反唯一约束条件 (APEX_030200.WWV_FLOW_DICTIONARY$WORDS)’ oracle
2020-06-10 00:30

回答 1 已采纳 https://blog.csdn.net/chenghui0317/article/details/19498497
如何在PHP中对多维数组进行排序，其中键实际上是值 php
2014-11-30 22:01

回答 1 已采纳 If your variable array name is $arrayDate, execute this: ksort($arrayDate); foreach($arrayDate as
刚刚自学C++在vs里面写的程序，biggies的定义被当成了重载，具体如下 c++ 有问必答
2021-11-04 18:37

回答 1 已采纳
804. Unique Morse Code Words
2018-03-31 11:15

Shezzer的博客 Now, given a list of words, each word can be written as a concatenation of the Morse code of each letter. For example, "cab" can be written as "-.-.-....-", (which is the concatenation "-.-." + "-......
前往：重用地图键时大量使用内存
2012-04-29 22:58

回答 1 已采纳 What does your code look like that turns files into strings? I would look for a problem there. I
matplotlib画图时出现报错 python 有问必答
2021-04-21 16:14

回答 3 已采纳错误是索引超出范围。是因为loc[df_top3words.topic_id==x, 'words']语法不对导致的错误，这样写行标签是一个bool值，当然取不到元素，应该是loc[x, 'words
LSA - 潜在语义分析 - 如何用PHP编写代码？ php
2009-06-18 20:10

回答 4 已采纳 LSA links: Landauer (co-creator) article on LSA the R-project lsa user guide Here is the co
UniqueWords:给定一个文本文件，计算并显示唯一的单词
2021-05-18 18:43

独特词给定本地或远程文本文件，计数并在列表中显示唯一的单词。
Word Amalgamation
2017-10-15 09:48

回答 1 已采纳 http://blog.csdn.net/shiow1991/article/details/7217015
【Leetcode】804. Unique Morse Code Words
2018-03-27 00:12

cheney康的博客 Unique Morse Code Wordshttps://leetcode.com/problems/unique-morse-code-words/description/International Morse Code defines a standard encoding where each letter is mapped to a series of dots and d...
Unique Morse Code Words
2019-02-09 22:22

weixin_30457065的博客【leetcode】Unique Morse Code Words https://leetcode.com/problems/unique-morse-code-words/ 1）problem International Morse Code defines a standard encoding where each letter is mapped to a ...
LeetCode-Unique Morse Code Words
2018-09-08 15:22

BeHelium的博客 LeetCode Java Unique Morse Code Words
804.Unique Morse Code Words
2019-10-06 09:51

就是那个党伟的博客 class Solution { public: int uniqueMorseRepresentations(vector<string>& words) { char* chLetters[] = {".-","-...","-.-.","-..",".","..-.","--.","....","..",".---", ...
没有解决我的问题, 去提问

悬赏问题

¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！
¥15 drone 推送镜像时候 purge: true 推送完毕后没有删除对应的镜像,手动拷贝到服务器执行结果正确在样才能让指令自动执行成功删除对应镜像，如何解决？

Unique Words

1条回答 默认 最新

悬赏问题

1条回答默认最新