为什么是 while (!)！ "总是错的?"？

I've seen people trying to read files like this in a lot of posts lately.

Code

#include <stdio.h>
#include <stdlib.h>

int main(int argc, char **argv)
{
    char * path = argc > 1 ? argv[1] : "input.txt";

    FILE * fp = fopen(path, "r");
    if( fp == NULL ) {
        perror(path);
        return EXIT_FAILURE;
    }

    while( !feof(fp) ) {  /* THIS IS WRONG */
        /* Read and process data from file… */
    }
    if( fclose(fp) == 0 ) {
        return EXIT_SUCCESS;
    } else {
        perror(path);
        return EXIT_FAILURE;
    }
}

What is wrong with this while( !feof(fp)) loop?

转载于:https://stackoverflow.com/questions/5431941/why-is-while-feof-file-always-wrong

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

5条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
℡Wang Yan 2014-10-24 22:28
关注
I'd like to provide an abstract, high-level perspective.

Concurrency and simultaneity

I/O operations interact with the environment. The environment is not part of your program, and not under your control. The environment truly exists "concurrently" with your program. As with all things concurrent, questions about the "current state" don't make sense: There is no concept of "simultaneity" across concurrent events. Many properties of state simply don't exist concurrently.

Let me make this more precise: Suppose you want to ask, "do you have more data". You could ask this of a concurrent container, or of your I/O system. But the answer is generally unactionable, and thus meaningless. So what if the container says "yes" – by the time you try reading, it may no longer have data. Similarly, if the answer is "no", by the time you try reading, data may have arrived. The conclusion is that there simply is no property like "I have data", since you cannot act meaningfully in response to any possible answer. (The situation is slightly better with buffered input, where you might conceivably get a "yes, I have data" that constitutes some kind of guarantee, but you would still have to be able to deal with the opposite case. And with output the situation is certainly just as bad as I described: you never know if that disk or that network buffer is full.)

So we conclude that it is impossible, and in fact unreasonable, to ask an I/O system whether it will be able to perform an I/O operation. The only possible way we can interact with it (just as with a concurrent container) is to attempt the operation and check whether it succeeded or failed. At that moment where you interact with the environment, then and only then can you know whether the interaction was actually possible, and at that point you must commit to performing the interaction. (This is a "synchronisation point", if you will.)

EOF

Now we get to EOF. EOF is the response you get from an attempted I/O operation. It means that you were trying to read or write something, but when doing so you failed to read or write any data, and instead the end of the input or output was encountered. This is true for essentially all the I/O APIs, whether it be the C standard library, C++ iostreams, or other libraries. As long as the I/O operations succeed, you simply cannot know whether further, future operations will succeed. You must always first try the operation and then respond to success or failure.

Examples

In each of the examples, note carefully that we first attempt the I/O operation and then consume the result if it is valid. Note further that we always must use the result of the I/O operation, though the result takes different shapes and forms in each example.

C stdio, read from a file:

for (;;) { size_t n = fread(buf, 1, bufsize, infile); consume(buf, n); if (n < bufsize) { break; } }

The result we must use is n, the number of elements that were read (which may be as little as zero).

C stdio, scanf:

for (int a, b, c; scanf("%d %d %d", &a, &b, &c) == 3; ) { consume(a, b, c); }

The result we must use is the return value of scanf, the number of elements converted.

C++, iostreams formatted extraction:

for (int n; std::cin >> n; ) { consume(n); }

The result we must use is std::cin itself, which can be evaluated in a boolean context and tells us whether the stream is still in the good() state.

C++, iostreams getline:

for (std::string line; std::getline(std::cin, line); ) { consume(line); }

The result we must use is again std::cin, just as before.

POSIX, write(2) to flush a buffer:

char const * p = buf; ssize_t n = bufsize; for (ssize_t k = bufsize; (k = write(fd, p, n)) > 0; p += k, n -= k) {} if (n != 0) { /* error, failed to write complete buffer */ }

The result we use here is k, the number of bytes written. The point here is that we can only know how many bytes were written after the write operation.

POSIX getline()

char *buffer = NULL; size_t bufsiz = 0; ssize_t nbytes; while ((nbytes = getline(&buffer, &bufsiz, fp)) != -1) { /* Use nbytes of data in buffer */ } free(buffer);

The result we must use is nbytes, the number of bytes up to and including the newline (or EOF if the file did not end with a newline).

Note that the function explicitly returns -1 (and not EOF!) when an error occurs or it reaches EOF.

You may notice that we very rarely spell out the actual word "EOF". We usually detect the error condition in some other way that is more immediately interesting to us (e.g. failure to perform as much I/O as we had desired). In every example there is some API feature that could tell us explicitly that the EOF state has been encountered, but this is in fact not a terribly useful piece of information. It is much more of a detail than we often care about. What matters is whether the I/O succeeded, more-so than how it failed.

A final example that actually queries the EOF state: Suppose you have a string and want to test that it represents an integer in its entirety, with no extra bits at the end except whitespace. Using C++ iostreams, it goes like this:

std::string input = " 123 "; // example std::istringstream iss(input); int value; if (iss >> value >> std::ws && iss.get() == EOF) { consume(value); } else { // error, "input" is not parsable as an integer }

We use two results here. The first is iss, the stream object itself, to check that the formatted extraction to value succeeded. But then, after also consuming whitespace, we perform another I/O/ operation, iss.get(), and expect it to fail as EOF, which is the case if the entire string has already been consumed by the formatted extraction.

In the C standard library you can achieve something similar with the strto*l functions by checking that the end pointer has reached the end of the input string.

The answer

while(!eof) is wrong because it tests for something that is irrelevant and fails to test for something that you need to know. The result is that you are erroneously executing code that assumes that it is accessing data that was read successfully, when in fact this never happened.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(4条)

报告相同问题？

关注问题

请问为什么错了？！getch() c++
2021-09-05 17:23

回答 1 已采纳 getch函数在C语言中使用时需包含的头文件为 conio.h ,应写为#include<conio.h>
为什么是 while (!)！ "总是错的?"？
2011-03-25 11:42

回答 5 已采纳 I'd like to provide an abstract, high-level perspective. Concurrency and simultaneity I/O operat
这里报的是什么错?为什么会报错？ java
2021-09-29 22:17

回答 2 已采纳改为： arr1[i] = (int) (Math.random() * 31);
缓冲区到底是什么？程序总是出人意料！
2023-09-09 11:34

从零开始的小菜鸡的博客 } 产生的问题真正的输出结果如下问题的原因为什么会这样呢？我们选择的数字是4。每次输入’n’时，程序打印了两条消息。每次从键盘输入的只是’n’字符吗？不是，每次从键盘输入的是字符’n’和换行符’\n’,...
这都能编错？为什么？求原因 c++
2022-10-20 22:03

回答 1 已采纳 a没有声明类型和大小27行多了个=
while语句，为什么最小公倍数错了 c语言
2022-11-02 21:04

回答 1 已采纳这是因为循环的时候m和n都变了，而最大公约数与最小公倍数的乘积等于m*n，是指开始的值。解决方法：再设置俩个变量记录初值，然后再最后计算最小公倍数就可以了。
while语句用n--和--n得到了一样的结果！？ c语言
2022-11-28 11:57

回答 2 已采纳就你这个代码来说，--n和n--对结果没有影响
java while(x )_Java中的while（x = false）和while（！x）有什么区别？
2021-02-26 20:28

FreVision优选的博客最近，我一直在尝试将try和catch语句包含在while循环中，因为我想确保从程序的其余部分中获取输入。我遇到了一个问题，在while条件(例如，while(！done))中，在变量前面使用感叹号(！)，而不是使用= false(例如，...
c++我这道题为什么会错？ c++
2021-08-18 08:24

回答 6 已采纳要在赋予数值时先取模 #include<bits/stdc++.h> using namespace std; const int M = 1000010; long long a[M]
为什么什么都输出不了？请问是哪里错了？ c语言
2022-12-10 20:42

回答 2 已采纳算法不对：1、a[j]会超出，j也要<3；2、cnt的计算方法不对，你这样的算法，如果4个数不同，cnt=6，而不是4，3个数不同，cnt=5，2个数不同，cnt=4或者3；3、cnt在循环内要
这个给链表排序为什么是错的？如何修改 c语言
2022-01-10 12:21

回答 2 已采纳修改调试通过，带头结点链表，供参考： #include <stdio.h> #include <stdlib.h> #include <string.h> ty
java while(x )_Java中while(x = false)和while(！x)之间有什么区别？
2021-02-26 20:29

qqc1024的博客我最近一直在处理在while循环中包含try和catch语句,因为我想确保从程序的其余部分包含输入.我遇到过一个问题,在while条件下在变量前面使用感叹号(！)(例如while(！done))而不是使用= false(例如while(done = false))...
为什么c语言创建的文本文件是乱码？ c语言有问必答
2021-12-02 17:03

回答 4 已采纳第10行的 while(a=getchar()!=EOF) 改成 while( (a=getchar()) !=EOF) 把a=getchar()用() 括起来。修改后运行结果如下图所示：
python的while,Python:while（真的！=真）循环
2021-04-27 09:47

鹏鹏仔的博客 = True) : translate() cont() 我的问题是，为什么这个程序有效？我将run设置为False，并将循环设置为run as run！=正确。没有问题，但是当我定义cont（）时，我将run设置为在用户输入“y”时采用True值。真的！=...
使用 npm install安装依赖时报错 npm ERR! Error while executing
2023-03-02 10:02

surprisejavascript的博客问题描述：vue-element-admin使用 npm install安装依赖时报错 npm ERR! Error while executing npm ERR! Error while executing: npm ERR! H:\Program Files\git\Git\cmd\git.EXE ls-remote -h -t ...
没有解决我的问题, 去提问

悬赏问题

¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！

为什么是 while (!)！ "总是错的?"？

5条回答 默认 最新

Concurrency and simultaneity

EOF

Examples

The answer

悬赏问题

5条回答默认最新