在CSV中查找重复的列值

I'm importing a CSV that has 3 columns, one of these columns could have duplicate records.

I have 2 things to check:

1. The field 'NAME' is not null and is a string
2. The field 'ID' is unique

So far, I'm parsing the CSV file, once and checking that 1. (NAME is valid), which if it fails, it simply breaks out of the while loop and stops.

I guess the question is, how I'd check that ID is unique?

I have fields like the following:

NAME,  ID,
Bob,   1,
Tom,   2,
James, 1,
Terry, 3,
Joe,   4,

This would output something like `Duplicate ID on line 3'

Thanks

P.S this CSV file has more columns and can have around 100,000 records. I have simplified it for a specific reason to solve the duplicate column/field

Thanks

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dou4121 2014-01-17 10:59
关注
I went assuming a certain type of design, as stripped out the CSV part, but the idea will remain the same :

<?php /* Let's make an array of 100,000 rows (Be careful, you might run into memory issues with this, issues you won't have with a CSV read line by line)*/ $arr = []; for ($i = 0; $i < 100000; $i++) $arr[] = [rand(0, 1000000), 'Hey']; /* Now let's have fun */ $ids = []; foreach ($arr as $line => $couple) { if ($ids[$couple[0]]) echo "Id " . $couple[0] . " on line " . $line . " already used<br />"; else $ids[$couple[0]] = true; } ?>

100, 000 rows aren't that much, this will be enough. (It ran in 3 seconds at my place.)

EDIT: As pointed out, in_array is less efficient than key lookup. I've updated my code consequently.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(3条)

报告相同问题？

关注问题

如何在php中获取CSV列值 php
2016-06-02 10:25

回答 1 已采纳 try this one <?php $row = 1; if (($handle = fopen("abc.csv", "r")) !== FALSE) { while (($dat
在PHP中组合csv列值 php
2016-10-17 06:00

回答 2 已采纳 Use below code for your solution $file = ('my.csv'); $fh = fopen($file, 'rb'); $tag = array(); $q
使用php在csv文件中连续添加2列 php
2018-09-22 15:12

回答 1 已采纳 When you want to have multiple columns in a single line, you have to have an array per line. So yo
php mysql csv_php – 使用Laravel在MySQL中导入大型CSV文件
2021-03-16 23:30

Jackie Bao的博客我有一个csv文件,范围从50k到超过100k行数据.我目前正在使用Laravel w / Laravel Forge,MySQL和Maatwebsite Laravel Excel软件包.这是由最终用户而不是我自己使用,所以我在我的刀片视图上创建了一个简单的表单：{!! ...
使用csv在列中设置数据 php
2019-03-19 09:21

回答 3 已采纳 <?php $array1 = array('column1 value1', 'column1 value2', 'column1 value3'); $array2 = array('
PHP - CSV导出 - 在列数据中添加逗号 php
2016-03-22 12:11

回答 2 已采纳 You should have category values between "" so the comma is not interpreted. You can change code li
将csv的值存储在表数据库的不同列中 mysql php
2017-11-01 19:55

回答 2 已采纳 .............. $handle = fopen($fileTmpName, 'r'); $k = 0; $energies = []; wh
php判断是否是csv文件路径,php – 自动检测文件中CSV标题的存在
2021-04-18 11:30

TrungTPhan的博客短问题：如何自动检测CSV文件...我试图找出一种可靠的方式来自动检测CSV标题的存在，因此脚本可以决定是否将CSV文件的第一行用作键/列名称或立即开始解析数据。由于我需要的是一个布尔测试，我可以很容易地在自己...
在php中的四列中显示csv数据 php
2016-02-01 10:43

回答 2 已采纳 Here's a solution: Variable number of columns Unequal number of lines are filled from left to ri
使用PHP查找CSV文件中是否存在值 php
2014-08-20 14:11

回答 3 已采纳 Since you haven't provided a sample of your file, am submitting the following as an alternative.
如何在插入表之前检查csv文件是否有重复值 javascript jquery php
2019-07-04 07:22

回答 1 已采纳 Not sure what you mean by "duplicates data found on 2 columns" so here's two solutions in one : $
pandas计算含缺失值中列平均值_Pandas进阶修炼120题，给你深度和广度的船新体验...
2021-01-14 09:33

weixin_39941792的博客按照grammer列进行去除重复值 df.drop_duplicates(['grammer']) 9.计算popularity列平均值 df['popularity'].mean() 10.将grammer列转换为list df['grammer'].to_list() 11.将DataFrame保存为EXCEL df.to_excel('...
vue文件中怎么使用php_在PHP中使用文件
2020-08-27 05:22

culi4814的博客 vue文件中怎么使用phpYou may well be familiar with databases such as MySQL and Access which are an ever-increasingly common means of storing data. But data is also stored in files, like Word documents,...
php数据库数据如何去除重复数据呢？
2021-04-05 16:02

sfi799的博客正常无论是用框架开发还是原生php都很少有自带的去重复的方法，基本上都需要我们自己嵌入原生sql，下面直接给大家上源码 //原生sql去重只留一条 $res = Db::execute('DELETE from data WHERE (phone) in ...
php中怎么将md5,PHP中md5()函数的用法讲解
2021-04-24 20:05

shul mate的博客 PHP中md5()函数的用法讲解PHP md5() 函数实例计算字符串 "Hello" 的 MD5 散列：$str = "Hello";echo md5($str);?>定义和用法md5()函数计算字符串的 MD5 散列。md5()函数使用 RSA 数据安全，包括 MD5 报文摘要算法...
没有解决我的问题, 去提问

悬赏问题

¥15 乌班图ip地址配置及远程SSH
¥15 怎么让点阵屏显示静态爱心，用keiluVision5写出让点阵屏显示静态爱心的代码，越快越好
¥15 PSPICE制作一个加法器
¥15 javaweb项目无法正常跳转
¥15 VMBox虚拟机无法访问
¥15 skd显示找不到头文件
¥15 机器视觉中图片中长度与真实长度的关系
¥15 fastreport table 怎么只让每页的最下面和最顶部有横线
¥15 R语言卸载之后无法重装，显示电脑存在下载某些较大二进制文件行为，怎么办
¥15 java 的protected权限，问题在注释里

在CSV中查找重复的列值

4条回答 默认 最新

悬赏问题

4条回答默认最新