CSV至今和浮动

I'm currently writing a small program which converts CSV-files into structs to be used for further prosessing. The csv lines look like this

20140102,09:30,38.88,38.88,38.82,38.85,67004

I have 500 files, each about 20-30 MB. My code works just fine, but I can't help wondering if there isn't a better way to convert these files than what I'm doing now. First reading the file and converting to csv records (pseudo code)

    data, err := ioutil.ReadFile(path)
    if err != nil {
        ... 
    }
    r := csv.NewReader(bytes.NewReader(data))
    records, err := r.ReadAll()
    if err != nil {
        ... 
    }

Then looping over all the records and doing

    parsedTime, err := time.Parse("2006010215:04", record[0]+record[1])
    if err != nil {
        return model.ZorroT6{}, time.Time{}, err
    }

    t6.Date = ConvertToOle(parsedTime)
    if open, err := strconv.ParseFloat(record[2], 32); err == nil {
        t6.Open = float32(open)
    }
    if high, err := strconv.ParseFloat(record[3], 32); err == nil {
        t6.High = float32(high)
    }
    if low, err := strconv.ParseFloat(record[4], 32); err == nil {
        t6.Low = float32(low)
    }
    if close, err := strconv.ParseFloat(record[5], 32); err == nil {
        t6.Close = float32(close)
    }
    if vol, err := strconv.ParseInt(record[6], 10,32); err == nil {
        t6.Vol = int32(vol)
    }

For example I have to go through []byte -> string -> float64 -> float32 to get my float values. What could I do to improve this code?

EDIT: Just to be clear I don't really need to improve the performance, I'm just better trying to understand Go and what performance optimization that could be applied to a problem like this. For example it seems like a lot of overhead to create loads of strings and float64 when I have a byte slice and want a float32.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongza1708 2019-02-18 09:06
关注
There is only one problem I see that needs fix:

Do not use ioutil.ReadFile together with bytes.NewReader. It reads all the contents into the memory, which is inefficient when the file is large.

Instead, use os.Open(file), it perfectly provides a io.Reader that csv.NewReader can utilize. Do not forget to close the file and handle errors.

If you still want to improve performance:

Since your csv file is of fixed format, it is possible to using raw bytes instead provided by bufio instead of csv.

You can copy and paste the underlying code in strconv and time to avoid general code that is not of your need.

But I think they are not worth the trouble.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

CSV至今和浮动
2019-02-17 20:48

回答 1 已采纳 There is only one problem I see that needs fix: Do not use ioutil.ReadFile together with bytes.Ne
Prestashop产品csv导入用于添加和更新 php
2017-11-08 04:54

回答 1 已采纳 First off, i would recommend updating to a higher PS version. Secondly there are enough 'free' mo
C#修改CSV文件，和将数组存入到CSV c#
2017-02-01 07:39

回答 3 已采纳修改csv唯一的办法是读取全部数据到list，然后修改，再循环写回去。因为文本文件是不能随机修改的。在C#中用File.ReadAllLines读取文件，split后装入数组，然后你想怎么处理怎么
MySQL（十）：MySQL语法-进阶
2023-07-17 17:13

Prosper Lee的博客如：在人员管理系统中，删除一个人员，即需要删除人员的基本资料，也要删除和该人员相关的信息，如信箱，文章等等。TIMESTAMP 也接受不同的格式，比如 YYYYMMDDHHMMSS、YYMMDDHHMMSS、YYYYMMDD 或 YYMMDD。如果添加 ...
CSV到结构建议 json
2018-02-20 17:27

回答 1 已采纳 Getting rid of the first entry is as easy as billData = billData[1:]. That, or do an initial read
总结csv的内容
2018-08-16 12:07

回答 1 已采纳 Just use two maps to store the sums per server and per cluster. And since you're not interested in
将地图写入CSV
2017-11-27 18:11

回答 1 已采纳 Package csv func (*Writer) Write func (w *Writer) Write(record []string) error Writ
「实战案例」基于Python语言开发的信用评分卡
2021-10-13 10:40

python机器学习建模的博客今天，给各位数据粉带来的是参加过CDA 认证level II 课程培训的学员...它涉及到公司管理，企业债发行，企业融资，企业上市，企业并购，个人炒股和购买公司债券等多个场景。企业债发行企业主体信用评级越高，意味...
如何从S3读取CSV文件
2018-09-28 03:22

回答 1 已采纳 As the error says: cannot use body (type []byte) as type io.Reader in argument to csv.NewRea
csv文件在Windows出现乱码 windows
2017-09-11 05:39

回答 2 已采纳 1、乱码可能是因为操作系统之间文件格式不同而导致，可以尝试参考该博客[http://blog.csdn.net/zhangyang0402/article/details/5153649](http:
错误导入CSV文件Crontab linux php
2018-07-05 07:15

回答 1 已采纳 In crontab run the script, is not be in the folder, so you need to write full path or correct rela
python量化开发【初级入门】
2023-12-11 22:05

saas软件销售顾问的博客省掉重复造轮子的精力十、使用JQData查询行情数据 1、数据字典：股票数据：提供2005年至今沪深A股全面的行情、财务、基本面等数据行业概念数据：包含行业板块、概念板块数据指数数据：包含沪深市场多只指数数据 ...
csv写入数据换行问题 python 爬虫
2022-06-09 22:36

回答 1 已采纳你想要，换行呢？还是不换行呢？，你在写入文件的时候是追加模式，尽量用pandas，pandas可以随心所欲的操作表，合并表，拆分表
获取股票数据【实时更新股票数据、创建你的股票数据】、计算交易指标【买入、卖出信号、计算持仓收益、计算累计收益率】
2021-09-09 15:33

webor2006的博客 1、新建一个测试模块：为了规范，这里将测试相关的代码都放到另一个模块中： 2、获取股票行情数据并导入csv：控制台输出：看一下表格文件有木有生成？但是！！！你会发现表格中表头部分少了一个日期：这个时候...
全栈之路-前端篇 | 第三讲.基础前置知识【前端标准与研发工具】学习笔记
2023-02-18 18:10

全栈工程师修炼指南的博客万维网联盟（外语缩写：W3C）创建于1994年，是Web技术领域最具权威和影响力的国际中立性技术标准机构, W3C标准定义了一个用于应用程序开发的开放式Web平台，该平台具有前所未有的潜力，使开发人员能够构建丰富的交互...
没有解决我的问题, 去提问

悬赏问题

¥15 素材场景中光线烘焙后灯光失效
¥15 请教一下各位，为什么我这个没有实现模拟点击
¥15 执行 virtuoso 命令后，界面没有，cadence 启动不起来
¥50 comfyui下连接animatediff节点生成视频质量非常差的原因
¥20 有关区间dp的问题求解
¥15 多电路系统共用电源的串扰问题
¥15 slam rangenet++配置
¥15 有没有研究水声通信方面的帮我改俩matlab代码
¥15 ubuntu子系统密码忘记
¥15 保护模式-系统加载-段寄存器

CSV至今和浮动

1条回答 默认 最新

悬赏问题

1条回答默认最新