总结csv的内容

Context I'm working on creating a little program that can summarize the contents of an absolute mess of a bill, which is in csv form.

The bill has three columns I'm interested in:

Event type. Here, I'm only interested in the rows where this column reads CHARGE
The cost. Self explanatory.
Resource name, containing Server and cluster names. The format is servername.clustername.

The idea is to select the rows that are labeled as charge, split them up first by cluster and then by server name, and sum up the total costs for each.

I can't help but feel like this should be easy, but I've been scratching my head on this for a while now, and just can't seem to figure it out. At this point I ought to state that I am fairly new to programming and entirely new to GO.

Here's what I have so far:

package main

import (
    "encoding/csv"
    "log"
    "os"
    "sort"
    "strings"
)



func main() {
    rows := readBill("bill-2018-April.csv")
    rows = calculateSummary(rows)
    writeSummary("bill-2018-April-output", rows)

}

func readBill(name string) [][]string {

    f, err := os.Open(name)

    if err != nil {
        log.Fatalf("Cannot open '%s': %s
", name, err.Error())
    }

    defer f.Close()

    r := csv.NewReader(f)

    rows, err := r.ReadAll()

    if err != nil {
        log.Fatalln("Cannot read CSV data:", err.Error())
    }

    return rows
}

type charges struct {
    impactType string
    cost       float64
    resName    string
}
func createCharges(rows [][]string){
    charges:= []charges{}
    for i,r:=range rows {
        var c charges
        c.impactType :=r [i][10]
        c.cost := r [i][15]
        c.resName := r [i][20]
        charges = append()
    }
    return charges
}

So, as far as I can tell, I should now have isolated the columns I am interested in (i.e. columns 10, 15 and 20). Is what I have so far even correct?

How would I go about singling out the rows reading "CHARGE" and slicing everything up by cluster and server?

Summing things up shouldn't be too tricky, but for whatever reason, this is really stumping me.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

dpkt31779 2018-08-16 12:42

关注

Just use two maps to store the sums per server and per cluster. And since you're not interested in the whole CSV but only some rows, reading everything is kind of wasteful. Just skip the rows you don't care about:

package main

import (
    "encoding/csv"
    "fmt"
    "io"
    "log"
    "strconv"
    "strings"
)

func main() {
    b := `
,,,,,,,,,,CHARGE,,,,,100.00,,,,,s1.c1
,,,,,,,,,,IGNORE,,,,,,,,,,
,,,,,,,,,,CHARGE,,,,,200.00,,,,,s2.c1
,,,,,,,,,,CHARGE,,,,,300.00,,,,,s3.c2
`

    r := csv.NewReader(strings.NewReader(b))

    byServer := make(map[string]float64)
    byCluster := make(map[string]float64)

    for i := 0; ; i++ {
        row, err := r.Read()
        if err == io.EOF {
            break
        }
        if err != nil {
            log.Fatal(err)
        }

        if row[10] != "CHARGE" {
            continue
        }

        cost, err := strconv.ParseFloat(row[15], 64)
        if err != nil {
            log.Fatalf("row %d: malformed cost: %v", i, err)
        }

        xs := strings.SplitN(row[20], ".", 2)
        if len(xs) != 2 {
            log.Fatalf("row %d: malformed resource name", i)
        }

        server, cluster := xs[0], xs[1]

        byServer[server] += cost
        byCluster[cluster] += cost
    }

    fmt.Printf("byServer: %+v
", byServer)
    fmt.Printf("byCluster: %+v
", byCluster)
}

// Output:
// byServer: map[s2:200 s3:300 s1:100]
// byCluster: map[c1:300 c2:300]

Try it on the playground: https://play.golang.org/p/1e9mJf4LyYE

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

总结csv的内容
2018-08-16 12:07

回答 1 已采纳 Just use two maps to store the sums per server and per cluster. And since you're not interested in
txt转化成csv内容丢失 json python
2022-07-08 14:15

回答 1 已采纳 city_infos=city_infos.replace("True","true") 换成这个city_infos=city_infos.replace("'notShowCurrentConf
在PHP中清理CSV内容 php
2014-03-25 01:09

回答 1 已采纳 You can try using array_walk() to run mysql_escape_string() or your database's equivalent to be do
python读写csv文件方法详细总结
2020-09-19 03:41

在本文中小编给各位分享的是关于python读写csv文件方法的详细内容，对此有需要的朋友们跟着学习参考下。
将地图写入CSV
2017-11-27 18:11

回答 1 已采纳 Package csv func (*Writer) Write func (w *Writer) Write(record []string) error Writ
CSV到结构建议 json
2018-02-20 17:27

回答 1 已采纳 Getting rid of the first entry is as easy as billData = billData[1:]. That, or do an initial read
如何从S3读取CSV文件
2018-09-28 03:22

回答 1 已采纳 As the error says: cannot use body (type []byte) as type io.Reader in argument to csv.NewRea
基于Pandas读取csv文件Error的总结
2021-01-20 04:12

OSError：报错1 <span xss=removed>pandas\_libs\parsers.pyx in pandas._libs.parsers.TextReader.__cinit__ (pandas\_libs\parsers.c:4209)() pandas\_libs\parsers.pyx in pandas._libs.parsers.TextReader._...
将csv内容放入多维数组中 php
2014-01-08 23:57

回答 2 已采纳 $fp = fopen($csvFile, 'r'); $master = array(); while( $line = fgetcsv( $fp ) ) { // 23,cars,43
python 写CSV python
2022-03-08 15:33

回答 2 已采纳 file = open('whatever.csv', 'w') for i in range(0, len(link_start)): trajOD, etaOD = traj_judge(
如何将大型csv文件拆分为多个csv文件 php
2018-08-21 14:09

回答 2 已采纳 The script you show is reading the WHOLE .csv file into an in memory array. Its not surprising it
python将excel转换为csv的代码方法总结
2020-09-19 04:29

在本篇文章里小编给大家分享了关于python如何将excel转换为csv的实例方法和代码内容，需要的朋友们学习下。
处理大型csv文件并限制goroutines
2019-05-27 11:46

回答 3 已采纳 I striped out the progress bar as i did not want to bother about it, but overall this is closer to
csv文件内容转义_CSV文件的转义处理
2020-12-23 02:13

小波思基的博客原文：http://blog.csdn.net/maqingli20/article/details/7095132------------------------------------------------------------------------------------CSV文件是一种适合程序格式化输出数据的文件格式。...CSV的...
数据的CSV文件存取
2020-12-23 01:15

本文的主要内容是基于中国大学mooc（慕课）中的“Python数据分析与可视化”课程进行整理和总结。 CSV，Comma-Separated Value，逗号分隔符，CSV是一种常见的文件格式，用于存储批量数据，常用于存储一维和二维数据。...
没有解决我的问题, 去提问

悬赏问题

¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀
¥20 手写数字识别运行c仿真时，程序报错错误代码sim211-100
¥15 关于#hadoop#的问题

码龄粉丝数原力等级 --

总结csv的内容

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

总结csv的内容

1条回答 默认 最新

悬赏问题

1条回答默认最新