dongzen2675 2015-05-05 08:18
浏览 94
已采纳

Golang:从文本文件中读取无效的JSON

I have a txt file with the following sample data:

host{
      Entry {
          id: "foo"
      }
       Entry {
          id: "bar"
      }
    }

port{
      Entry {
          id: "lorem"
      }
       Entry {
          id: "ipsum"
      }
    }

It has +300 of those Entry values. I'd like to read the file and extract the id values belonging to the port section. It's not valid JSON so I can't use the json decoder, is there any other way of extracting the values?

  • 写回答

2条回答 默认 最新

  • douqiao6563 2015-05-05 08:37
    关注

    If the structure is the same throughout and all you want is the id values you can do something like this (on the Playground):

    package main
    
    import (
        "fmt"
        "strings"
    )
    
    func main() {
        // This will work only if ids don't have spaces
        fields := strings.Fields(input1)
        for i, field := range fields {
            if field == "id:" {
                fmt.Println("Got an id: ", fields[i+1][1:len(fields[i+1])-1])
            }
        }
        fmt.Println()
    
        // This will extract all strings enclosed in ""
        for i1, i2 := 0, 0;; {
            i := strings.Index(input2[i1:], "\"") // find the first " starting after the last match
            if i > 0 { // if we found one carry on
                i1 = i + 1 + i1 // set the start index to the absolute position in the string
                i2 = strings.Index(input2[i1:], "\"") // find the second "
                fmt.Println(input2[i1 : i1+i2]) // print the string between ""
                i1 += i2 + 1 // set the new starting index to after the last match
            } else { // otherwise we are done
                break
            }
        }
    
    
        // Reading the text line by line and only processing port sections
        parts := []string{"port{", "  Entry {", "      id: \"foo bar\"", "  }", "   Entry {", "      id: \"more foo bar\"", "  }", "}"}        
        isPortSection := false
        for _, part := range parts {
            if string.HasPrefix(part, "port"){
                isPortSection = true
            }
            if string.HasPrefix(part, "host"){
                isPortSection = false
            }
            if isPortSection && strings.HasPrefix(strings.TrimSpace(part),"id:") {
                line := strings.TrimSpace(part)
                fmt.Println(line[5:len(line)-1])
            }
        }
    }
    
    var input1 string = `port{
      Entry {
          id: "foo"
      }
       Entry {
          id: "bar"
      }
    }`
    
    var input2 string = `port{
      Entry {
          id: "foo bar"
      }
       Entry {
          id: "more foo bar"
      }
    }`
    

    Prints:

    Got an id:  foo
    Got an id:  bar
    
    foo bar
    more foo bar
    

    Instead of printing them in the loop you can stick them into a slice or map or do whatever you want/need to. And of course instead of using the string literal you read in the lines from your file.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 WPF 大屏看板表格背景图片设置
  • ¥15 这个主板怎么能扩出一两个sata口
  • ¥15 不是,这到底错哪儿了😭
  • ¥15 2020长安杯与连接网探
  • ¥15 关于#matlab#的问题:在模糊控制器中选出线路信息,在simulink中根据线路信息生成速度时间目标曲线(初速度为20m/s,15秒后减为0的速度时间图像)我想问线路信息是什么
  • ¥15 banner广告展示设置多少时间不怎么会消耗用户价值
  • ¥16 mybatis的代理对象无法通过@Autowired装填
  • ¥15 可见光定位matlab仿真
  • ¥15 arduino 四自由度机械臂
  • ¥15 wordpress 产品图片 GIF 没法显示