douxinghuai3150 2013-12-10 23:17 采纳率: 0%
浏览 111
已采纳

将ASCII文件解析为MySQL表

For a project, I need to get some word definitions in a database. All the definitions can be found on multiple DB files, but the DB files that I got are for a C language program and are in the form of ASCII (I believe). I need to somehow phrase thorough the files, line by line add the data into a MySQL database.

I would prefer using PHP and/or MySQL.

I tried writing a PHP script to go through and do it, but it timed-out and is intensive on my system and in most cases don't complete.

I heard about LOAD DATA INFILE from MySQL but have no clue how to use it with this.

The file names change for each file and do not have a specific extension, however, all of them can be read from a text file, and I am sure they are all the same in terms of content.

I uploaded the contents of one file here.

You can see that some lines are useless, but the lines starting with { are good and the pattern is essentially the first word is the dictionary term, and the content within () are the definitions. The parts within the "" are sample sentences.

All I need to extract are the terms, definitions and sentences.

The definitions are provided by Princeton University and the license is open source (and I will be crediting them).

  • 写回答

1条回答 默认 最新

  • duanjie5570 2013-12-10 23:34
    关注

    Unless you want to reinvent the wheel I would go with something like wordnet2sql. It will output an SQL script that you can use to create your MySQL tables.

    You can find the database specifications on princeton's website.

    LOAD DATA is useful for csv files but not so much for special database formats.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥60 pb数据库修改或者求完整pb库存系统,需为pb自带数据库
  • ¥15 spss统计中二分类变量和有序变量的相关性分析可以用kendall相关分析吗?
  • ¥15 拟通过pc下指令到安卓系统,如果追求响应速度,尽可能无延迟,是不是用安卓模拟器会优于实体的安卓手机?如果是,可以快多少毫秒?
  • ¥20 神经网络Sequential name=sequential, built=False
  • ¥16 Qphython 用xlrd读取excel报错
  • ¥15 单片机学习顺序问题!!
  • ¥15 ikuai客户端多拨vpn,重启总是有个别重拨不上
  • ¥20 关于#anlogic#sdram#的问题,如何解决?(关键词-performance)
  • ¥15 相敏解调 matlab
  • ¥15 求lingo代码和思路