?yb? 2010-02-18 12:17 采纳率: 100%
浏览 4053
已采纳

如何将数据框列转换为数字类型?

如何将数据帧列转换为数字类型?

  • 写回答

16条回答 默认 最新

  • local-host 2010-02-19 00:31
    关注

    Since (still) nobody got check-mark, I assume that you have some practical issue in mind, mostly because you haven't specified what type of vector you want to convert to numeric. I suggest that you should apply transform function in order to complete your task.

    Now I'm about to demonstrate certain "conversion anomaly":

    # create dummy data.frame
    d <- data.frame(char = letters[1:5], 
                    fake_char = as.character(1:5), 
                    fac = factor(1:5), 
                    char_fac = factor(letters[1:5]), 
                    num = 1:5, stringsAsFactors = FALSE)
    

    Let us have a glance at data.frame

    > d
      char fake_char fac char_fac num
    1    a         1   1        a   1
    2    b         2   2        b   2
    3    c         3   3        c   3
    4    d         4   4        d   4
    5    e         5   5        e   5
    

    and let us run:

    > sapply(d, mode)
           char   fake_char         fac    char_fac         num 
    "character" "character"   "numeric"   "numeric"   "numeric" 
    > sapply(d, class)
           char   fake_char         fac    char_fac         num 
    "character" "character"    "factor"    "factor"   "integer" 
    

    Now you probably ask yourself "Where's an anomaly?" Well, I've bumped into quite peculiar things in R, and this is not the most confounding thing, but it can confuse you, especially if you read this before rolling into bed.

    Here goes: first two columns are character. I've deliberately called 2nd one fake_char. Spot the similarity of this character variable with one that Dirk created in his reply. It's actually a numerical vector converted to character. 3rd and 4th column are factor, and the last one is "purely" numeric.

    If you utilize transform function, you can convert the fake_char into numeric, but not the char variable itself.

    > transform(d, char = as.numeric(char))
      char fake_char fac char_fac num
    1   NA         1   1        a   1
    2   NA         2   2        b   2
    3   NA         3   3        c   3
    4   NA         4   4        d   4
    5   NA         5   5        e   5
    Warning message:
    In eval(expr, envir, enclos) : NAs introduced by coercion
    

    but if you do same thing on fake_char and char_fac, you'll be lucky, and get away with no NA's:

    > transform(d, fake_char = as.numeric(fake_char), 
                   char_fac = as.numeric(char_fac))
    
      char fake_char fac char_fac num
    1    a         1   1        1   1
    2    b         2   2        2   2
    3    c         3   3        3   3
    4    d         4   4        4   4
    5    e         5   5        5   5
    

    If you save transformed data.frame and check for mode and class, you'll get:

    > D <- transform(d, fake_char = as.numeric(fake_char), 
                        char_fac = as.numeric(char_fac))
    
    > sapply(D, mode)
           char   fake_char         fac    char_fac         num 
    "character"   "numeric"   "numeric"   "numeric"   "numeric" 
    > sapply(D, class)
           char   fake_char         fac    char_fac         num 
    "character"   "numeric"    "factor"   "numeric"   "integer"
    

    So, the conclusion is: Yes, you can convert character vector into a numeric one, but only if it's elements are "convertible" to numeric. If there's just one character element in vector, you'll get error when trying to convert that vector to numerical one.

    And just to prove my point:

    > err <- c(1, "b", 3, 4, "e")
    > mode(err)
    [1] "character"
    > class(err)
    [1] "character"
    > char <- as.numeric(err)
    Warning message:
    NAs introduced by coercion 
    > char
    [1]  1 NA  3  4 NA
    

    And now, just for fun (or practice), try to guess the output of these commands:

    > fac <- as.factor(err)
    > fac
    ???
    > num <- as.numeric(fac)
    > num
    ???
    

    Kind regards to Patrick Burns! =)

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(15条)

报告相同问题?

悬赏问题

  • ¥15 如何在scanpy上做差异基因和通路富集?
  • ¥20 关于#硬件工程#的问题,请各位专家解答!
  • ¥15 关于#matlab#的问题:期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707,使系统具有较小的超调量
  • ¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
  • ¥30 截图中的mathematics程序转换成matlab
  • ¥15 动力学代码报错,维度不匹配
  • ¥15 Power query添加列问题
  • ¥50 Kubernetes&Fission&Eleasticsearch
  • ¥15 報錯:Person is not mapped,如何解決?
  • ¥15 c++头文件不能识别CDialog