子集数据帧中的拖放因子级别

I have a data frame containing a factor. When I create a subset of this data frame using subset() or another indexing function, a new data frame is created. However, the factor variable retains all of its original levels -- even when they do not exist in the new data frame.

This creates headaches when doing faceted plotting or using functions that rely on factor levels.

What is the most succinct way to remove levels from a factor in my new data frame?

Here's my example:

df <- data.frame(letters=letters[1:5],
                    numbers=seq(1:5))

levels(df$letters)
## [1] "a" "b" "c" "d" "e"

subdf <- subset(df, numbers <= 3)
##   letters numbers
## 1       a       1
## 2       b       2
## 3       c       3    

## but the levels are still there!
levels(subdf$letters)
## [1] "a" "b" "c" "d" "e"

转载于:https://stackoverflow.com/questions/1195826/drop-factor-levels-in-a-subsetted-data-frame

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

12条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
MAO-EYE 2009-07-28 22:41
关注
All you should have to do is to apply factor() to your variable again after subsetting:

> subdf$letters [1] a b c Levels: a b c d e subdf$letters <- factor(subdf$letters) > subdf$letters [1] a b c Levels: a b c

EDIT

From the factor page example:

factor(ff) # drops the levels that do not occur

For dropping levels from all factor columns in a dataframe, you can use:

subdf <- subset(df, numbers <= 3) subdf[] <- lapply(subdf, function(x) if(is.factor(x)) factor(x) else x)
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(11条)

报告相同问题？

关注问题

子集数据帧中的拖放因子级别 r语言
2009-07-28 18:21

回答 11 已采纳 All you should have to do is to apply factor() to your variable again after subsetting: > subd
C++语言编程子集和问题（回溯法解）注意用C++啊 c++
2018-05-01 15:55

回答 2 已采纳 ``` #include #include #include #include using namespace std; int n,c,a[10000],b
r语言ggplot合并图形_R中带有ggplot2的图形
2020-09-09 06:21

weixin_26737625的博客 r语言ggplot合并图形介绍 (Introduction) R is known to be a really powerful programming language when it comes to graphics and visualizations (in addition to statistics and data science of course!). 当...
用Python将数据集分为两个子集 big data python sqlite 有问必答
2021-09-28 04:17

回答 2 已采纳 df_eedi = df[df["Technical Efficiency Index"] == "EEDI"] df_evi = df[df["Technical Efficiency Index
用python语言实现“子集和数”问题的分支限界算法 python
2022-05-28 00:54

回答 2 已采纳 def subset_sum(lst, target): for i in range(1, 2**len(lst)): pick = list(mask(lst, bin(
简单r语言操作，怎么做呢 r语言
2022-09-13 09:28

回答 1 已采纳可以查看手册：r语言简单操作；数字和向量中的内容
CoppeliaSim用户手册中文翻译版（一）
2020-07-05 18:25

汤姆与贝塔的博客 6种编程语言（C/C++、Python、Java、Lua、Matlab和Octave）超过400种不同的应用编程接口函数 4种物理引擎（ODE、Bullet、Vortex、Newton）集成射线追踪仪（POV-Ray）完整的运动学解算器（对于任何机构的逆运动学...
CoppeliaSim用户手册中文翻译版（二）
2020-10-12 10:56

汤姆与贝塔的博客 6.6 通讯方式 6.7 以编程方式访问对象 6.8 CoppeliaSim API框架 6.8.1 常规API 6.8.1.1 常规API函数列表（按字母顺序） 6.8.1.2 常规API（按类别） 6.8.1.3 API常量 6.8.1.4 对象参数ID 6.8.1.5 显式和非显式调用 ...
CoppeliaSim(V-rep)手册中文翻译
2020-07-20 19:01

Norach的博客如果拖放区域不受支持或不合适，则捕获的缩略图将显示为黑色。 [模型浏览器] 场景层次结构：默认情况下，场景层次结构是可见的，但可以使用其相应的工具栏按钮进行切换。它显示场景的内容（即组成场景的所有场景对象...
python中文语料分词处理，按字或者词cut_sentence
2020-02-19 22:43

高颜值的杀生丸的博客文档级别 sgjsj 取随机数 sgjsj 音频合成 sgjsj 普通语言学 sgjsj 添加行 sgjsj 释放资源 sgjsj 网络分割 sgjsj 计算系统 sgjsj 磁膜 sgjsj 色键 sgjsj 程序集变量 sgjsj 远程调用 sgjsj 顺序加电 sgjsj 控制技术 ...
iOS_SpriteKit_02_SpriteKit编程指南
2016-12-06 17:33

WenyHoooo的博客关于Sprite Kit SpriteKit提供了一个图形渲染（rendering）和动画的...你的游戏确定场景的内容，以及这些内容如何在每帧中变化。Sprite Kit做的工作，就是有效地利用图形硬件来渲染动画的帧。Sprite Kit优化到
51c自动驾驶~合集8
2024-08-01 03:54

大蛇瞪眼-风若璃的博客定义如下：给定个功能模块，其中所有模块或其中的一个子集是相关的，而多模块学习的目的是通过使用所有模型集成中包含的知识来共同学习个模块，以提高模型对共同任务的学习能力。论文中提出了面向功能模块集成和跨...
没有解决我的问题, 去提问

悬赏问题

¥15 vika文档如何与obsidian同步
¥15 华为手机相册里面的照片能够替换成自己想要的照片吗？
¥15 陆空双模式无人机飞控设置
¥15 sentaurus lithography
¥100 求抖音ck号或者提ck教程
¥15 关于#linux#的问题：子进程1等待子进程A、B退出后退出(语言-c语言)
¥20 web页面如何打开Outlook 365的全球离线通讯簿功能
¥15 io.jsonwebtoken.security.Keys
¥15 急，ubuntu安装后no caching mode page found等
¥15 联想交换机NE2580O/NE1064TO安装SONIC

子集数据帧中的拖放因子级别

12条回答 默认 最新

悬赏问题

12条回答默认最新