[问题] 原始资料中遗漏值被填上奇怪文字问题

楼主: jasonfghx (工人)   2018-03-25 19:34:49
[问题类型]:
程式咨询(我想用R 做某件事情,但是我不知道要怎么用R 写出来)
[软件熟悉度]:
入门(写过其他程式,只是对语法不熟悉)
[问题叙述]:
不好意思 原始资料中 有些空白被填上- ? NA这种
但实际上他是遗漏值, 有没有什么方式可以把它们取代成空白
让is.na判断成是遗漏值, 谢谢!!
[程式范例]:
wdbc=read.table("DM_Assignment_2_Kidney.csv", header=F,sep=",")
wdbc1=wdbc
names(wdbc) <- c("Age","Blood Pressure","Specific
Gravity","Albumin","Sugar","Red Blood Cells","Pus Cell","Pus Cell
clumps","Bacteria","Blood Glucose Random","Blood Urea","Serum
Creatinine","Sodium","Potassium","Hemoglobin","Packed Cell Volume","White
Blood Cell Count","Red Blood Cell Count","Hypertension","Diabetes
Mellitus","Coronary Artery Disease","Appetite","Pedal Edema","Anemia","Class")
for(i in 1:25)(
wdbc1[,i]<-gsub("[?]",NA, wdbc[,i])
)
[关键字]:
遗漏值
作者: celestialgod (天)   2018-03-25 19:39:00
which(is.na(x), arr.ind = TRUE)会回传NA所在的Row跟Column位置
楼主: jasonfghx (工人)   2018-03-25 19:50:00
THX
作者: celestialgod (天)   2018-03-25 20:13:00
matrix还是data.frame?matrix用gsub(x, "[-?]", "NA") 在抓=="NA"的位置漏看还有空白 gsub(x, "[-? ]", "NA")我放错位置了Orz gsub("[-? ]", "NA", x)如果是长度为零的话 x[nchar(x)] = "NA"更正 x[nchar(x)==0] = "NA"data.frame的话 就有很多做法了QQ看你是用base的函数还是dplyr or data.table不过data.frame的做法也一样拉,是character的字段做上面那两件事情,取代长度为0的跟取代特殊字符另外,你如果是读档案的话,还可以用read.table的na.strings去把你要变NA的字串直接转NAhttps://pastebin.com/B8xMx9mP
楼主: jasonfghx (工人)   2018-03-25 20:27:00
THX我来学习一下经过gsub 原本是data frame的资料都变成char这有解吗?我自己解决了

Links booklink

Contact Us: admin [ a t ] ucptt.com