title: “R语言入门–玩转字符串”
output: html_document
date: “2024-03-13”
代码的路上常新的才是迷人的,坚韧的才是长久的!
玩转字符串
— -str_split 是R中stringr包中的一个函数,用于将字符串进行拆分。- str_split函数中的simplify参数用于控制拆分结果的形式。如果simplify设置为TRUE,拆分的结果将被转换为矩阵或数组;如果设置为FALSE,拆分的结果将保存在列表中
- 1.检测字符串长度
x <- "I love my life."
x
### 1.检测字符串长度
str_length(x) ## 单个字符的长度 包括空格
length(x) ## 字符串的长度
[1] “I love my life.”
[1] 15
[1] 1
- 2.字符串拆分
### 2.字符串拆分 str_split将字符串进行拆分
str_split(x," ") #以空格为标准拆分
class(str_split(x," ")) #拆分的字符文件是列表
x2 = str_split(x," ")[[1]];x2 #字符拆分简写格式
[[1]]
[1] “I” “love” “my” “life.”
[1] “list”
[1] “I” “love” “my” “life.”
y = c("dd 150","pp 140","hh 152") #拆分的字符文件是多元素组成的列表
str_split(y," ")
str_split(y," ",simplify = T) #拆分的字符文件是列表
class(str_split(y," ",simplify = T))
## 矩阵不可局部修改数据类型 数据框可以
[1] “I love my life.”
[1] 15
[1] 1
Show in New Window
[[1]]
[1] “I” “love” “my” “life.”
[1] “list”
[1] “I” “love” “my” “life.”
Show in New Window
[[1]]
[1] “dd” “150”
[[2]]
[1] “pp” “140”
[[3]]
[1] “hh” “152”
[,1] [,2]
[1,] “dd” “150”
[2,] “pp” “140”
[3,] “hh” “152”
[1] “matrix” “array”
- 3.按位置提取字符串
x = "I love my life."
str_sub(x,3,6)
##[1] "birch"
[1] “love”
- 4.字符检测
x =c("I love my life","you","he","she","me")
str_detect(x,"v")
str_starts(x,"h") #
str_ends(x,"e")
[1] TRUE FALSE FALSE FALSE FALSE
[1] FALSE FALSE TRUE FALSE FALSE
[1] TRUE FALSE TRUE TRUE TRUE
- 5.字符串替换
x =c("I love my life","yoou","he")
str_replace(x,"o","A") #只能替换每个字符串中第1个
str_replace_all(x,"o","A") #可都替换掉
str_replace_all(x,"o|v","A")
[1] “I lAve my life” “yAou” “he”
[1] “I lAve my life” “yAAu” “he”
[1] “I lAAe my life” “yAAu” “he”
- 6.字符删除
x =c("I love my life","yoou","he")
str_remove(x," ")
str_remove_all(x," ")
[1] “Ilove my life” “yoou” “he”
[1] “Ilovemylife” “yoou” “he”
引自生信技能树