问题分析
微信开发中带有emoji表情昵称的微信用户无法自动登录的问题
异常信息:
Incorrect string value: '\xF0\x9F\x8D\x80' for column 'nickname' at row 1;
原因: emoji表情是用4个字节编码,mysql默认是默认varchar字段属于utf-8只能用存进3个字节编码的字符串内容
一:mysql版本5.7以下
1.1:使用base64转码后存入数据库,
1.2:过滤emoji表情(可能有些表情无法过滤)
过滤工具类
public class EmojiFilter {
private static boolean isEmojiCharacter(char codePoint) {
return (codePoint == 0x0) || (codePoint == 0x9) || (codePoint == 0xA)
|| (codePoint == 0xD)
|| ((codePoint >= 0x20) && (codePoint <= 0xD7FF))
|| ((codePoint >= 0xE000) && (codePoint <= 0xFFFD))
|| ((codePoint >= 0x10000) && (codePoint <= 0x10FFFF));
}
/**
* 过滤emoji 或者 其他非文字类型的字符
*
* @param source
* @return
*/
public static String filterEmoji(String source) {
if (StringUtils.isBlank(source)) {
return source;
}
StringBuilder buf = null;
int len = source.length();
for (int i = 0; i < len; i++) {
char codePoint = source.charAt(i);
if (isEmojiCharacter(codePoint)) {
if (buf == null) {
buf = new StringBuilder(source.length());
}
buf.append(codePoint);
}
}
if (buf == null) {
return source;
} else {
if (buf.length() == len) {
buf = null;
return source;
} else {
return buf.toString();
}
}
}
1.3:使用别人封装的一个类,专门解决emoji问题的。这个在git上有开源的代码。在pom工程中引入
<dependency>
<groupId>com.vdurmont</groupId>
<artifactId>emoji-java</artifactId>
<version>4.0.0</version>
</dependency>
使用方法:
EmojiParser.removeAllEmojis(str)
二:mysql版本5.7以上
2.1:修改数据库、表、列的字符集
ALTER DATABASE database_name CHARACTER SET = utf8mb4 COLLATE = utf8mb4_unicode_ci;
ALTER TABLE table_name CONVERT TO CHARACTER SET utf8mb4 COLLATE utf8mb4_unicode_ci;
ALTER TABLE table_name CHANGE column_name VARCHAR(50) CHARACTER SET utf8mb4 COLLATE utf8mb4_unicode_ci;
2.2 : 修改MySQL配置文件 my.cnf(windows下是 my.ini)
[client]
default-character-set = utf8mb4
[mysql]
default-character-set = utf8mb4
[mysqld]
character-set-client-handshake = FALSE
character-set-server = utf8mb4
collation-server = utf8mb4_unicode_ci
init_connect='SET NAMES utf8mb4'
注意: 如果后台代码使用了 mysql-connector-x.x.x.jar ,那么mysql-connector的版本必须高于 5.1.13 ,否则不支持 utf8mb4。