一,背景
完整异常日志:
com.alibaba.otter.canal.protocol.exception.CanalClientException: deserializer failed
at com.alibaba.otter.canal.client.CanalMessageDeserializer.deserializer(CanalMessageDeserializer.java:54) ~[canal.client-1.1.5.jar:?]
at com.alibaba.otter.canal.client.impl.SimpleCanalConnector.receiveMessages(SimpleCanalConnector.java:331) ~[canal.client-1.1.5.jar:?]
at com.alibaba.otter.canal.client.impl.SimpleCanalConnector.getWithoutAck(SimpleCanalConnector.java:323) ~[canal.client-1.1.5.jar:?]
at com.xxx.xxx.dts.receiver.CanalReceiver.processData(CanalReceiver.java:76) ~[classes/:?]
at jdk.internal.reflect.GeneratedMethodAccessor118.invoke(Unknown Source) ~[?:?]
at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:?]
at java.base/java.lang.reflect.Method.invoke(Method.java:568) ~[?:?]
at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:352) ~[spring-aop-6.1.1.jar:6.1.1]
at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:196) ~[spring-aop-6.1.1.jar:6.1.1]
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163) ~[spring-aop-6.1.1.jar:6.1.1]
at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.proceed(CglibAopProxy.java:765) ~[spring-aop-6.1.1.jar:6.1.1]
at org.springframework.aop.interceptor.AsyncExecutionInterceptor.lambda$invoke$0(AsyncExecutionInterceptor.java:115) ~[spring-aop-6.1.1.jar:6.1.1]
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) ~[?:?]
at com.xxx.ssc.tracer.support.TracerTaskDecoratorDelegate.lambda$decorate$0(TracerTaskDecoratorDelegate.java:30) ~[logging-tracer-core-1.0.0-20240315.083455-8.jar:1.0.0-SNAPSHOT]
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) [?:?]
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) [?:?]
at java.base/java.lang.Thread.run(Thread.java:833) [?:?]
Caused by: com.alibaba.otter.canal.protocol.exception.CanalClientException: something goes wrong with reason: something goes wrong with channel:[id: 0x22216bed, /xxx.xxx.xxx.xxx:55323 => /xxx.xxx.xxx.xxx:11111], exception=com.alibaba.otter.canal.meta.exception.CanalMetaManagerException: batchId:1132 is not the firstly:1105at com.alibaba.otter.canal.client.CanalMessageDeserializer.deserializer(CanalMessageDeserializer.java:46) ~[canal.client-1.1.5.jar:?]
... 16 more
二,问题原因
问题原因在于:CanalMetaManagerException: batchId:1132 is not the firstly:1105
在ack给canal的时候,batchId和取binlog时的batchId不一致。
所以canal会报错,没法儿正确的完成ack,所以batchId也不会增加,导致一致没法完成数据的同步。
回到消费端,根本原因是:batchId没有保证安全,导致获取batchId的线程做ack时,batchId已经更改了。
三,解决问题
1,在消费端代码里,保证线程安全
1.1 将获取batchId和ack用的batchId这一段放在同步代码块里
1.2 使用threadLocal存储batchId,保证同一个线程调用时batchId相同
2,在canal上配置参数
修改canal.properties
canal.instance.parser.parallel = false
问题解决!
如果对你有用,记得点赞关注哟!