机器学习和图像处理技术对肿瘤标志物进行分类

Cancer is one of the deadliest diseases so far found in the human beings. During the years 2012 and 2014, the death rate due to this deadly disease increased by nearly 6%. In 2012, there were 478,180 losses out of 2,934,314 cases stated. In 2013 there were 465,169 demises out of 3,016,628 cases. In 2014, 491,598 people deceased in out of 2,820,179 cases. In 2017, 9.6 million people are projected to have died from the various types of cancer. Every sixth bereavement in the world is due to cancer, making it the second prominent cause of deaths. Such a disease has its beginning with a small lump in any of the body parts, this is known to be a tumor. We cannot claim all tumors to be cancer tumors but most of the cancers begin with a tumor. So we are proposing a model to classify the tumor occurring in human as normal ones or cancer tumors, once they are categorized as Cancer Tumors, the type of cancer is predicted and the stage and immediate precautions are also prescribed.

癌症是迄今为止人类发现的最致命的疾病之一。 在2012年和2014年期间,由于这种致命疾病导致的死亡率增加了近6%。 2012年,在所述的2,934,314宗案件中,有478,180宗损失。 2013年,在3,016,628例病例中,有465,169例死亡。 2014年,在2,820,179起案件中,有491,598人死亡。 2017年,预计有960万人死于各种类型的癌症。 世界上每六个丧亲之死都是由于癌症,使之成为第二大死亡原因。 这种疾病的开始是在任何身体部位都有小肿块,这被称为肿瘤。 我们不能断言所有肿瘤都是癌瘤,但是大多数癌瘤都是始于肿瘤。 因此,我们提出了一种模型,将人类发生的肿瘤分类为正常肿瘤或癌症肿瘤,一旦将它们分类为癌症肿瘤,就可以预测癌症的类型,并规定分期和立即采取的预防措施。

问题识别: (Problem Identification:)

In this drastically changing lifestyle of people, there are many possibilities of cancer occurring, this disease occurs due to varied reasons like, organic or general human factors, such as age, gender, inherited genetic defects and skin type, environmental contact, for instance to radon and UV radiation, and fine particulate matter, job related risk factors, including consumption of carcinogens such as many chemicals, radioactive materials and asbestos and last but not least lifestyle-related factors such as simple food which we consume day to day as all the fruits and vegetables we eat has loads and loads of chemicals in the form of fertilizers and pesticides and the all junk food we consume leads us in the intake of cancer causing agents, so in these inevitable conditions every one of us irrespective of the life we live has a chance for this disease, but there are cases where people have survived from this, these are due to early detection and diagnosis so we aim at detecting this disease at an earlier stage with its early symptoms like tumor markers.

在这种急剧变化的人们的生活方式中,有多种癌症发生的可能性,这种疾病的发生是由于各种原因,例如有机的或一般的人为因素,例如年龄,性别,遗传遗传缺陷和皮肤类型,环境接触等。和紫外线辐射以及细颗粒物是与工作相关的危险因素,包括致癌物如许多化学物质,放射性物质和石棉的消耗,以及与生活有关的最重要因素,如我们日常所消耗的简单食物我们吃的水果和蔬菜含有大量的化肥和杀虫剂形式的化学物质,我们食用的所有垃圾食品都会导致我们摄入致癌物质,因此,在这些不可避免的情况下,我们每个人,不论我们的生活如何可能会患上这种疾病,但是在某些情况下人们已经从中幸存下来,这是由于早期发现和诊断所致,因此我们的目标是发现这种疾病 在较早的阶段,它具有诸如肿瘤标志物的早期症状。

建议的解决方案: (Proposed Solution:)

Image for post
Image by Author — Cancer Detection System with ML terms
作者提供的图像—带有ML术语的癌症检测系统

Tests for cancers are vast so one of the tests can be done with the use of tumor markers. The tumor image is captured in CT scan or MRI scan these kind of X- Ray images the unique features or tumor markers corresponding to cancer and its type are detected if any by our proposed model. The features from the already available patient’s data sets are extracted and stored; both the features are extracted and sent to the model for comparison. The type of cancer is predicted if the tumor is confirmed to be cancerous, we here can use two types of data sets both numerical and textual ones and image data sets our model should be flexible to use both and manipulate them with higher accuracy.

癌症的检测方法非常广泛,因此可以使用肿瘤标志物进行检测之一。 在CT扫描或MRI扫描中捕获的肿瘤图像是这类X射线图像,通过我们提出的模型可以检测出与癌症及其类型相对应的独特特征或肿瘤标志物。 从已经可用的患者数据集中提取特征并进行存储; 提取两个特征并将其发送到模型进行比较。 如果癌症被证实是癌性的,则可以预测癌症的类型,我们在这里可以使用两种类型的数据集,包括数字和文本数据集,图像数据集我们的模型应该灵活地使用这两种数据集,并以更高的精度进行操作。

使用的技术: (Technologies Used:)

The tech stack for implementing the above discussed model can be,

用于实现上述模型的技术栈可以是,

Image for post
Image by Author — Tech Stack for Tumor based Cancer Detection System
作者提供的图像—用于基于肿瘤的癌症检测系统的技术堆栈

For the above suggested solution we need many machine learning libraries like pandas, Sciki-tlearn, Scipy, Numpy, Matplotlib, Keras, Tensorflow, Opencv, KNN, regression, recommender systems and lot more. For instance to read all the data (numeric) and to explore features we need Numpy and pandas, to plot features extracted from Opencv or pandas exploration in graphs or visual means we need Matplotlib, to analyze images, layering for neural networks, extract, compare, match feature points and come to fruitful conclusions we require Keras, Tensorflow, Opencv libraries and to finally predict the outcomes we use KNN classification, regression prediction techniques and finally we use recommender systems to provide suggestions for the problem faced by which the situation can be kept under control. Artificial intelligence is used in this because we make the model to learn the features, changes and all the factors some by supervised learning and some using unsupervised learning and out if that knowledge we gain future predictions to control the factors and it is impossible by human to interpret certain digital images and analyze voluminous data sets hence we use machine learning and artificial intelligence models

对于上述建议的解决方案,我们需要许多机器学习库,如熊猫,Sciki-tlearn,Scipy,Numpy,Matplotlib,Keras,Tensorflow,Opencv,KNN,回归,推荐系统等。 例如,读取所有数据(数值)并探索我们需要Numpy和pandas的特征,以图形或视觉方式绘制从Opencv或pandas探索中提取的特征,我们需要Matplotlib,分析图像,为神经网络分层,提取,比较,匹配特征点并得出富有成效的结论,我们需要Keras,Tensorflow,Opencv库,最后要预测结果,我们使用KNN分类,回归预测技术,最后我们使用推荐系统为可能遇到的问题提供建议受到控制。 之所以使用人工智能,是因为我们使模型成为学习特征,变化和所有因素的模型,其中一些是通过有监督的学习来进行的,有些是通过无监督的学习来进行的,如果该知识我们能够获得未来的预测来控制这些因素,而人类则不可能解释某些数字图像并分析大量数据集,因此我们使用机器学习和人工智能模型

Image for post
Image by Author — Tumor based Cancer Detection System workflow
作者提供的图像—基于肿瘤的癌症检测系统工作流程

包含的详细信息: (Details to be included:)

The future enhancements of this proposed model are, this can be even modeled into a mobile application and using machine learning algorithms and the collected data sets we can predict for each individual to be tested for cancer or advises on lifestyle changes can be given, also the prediction of the amount of cancer causing agents in food are calculated and change in food habits can be suggested and healthier diets can be prescribed also a personalized ‘online food suggester’ and other cancer preventive measures can be recommended and to those with the symptoms can be advised to go to the doctors with the doctor details.

该提议模型的未来增强功能是,甚至可以将其建模为移动应用程序,并使用机器学习算法,并且可以为每个要测试的癌症患者预测预测的收集数据集或就生活方式的改变提供建议,可以计算出食物中致癌因子的预测值,可以建议改变饮食习惯,可以建议饮食更健康,还可以推荐个性化的“在线食物建议者”和其他预防癌症的措施,并且对于有症状的人可以建议带医生详细去看医生。

当前涉及的技术初创企业: (Tech Startups that are involved currently:)

There are many AI based startups to detect this disease at an earlier stage to name a few

有很多基于AI的初创公司可以在早期发现这种疾病,仅举几例

Sascan is one of them which facilitates multi-spectral camera as a non-intrusive and immediate solution to monitor and detect cancer cells in the mouth. The camera takes pictures of the interior of the mouth with lighting at different wavelengths of light. “The processed images are analyzed in real time to decide whether the tissue is abnormal or not,” explains Dr Narayanan.

Sascan是其中之一,它促进了多光谱相机作为一种非侵入性的即时解决方案来监视和检测口腔中的癌细胞。 照相机用不同波长的光在口腔内部拍照。 Narayanan博士解释说:“对处理过的图像进行实时分析,以确定组织是否异常。”

Aman is another one used to address this challenge by using exosomes, which can be used for molecular diagnosis of cancer without biopsy or scanning. Exocan’s technology-based assay analyses a patient’s biofluids (blood, saliva or urine) to afford exact diagnosis. The results are more precise and cost efficient than usual methods the results are available within just two days. The diagnostic assay, presently under progress, is set to be verified in greater experiments in the upcoming days.

阿曼(Aman)是另一种通过使用外泌体来应对这一挑战的方法,这种外泌体无需活检或扫描即可用于癌症的分子诊断。 Exocan基于技术的检测方法可以分析患者的生物流体(血液,唾液或尿液),以提供准确的诊断。 结果比通常的方法更精确和更具成本效益,仅需两天即可获得结果。 目前正在进行的诊断分析方法将在未来几天的更大实验中得到验证。

Theranosis is one more startup, which is working on a type of fluid biopsy that detects live cancer cells in blood circulation. Their pioneering “micro-fluidics lab-on-a-chip” technology takes circulating tumor cells (CTCs) in marginal blood. “The CTCs are live cancer cells that metastasize to other organs via the bloodstream and grow into secondary cancers. So, CTCs are more important than DNA which are derived from dead cancer cells,” Dr Kannan explains.

Theranosis又是一家新兴公司,正在开展一种流体活检,以检测血液循环中的活癌细胞。 他们开创性的“微流控单芯片实验室”技术可利用边缘血液中的循环肿瘤细胞(CTC)。 “ CTC是活癌细胞,它们通过血流转移到其他器官,然后发展为继发性癌症。 因此,CTC比源自死亡癌细胞的DNA更重要。” Kannan博士解释说。

One more company named Onward is also building a histopathology tool that assists labs procedure twice the amount of samples with the similar options. “One of the tools we are building for a specific set of bio markers will provide deeper insights and quantified clinical information that will help the oncologist narrow down their choice of treatment decisions which can go a long way in improving patient outcomes,” Dinesh explains. Furthermore, their machine-learning workings for radiology scans mammograms and helps locate suspicious calcifications and masses. It also determines whether these are benign or malignant at an early stage.

另一家名为Onward的公司也正在构建一种组织病理学工具,该工具可以通过类似的选择帮助实验室处理两倍于样本量的样品。 Dinesh解释说:“我们正在为一组特定的生物标记物构建的工具之一将提供更深入的见解和量化的临床信息,这将帮助肿瘤学家缩小对治疗决定的选择范围,这对改善患者的治疗效果大有帮助。” 此外,他们的放射学机器学习技术可扫描乳房X线照片,并帮助找到可疑的钙化和肿块。 它还可以在早期确定它们是良性还是恶性的。

外卖: (Takeaways:)

This usage of technologies will provide an early detection of this deadly disease cancer and will prevent individuals from becoming prey to this disease with the help of enhanced researches, required data sets and experiments. The precision of the results rendered by this model would be more accurate as it is developed with the latest technology ‘Tensorflow — Keras’. This idea can be modeled using Artificial intelligence and machine learning which would serve every human being to overcome the fear of cancer occurrence or curing it at an earlier stage.

这种技术的使用将提供对这种致命疾病癌症的早期检测,并在增强的研究,所需的数据集和实验的帮助下,防止个人成为该疾病的牺牲品。 通过使用最新技术“ Tensorflow-Keras”开发的此模型所呈现结果的精度将更加准确。 可以使用人工智能和机器学习对这一想法进行建模,这将使每个人都能克服对癌症发生的恐惧或在更早的阶段治愈它。

翻译自: https://medium.com/bloggers-bay/machine-learning-and-image-processing-techniques-to-categorize-tumor-markers-24898fe1fc51

  • 2
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值