- 标题
- 摘要
- 关键词
- 实验方案
- 产品
-
An enhanced diabetic retinopathy detection and classification approach using deep convolutional neural network
摘要: The objective of this study is to propose an alternative, hybrid solution method for diagnosing diabetic retinopathy from retinal fundus images. In detail, the hybrid method is based on using both image processing and deep learning for improved results. In medical image processing, reliable diabetic retinopathy detection from digital fundus images is known as an open problem and needs alternative solutions to be developed. In this context, manual interpretation of retinal fundus images requires the magnitude of work, expertise, and over-processing time. So, doctors need support from imaging and computer vision systems and the next step is widely associated with use of intelligent diagnosis systems. The solution method proposed in this study includes employment of image processing with histogram equalization, and the contrast limited adaptive histogram equalization techniques. Next, the diagnosis is performed by the classification of a convolutional neural network. The method was validated using 400 retinal fundus images within the MESSIDOR database, and average values for different performance evaluation parameters were obtained as accuracy 97%, sensitivity (recall) 94%, specificity 98%, precision 94%, FScore 94%, and GMean 95%. In addition to those results, a general comparison of with some previously carried out studies has also shown that the introduced method is efficient and successful enough at diagnosing diabetic retinopathy from retinal fundus images. By employing the related image processing techniques and deep learning for diagnosing diabetic retinopathy, the proposed method and the research results are valuable contributions to the associated literature.
关键词: Image processing,Deep learning,Convolutional neural network,Diabetic retinopathy
更新于2025-09-23 15:23:52
-
Computer-Assisted Diagnosis for Diabetic Retinopathy Based on Fundus Images Using Deep Convolutional Neural Network
摘要: Diabetic retinopathy (DR) is a complication of long-standing diabetes, which is hard to detect in its early stage because it only shows a few symptoms. Nowadays, the diagnosis of DR usually requires taking digital fundus images, as well as images using optical coherence tomography (OCT). Since OCT equipment is very expensive, it will benefit both the patients and the ophthalmologists if an accurate diagnosis can be made, based solely on reading digital fundus images. In the paper, we present a novel algorithm based on deep convolutional neural network (DCNN). Unlike the traditional DCNN approach, we replace the commonly used max-pooling layers with fractional max-pooling. Two of these DCNNs with a different number of layers are trained to derive more discriminative features for classification. After combining features from metadata of the image and DCNNs, we train a support vector machine (SVM) classifier to learn the underlying boundary of distributions of each class. For the experiments, we used the publicly available DR detection database provided by Kaggle. We used 34,124 training images and 1,000 validation images to build our model and tested with 53,572 testing images. The proposed DR classifier classifies the stages of DR into five categories, labeled with an integer ranging between zero and four. The experimental results show that the proposed method can achieve a recognition rate up to 86.17%, which is higher than previously reported in the literature. In addition to designing a machine learning algorithm, we also develop an app called 'Deep Retina.' Equipped with a handheld ophthalmoscope, the average person can take fundus images by themselves and obtain an immediate result, calculated by our algorithm. It is beneficial for home care, remote medical care, and self-examination.
关键词: deep convolutional neural network,mobile app,fractional max-pooling,support vector machine,diabetic retinopathy,fundus images,teaching-learning-based optimization
更新于2025-09-23 15:23:52
-
Deep structure tensor graph search framework for automated extraction and characterization of retinal layers and fluid pathology in retinal SD-OCT scans
摘要: Maculopathy is a group of retinal disorders that affect macula and cause severe visual impairment if not treated in time. Many computer-aided diagnostic methods have been proposed over the past that automatically detect macular diseases. However, to our best knowledge, no literature is available that provides an end-to-end solution for analyzing healthy and diseased macular pathology. This paper proposes a vendor-independent deep convolutional neural network and structure tensor graph search-based segmentation framework (CNN-STGS) for the extraction and characterization of retinal layers and fluid pathology, along with 3-D retinal profiling. CNN-STGS works by first extracting nine layers from an optical coherence tomography (OCT) scan. Afterward, the extracted layers, combined with a deep CNN model, are used to automatically segment cyst and serous pathology, followed by the autonomous 3-D retinal profiling. CNN-STGS has been validated on publicly available Duke datasets (containing a cumulative of 42,281 scans from 439 subjects) and Armed Forces Institute of Ophthalmology dataset (containing 4,260 OCT scans of 51 subjects), which are acquired through different OCT machinery. The performance of the CNN-STGS framework is validated through the marked annotations, and it significantly outperforms the existing solutions in various metrics. The proposed CNN-STGS framework achieved a mean Dice coefficient of 0.906 for segmenting retinal fluids, along with an accuracy of 98.75% for characterizing cyst and serous fluid from diseased retinal OCT scans.
关键词: convolutional neural network (CNN),Optical coherence tomography (OCT),maculopathy,ophthalmology,graph search
更新于2025-09-23 15:23:52
-
Disparate effects of DOM extracted from coastal seawaters and freshwaters on photodegradation of 2,4-Dihydroxybenzophenone
摘要: As the rapid development of deep learning techniques, extensive interest has been taken into the applications of deep learning methods on challenging problems of different domains. In view of the recent success of convolutional neural network (CNN) in various tasks of audio analysis, a comparative performance study of different the-state-of-the-art CNN architectures on a large-scale whale-call classification task is investigated in this paper. On the basis of deep neural network models, distinctive features of whale sub-populations are extracted to obtain higher level abstract representations for the accurate classification, which is significantly superior to the traditional classification approaches using manual features based on expert knowledge. In particular, a large open-source acoustic dataset recorded by audio sensors carried by whales in different locations is employed for performance comparison. Based on the experiments, it is found that the advancement of popular CNN architectures significantly improve the accuracy on the whale-call classification task. The accuracy and computational efficiency varies with the change of the CNN architectures. Xception provides the best performance among all four CNN architectures while an ensemble of CNN models can produce even better results.
关键词: deep learning,whale call classification,convolutional neural network
更新于2025-09-23 15:23:52
-
DeepSpectra: An end-to-end deep learning approach for quantitative spectral analysis
摘要: Learning patterns from spectra is critical for the development of chemometric analysis of spectroscopic data. Conventional two-stage calibration approaches consist of data preprocessing and modeling analysis. Misuse of preprocessing may introduce artifacts or remove useful patterns and result in worse model performance. An end-to-end deep learning approach incorporated Inception module, named DeepSpectra, is presented to learn patterns from raw data to improve the model performance. DeepSpectra model is compared to three CNN models on the raw data, and 16 preprocessing approaches are included to evaluate the preprocessing impact by testing four open accessed visible and near infrared spectroscopic datasets (corn, tablets, wheat, and soil). DeepSpectra model outperforms the other three convolutional neural network models on four datasets and obtains better results on raw data than in preprocessed data for most scenarios. The model is compared with linear partial least square (PLS) and nonlinear artificial neural network (ANN) methods and support vector machine (SVR) on raw and preprocessed data. The results show that DeepSpectra approach provides improved results than conventional linear and nonlinear calibration approaches in most scenarios. The increased training samples can improve the model repeatability and accuracy.
关键词: model accuracy,Inception,convolutional neural network,chemometrics,repeatability
更新于2025-09-23 15:23:52
-
[IEEE 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC) - Chongqing (2018.6.27-2018.6.29)] 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC) - Research on Single-Frame Super-Resolution Reconstruction Algorithm for Low Resolution Cell Images Based on Convolutional Neural Network
摘要: At the problem of low resolution and low contrast of cell images collected by lens-less cell imaging system, a novel cell super-resolution reconstruction network (CSRN) based on convolutional neural network is proposed. First, the cell image is collected by lens-less cell imaging system, and then the cell image is down-sampled by bicubic interpolation to obtain low-resolution cell image. Then, the low-resolution cell image is input into the CSRN network for super-resolution reconstruction. The experimental results show that the proposed CSRN network can effectively improve the resolution and contrast of cell images, and the reconstruction effect is better than that of the traditional bicubic interpolation and SRCNN network.
关键词: convolutional neural network,cell image,super-resolution,lens-less imaging
更新于2025-09-23 15:23:52
-
SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images
摘要: Context representations have been widely used to profit semantic image segmentation. The emergence of depth data provides additional information to construct more discriminating context representations. Depth data preserves the geometric relationship of objects in a scene, which is generally hard to be inferred from RGB images. While deep convolutional neural networks (CNNs) have been successful in solving semantic segmentation, we encounter the problem of optimizing CNN training for the informative context using depth data to enhance the segmentation accuracy. In this paper, we present a novel switchable context network (SCN) to facilitate semantic segmentation of RGB-D images. Depth data is used to identify objects existing in multiple image regions. The network analyzes the information in the image regions to identify different characteristics, which are then used selectively through switching network branches. With the content extracted from the inherent image structure, we are able to generate effective context representations that are aware of both image structures and object relationships, leading to a more coherent learning of semantic segmentation network. We demonstrate that our SCN outperforms state-of-the-art methods on two public datasets.
关键词: Context representation,convolutional neural network (CNN),RGB-D images,semantic segmentation
更新于2025-09-23 15:23:52
-
A CNN With Multiscale Convolution and Diversified Metric for Hyperspectral Image Classification
摘要: Recently, researchers have shown the powerful ability of deep methods with multilayers to extract high-level features and to obtain better performance for hyperspectral image classification. However, a common problem of traditional deep models is that the learned deep models might be suboptimal because of the limited number of training samples, especially for the image with large intraclass variance and low interclass variance. In this paper, novel convolutional neural networks (CNNs) with multiscale convolution (MS-CNNs) are proposed to address this problem by extracting deep multiscale features from the hyperspectral image. Moreover, deep metrics usually accompany with MS-CNNs to improve the representational ability for the hyperspectral image. However, the usual metric learning would make the metric parameters in the learned model tend to behave similarly. This similarity leads to obvious model’s redundancy and, thus, shows negative effects on the description ability of the deep metrics. Traditionally, determinantal point process (DPP) priors, which encourage the learned factors to repulse from one another, can be imposed over these factors to diversify them. Taking advantage of both the MS-CNNs and DPP-based diversity-promoting deep metrics, this paper develops a CNN with multiscale convolution and diversified metric to obtain discriminative features for hyperspectral image classification. Experiments are conducted over four real-world hyperspectral image data sets to show the effectiveness and applicability of the proposed method. Experimental results show that our method is better than original deep models and can produce comparable or even better classification performance in different hyperspectral image data sets with respect to spectral and spectral–spatial features.
关键词: deep metric learning,determinantal point process (DPP),image classification,multiscale features,Convolutional neural network (CNN),hyperspectral image
更新于2025-09-23 15:23:52
-
Relighting humans
摘要: Relighting of human images has various applications in image synthesis. For relighting, we must infer albedo, shape, and illumination from a human portrait. Previous techniques rely on human faces for this inference, based on spherical harmonics (SH) lighting. However, because they often ignore light occlusion, inferred shapes are biased and relit images are unnaturally bright particularly at hollowed regions such as armpits, crotches, or garment wrinkles. This paper introduces the first attempt to infer light occlusion in the SH formulation directly. Based on supervised learning using convolutional neural networks (CNNs), we infer not only an albedo map, illumination but also a light transport map that encodes occlusion as nine SH coefficients per pixel. The main difficulty in this inference is the lack of training datasets compared to unlimited variations of human portraits. Surprisingly, geometric information including occlusion can be inferred plausibly even with a small dataset of synthesized human figures, by carefully preparing the dataset so that the CNNs can exploit the data coherency. Our method accomplishes more realistic relighting than the occlusion-ignored formulation.
关键词: convolutional neural network,light transport,inverse rendering
更新于2025-09-23 15:23:52
-
Multi-label chest X-ray image classification via category-wise residual attention learning
摘要: This paper considers the problem of multi-label thorax disease classification on chest X-ray images. Identifying one or more pathologies from a chest X-ray image is often hindered by the pathologies unrelated to the targets. In this paper, we address the above problem by proposing a category-wise residual attention learning (CRAL) framework. CRAL predicts the presence of multiple pathologies in a class-specific attentive view. It aims to suppress the obstacles of irrelevant classes by endowing small weights to the corresponding feature representation. Meanwhile, the relevant features would be strengthened by assigning larger weights. Specifically, the proposed framework consists of two modules: feature embedding module and attention learning module. The feature embedding module learns high-level features with a convolutional neural network (CNN) while the attention learning module focuses on exploring the assignment scheme of different categories. The attention module can be flexibly integrated into any feature embedding networks with end-to-end training. The comprehensive experiments are conducted on the Chest X-ray14 dataset. CRAL yields the average AUC score of 0.816 which is a new state of the art.
关键词: Image classification,Chest X-ray,Convolutional neural network,Residual attention
更新于2025-09-23 15:23:52