- 标题
- 摘要
- 关键词
- 实验方案
- 产品
-
Automatic lung segmentation in low-dose chest CT scans using convolutional deep and wide network (CDWN)
摘要: Computed tomography (CT) imaging is the preferred imaging modality for diagnosing lung-related complaints. Automatic lung segmentation is the most common prerequisite to develop a computerized diagnosis system for analyzing chest CT images. In this paper, a convolutional deep and wide network (CDWN) is proposed to segment lung region from the chest CT scan for further medical diagnosis. Earlier lung segmentation techniques depend on handcrafted features, and their performance relies on the features considered for segmentation. The proposed model automatically segments the lung from complete CT scan in two laps: (1) learning the required ?lters to extract hierarchical feature representations at convolutional layers, (2) dense prediction with spatial features through learnable deconvolutional layers. The model has been trained and evaluated with low-dose chest CT scan images on LIDC-IDRI database. The proposed CDWN reaches the average Dice coef?cient of 0.95 and accuracy of 98% in segmenting the lung regions from 20 test images and maintains consistent results for all test images. The experimental results con?rm that the proposed approach achieves a superior performance compared to other state-of-the-art methods for lung segmentation.
关键词: Medical imaging,Image processing and analysis,Deep learning,Automatic lung segmentation,Convolutional neural network
更新于2025-09-23 15:21:01
-
Classification of foodborne bacteria using hyperspectral microscope imaging technology coupled with convolutional neural networksa??
摘要: Foodborne pathogens have become ongoing threats in the food industry, whereas their rapid detection and classification at an early stage are still challenging. To address early and rapid detection, hyperspectral microscope imaging (HMI) technology combined with convolutional neural networks (CNN) was proposed to classify foodborne bacterial species at the cellular level. HMI technology can simultaneously obtain both spatial and spectral information of different live bacterial cells, while two CNN frameworks, U-Net and one-dimensional CNN (1D-CNN), were employed to accelerate the data analysis process. U-Net was used for automating cellular regions of interest (ROI) segmentation, which generated accurate cell-ROI masks in a shorter timeframe than the conventional Otsu or Watershed methods. The 1D-CNN was employed for classifying the spectral profiles extracted from cell-ROI and resulted in a higher accuracy (90%) than k-nearest neighbor (81%) and support vector machine (81%). Overall, the CNN-assisted HMI technology showed potential for foodborne bacteria detection.
关键词: Convolutional neural network,Machine learning,Hyperspectral microscopy,Food safety,Foodborne pathogen,Rapid classification
更新于2025-09-23 15:19:57
-
Deep learning-based roadway crack classification using laser-scanned range images: A comparative study on hyperparameter selection
摘要: In recent years, deep learning-based crack detection methods have been widely explored and applied due to their high versatility and adaptability. In civil engineering applications, recent research on crack detection through deep convolutional neural network (DCNN) includes road pavement crack detection, bridge inspection, defects detection in shield tunnel lining, etc. Despite the increasing popularity of DCNN on crack detection, many challenges have yet to be properly addressed. For crack detection using three-dimensional (3D) range (i.e., elevation) images, disturbances such as surface variation can negatively affect the detection performance. Besides, some typical non-crack patterns such as grooves can be easily misidentified as cracks, i.e., false positives. Another issue lies in the selection of hyperparameters related with the design of a DCNN architecture. For example, the hyperparameters which are related with network structure (e.g., kernel size, network depth and width) and training (e.g., mini-batch size and learning rate) can impact the network performance to a significant extent. Therefore, they need to be properly determined for optimal performance. However, for deep learning-based roadway crack classification using laser-scanned range images, a comprehensive discussion on the hyperparameter selection/tuning has not been thoroughly performed. This study develops a hyperparameter selection process involving a series of experiments on laser-scanned range images with high diversities, investigating the optimal joint hyperparameter configuration on network structure and training for DCNN-based roadway crack classification. In a comparative study, 36 DCNN architectures with varying layouts are developed for crack classification. These architecture candidates differ in kernel sizes (e.g., 3 × 3, 7 × 7, and 11 × 11), network depths (from 5 to 8 weight layers), and widths (from 16 to 96 kernels in each convolutional layer). The 7-layer DCNN with constant 7 × 7 kernels and increasing network widths yields the highest classification performance among the proposed 36 DCNN classifiers, which may be because it can best reflect the complexity of the acquired laser-scanned roadway range images. Once the optimal architecture layout is determined, further discussion on the selection of min-batch sizes, learning rates, dropout factor and leaky rectified linear unit (LReLU) factor is performed. Experimental results show the optimal architecture with associated training configuration can achieve consistent and accurate performance, under the contamination of surface variations and grooved patterns in laser-scanned range images. Discussion on the hyperparameter selection can provide insights for the development of DCNN in similar applications using laser-scanned range images.
关键词: Roadway crack,Groove,Laser-scanned range image,Hyperparameter selection,Deep convolutional neural network
更新于2025-09-23 15:19:57
-
Prediction of Ia??V Characteristic Curve for Photovoltaic Modules Based on Convolutional Neural Network
摘要: Photovoltaic (PV) modules are exposed to the outside, which is affected by radiation, the temperature of the PV module back-surface, relative humidity, atmospheric pressure and other factors, which makes it difficult to test and analyze the performance of photovoltaic modules. Traditionally, the equivalent circuit method is used to analyze the performance of PV modules, but there are large errors. In this paper—based on machine learning methods and large amounts of photovoltaic test data—convolutional neural network (CNN) and multilayer perceptron (MLP) neural network models are established to predict the I–V curve of photovoltaic modules. Furthermore, the accuracy and the fitting degree of these methods for current–voltage (I–V) curve prediction are compared in detail. The results show that the prediction accuracy of the CNN and MLP neural network model is significantly better than that of the traditional equivalent circuit models. Compared with MLP models, the CNN model has better accuracy and fitting degree. In addition, the error distribution concentration of CNN has better robustness and the pre-test curve is smoother and has better nonlinear segment fitting effects. Thus, the CNN is superior to MLP model and the traditional equivalent circuit model in complex climate conditions. CNN is a high-confidence method to predict the performance of PV modules.
关键词: convolutional neural network,photovoltaic module,current–voltage curve,multilayer perceptron
更新于2025-09-23 15:19:57
-
Real-time penetration state monitoring using convolutional neural network for laser welding of tailor rolled blanks
摘要: In this paper, an innovative monitoring system capable of diagnosing the penetration state during the laser welding process is introduced, which consists of two main blocks: a coaxial visual monitoring platform and a penetration state diagnosis unit. The platform can capture coaxial images of the interaction zone during the laser welding through a partially transmitting mirror and a high-speed camera. An image dataset representing four welding states was created for training and validation. The unit mainly consists of an embedded power-efficient computing TX2 and image processing algorithms based on a convolution neural network (CNN). Experiment results show that the platform can stably capture state-of-the-art welding images. The CNN used for a diagnosis of the penetration state is optimized using an optimal network structure and hyperparameters, applying a super-Gaussian function to initialize the weights of the convolutional layer. Its latency on TX2 is less than 2 ms, satisfying the real-time requirement. During the real laser welding of tailor-rolled blanks, a penetration state diagnosis with an accuracy of 94.6 % can be achieved even if the illumination changes significantly. The similar accuracy between the validating set and a real laser welding demonstrates that the proposed monitoring system has strong robustness. The precision and recall ratios of the CNN are higher than those of other methods such as a histogram of oriented gradients and local binary pattern.
关键词: Laser welding,Coaxial visual monitoring,Penetration state diagnosis,Convolutional neural network (CNN),Tailor-rolled blank
更新于2025-09-23 15:19:57
-
[IEEE 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE) - Nara, Japan (2018.10.9-2018.10.12)] 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE) - Human Skin Segmentation Using Fully Convolutional Neural Networks
摘要: In recent years, skin segmentation has attracted much of attention from computer vision field. Normally, researchers use a simple pre-trained model or define a fixed threshold in color space to deal with skin segmentation. However, it is highly possible to failure in many conditions. In addition, convolutional neural network (CNN) has achieved great success in computer vision. This paper we present a fully convolutional neural network method in skin segmentation. A hand-crafted skin dataset has provided in this study. In the experiment, we attempt many CNN structures to determine the best one. According to the experimental result, we obtained a considerable result in three well-known skin datasets.
关键词: deep learning,skin segmentation,Convolutional neural network,human skin dataset
更新于2025-09-23 15:19:57
-
[IEEE 2018 Digital Image Computing: Techniques and Applications (DICTA) - Canberra, Australia (2018.12.10-2018.12.13)] 2018 Digital Image Computing: Techniques and Applications (DICTA) - Adversarial Context Aggregation Network for Low-Light Image Enhancement
摘要: Image captured in the low-light environments usually suffers from the low dynamic ranges and noise which degrade the quality of the image. Recently, convolutional neural network (CNN) has been employed for low-light image enhancement to simultaneously perform the brightness enhancement and noise removal. Although conventional CNN based techniques exhibit superior performance compared to traditional non-CNN based methods, they often produce the image with visual artifacts due to the small receptive field in their network. In order to cope with this problem, we propose an adversarial context aggregation network (ACA-net) for low-light image enhancement, which effectively aggregates the global context via full-resolution intermediate layers. In the proposed method, we first increase the brightness of a low-light image using the two different gamma correction functions and then feed the brightened images to CNN to obtain the enhanced image. To this end, we train ACA network using L1 pixel-wise reconstruction loss and adversarial loss which encourages the network to generate a natural image. Experimental results show that the proposed method achieves state-of-the-art results in terms of peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM).
关键词: context aggregation,Low-light image enhancement,Convolutional neural network,generative adversarial network
更新于2025-09-19 17:15:36
-
A multi-faceted CNN architecture for automatic classification of mobile LiDAR data and an algorithm to reproduce point cloud samples for enhanced training
摘要: Mobile Laser Scanning (MLS) data of outdoor environment are typically characterised by occlusion, noise, clutter, large data size and high quantum of information which makes their classification a challenging problem. This paper presents three deep Convolutional Neural Network (CNN) architectures in three dimension (3D), namely single CNN (SCN), multi-faceted CNN (MFC) and MFC with reproduction (MFCR) for automatic classification of MLS data. The MFC uses multiple facets of an MLS sample as inputs to different SCNs, thus providing additional information during classification. The MFC, once trained, is used to reproduce additional samples with the help of existing samples. The reproduced samples are employed to further refine the MFC training parameters, thus giving a new method called MFCR. The three architectures are evaluated on an ensemble of 3D outdoor MLS data consisting of four classes, i.e. tree, pole, house and ground covered with low vegetation along with car samples from KITTI dataset. The total accuracy and kappa values of classifications reached up to (i) 86.0% and 81.3% for the SCN (ii) 94.3% and 92.4% for the MFC and (iii) 96.0% and 94.6% for the MFCR, respectively. The paper has demonstrated the use of multiple facets to significantly improve classification accuracy over the SCN. Finally, a unique approach has been developed for reproduction of samples which has shown potential to improve the accuracy of classification. Unlike previous works on the use of CNN for structured point cloud of indoor objects, this work shows the utility of different proposed CNN architectures for classification of varieties of outdoor objects, viz., tree, pole, house and ground which are captured as unstructured point cloud by MLS.
关键词: Sample reproduction,Mobile Laser Scanning (MLS),Automatic classification,Convolutional Neural Network (CNN)
更新于2025-09-19 17:15:36
-
Unit panel node detection by CNN on FAST reflector
摘要: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has an active reflector. During observations, the reflector will be deformed into a paraboloid 300 meters in diameter. To improve its surface accuracy, we propose a scheme for photogrammetry to measure the positions of 2226 nodes on the reflector. The way to detect the nodes in the photos is the key problem in this application of photogrammetry. This paper applies a convolutional neural network (CNN) with candidate regions to detect the nodes in the photos. Experimental results show a high recognition rate of 91.5%, which is much higher than the recognition rate for traditional edge detection.
关键词: photogrammetry,FAST,telescopes,nodes detect,convolutional neural network
更新于2025-09-19 17:15:36
-
[IEEE 2018 7th European Workshop on Visual Information Processing (EUVIP) - Tampere, Finland (2018.11.26-2018.11.28)] 2018 7th European Workshop on Visual Information Processing (EUVIP) - A Hybrid Approach to Hand Detection and Type Classification in Upper-Body Videos
摘要: Detection of hands in videos and their classification into left and right types are crucial in various human-computer interaction and data mining systems. A variety of effective deep learning methods have been proposed for this task, such as region-based convolutional neural networks (R-CNNs), however the large number of their proposal windows per frame deem them computationally intensive. For this purpose we propose a hybrid approach that is based on substituting the 'selective search' R-CNN module by an image processing pipeline assuming visibility of the facial region, as for example in signing and cued speech videos. Our system comprises two main phases: preprocessing and classification. In the preprocessing stage we incorporate facial information, obtained by an AdaBoost face detector, into a skin-tone based segmentation scheme that drives Kalman filtering based hand tracking, generating very few candidate windows. During classification, the extracted proposal regions are fed to a CNN for hand detection and type classification. Evaluation of the proposed hybrid approach on four well-known datasets of gestures and signing demonstrates its superior accuracy and computational efficiency over the R-CNN and its variants.
关键词: region-based convolutional neural network (R-CNN),hand type classification,Hand detection,AdaBoost face detection,Kalman filtering
更新于2025-09-19 17:15:36