Tonnesenmccoy9974
80-0.97 recall and a 0.69-0.84 precision at a rate of 8.6 ms/word.
96% of biodiversity literature titles. For three different data sets, they resulted in a 0.80-0.97 recall and a 0.69-0.84 precision at a rate of 8.6 ms/word.
Statistical methods used by most morphologists to validate species boundaries (such as principal component analysis [PCA] and non-metric multidimensional scaling [nMDS]) are limiting because these methods are mostly used as visualization methods, and because the groups are identified by taxonomists (i.e., supervised), adding human bias. Here, we use a spectral clustering algorithm for the unsupervised discovery of species boundaries followed by the analysis of the cluster-defining characters.
We used spectral clustering, nMDS, and PCA on 16 morphological characters within the genus
to group 93 individuals from 10 taxa. A radial basis function kernel was used for the spectral clustering with user-specified tuning values (gamma). The goodness of the discovered clusters using each gamma value was quantified using eigengap, a normalized mutual information score, and the Rand index. Finally, mutual information-based character selection and a
-test were used to identify cluster-defining characters.
Spectral clustering revealed five, nine, and 12 clusters of taxa in the species complexes examined here. Character selection identified at least four characters that defined these clusters.
Together with our proposed character analysis methods, spectral clustering enabled the unsupervised discovery of species boundaries along with an explanation of their biological significance. Our results suggest that spectral clustering combined with a character selection analysis can enhance morphometric analyses and is superior to current clustering methods for species delimitation.
Together with our proposed character analysis methods, spectral clustering enabled the unsupervised discovery of species boundaries along with an explanation of their biological significance. Our results suggest that spectral clustering combined with a character selection analysis can enhance morphometric analyses and is superior to current clustering methods for species delimitation.Recent advances in sequencing and informatic technologies have led to a deluge of publicly available genomic data. While it is now relatively easy to sequence, assemble, and identify genic regions in diploid plant genomes, functional annotation of these genes is still a challenge. Over the past decade, there has been a steady increase in studies utilizing machine learning algorithms for various aspects of functional prediction, because these algorithms are able to integrate large amounts of heterogeneous data and detect patterns inconspicuous through rule-based approaches. The goal of this review is to introduce experimental plant biologists to machine learning, by describing how it is currently being used in gene function prediction to gain novel biological insights. In this review, we discuss specific applications of machine learning in identifying structural features in sequenced genomes, predicting interactions between different cellular components, and predicting gene function and organismal phenotypes. Finally, we also propose strategies for stimulating functional discovery using machine learning-based approaches in plants.
Trichomes are hair-like appendages extending from the plant epidermis. They serve many important biotic roles, including interference with herbivore movement. Olaparib PARP inhibitor Characterizing the number, density, and distribution of trichomes can provide valuable insights on plant response to insect infestation and define the extent of plant defense capability. Automated trichome counting would speed up this research but poses several challenges, primarily because of the variability in coloration and the high occlusion of the trichomes.
We developed a simplified method for image processing for automated and semi-automated trichome counting. We illustrate this process using 30 leaves from 10 genotypes of soybean (
) differing in trichome abundance. We explored various heuristic image-processing methods including thresholding and graph-based algorithms to facilitate trichome counting. Of the two automated and two semi-automated methods for trichome counting tested and with the help of regression analysis, the semi-automated manually annotated trichome intersection curve method performed best, with an accuracy of close to 90% compared with the manually counted data.
We address trichome counting challenges including occlusion by combining image processing with human intervention to propose a semi-automated method for trichome quantification. This provides new opportunities for the rapid and automated identification and quantification of trichomes, which has applications in a wide variety of disciplines.
We address trichome counting challenges including occlusion by combining image processing with human intervention to propose a semi-automated method for trichome quantification. This provides new opportunities for the rapid and automated identification and quantification of trichomes, which has applications in a wide variety of disciplines.
High-resolution cameras are very helpful for plant phenotyping as their images enable tasks such as target vs. background discrimination and the measurement and analysis of fine above-ground plant attributes. However, the acquisition of high-resolution images of plant roots is more challenging than above-ground data collection. An effective super-resolution (SR) algorithm is therefore needed for overcoming the resolution limitations of sensors, reducing storage space requirements, and boosting the performance of subsequent analyses.
We propose an SR framework for enhancing images of plant roots using convolutional neural networks. We compare three alternatives for training the SR model (i) training with non-plant-root images, (ii) training with plant-root images, and (iii) pretraining the model with non-plant-root images and fine-tuning with plant-root images. The architectures of the SR models were based on two state-of-the-art deep learning approaches a fast SR convolutional neural network and an SR generative adversarial network.