Issues in searching molecular sequence databases
- PMID: 8162065
- DOI: 10.1038/ng0294-119
Issues in searching molecular sequence databases
Abstract
Sequence similarity search programs are versatile tools for the molecular biologist, frequently able to identify possible DNA coding regions and to provide clues to gene and protein structure and function. While much attention had been paid to the precise algorithms these programs employ and to their relative speeds, there is a constellation of associated issues that are equally important to realize the full potential of these methods. Here, we consider a number of these issues, including the choice of scoring systems, the statistical significance of alignments, the masking of uninformative or potentially confounding sequence regions, the nature and extent of sequence redundancy in the databases and network access to similarity search services.
Similar articles
-
Protein structural similarity search by Ramachandran codes.BMC Bioinformatics. 2007 Aug 23;8:307. doi: 10.1186/1471-2105-8-307. BMC Bioinformatics. 2007. PMID: 17716377 Free PMC article.
-
FASTA-SWAP and FASTA-PAT: pattern database searches using combinations of aligned amino acids, and a novel scoring theory.J Mol Biol. 1996 Jun 21;259(4):840-54. doi: 10.1006/jmbi.1996.0362. J Mol Biol. 1996. PMID: 8683587
-
Using the FASTA program to search protein and DNA sequence databases.Methods Mol Biol. 1994;25:365-89. doi: 10.1385/0-89603-276-0:365. Methods Mol Biol. 1994. PMID: 8004177 No abstract available.
-
[Development of information biology].Tanpakushitsu Kakusan Koso. 1995 Sep;40(12):1803-8. Tanpakushitsu Kakusan Koso. 1995. PMID: 7480789 Review. Japanese. No abstract available.
-
An overview of sequence similarity ("homology") searching.Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.1. doi: 10.1002/0471250953.bi0301s00. Curr Protoc Bioinformatics. 2002. PMID: 18792936 Review.
Cited by
-
Genomic Diversity of Streptomyces clavuligerus: Implications for Clavulanic Acid Biosynthesis and Industrial Hyperproduction.Int J Mol Sci. 2024 Oct 12;25(20):10992. doi: 10.3390/ijms252010992. Int J Mol Sci. 2024. PMID: 39456781 Free PMC article.
-
SHARK enables sensitive detection of evolutionary homologs and functional analogs in unalignable and disordered sequences.Proc Natl Acad Sci U S A. 2024 Oct 15;121(42):e2401622121. doi: 10.1073/pnas.2401622121. Epub 2024 Oct 9. Proc Natl Acad Sci U S A. 2024. PMID: 39383002 Free PMC article.
-
Structural and Functional Characterization of Lipoxygenases from Diatoms by Bioinformatics and Modelling Studies.Biomolecules. 2024 Feb 25;14(3):276. doi: 10.3390/biom14030276. Biomolecules. 2024. PMID: 38540697 Free PMC article.
-
Genome Sequencing and Organization of Three Geographically Different Isolates of Nucleopolyhedrovirus from the Gypsy Moth Reveal Significant Genomic Differences.Curr Genomics. 2023 Nov 22;24(3):146-154. doi: 10.2174/0113892029249830231014163829. Curr Genomics. 2023. PMID: 38178988 Free PMC article.
-
In Vitro Efficacy of Isobutyl Cyanoacrylate Nanoparticles against Fish Bacterial Pathogens and Selection Preference by Rainbow Trout (Oncorhynchus mykiss).Microorganisms. 2023 Nov 28;11(12):2877. doi: 10.3390/microorganisms11122877. Microorganisms. 2023. PMID: 38138020 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources