Skip to main content

Articles

Page 3 of 9

  1. High-throughput sequencing technologies have led to explosive growth of genomic databases; one of which will soon reach hundreds of terabytes. For many applications we want to build and store indexes of these ...

    Authors: Christina Boucher, Travis Gagie, Alan Kuhnle, Ben Langmead, Giovanni Manzini and Taher Mun
    Citation: Algorithms for Molecular Biology 2019 14:13
  2. We study the problem of identifying differentially mutated subnetworks of a large gene–gene interaction network, that is, subnetworks that display a significant difference in mutation frequency in two sets of ...

    Authors: Morteza Chalabi Hajkarim, Eli Upfal and Fabio Vandin
    Citation: Algorithms for Molecular Biology 2019 14:10
  3. With long reads getting even longer and cheaper, large scale sequencing projects can be accomplished without short reads at an affordable cost. Due to the high error rates and less mature tools, de novo assemb...

    Authors: Riku Walve, Pasi Rastas and Leena Salmela
    Citation: Algorithms for Molecular Biology 2019 14:8
  4. Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wh...

    Authors: Lavinia Egidi, Felipe A. Louza, Giovanni Manzini and Guilherme P. Telles
    Citation: Algorithms for Molecular Biology 2019 14:6
  5. Network connectivity problems are abundant in computational biology research, where graphs are used to represent a range of phenomena: from physical interactions between molecules to more abstract relationship...

    Authors: Jimmy Wu, Alex Khodaverdian, Benjamin Weitz and Nir Yosef
    Citation: Algorithms for Molecular Biology 2019 14:5
  6. Sequencing technologies keep on turning cheaper and faster, thus putting a growing pressure for data structures designed to efficiently store raw data, and possibly perform analysis therein. In this view, ther...

    Authors: Nicola Prezza, Nadia Pisanti, Marinella Sciortino and Giovanna Rosone
    Citation: Algorithms for Molecular Biology 2019 14:3
  7. Absolute fast converging (AFC) phylogeny estimation methods are ones that have been proven to recover the true tree with high probability given sequences whose lengths are polynomial in the number of number of...

    Authors: Qiuyi Zhang, Satish Rao and Tandy Warnow
    Citation: Algorithms for Molecular Biology 2019 14:2
  8. A key factor in computational drug design is the consistency and reliability with which intermolecular interactions between a wide variety of molecules can be described. Here we present a procedure to efficien...

    Authors: Martin S. Engler, Bertrand Caron, Lourens Veen, Daan P. Geerke, Alan E. Mark and Gunnar W. Klau
    Citation: Algorithms for Molecular Biology 2019 14:1
  9. Motif analysis methods have long been central for studying biological function of nucleotide sequences. Functional genomics experiments extend their potential. They typically generate sequence lists ranked by ...

    Authors: Morten Muhlig Nielsen, Paula Tataru, Tobias Madsen, Asger Hobolth and Jakob Skou Pedersen
    Citation: Algorithms for Molecular Biology 2018 13:17
  10. Superbubbles are distinctive subgraphs in direct graphs that play an important role in assembly algorithms for high-throughput sequencing (HTS) data. Their practical importance derives from the fact they are c...

    Authors: Fabian Gärtner, Lydia Müller and Peter F. Stadler
    Citation: Algorithms for Molecular Biology 2018 13:16
  11. Genome sequences and genome annotation data have become available at ever increasing rates in response to the rapid progress in sequencing technologies. As a consequence the demand for methods supporting compa...

    Authors: Fabian Gärtner, Christian Höner zu Siederdissen, Lydia Müller and Peter F. Stadler
    Citation: Algorithms for Molecular Biology 2018 13:15
  12. Liquid chromatography combined with tandem mass spectrometry is an important tool in proteomics for peptide identification. Liquid chromatography temporally separates the peptides in a sample. The peptides tha...

    Authors: Yves Frank, Tomas Hruz, Thomas Tschager and Valentin Venzin
    Citation: Algorithms for Molecular Biology 2018 13:14
  13. One way to estimate the evolutionary distance between two given genomes is to determine the minimum number of large-scale mutations, or genome rearrangements, that are necessary to transform one into the other. I...

    Authors: Andre R. Oliveira, Guillaume Fertin, Ulisses Dias and Zanoni Dias
    Citation: Algorithms for Molecular Biology 2018 13:13
  14. Most phylogenetic studies using molecular data treat gaps in multiple sequence alignments as missing data or even completely exclude alignment columns that contain gaps.

    Authors: Alexander Donath and Peter F. Stadler
    Citation: Algorithms for Molecular Biology 2018 13:12
  15. The double cut and join (DCJ) model of genome rearrangement is well studied due to its mathematical simplicity and power to account for the many events that transform gene order. These studies have mostly been...

    Authors: Pijus Simonaitis and Krister M. Swenson
    Citation: Algorithms for Molecular Biology 2018 13:9
  16. Patterns with wildcards in specified positions, namely spaced seeds, are increasingly used instead of k-mers in many bioinformatics applications that require indexing, querying and rapid similarity search, as the...

    Authors: Samuele Girotto, Matteo Comin and Cinzia Pizzi
    Citation: Algorithms for Molecular Biology 2018 13:8
  17. An important task in a metagenomic analysis is the assignment of taxonomic labels to sequences in a sample. Most widely used methods for taxonomy assignment compare a sequence in the sample to a database of kn...

    Authors: Nidhi Shah, Stephen F. Altschul and Mihai Pop
    Citation: Algorithms for Molecular Biology 2018 13:7
  18. For a combination of reasons (including data generation protocols, approaches to taxon and gene sampling, and gene birth and loss), estimated gene trees are often incomplete, meaning that they do not contain a...

    Authors: Sarah Christensen, Erin K. Molloy, Pranjal Vachaspati and Tandy Warnow
    Citation: Algorithms for Molecular Biology 2018 13:6
  19. A profile-comparison method with position-specific scoring matrix (PSSM) is among the most accurate alignment methods. Currently, cosine similarity and correlation coefficients are used as scoring functions of...

    Authors: Kazunori D. Yamada
    Citation: Algorithms for Molecular Biology 2018 13:5
  20. Microbial typing methods are commonly used to study the relatedness of bacterial strains. Sequence-based typing methods are a gold standard for epidemiological surveillance due to the inherent portability of s...

    Authors: João A. Carriço, Maxime Crochemore, Alexandre P. Francisco, Solon P. Pissis, Bruno Ribeiro-Gonçalves and Cátia Vaz
    Citation: Algorithms for Molecular Biology 2018 13:4
  21. Reconstructing the genome of a species from short fragments is one of the oldest bioinformatics problems. Metagenomic assembly is a variant of the problem asking to reconstruct the circular genomes of all bact...

    Authors: Nidia Obscura Acosta, Veli Mäkinen and Alexandru I. Tomescu
    Citation: Algorithms for Molecular Biology 2018 13:3
  22. In the absence of horizontal gene transfer it is possible to reconstruct the history of gene families from empirically determined orthology relations, which are equivalent to event-labeled gene trees. Knowledge o...

    Authors: Nikolai Nøjgaard, Manuela Geiß, Daniel Merkle, Peter F. Stadler, Nicolas Wieseke and Marc Hellmuth
    Citation: Algorithms for Molecular Biology 2018 13:2
  23. Various approaches to alignment-free sequence comparison are based on the length of exact or inexact word matches between pairs of input sequences. Haubold et al. (J Comput Biol 16:1487–1500, 2009) showed how the...

    Authors: Burkhard Morgenstern, Svenja Schöbel and Chris-André Leimeister
    Citation: Algorithms for Molecular Biology 2017 12:27
  24. Suffix arrays, augmented by additional data structures, allow solving efficiently many string processing problems. The external memory construction of the generalized suffix array for a string collection is a ...

    Authors: Felipe A. Louza, Guilherme P. Telles, Steve Hoffmann and Cristina D. A. Ciferri
    Citation: Algorithms for Molecular Biology 2017 12:26
  25. Multiple sequence alignment (MSA) plays a key role in biological sequence analyses, especially in phylogenetic tree construction. Extreme increase in next-generation sequencing results in shortage of efficient...

    Authors: Shixiang Wan and Quan Zou
    Citation: Algorithms for Molecular Biology 2017 12:25
  26. In order to find correlated pairs of positions between proteins, which are useful in predicting interactions, it is necessary to concatenate two large multiple sequence alignments such that the sequences that ...

    Authors: William R. Taylor
    Citation: Algorithms for Molecular Biology 2017 12:24
  27. Genomic DNA frequently undergoes rearrangement of the gene order that can be localized by comparing the two DNA sequences. In mitochondrial genomes different mechanisms are likely at work, at least some of whi...

    Authors: Marwa Al Arab, Matthias Bernt, Christian Höner zu Siederdissen, Kifah Tout and Peter F. Stadler
    Citation: Algorithms for Molecular Biology 2017 12:22
  28. Mixtures of beta distributions are a flexible tool for modeling data with values on the unit interval, such as methylation levels. However, maximum likelihood parameter estimation with beta distributions suffe...

    Authors: Christopher Schröder and Sven Rahmann
    Citation: Algorithms for Molecular Biology 2017 12:21
  29. This paper addresses the problem of finding attractors in biological regulatory networks. We focus here on non-deterministic synchronous and asynchronous multi-valued networks, modeled using automata networks ...

    Authors: Emna Ben Abdallah, Maxime Folschette, Olivier Roux and Morgan Magnin
    Citation: Algorithms for Molecular Biology 2017 12:20
  30. Numerous cellular differentiation processes can be captured using discrete qualitative models of biological regulatory networks. These models describe the temporal evolution of the state of the network subject...

    Authors: Louis Fippo Fitime, Olivier Roux, Carito Guziolowski and Loïc Paulevé
    Citation: Algorithms for Molecular Biology 2017 12:19
  31. Isometric gene tree reconciliation is a gene tree/species tree reconciliation problem where both the gene tree and the species tree include branch lengths, and these branch lengths must be respected by the rec...

    Authors: Broňa Brejová, Askar Gafurov, Dana Pardubská, Michal Sabo and Tomáš Vinař
    Citation: Algorithms for Molecular Biology 2017 12:17
  32. In this work, we present a new coarse grained representation of RNA dynamics. It is based on adjacency matrices and their interactions patterns obtained from molecular dynamics simulations. RNA molecules are w...

    Authors: Sven Jager, Benjamin Schiller, Philipp Babel, Malte Blumenroth, Thorsten Strufe and Kay Hamacher
    Citation: Algorithms for Molecular Biology 2017 12:15
  33. The gene family-free framework for comparative genomics aims at providing methods for gene order analysis that do not require prior gene family assignment, but work directly on a sequence similarity graph. We ...

    Authors: Daniel Doerr, Metin Balaban, Pedro Feijão and Cedric Chauve
    Citation: Algorithms for Molecular Biology 2017 12:14
  34. Cancer is an evolutionary process characterized by the accumulation of somatic mutations in a population of cells that form a tumor. One frequent type of mutations is copy number aberrations, which alter the n...

    Authors: Mohammed El-Kebir, Benjamin J. Raphael, Ron Shamir, Roded Sharan, Simone Zaccaria, Meirav Zehavi and Ron Zeira
    Citation: Algorithms for Molecular Biology 2017 12:13
  35. Given a peptide as a string of amino acids, the masses of all its prefixes and suffixes can be found by a trivial linear scan through the amino acid masses. The inverse problem is the ideal ...

    Authors: Thomas Tschager, Simon Rösch, Ludovic Gillet and Peter Widmayer
    Citation: Algorithms for Molecular Biology 2017 12:12
  36. In a computed protein multiple sequence alignment, the coreness of a column is the fraction of its substitutions that are in so-called core columns of the gold-standard reference alignment of its proteins. In ben...

    Authors: Dan DeBlasio and John Kececioglu
    Citation: Algorithms for Molecular Biology 2017 12:11
  37. Frameshift translation is an important phenomenon that contributes to the appearance of novel coding DNA sequences (CDS) and functions in gene evolution, by allowing alternative amino acid translations of gene...

    Authors: Safa Jammali, Esaie Kuitche, Ayoub Rachati, François Bélanger, Michelle Scott and Aïda Ouangraoua
    Citation: Algorithms for Molecular Biology 2017 12:10
  38. A basic task in bioinformatics is the counting of k-mers in genome sequences. Existing k-mer counting tools are most often optimized for small k < 32 and suffer from excessive memory resource consumption or degra...

    Authors: Marius Erbert, Steffen Rechner and Matthias Müller-Hannemann
    Citation: Algorithms for Molecular Biology 2017 12:9