Articles

Page 3 of 10

GrpClassifierEC: a novel classification approach based on the ensemble clustering space

Advances in molecular biology have resulted in big and complicated data sets, therefore a clustering approach that able to capture the actual structure and the hidden patterns of the data is required. Moreover...

Authors: Loai Abdallah and Malik Yousef

Citation: Algorithms for Molecular Biology 2020 15:3

Content type: Research Published on: 13 February 2020
- View Full Text
- View PDF
Finding all maximal perfect haplotype blocks in linear time

Recent large-scale community sequencing efforts allow at an unprecedented level of detail the identification of genomic regions that show signatures of natural selection. Traditional methods for identifying su...

Authors: Jarno Alanko, Hideo Bannai, Bastien Cazaux, Pierre Peterlongo and Jens Stoye

Citation: Algorithms for Molecular Biology 2020 15:2

Content type: Research Published on: 10 February 2020
- View Full Text
- View PDF
Non-parametric correction of estimated gene trees using TRACTION

Estimated gene trees are often inaccurate, due to insufficient phylogenetic signal in the single gene alignment, among other causes. Gene tree correction aims to improve the accuracy of an estimated gene tree ...

Authors: Sarah Christensen, Erin K. Molloy, Pranjal Vachaspati, Ananya Yammanuru and Tandy Warnow

Citation: Algorithms for Molecular Biology 2020 15:1

Content type: Research Published on: 4 January 2020
- View Full Text
- View PDF
Kohdista: an efficient method to index and query possible Rmap alignments

Genome-wide optical maps are ordered high-resolution restriction maps that give the position of occurrence of restriction cut sites corresponding to one or more restriction enzymes. These genome-wide optical m...

Authors: Martin D. Muggli, Simon J. Puglisi and Christina Boucher

Citation: Algorithms for Molecular Biology 2019 14:25

Content type: Software article Published on: 12 December 2019
- View Full Text
- View PDF
NANUQ: a method for inferring species networks from gene trees under the coalescent model

Species networks generalize the notion of species trees to allow for hybridization or other lateral gene transfer. Under the network multispecies coalescent model, individual gene trees arising from a network ...

Authors: Elizabeth S. Allman, Hector Baños and John A. Rhodes

Citation: Algorithms for Molecular Biology 2019 14:24

Content type: Research Published on: 6 December 2019
- View Full Text
- View PDF
TMRS: an algorithm for computing the time to the most recent substitution event from a multiple alignment column

As the number of sequenced genomes grows, researchers have access to an increasingly rich source for discovering detailed evolutionary information. However, the computational technologies for inferring biologi...

Authors: Hisanori Kiryu, Yuto Ichikawa and Yasuhiro Kojima

Citation: Algorithms for Molecular Biology 2019 14:23

Content type: Research Published on: 18 November 2019
- View Full Text
- View PDF
Adjacency-constrained hierarchical clustering of a band similarity matrix with application to genomics

Genomic data analyses such as Genome-Wide Association Studies (GWAS) or Hi-C studies are often faced with the problem of partitioning chromosomes into successive regions based on a similarity matrix of high-re...

Authors: Christophe Ambroise, Alia Dehman, Pierre Neuvial, Guillem Rigaill and Nathalie Vialaneix

Citation: Algorithms for Molecular Biology 2019 14:22

Content type: Research Published on: 15 November 2019
- View Full Text
- View PDF
Super short operations on both gene order and intergenic sizes

The evolutionary distance between two genomes can be estimated by computing a minimum length sequence of operations, called genome rearrangements, that transform one genome into another. Usually, a genome is mode...

Authors: Andre R. Oliveira, Géraldine Jean, Guillaume Fertin, Ulisses Dias and Zanoni Dias

Citation: Algorithms for Molecular Biology 2019 14:21

Content type: Research Published on: 5 November 2019
- View Full Text
- View PDF
Bayesian localization of CNV candidates in WGS data within minutes

Full Bayesian inference for detecting copy number variants (CNV) from whole-genome sequencing (WGS) data is still largely infeasible due to computational demands. A recently introduced approach to perform Forw...

Authors: John Wiedenhoeft, Alex Cagan, Rimma Kozhemyakina, Rimma Gulevich and Alexander Schliep

Citation: Algorithms for Molecular Biology 2019 14:20

Content type: Software article Published on: 23 September 2019
- View Full Text
- View PDF
Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors

Tumors exhibit extensive intra-tumor heterogeneity, the presence of groups of cellular populations with distinct sets of somatic mutations. This heterogeneity is the result of an evolutionary process, describe...

Authors: Yuanyuan Qi, Dikshant Pradhan and Mohammed El-Kebir

Citation: Algorithms for Molecular Biology 2019 14:19

Content type: Research Published on: 3 September 2019
- View Full Text
- View PDF
A branching process for homology distribution-based inference of polyploidy, speciation and loss

The statistical distribution of the similarity or difference between pairs of paralogous genes, created by whole genome doubling, or between pairs of orthologous genes in two related species is an important so...

Authors: Yue Zhang, Chunfang Zheng and David Sankoff

Citation: Algorithms for Molecular Biology 2019 14:18

Content type: Research Published on: 1 August 2019
- View Full Text
- View PDF
A multi-labeled tree dissimilarity measure for comparing “clonal trees” of tumor progression

We introduce a new dissimilarity measure between a pair of “clonal trees”, each representing the progression and mutational heterogeneity of a tumor sample, constructed by the use of single cell or bulk high t...

Authors: Nikolai Karpov, Salem Malikic, Md. Khaledur Rahman and S. Cenk Sahinalp

Citation: Algorithms for Molecular Biology 2019 14:17

Content type: Research Published on: 27 July 2019
- View Full Text
- View PDF
A cubic algorithm for the generalized rank median of three genomes

The area of genome rearrangements has given rise to a number of interesting biological, mathematical and algorithmic problems. Among these, one of the most intractable ones has been that of finding the median ...

Authors: Leonid Chindelevitch, Sean La and Joao Meidanis

Citation: Algorithms for Molecular Biology 2019 14:16

Content type: Research Published on: 26 July 2019
- View Full Text
- View PDF
A general framework for genome rearrangement with biological constraints

This paper generalizes previous studies on genome rearrangement under biological constraints, using double cut and join (DCJ). We propose a model for weighted DCJ, along with a family of optimization problems ...

Authors: Pijus Simonaitis, Annie Chateau and Krister M. Swenson

Citation: Algorithms for Molecular Biology 2019 14:15

Content type: Research Published on: 19 July 2019
- View Full Text
- View PDF
Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge

Divide-and-conquer methods, which divide the species set into overlapping subsets, construct a tree on each subset, and then combine the subset trees using a supertree method, provide a key algorithmic framewo...

Authors: Erin K. Molloy and Tandy Warnow

Citation: Algorithms for Molecular Biology 2019 14:14

Content type: Research Published on: 19 July 2019
- View Full Text
- View PDF
Prefix-free parsing for building big BWTs

High-throughput sequencing technologies have led to explosive growth of genomic databases; one of which will soon reach hundreds of terabytes. For many applications we want to build and store indexes of these ...

Authors: Christina Boucher, Travis Gagie, Alan Kuhnle, Ben Langmead, Giovanni Manzini and Taher Mun

Citation: Algorithms for Molecular Biology 2019 14:13

Content type: Research Published on: 24 May 2019
- View Full Text
- View PDF
Linear time minimum segmentation enables scalable founder reconstruction

We study a preprocessing routine relevant in pan-genomic analyses: consider a set of aligned haplotype sequences of complete human chromosomes. Due to the enormous size of such data, one would like to represe...

Authors: Tuukka Norri, Bastien Cazaux, Dmitry Kosolobov and Veli Mäkinen

Citation: Algorithms for Molecular Biology 2019 14:12

Content type: Research Published on: 17 May 2019
- View Full Text
- View PDF
An average-case sublinear forward algorithm for the haploid Li and Stephens model

Hidden Markov models of haplotype inheritance such as the Li and Stephens model allow for computationally tractable probability calculations using the forward algorithm as long as the representative reference ...

Authors: Yohei M. Rosen and Benedict J. Paten

Citation: Algorithms for Molecular Biology 2019 14:11

Content type: Research Published on: 2 April 2019
- View Full Text
- View PDF
Differentially mutated subnetworks discovery

We study the problem of identifying differentially mutated subnetworks of a large gene–gene interaction network, that is, subnetworks that display a significant difference in mutation frequency in two sets of ...

Authors: Morteza Chalabi Hajkarim, Eli Upfal and Fabio Vandin

Citation: Algorithms for Molecular Biology 2019 14:10

Content type: Research Published on: 30 March 2019
- View Full Text
- View PDF
Repairing Boolean logical models from time-series data using Answer Set Programming

Boolean models of biological signalling-regulatory networks are increasingly used to formally describe and understand complex biological processes. These models may become inconsistent as new data become avail...

Authors: Alexandre Lemos, Inês Lynce and Pedro T. Monteiro

Citation: Algorithms for Molecular Biology 2019 14:9

Content type: Research Published on: 25 March 2019
- View Full Text
- View PDF
Kermit: linkage map guided long read assembly

With long reads getting even longer and cheaper, large scale sequencing projects can be accomplished without short reads at an affordable cost. Due to the high error rates and less mature tools, de novo assemb...

Authors: Riku Walve, Pasi Rastas and Leena Salmela

Citation: Algorithms for Molecular Biology 2019 14:8

Content type: Research Published on: 20 March 2019
- View Full Text
- View PDF
Reconciling multiple genes trees via segmental duplications and losses

Reconciling gene trees with a species tree is a fundamental problem to understand the evolution of gene families. Many existing approaches reconcile each gene tree independently. However, it is well-known that...

Authors: Riccardo Dondi, Manuel Lafond and Celine Scornavacca

Citation: Algorithms for Molecular Biology 2019 14:7

Content type: Research Published on: 20 March 2019
- View Full Text
- View PDF
External memory BWT and LCP computation for sequence collections with applications

Sequencing technologies produce larger and larger collections of biosequences that have to be stored in compressed indices supporting fast search operations. Many compressed indices are based on the Burrows–Wh...

Authors: Lavinia Egidi, Felipe A. Louza, Giovanni Manzini and Guilherme P. Telles

Citation: Algorithms for Molecular Biology 2019 14:6

Content type: Research Published on: 8 March 2019
- View Full Text
- View PDF
Connectivity problems on heterogeneous graphs

Network connectivity problems are abundant in computational biology research, where graphs are used to represent a range of phenomena: from physical interactions between molecules to more abstract relationship...

Authors: Jimmy Wu, Alex Khodaverdian, Benjamin Weitz and Nir Yosef

Citation: Algorithms for Molecular Biology 2019 14:5

Content type: Research Published on: 8 March 2019
- View Full Text
- View PDF
Semi-nonparametric modeling of topological domain formation from epigenetic data

Hi-C experiments capturing the 3D genome architecture have led to the discovery of topologically-associated domains (TADs) that form an important part of the 3D genome organization and appear to play a role in...

Authors: Emre Sefer and Carl Kingsford

Citation: Algorithms for Molecular Biology 2019 14:4

Content type: Research Published on: 5 March 2019
- View Full Text
- View PDF
SNPs detection by eBWT positional clustering

Sequencing technologies keep on turning cheaper and faster, thus putting a growing pressure for data structures designed to efficiently store raw data, and possibly perform analysis therein. In this view, ther...

Authors: Nicola Prezza, Nadia Pisanti, Marinella Sciortino and Giovanna Rosone

Citation: Algorithms for Molecular Biology 2019 14:3

Content type: Research Published on: 6 February 2019
- View Full Text
- View PDF
Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy

Absolute fast converging (AFC) phylogeny estimation methods are ones that have been proven to recover the true tree with high probability given sequences whose lengths are polynomial in the number of number of...

Authors: Qiuyi Zhang, Satish Rao and Tandy Warnow

Citation: Algorithms for Molecular Biology 2019 14:2

Content type: Research Published on: 6 February 2019
- View Full Text
- View PDF
Automated partial atomic charge assignment for drug-like molecules: a fast knapsack approach

A key factor in computational drug design is the consistency and reliability with which intermolecular interactions between a wide variety of molecules can be described. Here we present a procedure to efficien...

Authors: Martin S. Engler, Bertrand Caron, Lourens Veen, Daan P. Geerke, Alan E. Mark and Gunnar W. Klau

Citation: Algorithms for Molecular Biology 2019 14:1

Content type: Research Published on: 5 February 2019
- View Full Text
- View PDF
Regmex: a statistical tool for exploring motifs in ranked sequence lists from genomics experiments

Motif analysis methods have long been central for studying biological function of nucleotide sequences. Functional genomics experiments extend their potential. They typically generate sequence lists ranked by ...

Authors: Morten Muhlig Nielsen, Paula Tataru, Tobias Madsen, Asger Hobolth and Jakob Skou Pedersen

Citation: Algorithms for Molecular Biology 2018 13:17

Content type: Software article Published on: 8 December 2018
- View Full Text
- View PDF
Superbubbles revisited

Superbubbles are distinctive subgraphs in direct graphs that play an important role in assembly algorithms for high-throughput sequencing (HTS) data. Their practical importance derives from the fact they are c...

Authors: Fabian Gärtner, Lydia Müller and Peter F. Stadler

Citation: Algorithms for Molecular Biology 2018 13:16

Content type: Research Published on: 1 December 2018
- View Full Text
- View PDF
Coordinate systems for supergenomes

Genome sequences and genome annotation data have become available at ever increasing rates in response to the rapid progress in sequencing technologies. As a consequence the demand for methods supporting compa...

Authors: Fabian Gärtner, Christian Höner zu Siederdissen, Lydia Müller and Peter F. Stadler

Citation: Algorithms for Molecular Biology 2018 13:15

Content type: Research Published on: 24 September 2018
- View Full Text
- View PDF
Improved de novo peptide sequencing using LC retention time information

Liquid chromatography combined with tandem mass spectrometry is an important tool in proteomics for peptide identification. Liquid chromatography temporally separates the peptides in a sample. The peptides tha...

Authors: Yves Frank, Tomas Hruz, Thomas Tschager and Valentin Venzin

Citation: Algorithms for Molecular Biology 2018 13:14

Content type: Research Published on: 29 August 2018
- View Full Text
- View PDF
Sorting signed circular permutations by super short operations

One way to estimate the evolutionary distance between two given genomes is to determine the minimum number of large-scale mutations, or genome rearrangements, that are necessary to transform one into the other. I...

Authors: Andre R. Oliveira, Guillaume Fertin, Ulisses Dias and Zanoni Dias

Citation: Algorithms for Molecular Biology 2018 13:13

Content type: Research Published on: 26 July 2018
- View Full Text
- View PDF
Split-inducing indels in phylogenomic analysis

Most phylogenetic studies using molecular data treat gaps in multiple sequence alignments as missing data or even completely exclude alignment columns that contain gaps.

Authors: Alexander Donath and Peter F. Stadler

Citation: Algorithms for Molecular Biology 2018 13:12

Content type: Research Published on: 16 July 2018
- View Full Text
- View PDF
Locus-aware decomposition of gene trees with respect to polytomous species trees

Horizontal gene transfer (HGT), a process of acquisition and fixation of foreign genetic material, is an important biological phenomenon. Several approaches to HGT inference have been proposed. However, most o...

Authors: Michał Aleksander Ciach, Anna Muszewska and Paweł Górecki

Citation: Algorithms for Molecular Biology 2018 13:11

Content type: Research Published on: 4 June 2018
- View Full Text
- View PDF
A fast and accurate enumeration-based algorithm for haplotyping a triploid individual

Haplotype assembly, reconstructing haplotypes from sequence data, is one of the major computational problems in bioinformatics. Most of the current methodologies for haplotype assembly are designed for diploid...

Authors: Jingli Wu and Qian Zhang

Citation: Algorithms for Molecular Biology 2018 13:10

Content type: Research Published on: 1 June 2018
- View Full Text
- View PDF
Finding local genome rearrangements

The double cut and join (DCJ) model of genome rearrangement is well studied due to its mathematical simplicity and power to account for the many events that transform gene order. These studies have mostly been...

Authors: Pijus Simonaitis and Krister M. Swenson

Citation: Algorithms for Molecular Biology 2018 13:9

Content type: Research Published on: 4 May 2018
- View Full Text
- View PDF
FSH: fast spaced seed hashing exploiting adjacent hashes

Patterns with wildcards in specified positions, namely spaced seeds, are increasingly used instead of k-mers in many bioinformatics applications that require indexing, querying and rapid similarity search, as the...

Authors: Samuele Girotto, Matteo Comin and Cinzia Pizzi

Citation: Algorithms for Molecular Biology 2018 13:8

Content type: Research Published on: 22 March 2018
- View Full Text
- View PDF
Outlier detection in BLAST hits

An important task in a metagenomic analysis is the assignment of taxonomic labels to sequences in a sample. Most widely used methods for taxonomy assignment compare a sequence in the sample to a database of kn...

Authors: Nidhi Shah, Stephen F. Altschul and Mihai Pop

Citation: Algorithms for Molecular Biology 2018 13:7

Content type: Research Published on: 22 March 2018
- View Full Text
- View PDF
OCTAL: Optimal Completion of gene trees in polynomial time

For a combination of reasons (including data generation protocols, approaches to taxon and gene sampling, and gene birth and loss), estimated gene trees are often incomplete, meaning that they do not contain a...

Authors: Sarah Christensen, Erin K. Molloy, Pranjal Vachaspati and Tandy Warnow

Citation: Algorithms for Molecular Biology 2018 13:6

Content type: Research Published on: 15 March 2018
- View Full Text
- View PDF
Derivative-free neural network for optimizing the scoring functions associated with dynamic programming of pairwise-profile alignment

A profile-comparison method with position-specific scoring matrix (PSSM) is among the most accurate alignment methods. Currently, cosine similarity and correlation coefficients are used as scoring functions of...

Authors: Kazunori D. Yamada

Citation: Algorithms for Molecular Biology 2018 13:5

Content type: Research Published on: 15 February 2018
- View Full Text
- View PDF
Fast phylogenetic inference from typing data

Microbial typing methods are commonly used to study the relatedness of bacterial strains. Sequence-based typing methods are a gold standard for epidemiological surveillance due to the inherent portability of s...

Authors: João A. Carriço, Maxime Crochemore, Alexandre P. Francisco, Solon P. Pissis, Bruno Ribeiro-Gonçalves and Cátia Vaz

Citation: Algorithms for Molecular Biology 2018 13:4

Content type: Research Published on: 15 February 2018
- View Full Text
- View PDF
A safe and complete algorithm for metagenomic assembly

Reconstructing the genome of a species from short fragments is one of the oldest bioinformatics problems. Metagenomic assembly is a variant of the problem asking to reconstruct the circular genomes of all bact...

Authors: Nidia Obscura Acosta, Veli Mäkinen and Alexandru I. Tomescu

Citation: Algorithms for Molecular Biology 2018 13:3

Content type: Research Published on: 7 February 2018
- View Full Text
- View PDF
Time-consistent reconciliation maps and forbidden time travel

In the absence of horizontal gene transfer it is possible to reconstruct the history of gene families from empirically determined orthology relations, which are equivalent to event-labeled gene trees. Knowledge o...

Authors: Nikolai Nøjgaard, Manuela Geiß, Daniel Merkle, Peter F. Stadler, Nicolas Wieseke and Marc Hellmuth

Citation: Algorithms for Molecular Biology 2018 13:2

Content type: Research Published on: 6 February 2018
- View Full Text
- View PDF
Gene tree parsimony for incomplete gene trees: addressing true biological loss

Species tree estimation from gene trees can be complicated by gene duplication and loss, and “gene tree parsimony” (GTP) is one approach for estimating species trees from multiple gene trees. In its standard ...

Authors: Md Shamsuzzoha Bayzid and Tandy Warnow

Citation: Algorithms for Molecular Biology 2018 13:1

Content type: Research Published on: 19 January 2018
- View Full Text
- View PDF
Phylogeny reconstruction based on the length distribution of k-mismatch common substrings

Various approaches to alignment-free sequence comparison are based on the length of exact or inexact word matches between pairs of input sequences. Haubold et al. (J Comput Biol 16:1487–1500, 2009) showed how the...

Authors: Burkhard Morgenstern, Svenja Schöbel and Chris-André Leimeister

Citation: Algorithms for Molecular Biology 2017 12:27

Content type: Research Published on: 11 December 2017
- View Full Text
- View PDF
Generalized enhanced suffix array construction in external memory

Suffix arrays, augmented by additional data structures, allow solving efficiently many string processing problems. The external memory construction of the generalized suffix array for a string collection is a ...

Authors: Felipe A. Louza, Guilherme P. Telles, Steve Hoffmann and Cristina D. A. Ciferri

Citation: Algorithms for Molecular Biology 2017 12:26

Content type: Research Published on: 7 December 2017
- View Full Text
- View PDF
HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing

Multiple sequence alignment (MSA) plays a key role in biological sequence analyses, especially in phylogenetic tree construction. Extreme increase in next-generation sequencing results in shortage of efficient...

Authors: Shixiang Wan and Quan Zou

Citation: Algorithms for Molecular Biology 2017 12:25

Content type: Research Published on: 29 September 2017
- View Full Text
- View PDF
Algorithms for matching partially labelled sequence graphs

In order to find correlated pairs of positions between proteins, which are useful in predicting interactions, it is necessary to concatenate two large multiple sequence alignments such that the sequences that ...

Authors: William R. Taylor

Citation: Algorithms for Molecular Biology 2017 12:24

Content type: Research Published on: 25 September 2017
- View Full Text
- View PDF
Biologically feasible gene trees, reconciliation maps and informative triples

The history of gene families—which are equivalent to event-labeled gene trees—can be reconstructed from empirically estimated evolutionary event-relations containing pairs of orthologous, paralogous or xenologous...

Authors: Marc Hellmuth

Citation: Algorithms for Molecular Biology 2017 12:23

Content type: Research Published on: 29 August 2017
- View Full Text
- View PDF

How was your experience today?

Rating Please select one rating

Awful

Bad

Good

Great

Thank you for your feedback.

Tell us why (opens in a new tab)

Articles

Algorithms for Molecular Biology

Contact us