Buscar
Mostrando ítems 11-20 de 44
MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud
(Oxford University Press, 2017)
[Abstract] This article presents MarDRe, a de novo cloud-ready duplicate and near-duplicate removal tool that can process single- and paired-end reads from FASTQ/FASTA datasets. MarDRe takes advantage of the widely adopted ...
A 2D algorithm with asymmetric workload for the UPC conjugate gradient method
(Springer New York LLC, 2014)
[Abstract] This paper examines four different strategies, each one with its own data distribution, for implementing the parallel conjugate gradient (CG) method and how they impact communication and overall performance. ...
HSRA: Hadoop-based spliced read aligner for RNA sequencing data
(Public Library of Science, 2018-07-31)
[Abstract] Nowadays, the analysis of transcriptome sequencing (RNA-seq) data has become the standard method for quantifying the levels of gene expression. In RNA-seq experiments, the mapping of short reads to a reference ...
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
(Oxford University Press, 2016)
[Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-scale input ...
Volatile STT-RAM Scratchpad Design and Data Allocation for Low Energy
(Association for Computing Machinery, 2015)
[Abstract] On-chip power consumption is one of the fundamental challenges of current technology scaling. Cache memories consume a sizable part of this power, particularly due to leakage energy. STT-RAM is one of several ...
CPPC: a compiler‐assisted tool for portable checkpointing of message‐passing applications
(John Wiley & Sons Ltd., 2010-11-19)
[Abstract] With the evolution of high‐performance computing toward heterogeneous, massively parallel systems, parallel applications have developed new checkpoint and restart necessities. Whether due to a failure in the ...
Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience
(Springer New York LLC, 2013)
[Abstract] With the evolution of high-performance computing, parallel applications have developed an increasing necessity for fault tolerance, most commonly provided by checkpoint and restart techniques. Checkpointing tools ...
Parallel feature selection for distributed-memory clusters
(2019)
[Abstract]: Feature selection is nowadays an extremely important data mining stage in the field of machine learning due to the appearance of problems of high dimensionality. In the literature there are numerous feature ...
Multithreaded and Spark parallelization of feature selection filters
(2016)
[Abstract]: Vast amounts of data are generated every day, constituting a volume that is challenging to analyze. Techniques such as feature selection are advisable when tackling large datasets. Among the tools that provide ...
Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform
(Springer Netherlands, 2013-12)
[Abstract] Cloud computing is currently being explored by the scientific community to assess its suitability for High Performance Computing (HPC) environments. In this novel paradigm, compute and storage resources, as well ...