Search
Now showing items 1-10 of 21
Servet: A Benchmark Suite for Autotuning on Multicore Clusters
(Institute of Electrical and Electronics Engineers, 2010-05-24)
[Abstract] MapReduce is a powerful tool for processing large data sets used by many applications running in distributed environments. However, despite the increasing number of computationally intensive problems that require ...
Acceleration of a Feature Selection Algorithm Using High Performance Computing
(MDPI AG, 2020-09-01)
[Abstract]
Feature selection is a subfield of data analysis that is on reducing the dimensionality of datasets, so that subsequent analyses over them can be performed in affordable execution times while keeping the same ...
Multithreaded and Spark parallelization of feature selection filters
(2016)
[Abstract]: Vast amounts of data are generated every day, constituting a volume that is challenging to analyze. Techniques such as feature selection are advisable when tackling large datasets. Among the tools that provide ...
Parallel-FST: A feature selection library for multicore clusters
(Elsevier, 2022-11)
[Abstract]: Feature selection is a subfield of machine learning focused on reducing the dimensionality of datasets by performing a computationally intensive process. This work presents Parallel-FST, a publicly available ...
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
(Oxford University Press, 2016)
[Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-scale input ...
HSRA: Hadoop-based spliced read aligner for RNA sequencing data
(Public Library of Science, 2018-07-31)
[Abstract] Nowadays, the analysis of transcriptome sequencing (RNA-seq) data has become the standard method for quantifying the levels of gene expression. In RNA-seq experiments, the mapping of short reads to a reference ...
UPCBLAS: a library for parallel matrix computations in Unified Parallel C
(John Wiley & Sons Ltd., 2012-09-25)
[Abstract] The popularity of Partitioned Global Address Space (PGAS) languages has increased during the last years thanks to their high programmability and performance through an efficient exploitation of data locality, ...
A 2D algorithm with asymmetric workload for the UPC conjugate gradient method
(Springer New York LLC, 2014)
[Abstract] This paper examines four different strategies, each one with its own data distribution, for implementing the parallel conjugate gradient (CG) method and how they impact communication and overall performance. ...
PATO: genome-wide prediction of lncRNA-DNA triple helices
(Oxford University Press, 2023-03)
[Abstract]: Motivation: Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different molecular mechanisms, including direct RNA-DNA hybridization via ...
Parallel feature selection for distributed-memory clusters
(2019)
[Abstract]: Feature selection is nowadays an extremely important data mining stage in the field of machine learning due to the appearance of problems of high dimensionality. In the literature there are numerous feature ...