Oxford University Press, Bioinformatics, 21(37), p. 3865-3873, 2021
DOI: 10.1093/bioinformatics/btab427
Full text: Unavailable
Abstract Motivation Genome-wide association studies can reveal important genotype–phenotype associations; however, data quality and interpretability issues must be addressed. For drug discovery scientists seeking to prioritize targets based on the available evidence, these issues go beyond the single study. Results Here, we describe rational ranking, filtering and interpretation of inferred gene–trait associations and data aggregation across studies by leveraging existing curation and harmonization efforts. Each gene–trait association is evaluated for confidence, with scores derived solely from aggregated statistics, linking a protein-coding gene and phenotype. We propose a method for assessing confidence in gene–trait associations from evidence aggregated across studies, including a bibliometric assessment of scientific consensus based on the iCite relative citation ratio, and meanRank scores, to aggregate multivariate evidence. This method, intended for drug target hypothesis generation, scoring and ranking, has been implemented as an analytical pipeline, available as open source, with public datasets of results, and a web application designed for usability by drug discovery scientists. Availability and implementation Web application, datasets and source code via https://unmtid-shinyapps.net/tiga/. Supplementary information Supplementary data are available at Bioinformatics online.