Contents

Disease Ontology (DO)1 aims to provide an open source ontology for the integration of biomedical data that is associated with human disease. We developed DOSE2 package to promote the investigation of diseases. DOSE provides five methods including Resnik, Lin, Jiang, Rel and Wang for measuring semantic similarities among DO terms and gene products; Hypergeometric model and Gene Set Enrichment Analysis (GSEA) were also implemented for extracting disease association insight from genome wide expression profiles.

1 Citation

If you use DOSE in published research, please cite G. Yu (2015).

G Yu, LG Wang, GR Yan, QY He. DOSE: an R/Bioconductor package for Disease Ontology Semantic and Enrichment analysis. Bioinformatics 2015, 31(4):608-609. http://dx.doi.org/10.1093/bioinformatics/btu684.

2 Overview

DOSE provides five methods for measureing semantic similarity among DO terms and genes. It implemented over-representation analysis to associate disease with gene list (e.g. differential expressed genes) and gene set enrichment analysis to associate disease with genome wide expression profiles. The enrichment analyses support Disease Ontology (DO)1, Network of Cancer Gene (NCG)3 and DisGeNET4. In addition, several visualization methods were developed to help interpreting semantic and enrichment results.

3 Vignette Entry

More documents can be found in https://guangchuangyu.github.io/DOSE.

4 Session Information

Here is the output of sessionInfo() on the system on which this document was compiled:

## R version 3.3.2 (2016-10-31)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 16.04.1 LTS
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] DOSE_3.0.10     BiocStyle_2.2.1
## 
## loaded via a namespace (and not attached):
##  [1] Rcpp_0.12.9          plyr_1.8.4           tools_3.3.2         
##  [4] digest_0.6.11        RSQLite_1.1-2        evaluate_0.10       
##  [7] memoise_1.0.0        tibble_1.2           gtable_0.2.0        
## [10] igraph_1.0.1         fastmatch_1.0-4      DBI_0.5-1           
## [13] yaml_2.1.14          parallel_3.3.2       gridExtra_2.2.1     
## [16] fgsea_1.0.2          stringr_1.1.0        knitr_1.15.1        
## [19] S4Vectors_0.12.1     IRanges_2.8.1        stats4_3.3.2        
## [22] rprojroot_1.2        grid_3.3.2           qvalue_2.6.0        
## [25] Biobase_2.34.0       data.table_1.10.0    AnnotationDbi_1.36.1
## [28] BiocParallel_1.8.1   GOSemSim_2.0.4       rmarkdown_1.3       
## [31] reshape2_1.4.2       GO.db_3.4.0          DO.db_2.9           
## [34] ggplot2_2.2.1        magrittr_1.5         splines_3.3.2       
## [37] backports_1.0.5      scales_0.4.1         htmltools_0.3.5     
## [40] BiocGenerics_0.20.0  assertthat_0.1       colorspace_1.3-2    
## [43] stringi_1.1.2        lazyeval_0.2.0       munsell_0.4.3

References

1. Schriml, L. M. et al. Disease ontology: A backbone for disease semantic integration. Nucleic Acids Research 40, D940–D946 (2011).

2. Yu, G., Wang, L.-G., Yan, G.-R. & He, Q.-Y. DOSE: An r/bioconductor package for disease ontology semantic and enrichment analysis. Bioinformatics 31, 608–609 (2015).

3. A., O., D., G. M., M., T. P. & C., F. D. NCG 5.0: Updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings. Nucleic Acids Research 44, D992–D999 (2016).

4. Janet, P. et al. DisGeNET: A discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, bav028 (2015).