phyloviz online: web-based phylogene8c data analysis and ... · phyloviz online: web-based...

1
PHYLOViZ Online: Web-based Phylogene8c Data Analysis And Visualiza8on For Allelic Profiles And SNP Data Bruno Gonçalves 1 , Alexandre Francisco 2,3 , Cá8a Vaz 2,4 , Mário Ramirez 1 , João Carriço 1 (1) Ins8tuto de Microbiologia, Ins8tuto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portugal (2) INESC-ID, R. Alves Redol 9, 1000-029 Lisboa, Portugal (3) Ins8tuto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal (4) Ins8tuto Superior de Engenharia de Lisboa, Ins8tuto Politécnico de Lisboa, 007 Lisboa, Portugal Acknowledgments BRG was supported by grant SFRH/BD/101448/2014 from the Fundação para a Ciência e Tecnologia, Portugal. RESTful API Modify the Minimum Spanning Tree Interac>ve Distance Matrix User Specific Area Introduc>on Conclusions References PHYLOViZ Online is a user-friendly web applica8on , an online version of the standalone PHYLOViZ(1), developed to allow users to perform profile based data analyses without sofware installa8on, and to enable an easy data access, analysis, and sharing from any Internet enabled computer. It allows data set storage and visualiza8on through dis8nct visual strategies, and a programma8c access to data through a RESTful API allowing it to be seamlessly integrated into any third party web service or sofware. PHYLOViZ Online is freely available at hEps://online.phyloviz.net STORE your own data sets. SHARE them through an user specific URL. Or by making your data sets PUBLICLY available. RETRIEVE programma8cally public or registered data. UPLOAD data. RUN available tree algorithms on stored data. More informa8on at: hEps://online.phyloviz.net/api Ahribute colours to nodes according to auxiliary data fields. LINK all nodes at a given distance (nLV graph) HIDE all links above a given distance threshold (Tree cut-off). a) VISUALIZE pairwise distances of profiles in a heatmap display. b) ORDER cells according to auxiliary data fields. c) EXPLORE the distance matrix using a dynamic table . PHYLOViZ Online is a freely available applica8on that provides an effec8ve way for users to visualize, perform visual analy8cs and share annotated data sets. It also complies with the increasing requirement to make complete data sets available and reproducible so that they can be independently scru8nized. PHYLOViZ Online source code is freely available under GPLv3 licence at hhps://github.com/bfrgoncalves/Online-PhyloVIZ/tree/postgres_version .A set of Node.js modules is also available at hhps://www.npm.com/package/phyloviz_bundle for developers to incorporate PHYLOViZ tree visualiza8on into their sofware. Figure 1: PHYLOViZ Online Tree visualiza8on of a data set that consists of 145 strains of Streptococcus pneumoniae whose par8al genome sequences are publicly available(2, 3). The profile data consists on a 237 loci cgMLST scheme. The Minimum Spanning Tree was computed using the goeBURST algorithm(2) and Colours represent dis8nct serotypes provided using an auxiliary data file. Figure 3: Interac8ve Distance Matrix constructed from a node selec8on of a cgMLST Streptococcus pneumoniae data set. Matrix cells were ordered according to serotype and colours represent distances computed through pairwise comparisons of profiles. Selected nodes on the distance matrix (red border colour) have their informa8on displayed in the table according to the selected auxiliary data fields. Figure 2: Different features available that modify the Minimum Spanning Tree. a) Colour assignment. Colours assigned by serotype field exis8ng in an auxiliary data file. b) n Locus Variant. All nodes with distances equal or above 12 are linked. C) Tree cut-off . All links with a distance value above 60 were deleted. 1. Francisco,A.P., Vaz,C., Monteiro,P.T., Melo-Cris8no,J., Ramirez,M. and Carriço,J.A. (2012) PHYLOViZ: phylogene>c inference and data visualiza>on for sequence based typing methods. BMC Bioinforma3cs 2015 16:1, 13, 87. 2. Chewapreecha,C., Harris,S.R., Croucher,N.J., Turner,C., Maronen,P., Cheng,L., Pessia,A., Aanensen,D.M., Mather,A.E., Page,A.J., et al. (2014) Dense genomic sampling iden>fies highways of pneumococcal recombina>on. Nature gene3cs, 46, 305–309. 3. Croucher,N.J., Finkelstein,J.A., Pelton,S.I., Mitchell,P.K., Lee,G.M., Parkhill,J., Bentley,S.D., Hanage,W.P. and Lipsitch,M. (2013) Popula>on genomics of post-vaccine changes in pneumococcal epidemiology. Nature gene3cs, 45, 656–663. 4. Francisco,A.P., Bugalho,M., Ramirez,M. and Carriço,J.A. (2009) Global op>mal eBURST analysis of mul>locus typing data using a graphic matroid approach. BMC Bioinforma3cs 2015 16:1, 10, 152. Applica>on Programming Interface for increased interoperability: Scan to see the Video Tutorial!

Upload: others

Post on 06-Apr-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: PHYLOViZ Online: Web-based Phylogene8c Data Analysis And ... · PHYLOViZ Online: Web-based Phylogene8c Data Analysis And Visualizaon For Allelic Profiles And SNP Data Bruno Gonçalves1,

PHYLOViZOnline:Web-based

Phylogene8cDataAnalysisAndVisualiza8onForAllelicProfilesAndSNPData

BrunoGonçalves1,AlexandreFrancisco2,3,Cá8aVaz2,4,MárioRamirez1,JoãoCarriço1

(1)Ins8tutodeMicrobiologia,Ins8tutodeMedicinaMolecular,FaculdadedeMedicina,UniversidadedeLisboa,Lisboa,Portugal

(2)INESC-ID,R.AlvesRedol9,1000-029Lisboa,Portugal(3)Ins8tutoSuperiorTécnico,UniversidadedeLisboa,Lisboa,Portugal

(4)Ins8tutoSuperiordeEngenhariadeLisboa,Ins8tutoPolitécnicodeLisboa,007Lisboa,Portugal

AcknowledgmentsBRG was supported by grant SFRH/BD/101448/2014 from theFundaçãoparaaCiênciaeTecnologia,Portugal.

RESTfulAPI

ModifytheMinimumSpanningTree

Interac>veDistanceMatrix

UserSpecificArea

Introduc>on

ConclusionsReferences

PHYLOViZ Online is a user-friendly web applica8on, an online version of thestandalone PHYLOViZ(1), developed to allow users to perform profile baseddataanalyseswithoutsofwareinstalla8on,andtoenableaneasydataaccess,analysis, and sharing from any Internet enabled computer. It allows data setstorageandvisualiza8onthroughdis8nctvisualstrategies,andaprogramma8caccesstodatathroughaRESTfulAPIallowingittobeseamlesslyintegratedintoanythirdpartywebserviceorsofware.

PHYLOViZOnlineisfreelyavailableathEps://online.phyloviz.net

•  STOREyourowndatasets.•  SHAREthemthroughanuserspecificURL.•  OrbymakingyourdatasetsPUBLICLYavailable.

•  RETRIEVEprogramma8callypublicorregistereddata.•  UPLOADdata.•  RUNavailabletreealgorithmsonstoreddata.Moreinforma8onat:hEps://online.phyloviz.net/api

•  Ahributecolourstonodesaccordingtoauxiliarydatafields.•  LINKallnodesatagivendistance(nLVgraph)•  HIDEalllinksaboveagivendistancethreshold(Treecut-off).

a) VISUALIZE pairwise distances of profiles in aheatmapdisplay.b)ORDERcellsaccordingtoauxiliarydatafields.c)EXPLOREthedistancematrixusingadynamictable.

PHYLOViZOnlineisafreelyavailableapplica8onthatprovidesaneffec8vewayforuserstovisualize,performvisual analy8csand shareannotateddata sets. It also complieswith theincreasingrequirementtomakecompletedatasetsavailableandreproducible sothattheycan be independently scru8nized. PHYLOViZ Online source code is freely available underGPLv3licenceathhps://github.com/bfrgoncalves/Online-PhyloVIZ/tree/postgres_version.AsetofNode.jsmodules isalsoavailableathhps://www.npm.com/package/phyloviz_bundlefordeveloperstoincorporatePHYLOViZtreevisualiza8onintotheirsofware.

Figure1:PHYLOViZOnlineTreevisualiza8onofadata set thatconsistsof145strains of Streptococcus pneumoniae whose par8al genome sequences arepubliclyavailable(2,3).Theprofiledataconsistsona237 locicgMLSTscheme.TheMinimum Spanning Treewas computed using the goeBURST algorithm(2)andColoursrepresentdis8nctserotypesprovidedusinganauxiliarydatafile.

Figure 3: Interac8ve Distance Matrix constructed from a node selec8on of a cgMLSTStreptococcuspneumoniaedata set.Matrix cellswereorderedaccording to serotypeandcolours representdistances computed throughpairwisecomparisonsofprofiles. Selectednodeson thedistancematrix (redborder colour)have their informa8ondisplayed in thetableaccordingtotheselectedauxiliarydatafields.

Figure 2:Different features available that modify theMinimum Spanning Tree. a) Colour assignment.Colours assigned by serotype field exis8ng in an auxiliary data file. b) n Locus Variant. All nodes withdistancesequalorabove12are linked.C)Treecut-off .All linkswitha distancevalueabove60weredeleted.

1.  Francisco,A.P.,Vaz,C.,Monteiro,P.T.,Melo-Cris8no,J.,Ramirez,M.andCarriço,J.A.(2012)PHYLOViZ:phylogene>cinferenceanddatavisualiza>onfor

sequencebasedtypingmethods.BMCBioinforma3cs201516:1,13,87.2.  Chewapreecha,C., Harris,S.R., Croucher,N.J., Turner,C.,Maronen,P., Cheng,L., Pessia,A., Aanensen,D.M.,Mather,A.E., Page,A.J., et al. (2014)Dense

genomicsamplingiden>fieshighwaysofpneumococcalrecombina>on.Naturegene3cs,46,305–309.3.  Croucher,N.J.,Finkelstein,J.A.,Pelton,S.I.,Mitchell,P.K.,Lee,G.M.,Parkhill,J.,Bentley,S.D.,Hanage,W.P.andLipsitch,M.(2013)Popula>ongenomicsof

post-vaccinechangesinpneumococcalepidemiology.Naturegene3cs,45,656–663.4.  Francisco,A.P., Bugalho,M., Ramirez,M. and Carriço,J.A. (2009)Global op>mal eBURST analysis ofmul>locus typing data using a graphicmatroid

approach.BMCBioinforma3cs201516:1,10,152.

Applica>onProgrammingInterfaceforincreasedinteroperability:

ScantoseetheVideoTutorial!