The GSID HIV Data Browser has been developed in collaboration with the Genome Bioinformatics Group at the University of California, Santa Cruz (UCGBG), which is a cross-departmental team within the Center of Biomolecular Sciences and Engineering (CBSE). Under the guidance of Jim Kent and Fan Hsu, UCGBG developed a relational database containing the significant clinical data and sequence information pertaining to the infected subjects participating in the VAX003 and VAX004 Phase III clinical trials.
GSID has made this valuable resource of information available to the HIV vaccine research community via a web-accessible database. Additionally, serological samples associated with the viral sequences contained on the website are also available to investigators for further in-depth analysis.
The GSID HIV Data Browser contains three main “view” pages:
Subject View Page
This view allows the user to capture all of the available information for individual subjects identified by a blinded subject identification number. As was required during the conduct of the trial, both GSID and UCGBG will continue to blind the information contained in the database to ensure volunteer (subject) confidentiality and privacy. A filter control page is available to allow the user to set search parameters to retrieve and display information. The database includes the following demographic and clinical information (a summary of the data is available in PDF format here):
• Age
• Race
• Gender
• Risk group
• Geographic location
• Weight
• Immunization status: vaccine or placebo recipient
• Estimated study days of infection
• CD4 counts
• Viral load measurements
Table View Page
This view allows users to construct tables containing selected information from multiple subjects. In this view, users have the ability to sort, display information broadly or based upon data specific criteria and output the text in tab delineated format. Additionally, the user is able to retrieve associated DNA or protein sequences from this view.
Sequence View Page
Sequence view contains tools which allow for sequences to be aligned with each other, with reference sequences or with consensus sequences. The database contains three sequences per infected subject, as well as phylogenetic and positive selection analysis data. The primary alignment tool included in this view is called BLAT. More information about BLAT and the other tools, including proteomics tools, is available at the GSID HIV Data Browser website.