strapdown: '/styling/strapdown/v/0.2/strapdown.js'};
<html> <body> <h1>Welcome</h1> <p> The data files listed below are formatted for visualization in the Integrated Genome Browser, available from <a href="https://bioviz.org">BioViz.org</a>. </p> <p> Data files are from <a href="http://www.thellungiella.org">thellungiella.org</a>. Files were downloaded in December 2012 and reformatted for visualization in IGB. </p> <p> For information about the genome sequence and annotations, please contact Maheshi Dassanayake and Dong-Ha Oh. For information about this QuickLoad site or Integrated Genome Browser, contact Ann Loraine. </p> <p> The genome sequence and annotations for Thellungiella parvula (common name Anatolian cress) were published in the journal Nature Genetics in 2011. See: <a href="http://www.ncbi.nlm.nih.gov/pubmed/21822265">The genome of the extremophile crucifer Thellungiella parvula</a>. </p> <h2>About the files</h2> <p> Files with extension .gz were compressed and indexed using bgzip and tabix tools from Heng Li. </p> <p> The BED file T_parvula_May_2012.bed.gz is BED14 (bed detail) format, developed originally in the UCSC Genome Bioinformatics group and described on their site. </p> <p> In IGB, BED14 field 13 is designated the "gene name" field and field 14 is desigated as the description field. Both can be searched using the coordinates box or the Search tab once the file is loaded into IGB. </p> <p> T_parvula_May_2012.bed.gz was created from TpV84_ORFs.gff from thellungiella.org. In this case, BED14 fields 13 and 4 are identical and field 14 contains the Note attribute from the GFF extra feature field. The time stamp of the GFF file we used was Dec. 28, 2011 and the file was downloaded in Dec. 2012. The original file contained transcriptional start site and polyadenylation signals that were not included in the reformatted BED file because the format does not support these feature types except as field 2, which represents the start of transcription for a gene model. Boundaries of CDS, five_prime_UTR, and three_prime_UTR features were used to delimit exon boundaries in the BED output. For information about software used to perform the format conversion, please see log messages associated with this file. </p> <p> The .2bit file contains the genome sequence for parvula assembly version 2, released May, 2012. The fasta file was converted to .2bit format using Jim Kent's faToTwoBit tool. </p> </body> </html>
Name | Last modified | Size | Description | |
---|---|---|---|---|
Parent Directory | - | |||
T_parvula_May_2012.bed.gz.tbi | 2023-06-16 14:28 | 80K | Tabix index file | |
T_parvula_May_2012.2bit | 2023-06-16 14:28 | 30M | Sequence data in blat 2bit format | |
genome.txt | 2023-06-16 14:28 | 24K | Chromosome lengths and assembly information | |
T_parvula_May_2012.bed.gz | 2023-06-16 14:28 | 1.5M | Annotation or junction file | |
annots.xml | 2023-06-16 14:28 | 398 | Annotation and data file list |