User Tools

Site Tools


data_type:genetic_data:dna_packaging

Genetic Data- Secondary Structure

MeSH ID: D042003

Description:
The folding of an organism's DNA molecule into a compact, orderly structure that fits within the limited space of a cell or virus particle.

Best practice for sharing this type of data:
Data containing secondary structure elements may be stored as FASTA files and are commonly displayed as a sequence using DSSP code. The three general sets of DSSP code are H (alpha helix), E (beta strand/sheet), and C (loop/turn/coil). The sequence of each secondary structure element may be displayed as Tabular data, along with the primary structure sequence and other relevant information such as protein name and a protein ID for database storage. Tabular data should be saved as a .txt or .csv file. The first row(s) should contain information about the dataset, such as the data file name, author, today's date, when the data within the file were last modified, and companion file names. Please also state which symbol has been used to denote missing data (NA is preferred). Column headings should describe the content of each column and contain only numbers, letters, and underscores - no spaces or special characters. Lowercase letters are preferred. Row names should be consistent with those used in the article and in other related datasets

Most suitable repositories:
Secondary structure data may be submitted to repositories such as CATH Protein Structure Classification, Rfam, RNAcentral, rPredictor, Structural Biology Data Grid, and wwPDB

Best practice for indicating re-use of existing data:
For public datasets please provide a DOI or other stable identified for the dataset itself *and* include a citation for the dataset in the reference list. Be sure to indicate exactly which data has been re-used, particularly when multiple versions of the dataset exist. In many cases, this is best achieved by sharing the code used to extract the part of the data that you analyzed. In some cases it may be best to share the exact dataset(s) you analyzed as well.

For access-controlled data authors should provide a link to instructions for obtaining access (e.g. here is the information page for ADNI (Alzheimer's Disease Neuroimaging Initiative): http://adni.loni.usc.edu/data-samples/access-data/).

When re-using a private dataset from a previous study please contact the data owners to discuss how the data can be made public.

Most suitable repositories:
Not applicable

data_type/genetic_data/dna_packaging.txt · Last modified: 2021/04/23 18:48 by samantha