The transPLANT variation archive has been developed to provide a system for the persistent storage and analysis of variation data from plant species. We are accepting submissions of variant data in VCF format (version 4 and above) on known reference sequences, i.e. sequences present in the databases of the INSDC: ENA, GenBank, or DDBJ.
Submitted data will be:
- Persistently archived via the ENA.
- Uniquely accessioned per submission and at each variant locus.
Data from different submissions referencing the same locus will be assigned a common, non-redundant identifier.
- Propagated to new versions of the reference sequence.
When a new version of a reference sequence is submitted to the INSDC, all accessioned variants will be mapped onto the new reference sequence (where possible).
- Made available for download.
Both the original (and accessioned) submission and any subsequent updates will be made available for download.
- Made available in the Ensembl Plants interface.
Where the submitted data is located on a reference sequence that is used in the Ensembl Plants database, it will be visible there.
This work is being carried out in the context of the development of the European Variation Archive, a new resource to organize and process genomic variation data for all species.