Converting GenBank ASN.1 data file to XML:
- Obtain GenBank ASN.1 data file at: ftp://ftp.ncbi.nlm.nih.gov/ncbi-asn1/. Here daily-nc directory contains individual files for each day's new or updated entries since close-of-data for the last GenBank Release in ASN.1 format.
Additional documentation:
/ncbi-asn1/README.asn1
/ncbi-asn1/daily-nc/README.asn1.daily-nc
- Download the appropriate datatool binary for your platform:
ftp://ftp.ncbi.nlm.nih.gov/toolbox/ncbi_tools++/BIN/CURRENT/datatool/ - Download NCBI data specification file:
https://ncbi.nlm.nih.gov/data_specs/asn/NCBI_all.asn - Run the program:
./datatool -m NCBI_all.asn -d gbest225.aso -t Bioseq-set -px gbest225.xml
Here:
gbest225.aso
is the name of the source GenBank data file in ASN binary format
Bioseq-set
is the name of the data type in the source file
gbest225.xml
is the name of the output file in XML format