Misplaced Pages

Nexus file

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
File format used in bioinformatics For the NeXus format used in neutron, x-ray, and muon scattering, see Nexus (data format).
Nexus format
Filename extensionsusually .nex or .nxs
Internet media typeapplication/octet-stream
Magic number'#NEXUS\n'
Developed byMaddison DR, Swofford DL, Maddison WP
Initial releaseDecember 1997 (27 years ago) (1997-12)
Type of formatbioinformatics
Open format?Yes

The extensible NEXUS file format is widely used in phylogenetics, evolutionary biology, and bioinformatics. It stores information about taxa, morphological character states, DNA and protein sequence alignments, distances, and phylogenetic trees. The NEXUS format also allows the storage of data that can facilitate analyses, such as sets of characters or taxa. Many popular phylogenetic programs, including PAUP*, MrBayes, Mesquite, MacClade, and SplitsTree, use this format. Nexus file names typically have the extension .nxs or .nex .

Syntax

A NEXUS file is made out of a fixed header #NEXUS followed by multiple blocks. Each block starts with BEGIN block_name; and ends with END;. The keywords are case-insensitive. Comments are enclosed inside square brackets . Each of the pre-defined types of blocks may appear only once.

Block Name Description
TAXA Specifies the OTUs (operational taxonomic units) in data set
CHARACTERS Specifies the character data (e.g., homologous morphological characters or a multiple sequence alignment)
DATA Equivalent to a CHARACTERS block that includes the NewTaxa subcommand in the Dimensions command
TREES Stores trees in Newick format
DISTANCES Stores distance matrices
SETS Assigns names to sets of characters (CHARSET) or OTUs (TAXSET)
ASSUMPTIONS Assumptions about the data or directions regarding data treatment (e.g., the character exclusion status)

The following example NEXUS uses the TAXA, CHARACTERS, and TREES blocks:

#NEXUS
Begin TAXA;
  Dimensions ntax=4;
  TaxLabels Alpha Beta Gamma Delta;
End;
Begin CHARACTERS;
  Dimensions nchar=15;
  Format datatype=dna missing=? gap=- matchchar=.;
  Matrix
    Alpha   atgctagctagctcg
    Beta    ......??...-.a.
    Gamma   ...t.......-.g. 
    Delta   ...t.......-.a.           
  ;
End;
Begin TREES;
  Tree tree1 = ((Alpha,Beta),Gamma,Delta);
END;

See also

References

  1. Maddison DR, Swofford DL, Maddison WP (1997). "NEXUS: An extensible file format for systematic information". Systematic Biology. 46 (4): 590–621. doi:10.1093/sysbio/46.4.590. PMID 11975335.
  2. PAUP* Archived 2006-09-03 at the Wayback Machine — Phylogenetic Analysis Using Parsimony *and other methods
  3. MrBayes
  4. Mesquite: A modular system for evolutionary analysis
  5. MacClade
  6. Huson and Bryant, Application of Phylogenetic Networks in Evolutionary Studies, Mol Biol Evol (2005) 23 (2): 254-267. https://doi.org/10.1093/molbev/msj030
  7. Detailed NEXUS specification

External links

Bioinformatics
Databases
Software
Other
Institutions
Organizations
Meetings
File formats
Related topics
Categories:
Nexus file Add topic