mol and mol2

mol and mol2

The keyword mol gives the initial molecular coordinates. For some tasks like transition state search, another structure has to be given in option mol2.

Explicit XYZ Coordinates

You can simply give XYZ coordinates in Angstrom:

XYZ File Name

You can give the XYZ file name (path can be included if necessary):

PDB File Name

You can give the PDB file name (path can be included if necessary):

Theoretical Background

XYZ File Format

An XYZ file is a text format employed to depict the geometry of a molecule. Its structure is delineated as such:

  1. Number of atoms: The initial line consists of an integer representing the total number of atoms in the molecule.
  2. Comment line: The second line typically comprises a comment that can be utilized to provide information regarding the molecule.
  3. Each subsequent line contains the elemental symbol of an atom and its corresponding Cartesian coordinates (x, y, z), typically expressed in Angstroms.

PDB File Format

The protein data bank (PDB) file format is a comprehensive textual format widely used to store three-dimensional structural information of biological macromolecules, such as proteins and nucleic acids. Each PDB file contains detailed data, including the molecule's name, source species, sequence, secondary structure, and crystallographic parameters. In Qbics, only atomic records is used, i.e., lines starting with ATOM and HETATM.

Each atomic record in a PDB file provides specific information about an atom:

  1. Record Type: Indicates the type of record, such as ATOM or HETATM.
  2. Atom Serial Number: A unique identifier for each atom.
  3. Atom Name: The element symbol and possibly additional characters to distinguish atoms within the same residue.
  4. Residue Name: The name of the residue (e.g., ALA for alanine).
  5. Residue Sequence Number: The position of the residue in the sequence.
  6. X, Y, Z Coordinates: The atom's coordinates in Angstrom.
  7. Occupancy: The fraction of the molecule present at this position. This is a crystallographic parameter NOT used by Qbics.
  8. Temperature Factor (B-factor): Indicates the atomic displacement or mobility. This is a crystallographic parameter NOT used by Qbics.
  9. Element Symbol: The chemical element symbol.
Warning

Some software generates a PDB file format without "Element Symbol" column. In this case, Qbics will guess the element symbol based on the "Atom Name" column. However, this is dangerous and should be avoided since "Atom Name" is in principle arbitrary and can be misleading. For example, CLA can mean either chlorine or carbon atom in different PDB files. Therefore, it is recommended to always include the "Element Symbol" column in the PDB file for Qbics calculations.

Input Examples

Example: Transition State Search Using mol and mol2 to Give Reactant and Product

Search the transition state using NEB at xTB level of theory, with reactant and product structure giving in mol and mol2, respectively:

In this file, strucutures defined in mol and mol2 are shown below: