Basis Set Input
Summary
Introduction

CRYSTAL performs ab initio calculations on periodic systems within the linear combination of atomic orbitals (LCAO) approximation. That is, the crystalline orbitals (CO) are treated as linear combinations of Bloch functions (BF), \[ \begin{equation} \label{eq:COdef} \psi_{i}({\bf r};{\bf k}) = \sum_{\mu} a_{\mu,i} ({\bf k}) \phi_{\mu}({\bf r};{\bf k}) \end{equation} \] \[ \begin{equation} \label{BFdef} \phi_{\mu}({\bf r};{\bf k}) = \sum_{\bf g} \varphi_{\mu} ({\bf r}-{\bf A}_\mu-{\bf g})\; e^{i {\bf k} \cdot {\bf g}} \end{equation} \]

  defined in terms of local functions, hereafter indicated as atomic orbitals (AO). Those local functions are expressed as linear combination of a certain number of Gaussian type functions (GTF). \[ \begin{equation} \label{AOdef} \varphi_\mu ({\bf r}-{\bf A}_\mu - {\bf g}) = \sum_j^{n_G} d_j\; G( \alpha_j; {\bf r}-{\bf A}_\mu - {\bf g}) \end{equation} \]

They are characterized by the same centre A, with fixed coefficients, d, and exponents, alpha, defined in the input.

r coordinate of an electron
g direct lattice vector the sum over g is extended to the all lattice vectors (infinite) of direct lattice.
k lattice vector defining a point in the reciprocal lattice
A coordinate of an atom in the reference cell
a variational coefficients.
They multiply the BF; the sum over μ is limited to the number of basis functions
d coefficients of the primitive gaussians in the contraction, fixed for a given basis set; the sum over j is limited to the number of functions in the contraction

The AOs belonging to a given atom are grouped into shells.  
The shell can contain either all AOs with the same quantum numbers, n and l (for instance 3s, 2p, 3d shells), or all the AOs with the same principal quantum number n and different l (sp shells; exponent of s and p gaussians are the same).

A single, normalized, s-type GTF, the adjoined gaussian, is associated with each shell. The exponent of the adjoined gaussian is the smallest exponent of the gaussians in the contraction.
The adjoined gaussian is used to estimate the AO overlap and select the level of approximation to be adopted for the evaluation of the integrals.

This chapter discusses briefly the basis set input section. The basis set definition is the first step to uniquely define the level of calculation. The molecular/crystalline basis set must be balanced, that means each centre must have the same variational freedom in describing the electrons formally attributed to the centre.
Basis sets of different quality on different atoms (minimal basis sets on some atoms and split valence + polarization on others) may give spurious effects, exploited during the SCF iterations, and driving to solution not converging.

Few simple examples will be shown to explain how the basis set has to be specified in the CRYSTAL input.

Basis set input

The definition of the basis set is in the second input block. Basis set and initial electronic configuration must be given for each atom with a different conventional atomic number in the crystal structure input. CRYSTAL can use either general basis sets, including s, p, d, f functions or standard Pople basis sets (internally stored). All electrons and effective core pseudopotentials (ECP) calculations can be performed. In the latter case, the ECP must be inserted in this section as well.

When the basis set input has been specified, several optional keywords can be used, related to modification of the electronic configuration, use of ghost functions, and printing options.

Standard input

The basis set input format is strictly related to the mathematical definition of basis set given above. 
For each atom (as many blocks as different types of atoms in the crystal structure) it must be specified:
        the conventional atomic number and the number of shells ns of the atomic basis set  

        for each shell (ns blocks of records), type of basis set (0-1-2), type of shell (0-1-2-3-4),
               number of primitives GTF ng, shell electronic charge, scale factor

                for each primitive (ng records - optional - basis set type 0 only)
                       exponent, contraction coefficient, [contraction coefficient]

The definition of atomic basis sets ends with the record:  

99 0  

that is the conventional atomic number 99 with a 0 shell. Optional basis set keywords may follow.
The "conventional atomic number" links the basis set to the atoms entered in geometry input.

Basis set input is closed by the keyword END

In CRYSTAL three basis set types are available:

0 general basis set: exponent and contraction coefficients defined in input;
1 Pople STO-nG type basis set;
2 Pople 3(6)-21G type basis set;

The shell types available correspond to:

shell type code shell type AO AO order max shell charge
0 s 1 s 2
1 sp 4 s, x, y, z 8
2 p 3 x, y, z 6
3 d 5 2z2-x2-y2,  xz,  yz,  x2-y2,  xy 10
4 f 7 (2z2-3x2-3y2)z, (4z 2-x2-y2)x, (4z2-x2-y2)y,
(x 2-y2)z, xyz, (x2-3y2)x, (3 x2-y2)y
0 - polarization only

d shells include 5 d orbitals, f shells include 7 orbitals.
For sp shells two contraction coefficients must be specified, for   s and p AO, respectively.

Standard polarization functions can be added to 3(6)-21G basis sets of atoms up to Z=18, by inserting a record describing the polarization shell.

The formal shell electronic charge is the number of electrons attributed to each shell as initial electronic configuration. The electronic configuration of the atoms is used in the calculation of the atomic wave function only   (when the guess for SCF is a superposition of atomic densities).   It may correspond to a neutral atom or   to an ion (for MgO, Mg and O, or Mg++ and O--). The net charge in the cell must be zero, the cell must be neutral.

Example 1: General basis set input for MgO (Hartree-Fock) with a STO-3G Pople basis set (H - Ne: W.J. Hehre, R.F. Stewart and J.A. Pople, J. Chem. Phys. 2657 (1969). # Na - Ar: W.J. Hehre, R. Ditchfield, R.F. Stewart, J.A. Pople, J. Chem. Phys. 2769 (1970)).
MgO bulk
CRYSTAL
0 0 0
225
4.21
2
12 0.0 0.0 0.0
8 0.5 0.5 0.5
END
12 3
0 0 3 2.0 1.0
299.2374000 0.1543289673
54.5064700 0.5353281423
14.7515800 0.4446345422
0 1 3 8.0 1.0
15.1218200 -0.09996722919 0.1559162750
3.5139870 0.39951282610 0.6076837186
1.1428570 0.70011546890 0.3919573931
0 1 3 2.0 1.0
1.3954480 -0.2196203690 0.01058760429
0.3893260 0.2255954336 0.59516700530
0.1523800 0.9003984260 0.46200101200
8 2
0 0 3 2.0 1.0
130.7093200 0.15432897
23.8088610 0.53532814
6.4436083 0.44463454
0 1 3 6.0 1.0
5.0331513 -0.09996723 0.15591627
1.1695961 0.39951283 0.60768372
0.3803890 0.70011547 0.39195739
99 0
END
SHRINK
8 8
END

Example 2: MgO (Hartree-Fock) with a STO-3G Pople basis set, internally stored as in Gaussian 94.

MgO bulk
CRYSTAL
0 0 0
225
4.21
2
12 0.0 0.0 0.0
8 0.5 0.5 0.5
END
12 3
1 0 3 2. 0
1 1 3 8. 0
1 1 3 2. 0
8 2
1 0 3 2. 0
1 1 3 6. 0
99 0
END
SHRINK
8 8
END

In the output, the basis set exponents and coefficients are expressed in a extended table and are equal for examples 1 and 2, since both cases are composed by the same basis set (different input form).

 *******************************************************************************
LOCAL ATOMIC FUNCTIONS BASIS SET
*******************************************************************************
ATOM X(AU) Y(AU) Z(AU) N. TYPE EXPONENT S COEF P COEF D/F/G COEF
*******************************************************************************
1 MG 0.000 0.000 0.000
1 S
2.992E+02 1.543E-01 0.000E+00 0.000E+00
5.451E+01 5.353E-01 0.000E+00 0.000E+00
1.475E+01 4.446E-01 0.000E+00 0.000E+00
2- 5 SP
1.512E+01-9.997E-02 1.559E-01 0.000E+00
3.514E+00 3.995E-01 6.077E-01 0.000E+00
1.143E+00 7.001E-01 3.920E-01 0.000E+00
6- 9 SP
1.395E+00-2.196E-01 1.059E-02 0.000E+00
3.893E-01 2.256E-01 5.952E-01 0.000E+00
1.524E-01 9.004E-01 4.620E-01 0.000E+00
2 O 3.978 3.978 3.978
10 S
1.307E+02 1.543E-01 0.000E+00 0.000E+00
2.381E+01 5.353E-01 0.000E+00 0.000E+00
6.444E+00 4.446E-01 0.000E+00 0.000E+00
11- 14 SP
5.033E+00-9.997E-02 1.559E-01 0.000E+00
1.170E+00 3.995E-01 6.077E-01 0.000E+00
3.804E-01 7.001E-01 3.920E-01 0.000E+00

The influence of basis set can be seen here for silicon bulk here.
Input for pre-defined basis sets

Since CRYSTAL14, a set of internally stored pre-defined basis sets are available by using the keyword BASISSET. Note that when specifying this keyword, the END to close the GEOMETRY input section and the keywords 99 0 and END that close the "standard" basis set input section (see above), are no more necessary.
The dataset of available basis sets includes (available atomic numbers in parentheses):

Keyword Description
STO-3G Pople's STO-3G minimal basis set (1--53)
STO-6G Pople's STO-6G minimal basis set (1--36)
POB-DZVP POB double-\(\zeta\) valence + polarization set for solid state systems (1--35, 49, 74)
POB-DZVPP POB double-\(\zeta\) valence basis set + a double set of polarization functions for solid state systems (1--35, 49, 83)
POB-TZVP POB triple-\(\zeta\) valence + polarization basis set for solid state systems (1--35, 49, 83)

The same example used for MgO examples 1 and 2, can be obtained by specifying two keywords: BASISSET and STO-3G.

MgO bulk
CRYSTAL
0 0 0
225
4.21
2
12 0.0 0.0 0.0
8 0.5 0.5 0.5
BASISSET
STO-3G
SHRINK
8 8
END

Warning: Pre-defined basis sets cannot be modified. Different conventional atomic numbers (see below) cannot be specified.

Exercise: define a STO-3G basis set for Chlorine.
Exercise: define a STO-3G basis set for Sodium.
Exercise: define a STO-3G basis set for NaCl
Exercise: define an extended basis set for NaCl referring to the CRYSTAL basis set library.
Notes on the meaning of conventional atomic number

The conventional atomic number, NAT, links the basis set with the atoms defined in the crystal structure.

NAT<200: all-electron BS Given Z, NAT=Z, NAT'=Z+100
NAT>200: valence-electron BS Given Z, NAT=Z+200, NAT'=Z+300. A core pseudopotential (ECP) must be defined

A maximum of two different basis sets may be given for the same chemical species in positions not symmetry related, using the conventional atomic number NAT and NAT'.
Atoms with equal conventional atomic number are associated with the same basis set.

The atomic number Z is given by the remainder of the division of the conventional atomic number by 100 (Example: NAT=108, Z=8, Oxygen, all electron; NAT=208, Z=8, Oxygen, ECP).

A conventional atomic number 0 defines ghost atoms, that is points in space with an associated basis set, but lacking a nuclear charge.

Here an example is reported concerning the use of different conventional atomic numbers for the same atom, but in non-equivalent positions.

In the following example (test 35 of CRYSTAL test cases),  a three-layer slab model of the MgO(001) surface is created (SLABCUT) and a CO molecule is added (ATOMINSE) upon the surface to simulate an adsorption process. Two different atomic basis sets are used for the oxygen atom:  in MgO the oxygen (NAT=8, Z=8) basis set is optimized for O--,  in CO molecule the oxygen (NAT=108, Z=8) basis set is a standard molecular one.

TEST35 - MGO SLAB (001), 3 LAYER + CO ADSORPTION
CRYSTAL
0 0 0
225
4.21
2
12 0. 0. 0.
8 0.5 0.5 0.5
SLABCUT
0 0 1
1 3
BREAKSYM
ATOMINSE CO molecule added
2
6 1.488 -1.488 4.605
108 1.488 -1.488 5.729
END
12 3
0 0 8 2. 1.
68371.875 0.0002226
9699.34009 0.0018982
2041.176786 0.0110451
529.862906 0.0500627
159.186000 0.169123
54.6848 0.367031
21.2357 0.400410
8.74604 0.14987
0 1 6 8. 1.
156.795 -0.00624 0.00772
31.0339 -0.07882 0.06427
9.6453 -0.07992 0.2104
3.7109 0.29063 0.34314
1.61164 0.57164 0.3735
0.64294 0.30664 0.23286
0 1 1 0. 1.
0.4 1. 1.
8 3 user defined basis set
0 0 8 2. 1.
4000. 0.00144
1355.58 0.00764
248.545 0.05370
69.5339 0.16818
23.8868 0.36039
9.27593 0.38612
3.82034 0.14712
1.23514 0.07105
0 1 5 8. 1.
52.1878 -0.00873 0.00922
10.3293 -0.08979 0.07068
3.21034 -0.04079 0.20433
1.23514 0.37666 0.34958
0.536420 0.42248 0.27774
0 1 1 0. 1.
0.210000 1. 1.
6 3 Pople 6-21G basis set
2 0 6 2. 0.
2 1 2 4. 0.
2 1 1 0. 0.
108 3
2 0 6 2. 0.
2 1 2 6. 0.
2 1 1 0. 0.
99 0
END
SHRINK
8 8
FMIXING
30
END

Exercise: Use the MgO bulk input to create a Mg defect in 32 atoms super cell of MgO. Then specify in the input deck that the first-neighbors of the defect have a different basis set with respect to the other oxygen atoms.

Effective Core Pseudopotentials

CRYSTAL can perform valence-electron only calculations with the aid of effective core pseudopotentials (ECP). The idea behind pseudopotentials is to treat the core electrons as effective averaged potentials rather than actual particles. Thus, pseudopotentials are modifications to the Hamiltonian.
Atoms with conventional atomic number > 200 have a valence only basis set, being the core electrons described by an effective core pseudopotential.
The definition of the ECP is inserted in the atomic basis set input block, after the record giving the conventional atomic number and before the records giving the variational basis set. 
The variational basis set must be the basis set supplied with the ECP

The following ECPs are available as internal data in the CRYSTAL code:

ECP Keyword
Hay and Wadt large  core  HAYWLC
Hay and Wadt small core HAYWSC
Durand and Barthelat BARTHE or DURAND

Warning: The data defining the pseudopotentials were included in CRYSTAL92 and never modified.

The keyword INPUT allows to enter pseudopotentials from external data, when available. This is the case for Stuttgart-Dresden pseudopotentials for which updated data can be found at the web site: http://www.uni-stuttgart.de/theochem/  

See CRYSTAL User Manual (subject index ECP) for complete explanation of user defined ECP.

Ghost atoms and ghost functions

A conventional atomic number 0 defines ghost atoms, that is points in space with an associated basis set, but lacking a nuclear charge. Two are the main applications of ghost atoms.  

The first application is linked to the transformation of an atom in a "ghost", by removing the nuclear charge and the shell electron charge attributed to that centre, but leaving the basis set centred at the atomic position. This can be accomplished after the basis set definition by the use of the optional keyword GHOSTS. The number of atoms to be transformed and the corresponding atomic labels must be specified. If the system is forced to maintain the original symmetry (KEEPSYMM), all the atoms symmetry related to the given one are turned into ghosts. Attention must be paid to the neutrality of the cell.  

Ghost atoms and ghost functions can be very useful when treating defects like vacancies in materials (e.g. F center in oxides, C vacancy in diamond), because they allow a build up of charge density in the defective region by giving variational freedom in that site.
The first step when using a periodic model for a local defect in a crystalline materials the creation of a super cell, to guarantee no interaction between defects in neighboring cells.   See tutorial on defects in   materials.

Note: the keyword ATOMREMO (see the geometry input section) allows to create a vacancy, like GHOSTS, but the basis functions are also removed, as well as nuclear charge and electronic charge.

A second application is the use of ghost functions  to evaluate the basis set superposition error (BSSE) in periodic systems by means of the counterpoise method. For instance, to evaluate the adsorption energy  of a monolayer of CO on the (001) surface of MgO, the difference between the energy of CO+MgO and the sum of the energy of CO monolayer and clean MgO (001) surface, computed with the basis set of the atoms of the two separate layers only, gives an incorrectly high value.

\(E_{adsorption} = E_{CO+MgO} - (E_{CO} +E_{MgO})\)

As the basis set used in the calculation are generally far from complete, both the adsorbate layer and the surface layer may use additional variational freedom offered by each others basis functions to lower their energy of the complex system. This gives a non-physical stabilizing contribution to the energy of the surface-adsorbate complex (and may also lead to artificial charge transfer if the basis set description of the two systems is unbalanced). Hence there may be an error (BSSE) in the interaction energy which is connected with the superposition of the basis functions of the two subsystems.
Within the counterpoise method, the correction to the BSSE can be computed by means of the GHOSTS keyword.  
In the example above, the two contributions of CO monolayer and clean MgO (001) surface are computed starting from the geometry of the complex system, and transforming into ghosts the atoms of the surface and the CO monolayer, respectively.

See test35 (MgO+CO) and test36 (CO monolayer with Mg and O ghosts)

A quite peculiar application of GHOSTS is   to add bond basis functions. An example is supplied by test 28 of CRYSTAL test cases. The variational basis set includes s functions only: To simulate p functions on Lithium, s functions are added around the Li atom. Warning: this is a numerical test,   created to compare the results of CRYSTAL with other programs handling s functions only (Mulliken population analysis is based on the assumption that the basis functions are centred on an atomic position). The ghost atom is inserted in the geometry input, as   an atom with conventional atomic number 0.

TEST28 - LIH - GHOST ATOMS TO SIMULATE P FUNCTIONS ON LI
CRYSTAL
0 0 1
225
4.1237
3
3 0. 0. 0.
1 -0.5 -0.5 -0.5
0 .0 .0 -.37166
END
3 1
0 0 10 2.0 1.0
700. .001423
220. .003949
70. .016438
20. .089593
5. .316432
1.5 .487303
.51 .254013
.26 -.019926
.13 .018290
.065 -.004509
1 1
0 0 8 2. 1.
120. .000668
40. .001863
12. .007809
4. .032719
1.2 .114281
0.45 .117408
0.13 .636552
0.065 .184384
0 1
0 0 1 0. 1.
1.2 1. 1. 1.
99 0
END
TOLINTEG
4 4 4 4 8
SHRINK
4 4
FMIXING
30
END

Looking at the neighbors analysis, we see 6 "ghosts", s function only, around Lithium.

NEIGHBORS OF THE NON-EQUIVALENT ATOMS

N = NUMBER OF NEIGHBORS AT DISTANCE R
ATOM N R/ANG R/AU NEIGHBORS (ATOM LABELS AND CELL INDICES)
1 LI 6 1.5326 2.8962 3 XX 0 0 0 4 XX 0 0 0 5 XX 0 0 0
6 XX 0 0 0 7 XX 0 0 0 8 XX 0 0 0
1 LI 6 2.0619 3.8963 2 H -1 0 0 2 H 0-1 0 2 H 0 0-1
2 H -1-1 0 2 H -1 0-1 2 H 0-1-1
Other optional keywords

CHEMOD

It may be useful to allow atoms with the same basis set to have different electronic configurations. The formal shell charges attributed in the basis set input may be modified for selected atoms, identified by their label, by inserting the keyword CHEMOD. The user is advised to run a TESTGEOM, to be secure of what atoms/labels must be altered.
An example is an oxygen vacancy (GHOST) in a MgO(001) 3-layers slab (SLABCUT) with a Li defect (ATOMSUBS). The CRYSTAL input (TEST37) file follows:

TEST37 - MGO-LI WITH OXYGEN VACANCY
CRYSTAL
0 0 0
225
4.21
2
12 0. 0. 0.
8 0.5 0.5 0.5
SLAB
0 0 1
1 3
SUPERCEL
2 0
0 2
ATOMSUBS
1
9 3
ATOMDISP displacement of atoms
1
9 0. 0. -.294 atom labeled 9, is displaced -0.294 angs. in z direction
NEIGHPRT
10
END
12 3
0 0 8 2. 1.
68371.875 0.0002226
9699.34009 0.0018982
2041.176786 0.0110451
529.862906 0.0500627
159.186000 0.169123
54.6848 0.367031
21.2357 0.400410
8.74604 0.14987
0 1 6 8. 1.
156.795 -0.00624 0.00772
31.0339 -0.07882 0.06427
9.6453 -0.07992 0.2104
3.7109 0.29063 0.34314
1.61164 0.57164 0.3735
0.64294 0.30664 0.23286
0 1 1 0. 1.
0.4 1. 1.
8 3
0 0 8 2. 1.
4000. 0.00144
1355.58 0.00764
248.545 0.05370
69.5339 0.16818
23.8868 0.36039
9.27593 0.38612
3.82034 0.14712
1.23514 0.07105
0 1 5 8. 1.
52.1878 -0.00873 0.00922
10.3293 -0.08979 0.07068
3.21034 -0.04079 0.20433
1.23514 0.37666 0.34958
0.536420 0.42248 0.27774
0 1 1 0. 1.
0.210000 1. 1.
3 3
0 0 6 2.0 1.0
700. .001421
220. .003973
70. .01639
20. .089954
5. .31565
1.5 .4946
0 0 1 1.0 1.0
0.5 1.
0 2 1 0.0 1.0
0.6 1
99 0
GHOSTS
1
5 at the site of atom labeled 5, Oxygen, a vacancy is created
CHEMOD
2
5 1. 0. 0. one electron is attributed to the vacancy as initial guess
9 2. 0. 0. atom labeled 9, Li, has an ionic configuration.
END
UHF
SHRINK
4 4
SPINLOCK
1 100
FMIXING
30
MULPOPAN
TOLDEE
7
NODIRECT
END

The output sequence of atoms is the following: The formation of the super cell

 **** ATOMS BELONGING TO THE SUPERCELL
LABEL AT.NO. COORDINATES (ANGSTROM AND FRACTIONAL)
1 12 1.4885 -1.4885 2.1050 0.2500 -0.2500
2 12 1.4885 1.4885 2.1050 0.2500 0.2500
3 12 -1.4885 -1.4885 2.1050 -0.2500 -0.2500
4 12 -1.4885 1.4885 2.1050 -0.2500 0.2500
5 8 -0.0000 0.0000 2.1050 -0.0000 0.0000
6 8 -0.0000 2.9769 2.1050 -0.0000 0.5000
7 8 2.9769 0.0000 2.1050 0.5000 0.0000
8 8 2.9769 2.9769 2.1050 0.5000 0.5000
9 12 0.0000 0.0000 0.0000 0.0000 0.0000 10 12 0.0000 2.9769 0.0000 0.0000 0.5000 11 12 2.9769 -0.0000 0.0000 0.5000 -0.0000 12 12 2.9769 2.9769 0.0000 0.5000 0.5000 13 8 -1.4885 1.4885 0.0000 -0.2500 0.2500 14 8 -1.4885 -1.4885 0.0000 -0.2500 -0.2500 15 8 1.4885 1.4885 0.0000 0.2500 0.2500 16 8 1.4885 -1.4885 0.0000 0.2500 -0.2500 17 12 -1.4885 1.4885 -2.1050 -0.2500 0.2500 18 12 -1.4885 -1.4885 -2.1050 -0.2500 -0.2500 19 12 1.4885 1.4885 -2.1050 0.2500 0.2500 20 12 1.4885 -1.4885 -2.1050 0.2500 -0.2500 21 8 0.0000 -0.0000 -2.1050 0.0000 -0.0000 22 8 0.0000 2.9769 -2.1050 0.0000 0.5000 23 8 -2.9769 -0.0000 -2.1050 -0.5000 -0.0000 24 8 -2.9769 2.9769 -2.1050 -0.5000 0.5000

Atom 9, Mg, is at the centre of the cell, at a distance of 5.95 Angstrom from the nearest Mg. It is substituted with a Lithium (ATOMSUBS), then the Lithium is shifted (ATOMDISP) towards an Oxygen. The symmetry is reduced. The final geometry is:

 ATOMS IN THE ASYMMETRIC UNIT 12 - ATOMS IN THE UNIT CELL: 24
ATOM X/A Y/B Z(ANGSTROM)
*******************************************************************************
1 T 12 MG 2.500000000000E-01 -2.500000000000E-01 2.105000000000E+00
2 F 12 MG 2.500000000000E-01 2.500000000000E-01 2.105000000000E+00
3 F 12 MG -2.500000000000E-01 -2.500000000000E-01 2.105000000000E+00
4 F 12 MG -2.500000000000E-01 2.500000000000E-01 2.105000000000E+00
5 T 8 O -1.665334536938E-16 5.551115123126E-17 2.105000000000E+00
6 T 8 O -1.665334536938E-16 -5.000000000000E-01 2.105000000000E+00
7 F 8 O -5.000000000000E-01 5.551115123126E-17 2.105000000000E+00
8 T 8 O -5.000000000000E-01 -5.000000000000E-01 2.105000000000E+00
9 T 3 LI 2.660361080736E-17 1.470449196433E-17 -2.940000000000E-01 10 T 12 MG 2.660361080736E-17 -5.000000000000E-01 0.000000000000E+00 11 F 12 MG -5.000000000000E-01 1.470449196433E-17 0.000000000000E+00 12 T 12 MG -5.000000000000E-01 -5.000000000000E-01 0.000000000000E+00 13 T 8 O 2.500000000000E-01 2.500000000000E-01 0.000000000000E+00 14 F 8 O 2.500000000000E-01 -2.500000000000E-01 0.000000000000E+00 15 F 8 O -2.500000000000E-01 2.500000000000E-01 0.000000000000E+00 16 F 8 O -2.500000000000E-01 -2.500000000000E-01 0.000000000000E+00 17 T 12 MG -2.500000000000E-01 2.500000000000E-01 -2.105000000000E+00 18 F 12 MG -2.500000000000E-01 -2.500000000000E-01 -2.105000000000E+00 19 F 12 MG 2.500000000000E-01 2.500000000000E-01 -2.105000000000E+00 20 F 12 MG 2.500000000000E-01 -2.500000000000E-01 -2.105000000000E+00 21 T 8 O 6.504660432771E-17 -3.729435768845E-17 -2.105000000000E+00 22 T 8 O 6.504660432771E-17 -5.000000000000E-01 -2.105000000000E+00 23 F 8 O -5.000000000000E-01 -3.729435768845E-17 -2.105000000000E+00 24 T 8 O -5.000000000000E-01 -5.000000000000E-01 -2.105000000000E+00

The electronic configuration of the atoms is Mg++, O --, Li neutral. When Li substitute Mg   (the cell is not neutral).   An oxygen "ghost" is created, (all shell charges set to 0) and the charge of one electron is attributed to the ghost.

 *******************************************************************************
* ATOMS TRANSFORMED INTO GHOSTS : 1
5( 8)
SYMMETRY MAY BE BROKEN

*******************************************************************************
* ELECTRONIC CONFIGURATION MODIFIED IN 2 ATOMS
*******************************************************************************
5( 8) 1.0 0.0 0.0
9( 3) 2.0 0.0 0.0

The Li charge is modified to the electronic configuration of Li+ ion. The final result is a neutral cell, with an unpaired electron. UHF Hamiltonian is required. 

In summary, a 3-layers slab is cut out from the MgO bulk, the 2D cell is then enlarged by defining a 2x2 super cell, and a Li defect is finally created by substituting a Mg atom. After the definition of the defective structure, the basis set is given in input for each element type. An ionic configuration has been used for Mg and O whereas an atomic electronic configuration has been used for Li. To create the oxygen vacancy the GHOSTS keyword is then used by leaving the ghost functions into the vacancy. However, the basis set of the vacancy is defined for an oxygen atom, so the corresponding electronic configuration must be changed to define an unpaired electron in the vacancy. Further, the initial atomic configuration for Li is modified to be ionic.

Choice of the basis set and related problem

The choice of the basis set is a fundamental step in defining the level of calculation and its accuracy. This is of particular importance when treating periodic systems where a large variety of chemical bonding can be found.  

The choice of the basis set is a compromise between accuracy and costs. Nevertheless, we think that the accuracy must be the main goal of ab initio calculations. So, good quality basis sets should be always used in spite of their computational cost to avoid producing meaningless numbers.

Where to get additional information and web resources

More detailed discussions about the choice of the basis set in Chapter "Basis set" of CRYSTAL User's Manual.

A library of basis sets for periodic systems is available at the CRYSTAL web site.
A click-able periodic table is shown. A text file with a list of basis sets is displayed by clicking on each element type.  References to publications and hints about optimization, where relevant, are also supplied.

An introductory guide to Gaussian basis set in solid-state electronic structure calculations has been written by Mike Towler (TCM Group, University of Cambridge, UK). The library contains basis sets for almost all the atoms beyond zinc in the periodic table. However, it contains basis sets which have never been used in published calculations, and also sets which have simply been optimized in atomic SCF calculations but not tested in a crystalline compound.

The EMSL library supplies a wide selection of atomic basis sets optimized for molecules, good starting point to generate an atomic basis set to be used in a crystalline compound.:
The utility program gaustocry converts a basis set input in GAUSSIAN format in CRYSTAL format one.

Exercise: Take a molecular 6-31G(d,p) basis set from the EMSL site for H, C, N, and O, in the Gaussian code format, and  translate it in the CRYSTAL code format.