MayaChemTools

Previous  TOC  NextAtomTypes/EStateAtomTypes.pmCode | PDF | PDFA4

NAME

EStateAtomTypes

SYNOPSIS

use AtomTypes::EStateAtomTypes;

use AtomTypes::EStateAtomTypes qw(:all);

DESCRIPTION

EStateAtomTypes class provides the following methods:

new, AssignAtomTypes, GetAllPossibleEStateAtomTypes, GetAllPossibleEStateNonHydrogenAtomTypes, GetEStateAtomTypesData, StringifyEStateAtomTypes

The following functions are available:

GetAllPossibleEStateAtomTypes, GetAllPossibleEStateNonHydrogenAtomTypes, GetEStateAtomTypesData

EStateAtomTypes is derived from AtomTypes class which in turn is derived from ObjectProperty base class that provides methods not explicitly defined in EStateAtomTypes, AtomTypes or ObjectProperty classes using Perl's AUTOLOAD functionality. These methods are generated on-the-fly for a specified object property:

Set<PropertyName>(<PropertyValue>);
$PropertyValue = Get<PropertyName>();
Delete<PropertyName>();

The data file EStateAtomTypes.csv distributed with MayaChemTools release contains all possible electrotopological state (E-state) [ Ref 75-78 ] atom types.

E-state atom types for various different atom groups [Appendix Table 1 in Ref 76, Appendix III in Ref 77 ] are defined using central atom environments indicating its topological and valence state along with bonded hydrogens.

The current release of MayaChemTools implements an extended E-state atom assignment methodology which is able to assign atom types to any valid non-hydrogen atom in any atom group instead of a fixed set of E-state atom types [ Ref 77].

Let:

As = Atom symbol corresponding to element symbol

H<n> = Number of implicit and explicit hydrogens for atom

s = Single bond to non-hydrogen atoms attached to atom
s<x> = Symbol s repeated x times to indicate multiple single bonds

d = Double bond to non-hydrogen atoms attached to atom
d<x> = Symbol d repeated x times to indicate multiple double bonds

t = Triple bond to non-hydrogen atoms attached to atom
t<x> = Symbol t repeated x times to indicate multiple triple bonds

a = Aromatic to bond non-hydrogen atoms attached to atom
t<x> = Symbol a repeated x times to indicate multiple aromatic bonds

p = Plus or positive formal charge
m = Minus or negative formal charge

Then, E-state atom type specification for non-hydrogen or heavy atoms corresponds to:

t<x>d<x>a<x>s<x>AsH<n>p or t<x>d<x>a<x>s<x>AsH<n>m
Notes:

o p and n with values of 0 are not shown.
o s, d, t, and a bond symbol with values of zero are not shown.
o s and d bonds which are also aromatic don't contribute to the count of single and double bonds; instead, aromatic bond count reflect these bonds.

Hydrogen E-state [ Ref 76-77 ] atom type definitions are:

HGroup AtomType

-OH HsOH
-SH HsSH

-NH2 HsNH2
>NH HssNH
=NH HdNH
:NH: HaaNH
-NH3+ HsNH3p
>NH2+ HssNH2p
>NH-+ HsssNHp

#CH HtCH
=CH2 HdCH2 - H attached to a terminal vinyl group
=CH- HdsCH - H attached a non-terminal vinyl group
:CH: HaaCH

>CHF HCHF
-CH2F HCH2F
>CHCl HCHCl
-CH2Cl HCH2Cl

CHn (saturated) HCsats - H attached to sp3 carbon attached to saturated carbon(s)
CHn (unsatd.) HCsatu - H attached to sp3 carbon attached to unsaturated carbon(s)

CHn (aromatic) Havin - H attached to a non-terminal vinyl group, =CH-, attached to an aromatic carbon

CHn Hother - H attached to any other type of C, N, O or S
AHn Hmisc - H not attached to C, N, O or S
Notes:

o - : Single bond; = : Double bond; # : Triple bond
o Hother atom type capture Hydrogen atom groups not explicitly defined.
o HGroup doesn't explicitly corresponds to functional groups
o -OH group could be a hydroxyl group or part of carboxylic acid group and so on
o -NH2 group could be primary amine or part of an amide group and so on

Examples of E-state atom types for non-hydrogen or heavy atoms:

sCH3, dCH2, dsCH, ddC, aasC, sNH2 and so on

METHODS

new
$NewEStateAtomTypes = new AtomTypes::EStateAtomTypes(%NamesAndValues);

Using specified EStateAtomTypes property names and values hash, new method creates a new object and returns a reference to newly created EStateAtomTypes object. By default, the following properties are initialized:

Molecule = ''
Type = 'EState'
IgnoreHydrogens = 0

Examples:

$EStateAtomTypes = new AtomTypes::EStateAtomTypes( 'Molecule' => $Molecule, 'IgnoreHydrogens' => 0);
AssignAtomTypes
$EStateAtomTypes->AssignAtomTypes();

Assigns E-state atom types to all the atoms in a molecule and returns EStateAtomTypes.

GetAllPossibleEStateAtomTypes
$AllAtomTypesDataRef = $EStateAtomTypes-> GetAllPossibleEStateAtomTypes();
$AllAtomTypesDataRef = AtomTypes::EStateAtomTypes:: GetAllPossibleEStateAtomTypes();

Returns all possible EState atom types corresponding to hydrogen and non-hydrogen atoms as an array reference.

GetAllPossibleEStateNonHydrogenAtomTypes
$AtomTypesDataRef = $EStateAtomTypes->
                     GetAllPossibleEStateNonHydrogenAtomTypes();
$AtomTypesDataRef = AtomTypes::EStateAtomTypes::
                     GetAllPossibleEStateNonHydrogenAtomTypes();

Returns all possible EState atom types corresponding to non-hydrogen atoms as an array reference.

GetEStateAtomTypesData
$AtomTypesDataMapRef = $EStateAtomTypes->GetEStateAtomTypesData();
$AtomTypesDataMapRef = AtomTypes::EStateAtomTypes:: GetEStateAtomTypesData();

Returns EState atom types and associated data loaded from EState data file as a reference to hash with the following hash data format:

@{$EStateAtomTypesDataMap{AtomTypes}} - Array of all possible atom types for all atoms
@{$EStateAtomTypesDataMap{NonHydrogenAtomTypes}} - Array of all possible atom types for non-hydrogen atoms
@{$EStateAtomTypesDataMap->{ColLabels}} - Array of column labels
%{$EStateAtomTypesDataMap->{DataCol<Num>}} - Hash keys pair: DataCol<Num>, AtomType
StringifyEStateAtomTypes
$String = $EStateAtomTypes->StringifyEStateAtomTypes();

Returns a string containing information about EStateAtomTypes object.

AUTHOR

Manish Sud

SEE ALSO

AtomTypes.pm, AtomicInvariantsAtomTypes.pm, DREIDINGAtomTypes.pm, FunctionalClassAtomTypes.pm, MMFF94AtomTypes.pm, SLogPAtomTypes.pm, SYBYLAtomTypes.pm, TPSAAtomTypes.pm, UFFAtomTypes.pm

COPYRIGHT

Copyright (C) 2024 Manish Sud. All rights reserved.

This file is part of MayaChemTools.

MayaChemTools is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.

 

 

Previous  TOC  NextDecember 11, 2024AtomTypes/EStateAtomTypes.pm