RDKitGenerateMolecularFrameworks.py - Generate Bemis Murcko molecular frameworks
RDKitGenerateMolecularFrameworks.py [--infileParams <Name,Value,...>] [--mode <GraphFrameworks or AtomicFrameworks> ] [ --outfileParams <Name,Value,...> ] [--overwrite] [--removeDuplicates <yes or no>] [--sort <yes or no>] [--sortOrder <ascending or descending>] [--useChirality <yes or no>] [-w <dir>] -i <infile> -o <outfile>
RDKitGenerateMolecularFrameworks.py -h | --help | -e | --examples
Generate Bemis Murcko [ Ref 133 ] molecular frameworks for molecules. Two types of molecular frameworks can be generated: Graph or atomic frameworks. The graph molecular framework is a generic framework. The atom type, hybridization, and bond order is ignore during its generation. All atoms are set to carbon atoms and all bonds are single bonds. The atom type, hybridization, and bond order is preserved during generation of atomic molecular frameworks.
The supported input file formats are: SD (.sdf, .sd), SMILES (.smi, .csv, .tsv, .txt)
The supported output file formats are: SD (.sdf, .sd), SMILES (.smi)
Print examples.
Print this help message.
Input file name.
A comma delimited list of parameter name and value pairs for reading molecules from files. The supported parameter names for different file formats, along with their default values, are shown below:
Possible values for smilesDelimiter: space, comma or tab.
Type of molecular frameworks to generate for molecules. Possible values: GraphFrameworks or AtomicFrameworks. The graph molecular framework is a generic framework. The atom type, hybridization, and bond order is ignore during its generation. All atoms are set to carbon atoms and all bonds are single bonds. The atom type, hybridization, and bond order is preserved during the generation of atomic molecular frameworks.
Output file name.
A comma delimited list of parameter name and value pairs for writing molecules to files. The supported parameter names for different file formats, along with their default values, are shown below:
Default value for compute2DCoords: yes for SMILES input file; no for all other file types.
Overwrite existing files.
Remove duplicate molecular frameworks. Possible values: yes or no. The duplicate molecular franworks are identified using canonical SMILES. The removed frameworks are written to a separate output file.
Sort molecular frameworks by heavy atom count. Possible values: yes or no.
Sorting order for molecular frameworks. Possible values: ascending or descending.
Use stereochemistry for generation of canonical SMILES strings to identify duplicate molecular frameworks.
Location of working directory which defaults to the current directory.
To generate graph molecular framworks for molecules and write out a SMILES file, type:
To generate graph molecular framworks, remove duplicate frameworks for molecules and write out SD files for unique and duplicate frameworks, type:
To generate atomic molecular framworks, remove duplicate frameworks, sort framworks by heavy atom count in ascending order, write out SMILES files for unique and duplicate frameworks, type:
To generate graph molecular framworks for molecules in a CSV SMILES file, SMILES strings in column 1, name in olumn 2, emove duplicate frameworks, sort framworks by heavy atom count in decending order and write out a SD file, type:
RDKitConvertFileFormat.py, RDKitDrawMolecules.py, RDKitSearchFunctionalGroups.py, RDKitSearchSMARTS.py
Copyright (C) 2024 Manish Sud. All rights reserved.
The functionality available in this script is implemented using RDKit, an open source toolkit for cheminformatics developed by Greg Landrum.
This file is part of MayaChemTools.
MayaChemTools is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.