the goodman group
university of cambridge  


   practical six: databases

Practical Six: Databases

(1) Searching the Cambridge Crystallographic Database

(a) Type 'quest filename' to start the search program, which is called Quest. Draw this molecule as the substructure which the program will search for in the database.

(b) After drawing the substructure, click DEFINE-STRUCTURE, to make the program remember it, then click on TO-SEARCH. You now tell the program what you want it to remember of the structures that it finds. Click on FDAT and FCON, to remember to geometry and the connectivity of the structures.

(c) Click on QUEST. We are only interested in structures whose three dimensional structure is in the database, so click on INSIST-ON- COORDS. Click on the picture of the substructure that should be visible, to define the search question. Now click on START-SEARCH.

Be careful. It is not possible to interrupt a search once it has begun, unless you find a structure. It is best, therefore, to begin with a loosely defined search, which is likely to find many structures, rather than one which may be too tightly defined. If you discover that you are finding too many structures, you can easily stop and redefine your search. If your search criterion is so tight that you find no structures, you will have to wait until the program has looked through the entire database, before you can try again.

(d) The program will search through the entire database to find structures matching the substructure you have specified. Watch the original window to see how it is progressing. When a structure is found, you can rotate it to check it is what you want. Then click on KEEP, to remember the data.

(e) When the entire database has been searched, exit from Quest. We can look at the structures that have been found by turning them into MacroModel files. Type 'csd2mmod filename' to do this. There will now be a file called filename.mmod which contains all the structures that were found during the search.

(f) Read these structures into MacroModel, and calculate the average oxygen-boron bond length. How strong is the association? Molecular orbital calculations suggest that fluorine or other electronegative groups on the boron will tend to eclipse the carbonyl. Is the data you have collected consistent with this?

(2) Brookhaven Protein Database

The Brookhaven Protein database is stored in the directory '/usr/bigdisk/brookhaven' Type 'ls /usr/brookhaven' to get a list of the files and their cryptic code names. These files can be read by MacroModel. Copy the file to your own directory, type "gunzip filename.pdb.gz", and then read it using MacroModel, after selecting the PDB option in the Read window.

Alternatively, it can be searched using quest. Type "quest filename -db PDB". When the search is complete, type "pdbget -l -d -f filename.pid" to copy the pdb files that have been found into your current directory.

A small selection, including index files, from the Brookhaven database is in /usr/local/brookhaven directory. It will be quicker to look in this directory than in the /usr/brookhaven directory.




© Goodman Group, 2005-2024; privacy; last updated April 22, 2024

department of chemistry University of Cambridge