MongoChem Schema

From wiki.openchemistry.org
Revision as of 16:08, 3 January 2013 by Marcus.hanwell (talk | contribs) (moved ChemData Schema to MongoChem Schema: Project name change)
Jump to navigation Jump to search

This page describes the schema for the mongo database used by ChemData.

Molecules Collection

The data is stored in a format similar to Chemical JSON.

{ 
  "name" : "phenol"
  "atomCount" : 13,
  "heavyAtomCount" : 7,
  "formula" : "C6H6O",
  "inchi" : "InChI=1S/C6H6O/c7-6-4-2-1-3-5-6/h1-5,7H",
  "inchikey" : "ISWSIDIOOBJBQZ-UHFFFAOYSA-N",
  "descriptors" : {
    "tpsa" : 20.2,
    "xlogp3" : 0,
    "mass" : 94.11,
    "rotatable-bonds" : 0,
    "vabc" : 89.95
  },
  "annotations" : {
      ["john", Date(), "ran test #4 on phenol"]
  },
  "diagram" : BinData(0, "...")
}

The data is indexed by inchikey (unique) and heavyAtomCount (non-unique).

Quantum Collection

{ 
  "moleculeId" : "4ffef58953f0182db0000000",
  "name" : "C15H6N1S.39.86.sp.upbe10.tv.n.s",
  "calculation" : {
    "guess" : { ... },
    "theory" : "B3LYP",
    "basis" : "TZVP" 
  },
  "energy" : {
    "total" : -1923.12,
    "alpha" : {
      "homo" : -0.3,
      "lumo" : -0.1,
      "gap" : 0.2 
    },
    "beta" : : {
      "homo" : -0.3,
      "lumo" : -0.1,
      "gap" : 0.2 
    },
  },
  "totalDipoleMoment" : 3.3101,
  "files" : {
    "log" : [  ],
    "binary" : [ ]
  }
}

moleculeId maps to the ObjectId in the molecules collection