Don Berndt • dberndt@usf.edu • dberndt.link/home

SYNCHEM @ Stony Brook

As desribed for a fall 1994 research seminar by Herbert Gelernter: "The Stony Brook SYNCHEM system is a large knowledge-based domain-specific heuristic problem-solving program which has synthetic organic chemistry as its domain of expertise. SYNCHEM is able to discover synthesis routes for complex and chemically interesting molecules without guidance on the part of its user. Its main components are a chemist-oriented user interface (called KIS) for problem submission, output interpretation and knowledge base maintenance and refinement, a self-guided problem-space search module (the inference engine), a system maintenance module (called HUG), and the knowledge base itself."

In write-up, Prof. Gelernter went on to identify two challenges: 1) to improve reaction library enhancements (knowledge acquisition) through machine learning, and 2) to expand computational resources through parallelism (using the Linda framework). I was

"Despite the availability of a highly-refined user interface to the knowledge base that provides the chemist with easily understood graphical access to the reaction library, and even though a new reaction schema of moderate complexity can be entered into the library and validated for both syntactic and semantic correctness in an hour or so, it has been very difficult to engage domain experts to the long-term dedication and intensity of commitment necessary to create a production-quality knowledge base. We are currently addressing this problem via a research program in inductive and deductive machine learning."

"As the knowledge base grows larger, we will be forced to confront the inevitable inflation in the costs entailed by the substantial increase in both the selection and expressiveness of applicable operators that will have to be examined at each node in the combinatorially expanded region of the search space to be explored. We expect to meet the need for additional computing horsepower by exploiting the inherent suitability of the heuristic search paradigm for distributed/parallel execution. In particular, we plan to adopt the network Linda uncoupled process approach for distribution of the AND/OR problem-space search over the CS department's CER multiprocessor workstation network."

A quick note about Professor Herbert Gelernter, the director of the SYNCHEM project for roughly 25 years and an early artificial intelligence researcher. Dr. Gelernter was a physicist by training (and practice), but was also among the first wave of scientists and engineers that formed the modern day discipline of computer science. He was among the original dozen or so researchers that developed the area of artificial intelligence, building a "geometry-theorem proving machine" at IBM in the late 1950s, one of the earliest examples of an AI program. Check out the, some of his early work from a book-length collection called Computers and Thought, see XXXX. At Stony Brook, I got to work on the second generation SYNCHEM2 program under his direction both as a student and staff programmer. The development of SYNCHEM (and then SYNCHEM2) spanned more than two decades and is still probably one of the most interesting and complex AI artifacts ever built. I still cannot believe my good fortune to study with a true pioneer in the field (and an inspiring man). I still keep an old copy of Computers and Thought on my desk as a reminder.