ViewVC Help
View File | Revision Log | Show Annotations | View Changeset | Root Listing
root/group/trunk/tengDissertation/Appendix.tex
(Generate patch)

Comparing trunk/tengDissertation/Appendix.tex (file contents):
Revision 2811 by tim, Wed Jun 7 02:24:27 2006 UTC vs.
Revision 2831 by tim, Thu Jun 8 21:02:53 2006 UTC

# Line 1 | Line 1
1   \appendix
2 < \chapter{\label{chapt:appendix}APPENDIX}
2 > \chapter{\label{chapt:oopse}Object-Oriented Parallel Simulation Engine}
3  
4   Designing object-oriented software is hard, and designing reusable
5   object-oriented scientific software is even harder. Absence of
6   applying modern software development practices is the bottleneck of
7 < Scientific Computing community\cite{Wilson}. For instance, in the
8 < last 20 years , there are quite a few MD packages that were
7 > Scientific Computing community\cite{Wilson2006}. For instance, in
8 > the last 20 years , there are quite a few MD packages that were
9   developed to solve common MD problems and perform robust simulations
10   . However, many of the codes are legacy programs that are either
11   poorly organized or extremely complex. Usually, these packages were
# Line 14 | Line 14 | documents which is crucial to the maintenance and exte
14   coordination to enforce design and programming guidelines. Moreover,
15   most MD programs also suffer from missing design and implement
16   documents which is crucial to the maintenance and extensibility.
17 + Along the way of studying structural and dynamic processes in
18 + condensed phase systems like biological membranes and nanoparticles,
19 + we developed and maintained an Object-Oriented Parallel Simulation
20 + Engine ({\sc OOPSE}). This new molecular dynamics package has some
21 + unique features
22 + \begin{enumerate}
23 +  \item {\sc OOPSE} performs Molecular Dynamics (MD) simulations on non-standard
24 + atom types (transition metals, point dipoles, sticky potentials,
25 + Gay-Berne ellipsoids, or other "lumpy"atoms with orientational
26 + degrees of freedom), as well as rigid bodies.
27 +  \item {\sc OOPSE} uses a force-based decomposition algorithm using MPI on cheap
28 + Beowulf clusters to obtain very efficient parallelism.
29 +  \item {\sc OOPSE} integrates the equations of motion using advanced methods for
30 + orientational dynamics in NVE, NVT, NPT, NPAT, and NP$\gamma$T
31 + ensembles.
32 +  \item {\sc OOPSE} can carry out simulations on metallic systems using the
33 + Embedded Atom Method (EAM) as well as the Sutton-Chen potential.
34 +  \item {\sc OOPSE} can perform simulations on Gay-Berne liquid crystals.
35 +  \item  {\sc OOPSE} can simulate systems containing the extremely efficient
36 + extended-Soft Sticky Dipole (SSD/E) model for water.
37 + \end{enumerate}
38  
39 + \section{\label{appendixSection:architecture }Architecture}
40 +
41 + Mainly written by \texttt{C/C++} and \texttt{Fortran90}, {\sc OOPSE}
42 + uses C++ Standard Template Library (STL) and fortran modules as the
43 + foundation. As an extensive set of the STL and Fortran90 modules,
44 + {\sc Base Classes} provide generic implementations of mathematical
45 + objects (e.g., matrices, vectors, polynomials, random number
46 + generators) and advanced data structures and algorithms(e.g., tuple,
47 + bitset, generic data, string manipulation). The molecular data
48 + structures for the representation of atoms, bonds, bends, torsions,
49 + rigid bodies and molecules \textit{etc} are contained in the {\sc
50 + Kernel} which is implemented with {\sc Base Classes} and are
51 + carefully designed to provide maximum extensibility and flexibility.
52 + The functionality required for applications is provide by the third
53 + layer which contains Input/Output, Molecular Mechanics and Structure
54 + modules. Input/Output module not only implements general methods for
55 + file handling, but also defines a generic force field interface.
56 + Another important component of Input/Output module is the meta-data
57 + file parser, which is rewritten using ANother Tool for Language
58 + Recognition(ANTLR)\cite{Parr1995, Schaps1999} syntax. The Molecular
59 + Mechanics module consists of energy minimization and a wide
60 + varieties of integration methods(see Chap.~\ref{chapt:methodology}).
61 + The structure module contains a flexible and powerful selection
62 + library which syntax is elaborated in
63 + Sec.~\ref{appendixSection:syntax}. The top layer is made of the main
64 + program of the package, \texttt{oopse} and it corresponding parallel
65 + version \texttt{oopse\_MPI}, as well as other useful utilities, such
66 + as \texttt{StatProps} (see Sec.~\ref{appendixSection:StaticProps}),
67 + \texttt{DynamicProps} (see
68 + Sec.~\ref{appendixSection:appendixSection:DynamicProps}),
69 + \texttt{Dump2XYZ} (see
70 + Sec.~\ref{appendixSection:appendixSection:Dump2XYZ}), \texttt{Hydro}
71 + (see Sec.~\ref{appendixSection:appendixSection:hydrodynamics})
72 + \textit{etc}.
73 +
74 + \begin{figure}
75 + \centering
76 + \includegraphics[width=\linewidth]{architecture.eps}
77 + \caption[The architecture of {\sc OOPSE}] {Overview of the structure
78 + of {\sc OOPSE}} \label{appendixFig:architecture}
79 + \end{figure}
80 +
81   \section{\label{appendixSection:desginPattern}Design Pattern}
82  
83   Design patterns are optimal solutions to commonly-occurring problems
# Line 49 | Line 112 | the modern scientific software applications, such as J
112  
113   As one of the latest advanced techniques emerged from
114   object-oriented community, design patterns were applied in some of
115 < the modern scientific software applications, such as JMol, OOPSE
116 < \cite{Meineke05} and PROTOMOL \cite{Matthey05} \textit{etc}.
115 > the modern scientific software applications, such as JMol, {\sc
116 > OOPSE}\cite{Meineke05} and PROTOMOL\cite{Matthey05} \textit{etc}.
117 > The following sections enumerates some of the patterns used in {\sc
118 > OOPSE}.
119  
120   \subsection{\label{appendixSection:singleton}Singleton}
121 < The Singleton pattern ensures that only one instance of a class is
122 < created. All objects that use an instance of that class use the same
123 < instance.
121 > The Singleton pattern not only provides a mechanism to restrict
122 > instantiation of a class to one object, but also provides a global
123 > point of access to the object. Currently implemented as a global
124 > variable, the logging utility which reports error and warning
125 > messages to the console in {\sc OOPSE} is a good candidate for
126 > applying the Singleton pattern to avoid the global namespace
127 > pollution.Although the singleton pattern can be implemented in
128 > various ways  to account for different aspects of the software
129 > designs, such as lifespan control \textit{etc}, we only use the
130 > static data approach in {\sc OOPSE}. {\tt IntegratorFactory} class
131 > is declared as
132 > \begin{lstlisting}[float,caption={[A classic Singleton design pattern implementation(I)] Declaration of {\tt IntegratorFactory} class.},label={appendixScheme:singletonDeclaration}]
133  
134 + class IntegratorFactory {
135 +  public:
136 +    static IntegratorFactory* getInstance();
137 +    protected:
138 +      IntegratorFactory();
139 +    private:
140 +      static IntegratorFactory* instance_;
141 + };
142 +
143 + \end{lstlisting}
144 + The corresponding implementation is
145 + \begin{lstlisting}[float,caption={[A classic implementation of Singleton design pattern (II)] Implementation of {\tt IntegratorFactory} class.},label={appendixScheme:singletonImplementation}]
146 +
147 + IntegratorFactory::instance_ = NULL;
148 +
149 + IntegratorFactory* getInstance() {
150 +  if (instance_ == NULL){
151 +    instance_ = new IntegratorFactory;
152 +  }
153 +  return instance_;
154 + }
155 +
156 + \end{lstlisting}
157 + Since constructor is declared as {\tt protected}, a client can not
158 + instantiate {\tt IntegratorFactory} directly. Moreover, since the
159 + member function {\tt getInstance} serves as the only entry of access
160 + to {\tt IntegratorFactory}, this approach fulfills the basic
161 + requirement, a single instance. Another consequence of this approach
162 + is the automatic destruction since static data are destroyed upon
163 + program termination.
164 +
165   \subsection{\label{appendixSection:factoryMethod}Factory Method}
61 The Factory Method pattern is a creational pattern which deals with
62 the problem of creating objects without specifying the exact class
63 of object that will be created. Factory Method solves this problem
64 by defining a separate method for creating the objects, which
65 subclasses can then override to specify the derived type of product
66 that will be created.
166  
167 + Categoried as a creational pattern, the Factory Method pattern deals
168 + with the problem of creating objects without specifying the exact
169 + class of object that will be created. Factory Method is typically
170 + implemented by delegating the creation operation to the subclasses.
171  
172 < \subsection{\label{appendixSection:visitorPattern}Visitor}
173 < The purpose of the Visitor Pattern is to encapsulate an operation
174 < that you want to perform on the elements of a data structure. In
175 < this way, you can change the operation being performed on a
176 < structure without the need of changing the classes of the elements
74 < that you are operating on.
172 > Registers a creator with a type identifier. Looks up the type
173 > identifier in the internal map. If it is found, it invokes the
174 > corresponding creator for the type identifier and returns its
175 > result.
176 > \begin{lstlisting}[float,caption={[The implementation of Factory pattern (I)].},label={appendixScheme:factoryDeclaration}]
177  
178 + class IntegratorFactory {
179 +  public:
180 +    typedef std::map<string, IntegratorCreator*> CreatorMapType;
181  
182 < \subsection{\label{appendixSection:templateMethod}Template Method}
182 >    bool registerIntegrator(IntegratorCreator* creator) {
183 >      return creatorMap_.insert(creator->getIdent(), creator).second;
184 >    }
185 >
186 >    Integrator* createIntegrator(const string& id, SimInfo* info) {
187 >      Integrator* result = NULL;
188 >      CreatorMapType::iterator i = creatorMap_.find(id);
189 >      if (i != creatorMap_.end()) {
190 >        result = (i->second)->create(info);
191 >      }
192 >      return result;
193 >    }
194 >
195 >  private:
196 >    CreatorMapType creatorMap_;
197 > };
198 > \end{lstlisting}
199 > \begin{lstlisting}[float,caption={[The implementation of Factory pattern (III)]Souce code of creator classes.},label={appendixScheme:integratorCreator}]
200 >
201 > class IntegratorCreator {
202 >  public:
203 >    IntegratorCreator(const string& ident) : ident_(ident) {}
204 >
205 >    const string& getIdent() const { return ident_; }
206 >
207 >    virtual Integrator* create(SimInfo* info) const = 0;
208 >
209 >  private:
210 >    string ident_;
211 > };
212 >
213 > template<class ConcreteIntegrator>
214 > class IntegratorBuilder : public IntegratorCreator {
215 >  public:
216 >    IntegratorBuilder(const string& ident) : IntegratorCreator(ident) {}
217 >    virtual  Integrator* create(SimInfo* info) const {
218 >      return new ConcreteIntegrator(info);
219 >    }
220 > };
221 > \end{lstlisting}
222 >
223 > \subsection{\label{appendixSection:visitorPattern}Visitor}
224  
225 + The purpose of the Visitor Pattern is to encapsulate an operation
226 + that you want to perform on the elements. The operation being
227 + performed on a structure can be switched without changing the
228 + interfaces  of the elements. In other words, one can add virtual
229 + functions into a set of classes without modifying their interfaces.
230 + The UML class diagram of Visitor patten is shown in
231 + Fig.~\ref{appendixFig:visitorUML}. {\tt Dump2XYZ} program in
232 + Sec.~\ref{appendixSection:Dump2XYZ} uses Visitor pattern
233 + extensively.
234 +
235 + \begin{figure}
236 + \centering
237 + \includegraphics[width=\linewidth]{visitor.eps}
238 + \caption[The architecture of {\sc OOPSE}] {Overview of the structure
239 + of {\sc OOPSE}} \label{appendixFig:visitorUML}
240 + \end{figure}
241 +
242 + \begin{lstlisting}[float,caption={[The implementation of Visitor pattern (I)]Source code of the visitor classes.},label={appendixScheme:visitor}]
243 +
244 + class BaseVisitor{
245 +  public:
246 +    virtual void visit(Atom* atom);
247 +    virtual void visit(DirectionalAtom* datom);
248 +    virtual void visit(RigidBody* rb);
249 + };
250 +
251 + \end{lstlisting}
252 +
253 + \begin{lstlisting}[float,caption={[The implementation of Visitor pattern (II)]Source code of the element classes.},label={appendixScheme:element}]
254 +
255 + class StuntDouble {
256 +  public:
257 +    virtual void accept(BaseVisitor* v) = 0;
258 + };
259 +
260 + class Atom: public StuntDouble {
261 +  public:
262 +    virtual void accept{BaseVisitor* v*} {
263 +      v->visit(this);
264 +    }
265 + };
266 +
267 + class DirectionalAtom: public Atom {
268 +  public:
269 +    virtual void accept{BaseVisitor* v*} {
270 +      v->visit(this);
271 +    }
272 + };
273 +
274 + class RigidBody: public StuntDouble {
275 +  public:
276 +    virtual void accept{BaseVisitor* v*} {
277 +      v->visit(this);
278 +    }
279 + };
280 +
281 + \end{lstlisting}
282 +
283   \section{\label{appendixSection:concepts}Concepts}
284  
285   OOPSE manipulates both traditional atoms as well as some objects
286   that {\it behave like atoms}.  These objects can be rigid
287   collections of atoms or atoms which have orientational degrees of
288 < freedom.  Here is a diagram of the class heirarchy:
289 <
290 < %\begin{figure}
291 < %\centering
292 < %\includegraphics[width=3in]{heirarchy.eps}
293 < %\caption[Class heirarchy for StuntDoubles in {\sc oopse}-3.0]{ \\
294 < %The class heirarchy of StuntDoubles in {\sc oopse}-3.0. The
295 < %selection syntax allows the user to select any of the objects that
296 < %are descended from a StuntDouble.} \label{oopseFig:heirarchy}
297 < %\end{figure}
298 <
288 > freedom.  A diagram of the class heirarchy is illustrated in
289 > Fig.~\ref{oopseFig:heirarchy}. Every Molecule, Atom and
290 > DirectionalAtom in {\sc OOPSE} have their own names which are
291 > specified in the {\tt .md} file. In contrast, RigidBodies are
292 > denoted by their membership and index inside a particular molecule:
293 > [MoleculeName]\_RB\_[index] (the contents inside the brackets depend
294 > on the specifics of the simulation). The names of rigid bodies are
295 > generated automatically. For example, the name of the first rigid
296 > body in a DMPC molecule is DMPC\_RB\_0.
297 > \begin{figure}
298 > \centering
299 > \includegraphics[width=\linewidth]{heirarchy.eps}
300 > \caption[Class heirarchy for ojects in {\sc OOPSE}]{ A diagram of
301 > the class heirarchy.
302   \begin{itemize}
303   \item A {\bf StuntDouble} is {\it any} object that can be manipulated by the
304   integrators and minimizers.
# Line 100 | Line 307 | DirectionalAtom}s which behaves as a single unit.
307   \item A {\bf RigidBody} is a collection of {\bf Atom}s or {\bf
308   DirectionalAtom}s which behaves as a single unit.
309   \end{itemize}
310 <
311 < Every Molecule, Atom and DirectionalAtom in {\sc oopse} have their
105 < own names which are specified in the {\tt .md} file. In contrast,
106 < RigidBodies are denoted by their membership and index inside a
107 < particular molecule: [MoleculeName]\_RB\_[index] (the contents
108 < inside the brackets depend on the specifics of the simulation). The
109 < names of rigid bodies are generated automatically. For example, the
110 < name of the first rigid body in a DMPC molecule is DMPC\_RB\_0.
310 > } \label{oopseFig:heirarchy}
311 > \end{figure}
312  
313   \section{\label{appendixSection:syntax}Syntax of the Select Command}
314  
315   The most general form of the select command is: {\tt select {\it
316 < expression}}
316 > expression}}. This expression represents an arbitrary set of
317 > StuntDoubles (Atoms or RigidBodies) in {\sc OOPSE}. Expressions are
318 > composed of either name expressions, index expressions, predefined
319 > sets, user-defined expressions, comparison operators, within
320 > expressions, or logical combinations of the above expression types.
321 > Expressions can be combined using parentheses and the Boolean
322 > operators.
323  
117 This expression represents an arbitrary set of StuntDoubles (Atoms
118 or RigidBodies) in {\sc oopse}. Expressions are composed of either
119 name expressions, index expressions, predefined sets, user-defined
120 expressions, comparison operators, within expressions, or logical
121 combinations of the above expression types. Expressions can be
122 combined using parentheses and the Boolean operators.
123
324   \subsection{\label{appendixSection:logical}Logical expressions}
325  
326   The logical operators allow complex queries to be constructed out of
# Line 202 | Line 402 | expression}}
402   Users can define arbitrary terms to represent groups of
403   StuntDoubles, and then use the define terms in select commands. The
404   general form for the define command is: {\bf define {\it term
405 < expression}}
405 > expression}}. Once defined, the user can specify such terms in
406 > boolean expressions
407  
207 Once defined, the user can specify such terms in boolean expressions
208
408   {\tt define SSDWATER SSD or SSD1 or SSDRF}
409  
410   {\tt select SSDWATER}
# Line 250 | Line 449 | and other atoms of type $B$, $g_{AB}(r)$.  StaticProps
449   some or all of the configurations that are contained within a dump
450   file. The most common example of a static property that can be
451   computed is the pair distribution function between atoms of type $A$
452 < and other atoms of type $B$, $g_{AB}(r)$.  StaticProps can also be
453 < used to compute the density distributions of other molecules in a
454 < reference frame {\it fixed to the body-fixed reference frame} of a
455 < selected atom or rigid body.
452 > and other atoms of type $B$, $g_{AB}(r)$.  {\tt StaticProps} can
453 > also be used to compute the density distributions of other molecules
454 > in a reference frame {\it fixed to the body-fixed reference frame}
455 > of a selected atom or rigid body.
456  
457   There are five seperate radial distribution functions availiable in
458   OOPSE. Since every radial distrbution function invlove the
# Line 309 | Line 508 | The options available for {\tt StaticProps} are as fol
508   their body-fixed frames.} \label{oopseFig:gofr}
509   \end{figure}
510  
511 + Due to the fact that the selected StuntDoubles from two selections
512 + may be overlapped, {\tt StaticProps} performs the calculation in
513 + three stages which are illustrated in
514 + Fig.~\ref{oopseFig:staticPropsProcess}.
515 +
516 + \begin{figure}
517 + \centering
518 + \includegraphics[width=\linewidth]{staticPropsProcess.eps}
519 + \caption[A representation of the three-stage correlations in
520 + \texttt{StaticProps}]{This diagram illustrates three-stage
521 + processing used by \texttt{StaticProps}. $S_1$ and $S_2$ are the
522 + numbers of selected stuntdobules from {\tt -{}-sele1} and {\tt
523 + -{}-sele2} respectively, while $C$ is the number of stuntdobules
524 + appearing at both sets. The first stage($S_1-C$ and $S_2$) and
525 + second stages ($S_1$ and $S_2-C$) are completely non-overlapping. On
526 + the contrary, the third stage($C$ and $C$) are completely
527 + overlapping} \label{oopseFig:staticPropsProcess}
528 + \end{figure}
529 +
530   The options available for {\tt StaticProps} are as follows:
531   \begin{longtable}[c]{|EFG|}
532   \caption{StaticProps Command-line Options}
# Line 370 | Line 588 | The options available for DynamicProps are as follows:
588   different vectors).  The ability to use two selection scripts to
589   select different types of atoms is already present in the code.
590  
591 + For large simulations, the trajectory files can sometimes reach
592 + sizes in excess of several gigabytes. In order to effectively
593 + analyze that amount of data. In order to prevent a situation where
594 + the program runs out of memory due to large trajectories,
595 + \texttt{dynamicProps} will estimate the size of free memory at
596 + first, and determine the number of frames in each block, which
597 + allows the operating system to load two blocks of data
598 + simultaneously without swapping. Upon reading two blocks of the
599 + trajectory, \texttt{dynamicProps} will calculate the time
600 + correlation within the first block and the cross correlations
601 + between the two blocks. This second block is then freed and then
602 + incremented and the process repeated until the end of the
603 + trajectory. Once the end is reached, the first block is freed then
604 + incremented, until all frame pairs have been correlated in time.
605 + This process is illustrated in
606 + Fig.~\ref{oopseFig:dynamicPropsProcess}.
607 +
608 + \begin{figure}
609 + \centering
610 + \includegraphics[width=\linewidth]{dynamicPropsProcess.eps}
611 + \caption[A representation of the block correlations in
612 + \texttt{dynamicProps}]{This diagram illustrates block correlations
613 + processing in \texttt{dynamicProps}. The shaded region represents
614 + the self correlation of the block, and the open blocks are read one
615 + at a time and the cross correlations between blocks are calculated.}
616 + \label{oopseFig:dynamicPropsProcess}
617 + \end{figure}
618 +
619   The options available for DynamicProps are as follows:
620   \begin{longtable}[c]{|EFG|}
621   \caption{DynamicProps Command-line Options}
# Line 396 | Line 642 | Dump2XYZ can transform an OOPSE dump file into a xyz f
642  
643   \subsection{\label{appendixSection:Dump2XYZ}Dump2XYZ}
644  
645 < Dump2XYZ can transform an OOPSE dump file into a xyz file which can
646 < be opened by other molecular dynamics viewers such as Jmol and VMD.
647 < The options available for Dump2XYZ are as follows:
645 > {\tt Dump2XYZ} can transform an OOPSE dump file into a xyz file
646 > which can be opened by other molecular dynamics viewers such as Jmol
647 > and VMD\cite{Humphrey1996}. The options available for Dump2XYZ are
648 > as follows:
649  
650  
651   \begin{longtable}[c]{|EFG|}
# Line 428 | Line 675 | converted. \\
675       & {\tt -{}-refsele} &  In order to rotate the system, {\tt -{}-originsele} and {\tt -{}-refsele} must be given to define the new coordinate set. A StuntDouble which contains a dipole (the direction of the dipole is always (0, 0, 1) in body frame) is specified by {\tt -{}-originsele}. The new x-z plane is defined by the direction of the dipole and the StuntDouble is specified by {\tt -{}-refsele}.
676   \end{longtable}
677  
678 < \subsection{\label{appendixSection:hydrodynamics}Hydrodynamics}
678 > \subsection{\label{appendixSection:hydrodynamics}Hydro}
679  
680 + {\tt Hydro} can calculate resistance and diffusion tensors at the
681 + center of resistance. Both tensors at the center of diffusion can
682 + also be reported from the program, as well as the coordinates for
683 + the beads which are used to approximate the arbitrary shapes. The
684 + options available for Hydro are as follows:
685   \begin{longtable}[c]{|EFG|}
686   \caption{Hydrodynamics Command-line Options}
687   \\ \hline
# Line 442 | Line 694 | converted. \\
694    -i & {\tt -{}-input}  &             input dump file \\
695    -o & {\tt -{}-output} &             output file prefix  (default=`hydro') \\
696    -b & {\tt -{}-beads}  &                   generate the beads only, hydrodynamics calculation will not be performed (default=off)\\
697 <     & {\tt -{}-model}  &                 hydrodynamics model (support ``AnalyticalModel'', ``RoughShell'' and ``BeadModel'') \\
697 >     & {\tt -{}-model}  &                 hydrodynamics model (supports ``AnalyticalModel'', ``RoughShell'' and ``BeadModel'') \\
698   \end{longtable}

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines