ODBUserManual.html

<center><h1>Omniscient Debugger User Manual</h1></center>


Omniscient debugging is the concept of debugging by retaining state changes
and "navigating" backwards in time to find the problem. The ODB is a
proof-of-concept implementation written in Java. It is neither optimized nor
integrated. None-the-less, I believe you will find it to be the most
effective debugger you have ever used.<br><br>

<img align=center src="ODB1.png">
<br>
<b>Figure 1</b>
<br>
<br>

The ODB is based on the idea of collecting "time stamps" at each point of
interest in a program and then allowing the programmer to use those time
stamps to explore the history of that program run. "Interesting" points
include: setting a variable value (local, instance, array element, static),
making a method call, throwing/catching an exception, and creating a new
object. The debugger inserts code into the class files to collect these
stamps. (The instrumentation is done using the wonderful BCEL package which
is available from www.Jakarta.Apache.org/bcel.) When the program runs, time
stamps will be recorded.  
<br><br>

The ODB does not require any preprocessing, or any special repository. 
There are no limitations on the program being debugged. It
is not even absolutely necessary to have the source code available.  After
installing (see below), the programmer runs the debugger by calling:
<br><br>

<code>% debug TargetProgram arg1 arg2 arg3</code>
<br><br>

The debugger will display a little control window and start the program. The
main debugger window will pop up when the program calls <code>exit()</code> or when the
programmer pushes the "Stop Recording" button in the control window. The
programmer will then be able to navigate through the time stamps to find
interesting events.
<br><br>
<img align=center src="Controller.jpg">
<br>
<br>

Once the debugger main window is up, all recording will be turned off, even
if the program is still running. You may resume recording using the control
window.
<br><br>

<h4>Data Display Formats</h4>

The ODB needs a print
string, so it supplies its own format showing the class name and an index
number: <code>&lt;MyObject_75&gt;</code>, chopping off the package. You may add
an additional string to it: <code>&lt;MyObject_75 square&gt;</code> (see defaults file).
<br><br>

Class objects are displayed as just the class name (e.g., <code>Person</code> ). Boolean
values, numbers, and characters are displayed as expected: <code>true</code>, <code>false</code>,
<code>123</code>, <code>45.678</code>, <code>'X' (88)</code>. Bytes are also displayed like characters
with both the ASCII character and the decimal value in parenthesis.  
<br><br>

Long names and strings (more than 20 characters) will be elided and printed as:
<code>&lt;TestLongName..0></code> in the trace pane. In the objects pane more
room is provided (500 characters). Very long strings and large objects may be printed out to
the terminal via the menu item <i>Objects->Print</i>.

<br><br>


Method traces are displayed with the object (or class object for static methods),
followed by the method name, the first three arguments, and finally an
arrow and the return value. Methods that haven't returned just show
<code>****</code>. Methods that throw (or propagate) an exception show <code>****&lt;Exception></code>.
They are indented appropriately, hence in figure 1, <code>sort()</code>
called <code>average()</code> once, <code>new()</code> twice, then <code>start()</code>, and finally <code>sort()</code>
recursively. The recursive call to <code>sort()</code>
called <code>average()</code>, etc.
<br><br>


Returns from methods are displayed on separate lines as <code>sort -> void</code>.
If the return line would be right after the call line, it will not be
displayed. If the program catches an exception from more than one method
call deep, the display will appear jagged because there won't be any intervening
"return lines".
<br><br>

Constructors will be shown like static methods (<code>MyObject.new() -> &lt;MyObject_1></code>),
as will super methods.

<h4>The Display Panes</h4>

The ODB attempts to show all the data of interest on the screen at
the same time. Popup dialogues are only used for finding source files and selecting values, which
are rarely done. The panes are all updated to display the same time
stamp.
<br>

<h5>Threads Pane </h5>

Threads are displayed in the format <code>&lt;Thread-3></code>, where "Thread-3" is
whatever is returned by <code>getName()</code>. If the currently selected time is
previous to the creation of the thread (before the thread's first time
stamp), it is shown as <code>-- &lt;Thread-3> --</code>, and as <code>&lt;Thread-3> Dead</code> if after
the thread has exited (after the last time stamp for a thread whose
<code>isAlive()</code> method returns false). If the thread is blocked, waiting for a
lock or in a call to <code>wait()</code> or <code>join()</code>, it is shown as <code>&lt;Thread-10> &lt;MyObj_2></code>. 
Adding the thread to the objects pane will show this as a
pseudo-instance variable <code>_blockedOn</code>.
<br>

<h5>Stack Pane</h5>

All methods on the stack are displayed here. Single-clicking on a line will
update the Code pane, the arguments, and the local variables for that stack
pane. <i>The current time stamp will not be changed.</i> Pushing any button will
act on the actual time, not what is displayed in the code pane. (It's not
clear what is most "intuitive" here.) The menu command <i>Code->Goto Stack Frame</i>
will revert to that time stamp.
<br>

<h5>Locals Pane </h5>

All arguments and local variables will be displayed here. If
the program has been compiled without the -g flag, local variable names
will displayed as <code>var1</code>, etc.
<br>

<h5>'this' Pane</h5>

This pane displays the current 'this' object (or the class object for
static methods).
<br>

<h5>Trace Pane</h5>

For each thread, this shows the trace of all method calls
that were made.  Only three arguments are shown. Only ten are recorded.
<br>

<h5>Code Pane</h5>

This displays the current line of code. The buttons all move
forwards or backwards starting from this line. The two exceptions are:
when a higher stack frame has been selected (see Stack Pane) and when a
time stamp occurs in a source file that can't be found, in which case the code pane
will be empty.
<br>

<h5>I/O Pane</h5>

This shows the all the lines written by <code>System.out.println()</code> and <code>System.err.println()</code>. (Obviously,
only calls from instrumented code.) If the line is preceded by "--", that
indicates that the current time is previous to the time that line was
written.  (Eventually this pane will show calls to <code>print(), write()</code>, etc.
going to any I/O stream.)
<br>

<h5>Objects Pane</h5>

This shows whatever objects the user has copied here via double-clicking on
objects in other windows. Double-clicking on an instance variable will add
that object to the pane. Double-clicking on an object in this pane will
"close up" that object, hiding all its instance variables.  Double-clicking
a second time will open it back up. Individual instance variables may also
be removed selectively (see <i>Objects->Remove</i> and <i>Objects->Retain Only</i>).
<br><br>

Double-clicking on an instance variable will expand the value
of that instance variable in-place (like the array <code>int[20]_0</code> in
figure 1). If the value is a primitive or is an object with no visible instance
variables, nothing will happen. Pushing the <i>Previous Value</i> button, etc.
will advance to the next value of the instance variable, <i>not</i> the
next value of the instance variable's instance variable. For example, the
previous value of <code>array</code> in figure 1 is <code>null</code>, so
the array will disappear if you go there. If you select an
instance variable's instance variable, the commands will work on that, which
could cause it to disappear if the instance variable also changed value.
(This recursive stuff is a little confusing to explain. Just try it.)
<br><br>
Values which have not yet been
set (e.g., instance variables of objects before they have been created) will
be displayed as "--". Variables which changed values from the previously
selected time stamp will be displayed with a leading "*".
<br><br>

Currently only a single level of recursion is supported. Double-clicking on an
instance variable's instance variable will copy that 
instance variable to the top level of the pane. An instance variable's value may also be
copied to the top level via <i>Objects->Add Instance Variable</i>.
<br><br>

The Objects menu allows you to display class object of the selected object (and the static
variables thereof) <i>Objects->Add Class</i>.
<br>


<h4>Navigation</h4>
Navigation has never been a issue in traditional debuggers because there
wasn't much to think about. You pushed <i>Continue</i>, the program resumed
and ran until it hit the next breakpoint, and that was it. We need to be
a bit more formal because there are so many more ways to get from here
to there and back again.
<br><br>

The ODB provides a variety of methods for navigating through the program
run. They are all aimed at making it as simple as possible for the programmer
to get to "interesting" points in the program and see what state it was in.
The ODB implements what I believe are the most "obvious" methods, with plenty
of room for adding others.
<br><br>

The primary modes of navigation are: 
<br><br>

o Selecting a line in the trace window, which will revert the debugger to
the time of the first time stamp recorded in that method. If that method
was not instrumented, then the debugger will revert to the line it was
called on.
You may request the debugger to always to the calling line by turning
<i>Trace->Go to first line in method</i> off.
<br><br>

o Selecting a line in the code window, which will revert the debugger to a
time stamp on that line (see below).
<br><br>

o Pushing one of the buttons, which will revert the debugger to a time stamp
depending on the currently selected object or line in that pane. (See Navigation
Buttons below.)
<br><br>

o Selecting a line in the Stack pane, which will not change the current time
stamp, but will show you where methods in the stack were called from
(reverting the code pane) and the values of the local variables. (See Stack Pane.)
<br><br>

o Selecting a thread, which will revert to the nearest time stamp in that
thread, earlier than the current time preferred. (All instrumentation methods
are synchronized, so time stamps are strictly sequential.)
<br><br>

o Selecting a line in the I/O window, which will revert to the time when the
newly selected line was printed.
<br><br>


The choice of which time stamp on a line in the code pane to select may be
controlled by a set of options in the Code menu. The normal direction is to
go backwards in time if the newly selected line is above the previously
selected line, and forwards if not. Going forwards, you will select the
first TS (of a contiguous set, e.g., call/return) on a line, going
backwards, you will select the last. Normally the time stamp has to be in
the current thread. You may also restrict the choice to being in the same
method call. (The best behavior here is hard to determine. I've played with
lots of ideas.)
<br><br>

<h4>Navigation Buttons</h4>

The buttons work as uniformly as possible for the different panes.  The four
arrow buttons revert to the first, previous, next, and last time for
selected object in that pane.  All of the buttons have tooltip messages.
<br><br>

You may bring up a list of all the values a variable ever had and
select one of them. The commands <i>Object->Select IV Value</i> selects a value
for an instance variable, and <i>Object->Select Local Value</i>
selects a value from a local variable. The ODB will revert to the time
when that variable first took that value.
<br><br>

<img align=center src="Select.jpg">
<br><br>

<h2>Now for the Details</h2>

If you are just starting, skip down to "Installing the Debugger" and spend some
time playing with the demo.  

<h4>Minibuffer</h4>

Modeled after EMACS, extended commands (such as Evaluate Expression and
Search) are displayed here, as are messages to the programmer.

<h5>Searches</h5>

You may do an incremental text search through
the trace pane.  &lt;Ctrl-S> will start the search and give focus to the
minibuffer. Typing characters will extend the search string. Another
&lt;Ctrl-S> will advance to the next match.  &lt;Ctrl-R> will reverse the search.
&lt;Return> will end the search. &lt;Ctrl-G> will abort any command.
<br><br>
The search is case-insensitive and looks for two things: the exact characters
on the screen (including the ".." for very long names), and the characters
in the individual objects (i.e., the complete name for long names and
strings). TABs are typed as themselves (not &lt;Ctrl-I>, not "\t"), newline
characters are entered as "\n". In the middle of a search, double-clicking
on an object will add that object to the search string.
<br><br>

"Event" searches are provide via the "Fget" commands (&lt;Ctrl-F>). They are
executed as incremental searches based on the pattern typed in (another
&lt;Ctrl-F> will find the next match and &lt;Ctrl-R> 
will search in the reverse direction). A typical pattern would be (note the
"Prolog style"):

<pre>port = call & to = THIS & arg0 = &lt;MyObj_5></pre>

which will search for method calls ("port = call") whose
"this" object ("to = THIS") will be assigned to the variable
THIS (NB: a sybmol in uppercase means it's a variable) and whose first argument
("arg0") is <code>&lt;MyObj_5></code>.  

<br><br>

Port must be one of: call, return, enter (first line in a
method), exit (last line in a method), civ
(change instance variable), clv (change local variable),
chgArray (change array element).  
<br>
o  All ports define "to" (this object), "toc" (this object
class), "mn" (method name), "thr" (thread), and "pN" (parameter 0-9). 
<br>
o  "Calls" also define "argN" (argument 0-9) and "cmn" (call method name).
<br>
o  "Returns" define "rv" (return value).
<br>
o  "Changes" define "nv" (new value).
<br><br>

Strings are entered with '"' and class objects are prefixed with
a '#'. All objects may be compared with '=' and '!='.
Integer values may be compared with '>' and '&lt;'.
Hence this pattern:
<pre>
port = enter & mn = "sort" & toc = #fr.insa.Thing & p0 > 3
</pre>
will match the first line in any method named "sort" in the
class fr.insa.Thing whose first parameter is an int greater
than 3.

<br><br>
There are two commands on the Trace menu which will create queries
for you, matching either the current trace line (i.e., the current method
call) or the current source line. These will  be stored as the 
"previous" FGET query so that typing &lt;Ctrl-F>&lt;Ctrl-F> will show them.
<br><br>
Typing &lt;Ctrl-T> in the midst of an FGET query will display the total
number of matches in the entire run of the program (starting with the
currently selected time).
<br><br>


<h4>Marking Points of Interest</h4>

The ODB maintains a ring of "marks" for time stamps. You may add a mark to the
ring with &lt;Ctrl-SPACE>. You may then revert to that mark with &lt;Ctrl-X>. The
ring is circular, and if there is more than one mark, &lt;Ctrl-X> will cycle through
the ring.


<h4>Locks and Wait Sets </h4>

When a thread blocks, waiting for a lock or calls <code>wait()</code> , the object on which it
blocks will be displayed next to it in the Threads Pane (e.g.,
<code>&lt;Thread-7></code>
is blocked on the class object <code>DemoWait</code>
below). A pseudo-instance variable <code>_blockedOn</code>
will be added to the thread and shown in the object pane if the thread is
displayed there. Three pseudo-instance variable ( <code>_waiters, _owner,
_sleepers</code>) will be added to the object.  If a thread blocks in
uninstrumented code, the ODB will not notice.  
<br><br>

<img align=center src="Threads.jpg">
<br>
<br>

<img align=center src="Blocked.jpg">
<br>
<br>

<h4>Filters </h4>

If you have a large number of trace lines which are not particularly
interesting, you may filter them out using the Filter menu options. You may
filter out individual methods (hiding them and their internals), filter out
method calls deeper than some limit, or filter in one method (hiding
everything else). And you may "unfilter" everything.  You may save the
filters to the <code>.debuggerDefaults</code> file (see options) and the next time you
instrument those classes, those methods will neither be instrumented nor
recorded.
<br><br>
Probably the most useful filter option is to "filter in" a method, hiding
all method calls outside of the selected one.
<br><br>


All panes reflect the current time stamp (except when navigating using the
stack). The code lines are those defined by the compiler, so there may be
many time stamps on one line of code. Each method call generates two time
stamps -- one for the call, one for the return. If it's instrumented, the
method then also records the first line executed and the last line. If
return is not called directly, the Sun compiler assigns the implicit return instruction
to the first line of the method (kinda weird). This line:
<br><br>

<code>t = first(tl.getNext());</code>
<br><br>

generates five time stamps, two calls, two returns, and an assignment. Logic,
flow of control,
and operators generate no time stamps. This:
<br><br>

<code>if ( ((a + 71) > (b * c)) || (this == that) ) break;</code>

<br><br>
generates no time stamps.
<br><br>

<h4>Evaluating Expressions Interactively</h4>

Arbitrary method calls may be made at any time. You may revert the debugger
to any time you wish and then evaluate a method call using the current
values of the recorded objects. So if you revert to time 1234, when
<code>&lt;Demo_0&gt.array</code> elements 16 and 17 are 454 and 123 respectively, calling
<code>&lt;Demo_0&gt.quick(16, 17)</code> will sort those two. If you advance to time 1330,
when 16 and 17 are already 123 and 454, then sorting them won't change
anything.
<br><br>

In order to maintain some sense of consistancy, the methods will be executed
in a different "timeline". Basically, the current state will be copied into
a new timeline which is initialized with only one time stamp. The method
will then be run (with recording turned on and off automatically),
populating this new timeline. You can switch back and forth between the two
timelines freely. Further method calls will clear the secondary timeline
before running.
<br><br>

In the secondary timeline you may change the value of instance variables
before running. The commands are all on the main menus, with keyboard
accelerators. &lt;Tab> will do an auto-complete on object and method
names. &lt;Return> will execute the command (running the method or setting the
instance variable value).
<br><br>

These commands are the least robust part of the debugger. The interactive
"Evaluate Expression" only accepts four arguments. The arguments must be
object or class print strings (e.g., <code>Demo</code>, <code>&lt;Demo_0></code>), strings, integers,
true, false, or null. You cannot change the value of final variables. One
space is required after each "," and nowhere else.  &lt;Return> and &lt;Tab> must
be typed at the end of the minibuffer. You can only change the values of
variables of type Object, int, and boolean. Lock states and wait sets cannot be
set, so their displays will be set to "unlocked" and "empty".
<br><br>

<h4>Garbage Collection</h4>

There is a limit to the number of time stamps that can be recorded. Each 
simple time stamp takes three words (12 bytes on a 32-bit system) and each
method call/return takes about 25 words and another 25 for the JList to display
it. 
And then your program probably needs to use some memory also.
<br><br>

The ODB looks at the value of the MEMORY environment variable (or
MaxTimeStamps in .debuggerDefaults) to determine
how much memory to devote to time stamps. Currently it allows a maximum
of MEMORY/200 time stamps. (You'll notice that the aliases default to 
400MB, allowing 2m time stamps.) When that number is exceeded, the
ODB's "garbage collector" kicks in and throws away some of the early time
stamps.


The defaults are set on the command line by the aliases such as 
this one (for <code>debug</code>):
<br><br>
<code>java -Xms400100100 -Xmx400100100 -DMEMORY=400100100 -cp $HOME/Debugger/debugger.jar:$CLASSPATH $USER_FLAGS com.lambda.Debugger.Debugger</code>
<br><br>
The flag <code>-Xmx</code> sets the maximum memory for the program, <code>-Xms</code>
sets the initial memory size. We want to avoid useless early JVM garbage collections,
so they are set to the same size. Depending on your program, you may wish to 
increase these numbers while keeping the value of MEMORY low. This would give
your program more room of its own.
<br><br>


Of course these are not really garbage. So the ODB attempts to retain the
time stamps which are most likely to be interesting (object changes), while throwing away
those that aren't. In the first mode, the ODB throws away all time stamps
for local variables and method calls that are older than 50% of the entries.
If this is insufficent (less than 10% are collected), then the second mode
kicks in and entries for objects are also collected.
<br><br>
On each invocation, the collector will print out a line similar to this:
<br><br>
<code>ODB:  Collected 244783 out of 326382 stamps and 47268 TraceLines</code>
<br><br>
Although very clever, it is not clear that the GC is actually useful. If you turn it
off, collection will simply stop when storage is filled up. (See the environment variable GC_OFF.)
<br><br>

<h4>Objective of the ODB</h4>

I originally wrote a paper describing this so that someone else would write
a proper debugger/IDE so that I could use it. Then I thought it would be
good if I could prove that this was actually useful. So I wrote a little
mock-up, which evolved into this full-fledged proof-of-concept. I claim that
the ODB is sufficient proof that the idea of "omniscient debugging" works
and works very well.
<br><br>

So, there are lots of guesses about what data would be most useful, how it
should be displayed, what navigation commands make sense, etc. Most of the
surrounding pieces of an IDE have already been done quite well by other
folks. So, for compilation, editing, class browsing, etc. I simply assume
that the ODB will be properly integrated into those bits which work best.
I wrote the ODB in Java because it was easy. Omniscient Debugging 
could equally well be applied to C, C++, Eifel, etc.  
<br><br>

<h4>Optimizations and Limitations</h4>

Other than any exceptions noted in the "bugs" section below,
the ODB shows data which is 100% correct and can be
confidently run on any Java 1.3 1.4, or 1.5 program. (ODB1.4 for
SDK 1.3 and 1.4, and ODB1.5 for SDK1.5.)

Unless there are bugs I don't know of (ha-ha). 
<br><br>

If you don't need to get fancy, stop now. You'll probably never use the
stuff below.
<br><br>

Although it is possible to record absolutely everything, it is rather
expensive and not very useful. For classes which are trusted (such as the
JCF classes, library classes, and your own well-tested classes), it is
possible not to instrument them. Indeed, the default is to instrument and
record only those classes in the package of the main method. The debugged code will
run faster and the debugger will not be cluttered with uninteresting
data. There are some limitations...
<br><br>

The limitations include: any direct changes to an instance variable (e.g.,
<code>obj.iv = value</code>;) performed in non-instrumented code will not be recorded
(it's bad OOP practice to set instance variables directly anyway! You should be
using getters and setters.), calls from non-instrumented code (e.g., from
the event loop) back into instrumented code will look funny (the call chain
won't be visible. These "unparented" calls are marked in the trace pane with
"**" preceding the call.), calls to <code>System.exit()</code> in non-instrumented code
will exit the process.
<br><br>

You may clear the history and record new data from a live application (e.g.,
when debugging a button push), or restart the same. You may select a line in
the program and tell the debugger to turn recording on/off there. You may 
select a line of output for the same purpose.
<br><br>

Objects which were created outside of instrumented code will not be
recognized until they are touched in instrumented code. The
actual creation time is thus not known and their instance variable values
will not be known. They will be marked with a "@" in the Objects Pane.
<br><br>

Some classes are just crying out for special, hand instrumentation. Vectors,
for example, really want to be displayed as if they were arrays which just
happen to be of the exact right size. This is done by replacing <code>Vector</code> with
<code>MyVector</code> during instrumentation.
<br><br>

Obviously, instrumenting classes which the ODB uses (e.g., Swing)
will not work. If you are working on such code, you will have to change the
package for the code you're working on. For example, to debug the ODB, a
complete copy of the source tree is made and the package
"com.lambda.Debugger" is replaced by "lambda.Debugger." I do the same thing
when debugging Swing itself.
<br><br>

Displaying very large objects is awkward. By default, objects with more than
1000 instance variables (i.e., arrays) will display only the first 1000, and
print (<i>Objects>Print</i>) only the first 10000.
<br><br>

<h4>Options</h4>

The ODB normally inserts instrumentation while loading classes via a
subclass of <code>java.lang.ClassLoader</code>. 
This is done by default for classes in the package
of the main method. Thus, if you use Apache's crimson package it won't
be instrumented unless you specifically request it (see .debuggerDefaults).
It doesn't matter where the files are loaded from.  
Normally the ODB will bring up the main debugger window as soon as the
target program's main thread returns or <code>System.exit()</code> is called from
instrumented code. (The <code>exit()</code> method gets replaced by a call to throw a
<code>DebuggerExit</code> exception.) Any other threads in the program will
be suspended on their next call to instrumented code.
<br><br>

For programs in packages, <code>CLASSPATH</code> must be set correctly (the aliases will
pick this up when they are run), so you must start the debugger from the
command line. 
This is also true
for code from jar files--you must put the jar file in the CLASSPATH
and start the debugger from the command line. (Microsoft details are noted below.)
<br><br>
<code>
% setenv CLASSPATH /home/your_directory/your_code.jar
<br>
% debug com.your_company.CoolProgram arg1 arg2
</code>
<br><br>

All of the options below may be selected from the controller window if the
debugger is started with the <code>DONT_START</code> flag (use <code>debugp</code>). Once you figure out
what setting you want, it's more convenient to use the appropriate alias.
<br><br>

For programs that spawn new threads which continue to run after <code>main()</code>
returns (e.g., any windowing program), you should set the <code>DONT_SHOW</code> flag (run
the alias <code>debugn</code> ). Recording will continue until you push the <i>Stop
Recording</i> button.
<br><br>

You may set the <code>PAUSED</code> flag to start the program without starting recording.
For example, when there's a bug when pushing a button, you can setup the windows
without recording.
<br><br>

You may set both
flags from the command line or the controller window, allowing you to start, stop, and
restart recording. You may see some odd looking traces if the program is not
quiescent when you start and stop it.
<br><br>

You may request that certain methods not be instrumented or not be recorded,
by placing their class and method names into the .debuggerDefaults file in the current
directory. 
<br><br>

For example, <code>StringBuffer</code> and <code>StringWriter</code> aren't be recorded by
default, thus ridding the trace pane of lines from code like this: 
<br>
<code>"Teach "+ n +"dogs new tricks!"</code>
<br>
 (which actually involves a new
<code>StringBuffer</code>, two appends, and a <code>valueOf()</code>
call). The methods not instrumented by default are: <code>toString()</code> and
<code>valueOf()</code>. Those not
recorded are: <code>toString()</code>, <code>valueOf()</code>,
and <code>&lt;cinit></code> (Class
constructors -- "initialize statics") and all <code>StringBuffer</code>
methods.  
<br><br>

If you really want to record/instrument everything (hint: you don't!), you
may remove them from the defaults file. You may use the Filter menu to add methods
to it or edit it by hand (see Defaults File below).
<br><br>

You may instrument specific classes in different directories by hand:
<br><br>

<code>% debugify MyObject1.class MyObject2.class ...</code>
<br><br>

If you want to run the debugger and have it not instrument any files, you
may pass the <code>DONT_INSTRUMENT</code> flag. (Presumably you have
debugified some files by hand.) This may be done with the alias/bat file:

<br><br>
<code>% debugdi MyProgram arg0 arg1...</code>
<br><br>

If you start the debugger with no arguments or from a file manager, a file
chooser will pop up and allow you to select a program and set these options.
This method is rather limited. The program in question must be in the default
package.
<br>
<h5>Microsoft Windows</h5>
Everything works the same way as above, except that instead of aliases,
batch files are used. It is assumed that you will put the debugger in the
directory <code>c:\Debugger</code> and that you will unpack the batch files
instead of the aliases. Include
them in your path and you'll be ready to go! If you choose a different location,
edit the batch files appropriately.
<br><br>
<code>
% jar xf debugger.jar Microsoft 
</code>
<br><br>

<h4>Defaults and the .debuggerDefaults File </h4>
Most of the flags and parameters which the ODB uses are defined in a single 
file, <code>.debuggerDefaults</code>. This file is located in the current
working directory. If no such file is found when the ODB is started, it will
create one. After that, the ODB will not change the file unless you specifically
request it to. You may edit the file by hand and the ODB will retain your changes.
(Thus you may run the debugger once and then edit the file it writes out.)
<br><br>
You may declare a class which you write to be the "special" format for
displaying objects of some type (typically classes, but primitives are 
allowable). Look at the example in the source for SpecialTimeStampFormatter.java.
These would be listed as:
<pre>
SpecialFormatter:      com.lambda.Debugger.SpecialTimeStampFormatter
</pre>

and would be so displayed only in the Objects window when you select the
object (or an array of such objects) and execute <i>Objects>Special Format</i>.
<br><br>

Here is a sample. All of the lines may have duplicate entires (e.g., the 
DontInstrumentMethod lines below), except for the  start/stop lines.
It is unlikely you will change anything other than "*Instrument/Record*"
and "UserSelectedField" by hand.

<pre>
# ODB Defaults -- You may edit by hand. See Manual for details
MaxTimeStamps:			400000
StartPattern:
StopPattern:
SourceDirectory:	
OnlyInstrument:			"org.apache"
OnlyInstrument:			"fr.emn.info.eaop.cps"
DontInstrument:			"recoder.java.declaration.TypeDeclaration"
DontInstrumentMethod:		"*	 toString"
DontInstrumentMethod:		"*	 valueOf"
DontRecordMethod:	
DontRecordMethod:		"*	 toString"
DontRecordMethod:		"*	 valueOf"
DontRecordMethod:		"java.lang.StringBuffer	 *"
DontRecordMethod:		"java.lang.Object	 new"
UserSelectedField:		"com.lambda.test.MyClass name"
</pre>

You may request some objects to include a field value in
their print string by setting the user selected field as
above. The field choosen must be a primitive and must not
change.

<br><br>

The ODB will write out a "<code>DidntInstrument/OnlyInstrument</code>" line for every single
class loaded. This makes it easier to change the set of classes being
instrumented. By default, only the classes in the package of the 
"main" method are instrumented.

<h4>Finding the Source Code </h4>

The ODB will look for source code in the same directory it loaded the class
files from. If there is no java file there, the ODB will then look in the
current directory. If it doesn't find anything there, it will popup a file
chooser so you can tell it where the source is.
Any directories you add will be written into the defaults file
in the current directory and loaded on subsequent runs.
<br><br>

<h4>Known Bugs</h4>

The "Heisenbug Principal" of software (observing a program changes its
behavior) will always apply, but there shouldn't be any particular
differences from running under a traditional debugger. The one exception
is when a program suffers from cache incoherency. If you fail to properly
lock a shared variable, it is possible for CPU 1 to write a value to
that variable and for CPU 2 <i>not</i> to see that new value for some 
time. The ODB will prevent this bug from occuring. There are static
lock analyzers. Use one. The substitute collection classes (MyArrayList,
Vector, Hashtable) don't record changing entries via enumerators.
The clock displayed on the menu bar is of very doubious value.

<br><br>
Java puts a "catch" around all code that uses locks. Normally you don't
notice this. It will appear as an "extra" catch in the ODB.
<br><br>

Calls to "super" methods will look like calls to static methods. (It's
not clear what kind of display would be the most "intutive".)
<pre>
&lt;MySubObj_4>.frob() -> void
  MySuperObj.frob() -> void
frob -> void
</pre>

Some char and
boolean values are recorded as int.
<br><br>

A few Swing bugs and error messages appear in JDK
1.3, which you can ignore (pressing "Stop" in the controller again will
force redisplay -- useful in JDK 1.3). Arrays only display the first 1,000 elements. The I/O pane
only displays calls to <code>println()</code> writing to standard output. 
<br><br>

For faster startup on larger projects (100+ files) where most
of the files don't get recompiled very often, you may
instrument your files by hand.
<br>

<h5>RMI and Serialization Problems</h5>

If you serialize an object, the program which unserializes it expects
to have the same class definition as the program which serialized it.
However, if one of the two programs is instrumented and the other isn't,
then you can get a variety of exceptions, such as below. One solution is
to instrument the files directly and simply run one side with collection
turned off.
<br>
<pre>
java.lang.LinkageError: duplicate class definition: ClientImpl_Skel
</pre>


<h5>Possible Verifier Bugs</h5>

Because the ODB works by instrumenting the class code, it is
possible that your javac compiler produces code in a form
that the debugifier won't work with. (It works with all the
code I've produced, but I have been surprised before.) The
only thing you can do is to: (a) put that class in the "don't
instrument" list, or (b) debugify all the other files by hand
and run the ODB with instrumentation turned off. And report
the bug to me.

<br><br>
A typical verifier error (and related linker error) looks like this:
<br>
<pre>
java.lang.VerifyError: (class: BadClass, method: bar signature: ()V) 
  Unable to pop operand off an empty stack

java.lang.LinkageError: Class org/xml/sax/EntityResolver violates loader constraints
</pre>


<h5>Classloader Issues</h5>

The default operation of the ODB is to use a classloader I wrote, which
is a simple subclass of java.lang.ClassLoader. However, if you use your
own classloader for some of your code, then that code won't be instrumented and
it might even cause a classloader conflict (classes mine loads won't recognize
classes yours loads). Methods that depend on a specific classloader will also
fail. The method <code>Class.getPackage()</code> for example, accesses the classloader
to determine the package (for some reason). It will return null for my classloader.
<br><br>
You may either instrument by hand (above) or 
build and install a system classloader:

<pre>
java com.lambda.Debugger.InstrumentorForCL ~/Debugger
or
java com.lambda.Debugger.InstrumentorForCL \Debugger
</pre>

<h4>Future Enhancements</h4>

One may imagine all sorts of things. High on the list are: The ability to
change
command line arguments, some more elaborate data display options, saving a
debugging session to disk, integration with IDEs, dynamic reloading of
recompiled code, hand instrumentation of collections, 
new navigation commands, a pane to display all I/O, another to display windows,
and of course, performance optimizations. Suggestions welcome.
<br><br>

<h4>Performance</h4>

This is an unoptimized proof-of-concept implementation. I know of an obvious
order of magnitude improvement in both time and space for the ODB. Other
implementations could do even better. So what can you expect? It works fine
for me when debugging the debugger on my 110MHz, 128MB SS4. On my SS4 with a
80MB maximum heap size, I can reasonably create and navigate
40,000 "typical" method calls (with 10 time stamps inside). When you exceed the
amount of physical memory available, performance will degrade somewhat.
The ODB does run fairly well even in very large virtual spaces -- until you
have to garbage collect. Your program may not.
<br><br>

It is possible for a program to create far more time stamps than the ODB can record. You can 
certainly expand the maximum heap up to the 31-bit limit (2GB). I have successfully
collected 100 million time stamps and been able to navigate through them, but it's
clearly slower than I'd like.
It is highly
unlikely that collecting so many time stamps will be useful, simply because
it is so difficult to find your way through that much data. Selective
instrumentation, judicious use of Start and Stop, and intelligent narrowing
of the code to the bug will almost certainly be sufficent.
<br><br>

<h4>Installing the Debugger</h4>
Download the jar file (on: <code>www.LambdaCS.com/debugger/debugger.html</code>)
into <code>$HOME/Debugger</code>, then
extract the aliases (see above for .BAT files). Define the latter. 
If you wish to add command line properties (e.g.,
<code>-DYOUR_PROPERTY</code>) you can set the variable <code>USER_FLAGS</code>.
The aliases (<code>debug, debugn, debugp</code>) will now work.
<br><br>

<pre>
% cd ~/Debugger
% jar xf debugger.jar aliases 
% source aliases
% setenv USER_FLAGS -DYOUR_PROPERTY
% debug YourProgram
</pre>
or
<pre>
% cd \Debugger
% jar xf debugger.jar Microsoft
<i>Place the files Microsoft/*.BAT where you will find them.</i>
% set USER_FLAGS=-DYOUR_PROPERTY
% debug YourProgram
</pre>
<br><br>

<h4>Running the Demo</h4>
Start the ODB by double-clicking. Select "Demo" (a multithreaded quicksort) from
the file chooser. It will run, the debugger will popup, and you will be able
to "explore" the program.
<br><br>


Bil Lewis
<br>
29 December 2006