Mercurial > libpst
annotate xml/libpst.in @ 41:183ae993b9ad
security fix for potential buffer overrun in lz decompress
author | carl |
---|---|
date | Tue, 02 Oct 2007 15:49:44 -0700 |
parents | 6fe121a971c9 |
children | f6db1f060a95 |
rev | line source |
---|---|
16 | 1 <reference> |
2 <title>@PACKAGE@ Utilities - Version @VERSION@</title> | |
3 <partintro> | |
4 <title>Packages</title> | |
31 | 5 |
6 <para>This is a fork of the libpst project at SourceForge. Another fork | |
7 is located at <ulink | |
8 url="http://alioth.debian.org/projects/libpst/">http://alioth.debian.org/projects/libpst/</ulink> | |
9 </para> | |
10 | |
11 <para>The various source and binary packages are available at <ulink | |
12 url="http://www.five-ten-sg.com/@PACKAGE@/packages/">http://www.five-ten-sg.com/@PACKAGE@/packages/</ulink> | |
13 The most recent documentation is available at <ulink | |
14 url="http://www.five-ten-sg.com/@PACKAGE@/">http://www.five-ten-sg.com/@PACKAGE@/</ulink> | |
15 </para> | |
16 | |
24 | 17 </partintro> |
16 | 18 |
19 | |
20 <refentry id="readpst.1"> | |
21 <refentryinfo> | |
31 | 22 <date>2007-07-10</date> |
16 | 23 </refentryinfo> |
24 | |
25 <refmeta> | |
26 <refentrytitle>readpst</refentrytitle> | |
27 <manvolnum>1</manvolnum> | |
28 <refmiscinfo>readpst @VERSION@</refmiscinfo> | |
29 </refmeta> | |
30 | |
20 | 31 <refnamediv id='readpst.name.1'> |
16 | 32 <refname>readpst</refname> |
33 <refpurpose>convert PST (MS Outlook Personal Folders) files to mbox format</refpurpose> | |
34 </refnamediv> | |
35 | |
20 | 36 <refsynopsisdiv id='readpst.synopsis.1'> |
16 | 37 <title>Synopsis</title> |
38 <cmdsynopsis> | |
39 <command>readpst</command> | |
31 | 40 <arg><option>-b</option></arg> |
16 | 41 <arg><option>-c <replaceable class="parameter">format</replaceable></option></arg> |
42 <arg><option>-d <replaceable class="parameter">debug-file</replaceable></option></arg> | |
43 <arg><option>-h</option></arg> | |
44 <arg><option>-k</option></arg> | |
45 <arg><option>-o <replaceable class="parameter">output-directory</replaceable></option></arg> | |
46 <arg><option>-q</option></arg> | |
47 <arg><option>-r</option></arg> | |
48 <arg><option>-S</option></arg> | |
25 | 49 <arg><option>-M</option></arg> |
16 | 50 <arg><option>-V</option></arg> |
51 <arg><option>-w</option></arg> | |
24 | 52 <arg rep='repeat' choice='plain'>files</arg> |
16 | 53 </cmdsynopsis> |
54 </refsynopsisdiv> | |
55 | |
20 | 56 <refsect1 id='readpst.description.1'> |
57 <title>Description</title> | |
28 | 58 <para><command>readpst</command> is a program that can read an Outlook |
59 PST (Personal Folders) file and convert it into an mbox file, a format | |
60 suitable for KMail, a recursive mbox structure, or separate emails. | |
20 | 61 </para> |
62 </refsect1> | |
63 | |
64 <refsect1 id='readpst.options.1'> | |
16 | 65 <title>Options</title> |
66 <variablelist> | |
67 <varlistentry> | |
31 | 68 <term>-b</term> |
69 <listitem><para> | |
70 Do not save the attachments for the RTF format of the email body. | |
71 </para></listitem> | |
72 </varlistentry> | |
73 <varlistentry> | |
16 | 74 <term>-c <replaceable class="parameter">format</replaceable></term> |
75 <listitem><para> | |
76 Set the Contact output mode. Use -cv for vcard format or -cl for an email list. | |
77 </para></listitem> | |
78 </varlistentry> | |
79 <varlistentry> | |
80 <term>-d <replaceable class="parameter">debug-file</replaceable></term> | |
81 <listitem><para> | |
33
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
82 Specify name of debug log file. The |
28 | 83 log file is not an ascii file, it is a binary file readable |
84 by <command>readpstlog</command>. | |
16 | 85 </para></listitem> |
86 </varlistentry> | |
87 <varlistentry> | |
88 <term>-h</term> | |
89 <listitem><para> | |
31 | 90 Show summary of options and exit. |
16 | 91 </para></listitem> |
92 </varlistentry> | |
93 <varlistentry> | |
94 <term>-k</term> | |
95 <listitem><para> | |
96 Changes the output format to KMail. | |
97 </para></listitem> | |
98 </varlistentry> | |
99 <varlistentry> | |
20 | 100 <term>-o <replaceable class="parameter">output-directory</replaceable></term> |
16 | 101 <listitem><para> |
102 Specifies the output directory. The directory must already exist, and | |
103 is entered after the PST file is opened, but before any processing of | |
104 files commences. | |
105 </para></listitem> | |
106 </varlistentry> | |
107 <varlistentry> | |
108 <term>-q</term> | |
109 <listitem><para> | |
110 Changes to silent mode. No feedback is printed to the screen, except | |
111 for error messages. | |
112 </para></listitem> | |
113 </varlistentry> | |
114 <varlistentry> | |
115 <term>-r</term> | |
116 <listitem><para> | |
117 Changes the output format to Recursive. This will create folders as | |
21 | 118 named in the PST file, and will put all emails in a file called "mbox" |
16 | 119 inside each folder. These files are then compatible with all |
120 mbox-compatible email clients. | |
121 </para></listitem> | |
122 </varlistentry> | |
123 <varlistentry> | |
124 <term>-S</term> | |
125 <listitem><para> | |
25 | 126 Output messages into separate files. This will create folders as named |
127 in the PST file, and will put each email in its own file. These files | |
28 | 128 will be numbered from 1 increasing in intervals of 1 (ie 1, 2, 3, ...). |
129 Any attachments are saved alongside each email as XXXXXXXXX-attach1, | |
130 XXXXXXXXX-attach2 and so on, or with the name of the attachment if one | |
131 is present. | |
16 | 132 </para></listitem> |
133 </varlistentry> | |
134 <varlistentry> | |
25 | 135 <term>-M</term> |
136 <listitem><para> | |
137 Output messages in MH format as separate files. This will create | |
28 | 138 folders as named in the PST file, and will put each email together with |
139 any attachments into its own file. These files will be numbered from 1 | |
140 to n with no leading zeros. | |
25 | 141 </para></listitem> |
142 </varlistentry> | |
143 <varlistentry> | |
16 | 144 <term>-V</term> |
145 <listitem><para> | |
31 | 146 Show program version and exit. |
16 | 147 </para></listitem> |
148 </varlistentry> | |
149 <varlistentry> | |
150 <term>-w</term> | |
151 <listitem><para> | |
20 | 152 Overwrite any previous output files. Beware: When used with the -S |
16 | 153 switch, this will remove all files from the target folder before |
154 writing. This is to keep the count of emails and attachments correct. | |
155 </para></listitem> | |
156 </varlistentry> | |
157 </variablelist> | |
158 </refsect1> | |
159 | |
20 | 160 <refsect1 id='readpst.also.1'> |
16 | 161 <title>See Also</title> |
162 <para> | |
25 | 163 <citerefentry><refentrytitle>readpstlog</refentrytitle> <manvolnum>1</manvolnum> </citerefentry> |
16 | 164 </para> |
165 </refsect1> | |
166 | |
20 | 167 <refsect1 id='readpst.author.1'> |
16 | 168 <title>Author</title> |
169 <para> | |
170 This manual page was originally written by Dave Smith | |
171 <dave.s@earthcorp.com>, and updated by Joe Nahmias <joe@nahmias.net> | |
172 for the Debian GNU/Linux system (but may be used by others). It was | |
20 | 173 subsequently updated by Brad Hards <bradh@frogmouth.net>, and converted to |
16 | 174 xml format by Carl Byington <carl@five-ten-sg.com>. |
175 </para> | |
176 </refsect1> | |
177 | |
20 | 178 <refsect1 id='readpst.copyright.1'> |
16 | 179 <title>Copyright</title> |
180 <para> | |
181 Copyright (C) 2002 by David Smith <dave.s@earthcorp.com>. | |
28 | 182 XML version Copyright (C) 2006 by 510 Software Group <carl@five-ten-sg.com>. |
16 | 183 </para> |
184 <para> | |
185 This program is free software; you can redistribute it and/or modify it | |
186 under the terms of the GNU General Public License as published by the | |
187 Free Software Foundation; either version 2, or (at your option) any | |
188 later version. | |
189 </para> | |
190 <para> | |
191 You should have received a copy of the GNU General Public License along | |
192 with this program; see the file COPYING. If not, please write to the | |
193 Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA. | |
194 </para> | |
195 </refsect1> | |
196 | |
20 | 197 <refsect1 id='readpst.version.1'> |
16 | 198 <title>CVS Version</title> |
199 <para> | |
200 $Id$ | |
201 </para> | |
202 </refsect1> | |
203 </refentry> | |
204 | |
205 | |
206 <refentry id="readpstlog.1"> | |
207 <refentryinfo> | |
31 | 208 <date>2007-07-10</date> |
16 | 209 </refentryinfo> |
210 | |
211 <refmeta> | |
212 <refentrytitle>readpstlog</refentrytitle> | |
213 <manvolnum>1</manvolnum> | |
214 <refmiscinfo>readpstlog @VERSION@</refmiscinfo> | |
215 </refmeta> | |
216 | |
20 | 217 <refnamediv id='readpstlog.name.1'> |
16 | 218 <refname>readpstlog</refname> |
24 | 219 <refpurpose>convert a <command>readpst</command> logfile to text format</refpurpose> |
16 | 220 </refnamediv> |
221 | |
20 | 222 <refsynopsisdiv id='readpstlog.synopsis.1'> |
16 | 223 <title>Synopsis</title> |
224 <cmdsynopsis> | |
24 | 225 <command>readpstlog</command> |
16 | 226 <arg><option>-f <replaceable class="parameter">format</replaceable></option></arg> |
227 <arg><option>-t <replaceable class="parameter">include-types</replaceable></option></arg> | |
228 <arg><option>-x <replaceable class="parameter">exclude-types</replaceable></option></arg> | |
24 | 229 <arg choice='plain'>logfile</arg> |
16 | 230 </cmdsynopsis> |
231 </refsynopsisdiv> | |
232 | |
20 | 233 <refsect1 id='readpstlog.description.1'> |
234 <title>Description</title> | |
21 | 235 <para><command>readpstlog</command> |
24 | 236 is a program that converts the binary logfile generated |
237 by <command>readpst</command> to a more desirable text format. | |
20 | 238 </para> |
239 </refsect1> | |
240 | |
241 <refsect1 id='readpstlog.options.1'> | |
16 | 242 <title>Options</title> |
243 <variablelist> | |
244 <varlistentry> | |
245 <term>-f <replaceable class="parameter">format</replaceable></term> | |
246 <listitem><para> | |
247 Sets the format of the text log output. Currently, the only valid output | |
36 | 248 formats are T, for single line text, D for the default default multi line |
249 format, and I for an indented style with single line text. | |
16 | 250 </para></listitem> |
251 </varlistentry> | |
252 <varlistentry> | |
253 <term>-t <replaceable class="parameter">include-types</replaceable></term> | |
254 <listitem><para> | |
255 Print only the specified types of log messages. | |
256 Types are specified in a comma-delimited list (e.g. 3,10,5,6). | |
257 </para></listitem> | |
258 </varlistentry> | |
259 <varlistentry> | |
260 <term>-x <replaceable class="parameter">exclude-types</replaceable></term> | |
261 <listitem><para> | |
262 Exclude the specified types of log messages. | |
263 Types are specified in a comma-delimited list (e.g. 3,10,5,6). | |
264 </para></listitem> | |
265 </varlistentry> | |
266 </variablelist> | |
267 </refsect1> | |
268 | |
20 | 269 <refsect1 id='readpstlog.message.types.1'> |
16 | 270 <title>Message Types</title> |
24 | 271 <para><command>readpstlog</command> understands the following types of log |
272 messages: | |
16 | 273 </para> |
274 <variablelist> | |
20 | 275 <varlistentry> |
276 <term>1</term> | |
277 <listitem><para> | |
278 File accesses | |
279 </para></listitem> | |
280 </varlistentry> | |
281 <varlistentry> | |
282 <term>2</term> | |
283 <listitem><para> | |
284 Index accesses | |
285 </para></listitem> | |
286 </varlistentry> | |
287 <varlistentry> | |
288 <term>3</term> | |
289 <listitem><para> | |
290 New email found | |
291 </para></listitem> | |
292 </varlistentry> | |
293 <varlistentry> | |
294 <term>4</term> | |
295 <listitem><para> | |
296 Warnings | |
297 </para></listitem> | |
298 </varlistentry> | |
299 <varlistentry> | |
300 <term>5</term> | |
301 <listitem><para> | |
302 Read accesses | |
303 </para></listitem> | |
304 </varlistentry> | |
305 <varlistentry> | |
306 <term>6</term> | |
307 <listitem><para> | |
308 Informational messages | |
309 </para></listitem> | |
310 </varlistentry> | |
311 <varlistentry> | |
312 <term>7</term> | |
313 <listitem><para> | |
314 Main function calls | |
315 </para></listitem> | |
316 </varlistentry> | |
317 <varlistentry> | |
318 <term>8</term> | |
319 <listitem><para> | |
320 Decrypting calls | |
321 </para></listitem> | |
322 </varlistentry> | |
323 <varlistentry> | |
36 | 324 <term>9</term> |
325 <listitem><para> | |
326 Function entries | |
327 </para></listitem> | |
328 </varlistentry> | |
329 <varlistentry> | |
20 | 330 <term>10</term> |
331 <listitem><para> | |
36 | 332 Function exits |
20 | 333 </para></listitem> |
334 </varlistentry> | |
335 <varlistentry> | |
336 <term>11</term> | |
337 <listitem><para> | |
338 HexDump calls | |
339 </para></listitem> | |
340 </varlistentry> | |
16 | 341 </variablelist> |
342 </refsect1> | |
343 | |
20 | 344 <refsect1 id='readpstlog.author.1'> |
16 | 345 <title>Author</title> |
346 <para> | |
347 This manual page was written by Joe Nahmias <joe@nahmias.net> | |
348 for the Debian GNU/Linux system (but may be used by others). It was | |
349 converted to xml format by Carl Byington <carl@five-ten-sg.com>. | |
350 </para> | |
351 </refsect1> | |
352 | |
20 | 353 <refsect1 id='readpstlog.copyright.1'> |
16 | 354 <title>Copyright</title> |
355 <para> | |
356 Copyright (C) 2002 by David Smith <dave.s@earthcorp.com>. | |
357 XML version Copyright (C) 2005 by 510 Software Group <carl@five-ten-sg.com>. | |
358 </para> | |
359 <para> | |
360 This program is free software; you can redistribute it and/or modify it | |
361 under the terms of the GNU General Public License as published by the | |
362 Free Software Foundation; either version 2, or (at your option) any | |
363 later version. | |
364 </para> | |
365 <para> | |
366 You should have received a copy of the GNU General Public License along | |
367 with this program; see the file COPYING. If not, please write to the | |
368 Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA. | |
369 </para> | |
370 </refsect1> | |
371 | |
20 | 372 <refsect1 id='readpstlog.version.1'> |
16 | 373 <title>CVS Version</title> |
374 <para> | |
375 $Id$ | |
376 </para> | |
377 </refsect1> | |
378 </refentry> | |
24 | 379 |
380 | |
381 <refentry id="pst2ldif.1"> | |
382 <refentryinfo> | |
31 | 383 <date>2007-07-10</date> |
24 | 384 </refentryinfo> |
385 | |
386 <refmeta> | |
387 <refentrytitle>pst2ldif</refentrytitle> | |
388 <manvolnum>1</manvolnum> | |
389 <refmiscinfo>pst2ldif @VERSION@</refmiscinfo> | |
390 </refmeta> | |
391 | |
392 <refnamediv id='pst2ldif.name.1'> | |
393 <refname>pst2ldif</refname> | |
394 <refpurpose>extract contacts from a MS Outlook .pst file in .ldif format</refpurpose> | |
395 </refnamediv> | |
396 | |
397 <refsynopsisdiv id='pst2ldif.synopsis.1'> | |
398 <title>Synopsis</title> | |
399 <cmdsynopsis> | |
400 <command>pst2ldif</command> | |
401 <arg><option>-h</option></arg> | |
402 <arg><option>-V</option></arg> | |
403 <arg><option>-b <replaceable class="parameter">ldap-base</replaceable></option></arg> | |
404 <arg><option>-c <replaceable class="parameter">class</replaceable></option></arg> | |
33
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
405 <arg><option>-d <replaceable class="parameter">debug-file</replaceable></option></arg> |
24 | 406 <arg choice='plain'>pstfilename</arg> |
407 </cmdsynopsis> | |
408 </refsynopsisdiv> | |
409 | |
410 <refsect1 id='pst2ldif.options.1'> | |
411 <title>Options</title> | |
412 <variablelist> | |
413 <varlistentry> | |
414 <term>-h</term> | |
415 <listitem><para> | |
416 Show summary of options. Subsequent options are then ignored. | |
417 </para></listitem> | |
418 </varlistentry> | |
419 <varlistentry> | |
420 <term>-V <replaceable class="parameter">include-types</replaceable></term> | |
421 <listitem><para> | |
422 Show program version. Subsequent options are then ignored. | |
423 </para></listitem> | |
424 </varlistentry> | |
425 <varlistentry> | |
426 <term>-b <replaceable class="parameter">ldap-base</replaceable></term> | |
427 <listitem><para> | |
428 Sets the ldap base value used in the dn records. You probably want to | |
429 use something like "o=organization, c=US". | |
430 </para></listitem> | |
431 </varlistentry> | |
432 <varlistentry> | |
433 <term>-c <replaceable class="parameter">class</replaceable></term> | |
434 <listitem><para> | |
435 Sets the objectClass values for the contact items. This class needs to be | |
436 defined in the schema used by your LDAP server, and at a minimum it must | |
437 contain the ldap attributes given below. | |
438 </para></listitem> | |
439 </varlistentry> | |
33
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
440 <varlistentry> |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
441 <term>-d <replaceable class="parameter">debug-file</replaceable></term> |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
442 <listitem><para> |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
443 Specify name of debug log file. The |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
444 log file is not an ascii file, it is a binary file readable |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
445 by <command>readpstlog</command>. |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
446 </para></listitem> |
12cac756bc05
enable -d option, but if not specified, don't generate a debug file
carl
parents:
32
diff
changeset
|
447 </varlistentry> |
24 | 448 </variablelist> |
449 </refsect1> | |
450 | |
451 <refsect1 id='pst2ldif.description.1'> | |
452 <title>Description</title> | |
453 <para><command>pst2ldif</command> | |
454 reads the contact information from a MS Outlook .pst file | |
455 and produces a .ldif file that may be used to import those contacts | |
456 into an LDAP database. The following ldap attributes are generated: | |
457 <simplelist> | |
458 <member>cn </member> | |
459 <member>givenName </member> | |
460 <member>sn </member> | |
461 <member>personalTitle </member> | |
462 <member>company </member> | |
463 <member>mail </member> | |
464 <member>postalAddress </member> | |
465 <member>l </member> | |
466 <member>st </member> | |
467 <member>postalCode </member> | |
468 <member>c </member> | |
469 <member>homePhone </member> | |
470 <member>telephoneNumber </member> | |
471 <member>facsimileTelephoneNumber </member> | |
472 <member>mobile </member> | |
473 <member>description </member> | |
474 </simplelist> | |
475 </para> | |
476 </refsect1> | |
477 | |
478 <refsect1 id='pst2ldif.copyright.1'> | |
479 <title>Copyright</title> | |
480 <para> | |
481 Copyright (C) 2006 by 510 Software Group <carl@five-ten-sg.com> | |
482 </para> | |
483 <para> | |
484 This program is free software; you can redistribute it and/or modify it | |
485 under the terms of the GNU General Public License as published by the | |
486 Free Software Foundation; either version 2, or (at your option) any | |
487 later version. | |
488 </para> | |
489 <para> | |
490 You should have received a copy of the GNU General Public License along | |
491 with this program; see the file COPYING. If not, please write to the | |
492 Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA. | |
493 </para> | |
494 </refsect1> | |
495 | |
496 <refsect1 id='pst2ldif.version.1'> | |
497 <title>CVS Version</title> | |
498 <para> | |
499 $Id$ | |
500 </para> | |
501 </refsect1> | |
502 </refentry> | |
503 | |
504 | |
505 <refentry id="pst.5"> | |
506 <refentryinfo> | |
31 | 507 <date>2007-07-10</date> |
24 | 508 </refentryinfo> |
509 | |
510 <refmeta> | |
511 <refentrytitle>outlook.pst</refentrytitle> | |
512 <manvolnum>5</manvolnum> | |
513 </refmeta> | |
514 | |
515 <refnamediv id='pst.name.1'> | |
516 <refname>outlook.pst</refname> | |
517 <refpurpose>format of MS Outlook .pst file</refpurpose> | |
518 </refnamediv> | |
519 | |
520 <refsynopsisdiv id='pst.synopsis.1'> | |
521 <title>Synopsis</title> | |
522 <cmdsynopsis> | |
523 <command>outlook.pst</command> | |
524 </cmdsynopsis> | |
525 </refsynopsisdiv> | |
526 | |
527 <refsect1 id='pst.file.overview.5'> | |
528 <title>Overview</title> | |
529 <para> | |
530 Each item in a .pst file is identified by two id values ID1 and ID2. | |
531 There are two separate b-trees indexed by these ID1 and ID2 values. | |
532 </para> | |
533 </refsect1> | |
534 | |
535 <refsect1 id='pst.file.header.5'> | |
536 <title>File Header</title> | |
537 <para> | |
538 The file header is located at offset 0 in the .pst file. | |
539 </para> | |
540 <literallayout class="monospaced"><![CDATA[ | |
541 0000 21 42 44 4e 49 f8 64 d9 53 4d 0e 00 13 00 01 01 | |
542 0010 00 00 00 00 00 00 00 00 50 d6 03 00 bd 1e 02 00 | |
543 0020 08 4c 00 00 00 04 00 00 00 04 00 00 0f 04 00 00 | |
544 0030 0d 40 00 00 99 0a 01 00 18 04 00 00 0d 40 00 00 | |
545 0040 0d 40 00 00 11 80 00 00 02 04 00 00 0a 04 00 00 | |
546 0050 00 04 00 00 00 04 00 00 0f 04 00 00 0f 04 00 00 | |
547 0060 0f 04 00 00 0d 40 00 00 00 04 00 00 00 04 00 00 | |
548 0070 04 40 00 00 00 04 00 00 00 04 00 00 00 04 00 00 | |
549 0080 00 04 00 00 00 04 00 00 00 04 00 00 00 04 00 00 | |
550 0090 00 04 00 00 00 04 00 00 00 04 00 00 00 04 00 00 | |
551 00a0 0c 09 00 00 00 00 00 00 00 04 27 00 00 24 23 00 | |
552 00b0 c0 09 0a 00 00 c8 00 00 bc 1e 02 00 00 7e 0c 00 | |
553 00c0 b4 1e 02 00 00 54 00 00 01 00 00 00 23 55 44 d1 | |
554 00d0 5a 4f ce 6b 80 ff ff ff 00 00 00 00 00 00 00 00 | |
555 00e0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
556 00f0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
557 0100 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
558 0110 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
559 0120 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
560 0130 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
561 0140 00 00 00 00 00 00 00 00 00 00 00 00 3f ff ff ff | |
562 0150 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
563 0160 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
564 0170 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
565 0180 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
566 0190 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
567 01a0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
568 01b0 ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff | |
569 01c0 ff ff ff ff ff ff ff ff ff ff ff ff 80 01 00 00 | |
570 01d0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
571 01e0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
572 01f0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
573 | |
574 0000 signature [4 bytes] 0x4e444221 constant | |
28 | 575 000a indexType [1 byte] 0x0e constant |
576 01cd encryptionType [1 byte] 0x01 constant | |
24 | 577 00a8 total file size [4 bytes] 0x270400 in this case |
28 | 578 00c0 backPointer1 [4 bytes] 0x021eb4 in this case |
579 00c4 offsetIndex1 [4 bytes] 0x005400 in this case | |
580 00b8 backPointer2 [4 bytes] 0x021ebc in this case | |
581 00bc offsetIndex2 [4 bytes] 0x0c7e00 in this case | |
24 | 582 ]]></literallayout> |
583 <para> | |
584 We only support index type 0x0E and encryption type 0x01. | |
585 </para> | |
586 <para> | |
28 | 587 offsetIndex1 is the file offset of the root of the |
24 | 588 index1 b-tree, which contains (ID1, offset, size, unknown) tuples |
28 | 589 for each item in the file. backPointer1 is the value that should |
24 | 590 appear in the parent pointer of that root node. |
591 </para> | |
592 <para> | |
28 | 593 offsetIndex2 is the file offset of the root of the |
24 | 594 index2 b-tree, which contains (ID2, DESC-ID1, LIST-ID1, PARENT-ID2) |
28 | 595 tuples for each item in the file. backPointer2 is the value that should |
24 | 596 appear in the parent pointer of that root node. |
597 </para> | |
598 </refsect1> | |
599 | |
600 <refsect1 id='pst.file.node1.5'> | |
601 <title>Index 1 Node</title> | |
602 <para> | |
603 The index1 b-tree nodes are 516 byte blocks with the following format. | |
604 </para> | |
605 <literallayout class="monospaced"><![CDATA[ | |
606 0000 04 00 00 00 8a 1e 02 00 00 1c 0b 00 | |
607 000c 58 27 03 00 b3 1e 02 00 00 52 00 00 | |
608 0018 00 00 00 00 00 00 00 00 00 00 00 00 | |
609 0024 00 00 00 00 00 00 00 00 00 00 00 00 | |
610 0030 00 00 00 00 00 00 00 00 00 00 00 00 | |
611 003c 00 00 00 00 00 00 00 00 00 00 00 00 | |
612 0048 00 00 00 00 00 00 00 00 00 00 00 00 | |
613 0054 00 00 00 00 00 00 00 00 00 00 00 00 | |
614 0060 00 00 00 00 00 00 00 00 00 00 00 00 | |
615 006c 00 00 00 00 00 00 00 00 00 00 00 00 | |
616 0078 00 00 00 00 00 00 00 00 00 00 00 00 | |
617 0084 00 00 00 00 00 00 00 00 00 00 00 00 | |
618 0090 00 00 00 00 00 00 00 00 00 00 00 00 | |
619 009c 00 00 00 00 00 00 00 00 00 00 00 00 | |
620 00a8 00 00 00 00 00 00 00 00 00 00 00 00 | |
621 00b4 00 00 00 00 00 00 00 00 00 00 00 00 | |
622 00c0 00 00 00 00 00 00 00 00 00 00 00 00 | |
623 00cc 00 00 00 00 00 00 00 00 00 00 00 00 | |
624 00d8 00 00 00 00 00 00 00 00 00 00 00 00 | |
625 00e4 00 00 00 00 00 00 00 00 00 00 00 00 | |
626 00f0 00 00 00 00 00 00 00 00 00 00 00 00 | |
627 00fc 00 00 00 00 00 00 00 00 00 00 00 00 | |
628 0108 00 00 00 00 00 00 00 00 00 00 00 00 | |
629 0114 00 00 00 00 00 00 00 00 00 00 00 00 | |
630 0120 00 00 00 00 00 00 00 00 00 00 00 00 | |
631 012c 00 00 00 00 00 00 00 00 00 00 00 00 | |
632 0138 00 00 00 00 00 00 00 00 00 00 00 00 | |
633 0144 00 00 00 00 00 00 00 00 00 00 00 00 | |
634 0150 00 00 00 00 00 00 00 00 00 00 00 00 | |
635 015c 00 00 00 00 00 00 00 00 00 00 00 00 | |
636 0168 00 00 00 00 00 00 00 00 00 00 00 00 | |
637 0174 00 00 00 00 00 00 00 00 00 00 00 00 | |
638 0180 00 00 00 00 00 00 00 00 00 00 00 00 | |
639 018c 00 00 00 00 00 00 00 00 00 00 00 00 | |
640 0198 00 00 00 00 00 00 00 00 00 00 00 00 | |
641 01a4 00 00 00 00 00 00 00 00 00 00 00 00 | |
642 01b0 00 00 00 00 00 00 00 00 00 00 00 00 | |
643 01bc 00 00 00 00 00 00 00 00 00 00 00 00 | |
644 01c8 00 00 00 00 00 00 00 00 00 00 00 00 | |
645 01d4 00 00 00 00 00 00 00 00 00 00 00 00 | |
646 01e0 00 00 00 00 00 00 00 00 00 00 00 00 | |
647 01ec 00 00 00 00 02 29 0c 02 80 80 b6 4a | |
648 01f8 b4 1e 02 00 27 9c cc 56 58 27 03 00 | |
649 | |
28 | 650 01f0 itemCount [1 byte] 0x02 in this case |
651 01f1 maxItemCount [1 byte] 0x29 constant | |
652 01f3 nodeLevel [1 byte] 0x02 in this case | |
653 01f8 backPointer [4 bytes] 0x021eb4 in this case | |
24 | 654 ]]></literallayout> |
655 <para> | |
28 | 656 The itemCount specifies the number of 12 byte records that |
657 are active. The nodeLevel is non-zero for this style of nodes. | |
658 The leaf nodes have a different format. The backPointer must | |
659 match the backPointer from the triple that pointed to this node. | |
24 | 660 </para> |
661 <para> | |
35 | 662 Each item in this node is a triple of (ID1, backPointer, offset) |
24 | 663 where the offset points to the next deeper node in the tree, the |
28 | 664 backPointer value must match the backPointer in that deeper node, |
35 | 665 and ID1 is the lowest ID1 value in the subtree. |
24 | 666 </para> |
667 </refsect1> | |
668 | |
669 <refsect1 id='pst.file.leaf1.5'> | |
670 <title>Index 1 Leaf Node</title> | |
671 <para> | |
672 The index1 b-tree leaf nodes are 516 byte blocks with the following format. | |
673 </para> | |
674 <literallayout class="monospaced"><![CDATA[ | |
675 0000 04 00 00 00 00 58 00 00 64 00 0f 00 | |
676 000c 08 00 00 00 80 58 00 00 ac 00 06 00 | |
677 0018 0c 00 00 00 40 59 00 00 ac 00 06 00 | |
678 0024 10 00 00 00 00 5a 00 00 bc 00 03 00 | |
679 0030 14 00 00 00 00 5b 00 00 a4 00 02 00 | |
680 003c 18 00 00 00 c0 5b 00 00 64 00 02 00 | |
681 0048 1c 00 00 00 40 5c 00 00 5c 00 02 00 | |
682 0054 50 00 00 00 80 62 00 00 60 00 02 00 | |
683 0060 74 00 00 00 00 77 00 00 5e 00 02 00 | |
684 006c 7c 00 00 00 80 77 00 00 66 00 02 00 | |
685 0078 84 00 00 00 00 76 00 00 ca 00 02 00 | |
686 0084 88 00 00 00 00 63 00 00 52 00 02 00 | |
687 0090 90 00 00 00 00 79 00 00 58 00 02 00 | |
688 009c cc 00 00 00 c0 61 00 00 76 00 02 00 | |
689 00a8 e0 00 00 00 00 61 00 00 74 00 02 00 | |
690 00b4 f4 00 00 00 80 65 00 00 6e 00 02 00 | |
691 00c0 8c 01 00 00 40 60 00 00 70 00 02 00 | |
692 00cc ea 01 00 00 80 61 00 00 10 00 02 00 | |
693 00d8 ec 01 00 00 40 8a 00 00 f3 01 02 00 | |
694 00e4 f0 01 00 00 80 93 00 00 f4 1f 02 00 | |
695 00f0 fa 01 00 00 c0 7f 00 00 10 00 02 00 | |
696 00fc 00 02 00 00 00 89 00 00 34 01 02 00 | |
697 0108 1c 02 00 00 40 ec 00 00 12 06 02 00 | |
698 0114 22 02 00 00 00 84 00 00 10 00 02 00 | |
699 0120 24 02 00 00 c0 ea 00 00 3c 01 02 00 | |
700 012c 40 02 00 00 00 f4 00 00 0a 06 02 00 | |
701 0138 46 02 00 00 40 8c 00 00 10 00 02 00 | |
702 0144 48 02 00 00 80 f2 00 00 36 01 02 00 | |
703 0150 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
704 015c 6a 02 00 00 80 63 00 00 10 00 02 00 | |
705 0168 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
706 0174 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
707 0180 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
708 018c 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
709 0198 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
710 01a4 6c 02 00 00 40 fa 00 00 2a 01 02 00 | |
711 01b0 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
712 01bc 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
713 01c8 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
714 01d4 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
715 01e0 64 02 00 00 80 fb 00 00 bf 07 02 00 | |
716 01ec 00 00 00 00 1f 29 0c 00 80 80 5b b3 | |
717 01f8 5a 67 01 00 4f ae 70 a7 92 06 00 00 | |
718 | |
28 | 719 01f0 itemCount [1 byte] 0x1f in this case |
720 01f1 maxItemCount [1 byte] 0x29 constant | |
721 01f3 nodeLevel [1 byte] 0x00 in this case | |
722 01f8 backPointer [4 bytes] 0x01675a in this case | |
24 | 723 ]]></literallayout> |
724 <para> | |
28 | 725 The itemCount specifies the number of 12 byte records that |
726 are active. The nodeLevel is zero for these leaf nodes. | |
727 The backPointer must match the backPointer from the triple | |
24 | 728 that pointed to this node. |
729 </para> | |
730 <para> | |
731 Each item in this node is a tuple of (ID1, offset, size, unknown) | |
35 | 732 The two low order bits of the ID1 value seem to be flags. I have |
733 never seen a case with bit zero set. Bit one indicates that the | |
734 item is <emphasis>not</emphasis> encrypted. Note that references | |
735 to these ID1 values elsewhere may have the low order bit set (and | |
736 I don't know what that means), but when we do the search in this | |
737 tree we need to clear that bit so that we can find the correct item. | |
24 | 738 </para> |
739 </refsect1> | |
740 | |
741 <refsect1 id='pst.file.node2.5'> | |
742 <title>Index 2 Node</title> | |
743 <para> | |
744 The index2 b-tree nodes are 516 byte blocks with the following format. | |
745 </para> | |
746 <literallayout class="monospaced"><![CDATA[ | |
747 0000 21 00 00 00 bb 1e 02 00 00 e2 0b 00 | |
748 000c 64 78 20 00 8c 1e 02 00 00 dc 0b 00 | |
749 0018 00 00 00 00 00 00 00 00 00 00 00 00 | |
750 0024 00 00 00 00 00 00 00 00 00 00 00 00 | |
751 0030 00 00 00 00 00 00 00 00 00 00 00 00 | |
752 003c 00 00 00 00 00 00 00 00 00 00 00 00 | |
753 0048 00 00 00 00 00 00 00 00 00 00 00 00 | |
754 0054 00 00 00 00 00 00 00 00 00 00 00 00 | |
755 0060 00 00 00 00 00 00 00 00 00 00 00 00 | |
756 006c 00 00 00 00 00 00 00 00 00 00 00 00 | |
757 0078 00 00 00 00 00 00 00 00 00 00 00 00 | |
758 0084 00 00 00 00 00 00 00 00 00 00 00 00 | |
759 0090 00 00 00 00 00 00 00 00 00 00 00 00 | |
760 009c 00 00 00 00 00 00 00 00 00 00 00 00 | |
761 00a8 00 00 00 00 00 00 00 00 00 00 00 00 | |
762 00b4 00 00 00 00 00 00 00 00 00 00 00 00 | |
763 00c0 00 00 00 00 00 00 00 00 00 00 00 00 | |
764 00cc 00 00 00 00 00 00 00 00 00 00 00 00 | |
765 00d8 00 00 00 00 00 00 00 00 00 00 00 00 | |
766 00e4 00 00 00 00 00 00 00 00 00 00 00 00 | |
767 00f0 00 00 00 00 00 00 00 00 00 00 00 00 | |
768 00fc 00 00 00 00 00 00 00 00 00 00 00 00 | |
769 0108 00 00 00 00 00 00 00 00 00 00 00 00 | |
770 0114 00 00 00 00 00 00 00 00 00 00 00 00 | |
771 0120 00 00 00 00 00 00 00 00 00 00 00 00 | |
772 012c 00 00 00 00 00 00 00 00 00 00 00 00 | |
773 0138 00 00 00 00 00 00 00 00 00 00 00 00 | |
774 0144 00 00 00 00 00 00 00 00 00 00 00 00 | |
775 0150 00 00 00 00 00 00 00 00 00 00 00 00 | |
776 015c 00 00 00 00 00 00 00 00 00 00 00 00 | |
777 0168 00 00 00 00 00 00 00 00 00 00 00 00 | |
778 0174 00 00 00 00 00 00 00 00 00 00 00 00 | |
779 0180 00 00 00 00 00 00 00 00 00 00 00 00 | |
780 018c 00 00 00 00 00 00 00 00 00 00 00 00 | |
781 0198 00 00 00 00 00 00 00 00 00 00 00 00 | |
782 01a4 00 00 00 00 00 00 00 00 00 00 00 00 | |
783 01b0 00 00 00 00 00 00 00 00 00 00 00 00 | |
784 01bc 00 00 00 00 00 00 00 00 00 00 00 00 | |
785 01c8 00 00 00 00 00 00 00 00 00 00 00 00 | |
786 01d4 00 00 00 00 00 00 00 00 00 00 00 00 | |
787 01e0 00 00 00 00 00 00 00 00 00 00 00 00 | |
788 01ec 00 00 00 00 02 29 0c 02 81 81 b2 60 | |
789 01f8 bc 1e 02 00 7e 70 dc e3 21 00 00 00 | |
790 | |
28 | 791 01f0 itemCount [1 byte] 0x02 in this case |
792 01f1 maxItemCount [1 byte] 0x29 constant | |
793 01f3 nodeLevel [1 byte] 0x02 in this case | |
794 01f8 backPointer [4 bytes] 0x021ebc in this case | |
24 | 795 ]]></literallayout> |
796 <para> | |
28 | 797 The itemCount specifies the number of 12 byte records that |
798 are active. The nodeLevel is non-zero for this style of nodes. | |
799 The leaf nodes have a different format. The backPointer must | |
800 match the backPointer from the triple that pointed to this node. | |
24 | 801 </para> |
802 <para> | |
28 | 803 Each item in this node is a triple of (ID2, backPointer, offset) |
24 | 804 where the offset points to the next deeper node in the tree, the |
28 | 805 backPointer value must match the backPointer in that deeper node, |
24 | 806 and ID2 is the lowest ID2 value in the subtree. |
807 </para> | |
808 </refsect1> | |
809 | |
810 <refsect1 id='pst.file.leaf2.5'> | |
811 <title>Index 2 Leaf Node</title> | |
812 <para> | |
813 The index2 b-tree leaf nodes are 516 byte blocks with the following format. | |
814 </para> | |
815 <literallayout class="monospaced"><![CDATA[ | |
816 0000 21 00 00 00 38 e6 00 00 00 00 00 00 00 00 00 00 | |
817 0010 61 00 00 00 2c a8 02 00 36 a8 02 00 00 00 00 00 | |
818 0020 22 01 00 00 20 a2 02 00 00 00 00 00 22 01 00 00 | |
819 0030 2d 01 00 00 88 7b 03 00 00 00 00 00 00 00 00 00 | |
820 0040 2e 01 00 00 08 00 00 00 00 00 00 00 00 00 00 00 | |
821 0050 2f 01 00 00 0c 00 00 00 00 00 00 00 00 00 00 00 | |
822 0060 e1 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
823 0070 01 02 00 00 b4 e4 02 00 00 00 00 00 00 00 00 00 | |
824 0080 61 02 00 00 a0 e4 02 00 00 00 00 00 00 00 00 00 | |
825 0090 0d 06 00 00 04 00 00 00 00 00 00 00 00 00 00 00 | |
826 00A0 0e 06 00 00 08 00 00 00 00 00 00 00 00 00 00 00 | |
827 00B0 0f 06 00 00 0c 00 00 00 00 00 00 00 00 00 00 00 | |
828 00C0 10 06 00 00 10 00 00 00 00 00 00 00 00 00 00 00 | |
829 00D0 2b 06 00 00 84 00 00 00 00 00 00 00 00 00 00 00 | |
830 00E0 4c 06 00 00 1c 00 00 00 00 00 00 00 00 00 00 00 | |
831 00F0 71 06 00 00 18 00 00 00 00 00 00 00 00 00 00 00 | |
832 0100 92 06 00 00 14 00 00 00 00 00 00 00 00 00 00 00 | |
833 0110 23 22 00 00 14 a0 02 00 00 00 00 00 22 01 00 00 | |
834 0120 26 22 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
835 0130 27 22 00 00 1c a0 02 00 00 00 00 00 00 00 00 00 | |
836 0140 22 80 00 00 50 00 00 00 00 00 00 00 22 01 00 00 | |
837 0150 2d 80 00 00 f8 9f 02 00 00 00 00 00 00 00 00 00 | |
838 0160 2e 80 00 00 08 00 00 00 00 00 00 00 00 00 00 00 | |
839 0170 2f 80 00 00 34 e6 00 00 00 00 00 00 00 00 00 00 | |
840 0180 42 80 00 00 3c 6d 02 00 00 00 00 00 22 80 00 00 | |
841 0190 4d 80 00 00 04 00 00 00 00 00 00 00 00 00 00 00 | |
842 01A0 4e 80 00 00 10 6d 02 00 00 00 00 00 00 00 00 00 | |
843 01B0 4f 80 00 00 ec 23 00 00 00 00 00 00 00 00 00 00 | |
844 01C0 62 80 00 00 38 78 02 00 00 00 00 00 22 01 00 00 | |
845 01D0 6d 80 00 00 34 78 02 00 00 00 00 00 00 00 00 00 | |
846 01E0 6e 80 00 00 08 00 00 00 00 00 00 00 00 00 00 00 | |
847 01F0 10 1f 10 00 81 81 a0 9a ae 1e 02 00 89 44 6a 0f | |
848 0200 b8 b1 03 00 | |
849 | |
28 | 850 01f0 itemCount [1 byte] 0x10 in this case |
851 01f1 maxItemCount [1 byte] 0x1f constant | |
852 01f3 nodeLevel [1 byte] 0x00 in this case | |
853 01f8 backPointer [4 bytes] 0x021eae in this case | |
24 | 854 ]]></literallayout> |
855 <para> | |
28 | 856 The itemCount specifies the number of 16 byte records that |
857 are active. The nodeLevel is zero for these leaf nodes. | |
858 The backPointer must match the backPointer from the triple | |
24 | 859 that pointed to this node. |
860 </para> | |
861 <para> | |
862 Each item in this node is a tuple of (ID2, DESC-ID1, LIST-ID1, PARENT-ID2) | |
863 </para> | |
864 </refsect1> | |
865 | |
866 <refsect1 id='pst.file.list.5'> | |
867 <title>Associated List Item</title> | |
868 <para> | |
869 Contains associations between id1 and id2 for the items controlled by the record. | |
870 In the above leaf node, we have a tuple of (0x61, 0x02a82c, 0x02a836, 0) | |
871 0x02a836 is the ID1 of the associated list, and we can lookup that ID1 value | |
872 in the index1 b-tree to find the (offset,size) of the data in the .pst file. | |
873 </para> | |
874 <literallayout class="monospaced"><![CDATA[ | |
875 0000 02 00 01 00 9f 81 00 00 30 a8 02 00 00 00 00 00 | |
876 | |
877 0000 unknown [2 bytes] 0x0002 constant | |
878 0002 count [2 bytes] 0x0001 in this case | |
879 repeating | |
880 0004 id2 [4 bytes] 0x00819f in this case | |
881 0008 id [4 bytes] 0x02a830 in this case | |
882 000c unknown [4 bytes] 0 in this case | |
883 ]]></literallayout> | |
884 </refsect1> | |
885 | |
886 <refsect1 id='pst.file.desc.5'> | |
28 | 887 <title>Associated Descriptor Item 0xbcec</title> |
24 | 888 <para> |
28 | 889 Contains information about the item, which may be email, contact, or |
890 other outlook types. In the above leaf node, we have a tuple of (0x21, | |
891 0x00e638, 0, 0) 0x00e638 is the ID1 of the associated descriptor, and we | |
892 can lookup that ID1 value in the index1 b-tree to find the (offset,size) | |
893 of the data in the .pst file. | |
24 | 894 </para> |
895 <literallayout class="monospaced"><![CDATA[ | |
896 0000 3c 01 ec bc 20 00 00 00 00 00 00 00 b5 02 06 00 | |
897 0010 40 00 00 00 f9 0f 02 01 60 00 00 00 01 30 1e 00 | |
898 0020 80 00 00 00 04 30 1e 00 00 00 00 00 df 35 03 00 | |
899 0030 ff 00 00 00 e0 35 02 01 a0 00 00 00 e2 35 02 01 | |
900 0040 e0 00 00 00 e3 35 02 01 c0 00 00 00 e4 35 02 01 | |
901 0050 00 01 00 00 e5 35 02 01 20 01 00 00 e6 35 02 01 | |
902 0060 40 01 00 00 e7 35 02 01 60 01 00 00 1e 66 0b 00 | |
903 0070 00 00 00 00 ff 67 03 00 00 00 00 00 d2 7f 17 d8 | |
904 0080 64 8c d5 11 83 24 00 50 04 86 95 45 53 74 61 6e | |
905 0090 6c 65 79 00 00 00 00 d2 7f 17 d8 64 8c d5 11 83 | |
906 00A0 24 00 50 04 86 95 45 22 80 00 00 00 00 00 00 d2 | |
907 00B0 7f 17 d8 64 8c d5 11 83 24 00 50 04 86 95 45 42 | |
908 00C0 80 00 00 00 00 00 00 d2 7f 17 d8 64 8c d5 11 83 | |
909 00D0 24 00 50 04 86 95 45 a2 80 00 00 00 00 00 00 d2 | |
910 00E0 7f 17 d8 64 8c d5 11 83 24 00 50 04 86 95 45 c2 | |
911 00F0 80 00 00 00 00 00 00 d2 7f 17 d8 64 8c d5 11 83 | |
912 0100 24 00 50 04 86 95 45 e2 80 00 00 00 00 00 00 d2 | |
913 0110 7f 17 d8 64 8c d5 11 83 24 00 50 04 86 95 45 02 | |
914 0120 81 00 00 00 00 00 00 d2 7f 17 d8 64 8c d5 11 83 | |
915 0130 24 00 50 04 86 95 45 62 80 00 00 00 0b 00 00 00 | |
916 0140 0c 00 14 00 7c 00 8c 00 93 00 ab 00 c3 00 db 00 | |
917 0150 f3 00 0b 01 23 01 3b 01 | |
918 | |
28 | 919 0000 indexOffset [2 bytes] 0x013c in this case |
24 | 920 0002 signature [2 bytes] 0xbcec constant |
35 | 921 0004 b5offset [4 bytes] 0x0020 index reference |
24 | 922 ]]></literallayout> |
923 <para> | |
35 | 924 Note the signature of 0xbcec. There are other descriptor block formats |
925 with other signatures. Note the indexOffset of 0x013c - starting at | |
926 that position in the descriptor block, we have an array of two byte | |
927 integers. The first integer (0x000b) is a (count-1) of the number of | |
928 overlapping pairs following the count. The first pair is (0, 0xc), the | |
929 next pair is (0xc, 0x14) and the last (12th) pair is (0x123, 0x13b). | |
930 These pairs are (start,end+1) offsets of items in this block. So we | |
931 have count+2 integers following the count value. | |
24 | 932 </para> |
933 <para> | |
35 | 934 Note the b5offset of 0x0020, which is a type that I will call an index |
935 reference. Such index references have at least two different forms, and | |
936 may point to data either in this block, or in some other block. | |
937 External pointer references have the low order 4 bits all set, and are | |
938 ID2 values that can be used to fetch data. This value of 0x0020 is an | |
939 internal pointer reference, which needs to be right shifted by 4 bits to | |
940 become 0x0002, which is then a byte offset to be added to the above | |
941 indexOffset plus two (to skip the count), so it points to the (0xc, | |
942 0x14) pair. | |
943 </para> | |
944 <para> | |
945 Finally, we have the offset and size of the "b5" block located at offset 0xc | |
24 | 946 with a size of 8 bytes in this descriptor block. The "b5" block has the |
947 following format: | |
948 </para> | |
949 <literallayout class="monospaced"><![CDATA[ | |
950 0000 signature [2 bytes] 0x02b5 constant | |
951 0002 unknown [2 bytes] 0x0006 in this case | |
35 | 952 0004 descoffset [4 bytes] 0x0040 index reference |
24 | 953 ]]></literallayout> |
954 <para> | |
35 | 955 Note the descoffset of 0x0040, which again is an index reference. In this |
956 case, it is an internal pointer reference, which needs to be right shifted by 4 bits | |
24 | 957 to become 0x0004, which is then a byte offset to be added to the above |
28 | 958 indexOffset plus two (to skip the count), so it points to the (0x14, 0x7c) |
24 | 959 pair. We now have the offset 0x14 of the descriptor array, composed of 8 byte |
960 entries. Each descriptor entry has the following format: | |
961 </para> | |
962 <literallayout class="monospaced"><![CDATA[ | |
28 | 963 0000 itemType [2 bytes] |
964 0002 referenceType [2 bytes] | |
24 | 965 0004 value [4 bytes] |
966 ]]></literallayout> | |
967 <para> | |
968 For some reference types (2, 3, 0xb) the value is used directly. Otherwise, | |
35 | 969 the value is an index reference, which is either an ID2 value, or an |
970 offset, to be right shifted by 4 bits and used to fetch a pair from the | |
971 index table to find the offset and size of the item in this descriptor block. | |
24 | 972 </para> |
973 <para> | |
974 The following reference types are known, but not all of these | |
975 are implemented in the code yet. | |
976 </para> | |
977 <literallayout class="monospaced"><![CDATA[ | |
978 0x0002 - Signed 16bit value | |
979 0x0003 - Signed 32bit value | |
980 0x0004 - 4-byte floating point | |
981 0x0005 - Floating point double | |
982 0x0006 - Signed 64-bit int | |
983 0x0007 - Application Time | |
984 0x000A - 32-bit error value | |
985 0x000B - Boolean (non-zero = true) | |
986 0x000D - Embedded Object | |
987 0x0014 - 8-byte signed integer (64-bit) | |
988 0x001E - Null terminated String | |
989 0x001F - Unicode string | |
990 0x0040 - Systime - Filetime structure | |
991 0x0048 - OLE Guid | |
992 0x0102 - Binary data | |
993 0x1003 - Array of 32bit values | |
994 0x1014 - Array of 64bit values | |
995 0x101E - Array of Strings | |
996 0x1102 - Array of Binary data | |
997 ]]></literallayout> | |
998 <para> | |
999 The following item types are known, but not all of these | |
1000 are implemented in the code yet. | |
1001 Note: it appears that some types can have a IPOS value or a ID2 value | |
1002 depending on the size of the field in question. It is safer to check | |
1003 every field than for me to say what the "usually" contain. Absolute | |
1004 values though, are generally going to be constant. | |
1005 </para> | |
1006 <literallayout class="monospaced"><![CDATA[ | |
1007 0002 AutoForward allowed | |
1008 0003 Extended Attributes Table | |
1009 0017 Importance Level | |
1010 001a IPM Context. What type of message is this | |
1011 0023 Global Delivery Report | |
1012 0026 Priority | |
1013 0029 Read Receipt | |
1014 002b Reassignment Prohibited | |
1015 002e Original Sensitivity | |
1016 0036 Sensitivity | |
1017 0037 Email Subject. The referenced item is of type "Subject Type" | |
1018 0039 Date. This is likely to be the arrival date | |
1019 003b Outlook Address of Sender | |
1020 003f Outlook structure describing the recipient | |
1021 0040 Name of the Outlook recipient structure | |
1022 0041 Outlook structure describing the sender | |
1023 0042 Name of the Outlook sender structure | |
1024 0043 Another structure describing the recipient | |
1025 0044 Name of the second recipient structure | |
1026 004f Reply-To Outlook Structure | |
1027 0050 Name of the Reply-To structure | |
1028 0051 Outlook Name of recipient | |
1029 0052 Second Outlook name of recipient | |
1030 0057 My address in TO field | |
1031 0058 My address in CC field | |
1032 0059 Message addressed to me | |
1033 0063 Response requested | |
1034 0064 Sender's Address access method (SMTP, EX) | |
1035 0065 Sender's Address | |
1036 0070 Processed Subject (with Fwd:, Re, ... removed) | |
1037 0071 Date. Another date | |
1038 0075 Recipient Address Access Method (SMTP, EX) | |
1039 0076 Recipient's Address | |
1040 0077 Second Recipient Access Method (SMTP, EX) | |
1041 0078 Second Recipient Address | |
1042 007d Email Header. This is the header that was attached to the email | |
1043 0c17 Reply Requested | |
1044 0c19 Second sender struct | |
1045 0c1a Name of second sender struct | |
1046 0c1d Second outlook name of sender | |
1047 0c1e Second sender access method (SMTP, EX) | |
1048 0c1f Second Sender Address | |
1049 0e01 Delete after submit | |
1050 0e03 CC Address? | |
1051 0e04 SentTo Address | |
1052 0e06 Date. | |
1053 0e07 Flag - contains IsSeen value | |
1054 0e08 Message Size | |
1055 0e0a Sentmail EntryID | |
1056 0e1f Compressed RTF in Sync | |
1057 0e20 Attachment Size | |
1058 0ff9 binary record header | |
1059 1000 Plain Text Email Body. Does not exist if the email doesn't have a plain text version | |
1060 1006 RTF Sync Body CRC | |
1061 1007 RTF Sync Body character count | |
1062 1008 RTF Sync body tag | |
1063 1009 RTF Compressed body | |
1064 1010 RTF whitespace prefix count | |
1065 1011 RTF whitespace tailing count | |
36 | 1066 1013 HTML Email Body. Does not exist if the email doesn't have an HTML version |
24 | 1067 1035 Message ID |
1068 1042 In-Reply-To or Parent's Message ID | |
1069 1046 Return Path | |
1070 3001 Folder Name? I have seen this value used for the contacts record aswell | |
1071 3002 Address Type | |
1072 3003 Contact Address | |
1073 3004 Comment | |
1074 3007 Date item creation | |
1075 3008 Date item modification | |
1076 300b binary record header | |
1077 35df Valid Folder Mask | |
1078 35e0 binary record found in first item. Contains the reference to "Top of Personal Folder" item | |
1079 35e3 binary record with a reference to "Deleted Items" item | |
36 | 1080 35e7 binary record with a reference to "Search Root" item |
24 | 1081 3602 the number of emails stored in a folder |
1082 3603 the number of unread emails in a folder | |
1083 360a Has Subfolders | |
1084 3613 the folder content description | |
1085 3617 Associate Content count | |
1086 3701 Binary Data attachment | |
1087 3704 Attachment Filename | |
1088 3705 Attachement method | |
1089 3707 Attachment Filename long | |
1090 370b Attachment Position | |
1091 370e Attachment mime encoding | |
1092 3710 Attachment Mime Sequence | |
1093 3a00 Contact's Account name | |
1094 3a01 Contact Alternate Recipient | |
1095 3a02 Callback telephone number | |
1096 3a03 Message Conversion Prohibited | |
1097 3a05 Contacts Suffix | |
1098 3a06 Contacts First Name | |
1099 3a07 Contacts Government ID Number | |
1100 3a08 Business Telephone Number | |
1101 3a09 Home Telephone Number | |
1102 3a0a Contacts Initials | |
1103 3a0b Keyword | |
1104 3a0c Contact's Language | |
1105 3a0d Contact's Location | |
1106 3a0e Mail Permission | |
1107 3a0f MHS Common Name | |
1108 3a10 Organizational ID # | |
1109 3a11 Contacts Surname | |
1110 3a12 original entry id | |
1111 3a13 original display name | |
1112 3a14 original search key | |
1113 3a15 Default Postal Address | |
1114 3a16 Company Name | |
1115 3a17 Job Title | |
1116 3a18 Department Name | |
1117 3a19 Office Location | |
1118 3a1a Primary Telephone | |
1119 3a1b Business Phone Number 2 | |
1120 3a1c Mobile Phone Number | |
1121 3a1d Radio Phone Number | |
1122 3a1e Car Phone Number | |
1123 3a1f Other Phone Number | |
1124 3a20 Transmittable Display Name | |
1125 3a21 Pager Phone Number | |
1126 3a22 user certificate | |
1127 3a23 Primary Fax Number | |
1128 3a24 Business Fax Number | |
1129 3a25 Home Fax Number | |
1130 3a26 Business Address Country | |
1131 3a27 Business Address City | |
1132 3a28 Business Address State | |
1133 3a29 Business Address Street | |
1134 3a2a Business Postal Code | |
1135 3a2b Business PO Box | |
1136 3a2c Telex Number | |
1137 3a2d ISDN Number | |
1138 3a2e Assistant Phone Number | |
1139 3a2f Home Phone 2 | |
1140 3a30 Assistant's Name | |
1141 3a40 Can receive Rich Text | |
1142 3a41 Wedding Anniversary | |
1143 3a42 Birthday | |
1144 3a43 Hobbies | |
1145 3a44 Middle Name | |
1146 3a45 Display Name Prefix (Title) | |
1147 3a46 Profession | |
1148 3a47 Preferred By Name | |
1149 3a48 Spouse's Name | |
1150 3a49 Computer Network Name | |
1151 3a4a Customer ID | |
1152 3a4b TTY/TDD Phone | |
1153 3a4c Ftp Site | |
1154 3a4d Gender | |
1155 3a4e Manager's Name | |
1156 3a4f Nickname | |
1157 3a50 Personal Home Page | |
1158 3a51 Business Home Page | |
1159 3a57 Company Main Phone | |
1160 3a58 childrens names | |
1161 3a59 Home Address City | |
1162 3a5a Home Address Country | |
1163 3a5b Home Address Postal Code | |
1164 3a5c Home Address State or Province | |
1165 3a5d Home Address Street | |
1166 3a5e Home Address Post Office Box | |
1167 3a5f Other Address City | |
1168 3a60 Other Address Country | |
1169 3a61 Other Address Postal Code | |
1170 3a62 Other Address State | |
1171 3a63 Other Address Street | |
1172 3a64 Other Address Post Office box | |
1173 65e3 Entry ID | |
1174 67f2 Attachment ID2 value | |
36 | 1175 67ff Password checksum |
24 | 1176 6f02 Secure HTML Body |
1177 6f04 Secure Text Body | |
36 | 1178 7c07 Top of folders RecID |
24 | 1179 8000 Contain extra bits of information that have been taken from the email's header. I call them extra lines |
1180 8005 Contact Fullname | |
1181 801a Home Address | |
1182 801b Business Address | |
1183 801c Other Address | |
1184 8082 Email Address 1 Transport | |
1185 8083 Email Address 1 Address | |
1186 8084 Email Address 1 Description | |
1187 8085 Email Address 1 Record | |
1188 8092 Email Address 2 Transport | |
1189 8093 Email Address 2 Address | |
36 | 1190 8094 Email Address 2 Description |
24 | 1191 8095 Email Address 2 Record |
36 | 1192 80a2 Email Address 3 Transport |
24 | 1193 80a3 Email Address 3 Address |
1194 80a4 Email Address 3 Description | |
1195 80a5 Email Address 3 Record | |
1196 80d8 Internet Free/Busy | |
1197 8205 Appointment shows as | |
1198 8208 Appointment Location | |
1199 8214 Label for appointment | |
32 | 1200 8215 All day appointment flag |
24 | 1201 8234 TimeZone of times |
1202 8235 Appointment Start Time | |
1203 8236 Appointment End Time | |
1204 8516 Duplicate Time Start | |
1205 8517 Duplicate Time End | |
1206 8530 Followup String | |
1207 8534 Mileage | |
1208 8535 Billing Information | |
1209 8554 Outlook Version | |
1210 8560 Appointment Reminder Time | |
1211 8700 Journal Entry Type | |
1212 8706 Start Timestamp | |
1213 8708 End Timestamp | |
1214 8712 Journal Entry Type | |
1215 ]]></literallayout> | |
1216 </refsect1> | |
1217 | |
28 | 1218 <refsect1 id='pst.file.desc2.5'> |
1219 <title>Associated Descriptor Item 0x7cec</title> | |
1220 <para> | |
35 | 1221 This style of descriptor block is similar to the 0xbcec format. |
28 | 1222 </para> |
1223 <literallayout class="monospaced"><![CDATA[ | |
1224 0000 7a 01 ec 7c 40 00 00 00 00 00 00 00 b5 04 02 00 | |
1225 0010 60 00 00 00 7c 18 60 00 60 00 62 00 65 00 20 00 | |
1226 0020 00 00 80 00 00 00 00 00 00 00 03 00 20 0e 0c 00 | |
1227 0030 04 03 1e 00 01 30 2c 00 04 0b 1e 00 03 37 28 00 | |
1228 0040 04 0a 1e 00 04 37 14 00 04 05 03 00 05 37 10 00 | |
1229 0050 04 04 1e 00 07 37 24 00 04 09 1e 00 08 37 20 00 | |
1230 0060 04 08 02 01 0a 37 18 00 04 06 03 00 0b 37 08 00 | |
1231 0070 04 02 1e 00 0d 37 1c 00 04 07 1e 00 0e 37 40 00 | |
1232 0080 04 10 02 01 0f 37 30 00 04 0c 1e 00 11 37 34 00 | |
1233 0090 04 0d 1e 00 12 37 3c 00 04 0f 1e 00 13 37 38 00 | |
1234 00A0 04 0e 03 00 f2 67 00 00 04 00 03 00 f3 67 04 00 | |
1235 00B0 04 01 03 00 09 69 44 00 04 11 03 00 fa 7f 5c 00 | |
1236 00C0 04 15 40 00 fb 7f 4c 00 08 13 40 00 fc 7f 54 00 | |
1237 00D0 08 14 03 00 fd 7f 48 00 04 12 0b 00 fe 7f 60 00 | |
1238 00E0 01 16 0b 00 ff 7f 61 00 01 17 45 82 00 00 00 00 | |
1239 00F0 45 82 00 00 78 3c 00 00 ff ff ff ff 49 1e 00 00 | |
1240 0100 06 00 00 00 00 00 00 00 a0 00 00 00 00 00 00 00 | |
1241 0110 00 00 00 00 00 00 00 00 00 00 00 00 c0 00 00 00 | |
1242 0120 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 | |
1243 0130 00 00 00 00 00 00 00 00 00 00 00 00 00 40 dd a3 | |
1244 0140 57 45 b3 0c 00 40 dd a3 57 45 b3 0c 02 00 00 00 | |
1245 0150 00 00 fa 10 3e 2a 86 48 86 f7 14 03 0a 03 02 01 | |
1246 0160 4a 2e 20 44 61 76 69 64 20 4b 61 72 61 6d 27 73 | |
1247 0170 20 42 69 72 74 68 64 61 79 00 06 00 00 00 0c 00 | |
1248 0180 14 00 ea 00 f0 00 55 01 60 01 79 01 | |
1249 | |
1250 0000 indexOffset [2 bytes] 0x017a in this case | |
1251 0002 signature [2 bytes] 0x7cec constant | |
35 | 1252 0004 7coffset [4 bytes] 0x0040 index reference |
28 | 1253 ]]></literallayout> |
1254 <para> | |
1255 Note the signature of 0x7cec. There are other descriptor block | |
1256 formats with other signatures. | |
1257 Note the indexOffset of 0x017a - starting at that position in the | |
1258 descriptor block, we have an array of two byte integers. The first | |
1259 integer (0x0006) is a (count-1) of the number of overlapping pairs | |
1260 following the count. The first pair is (0, 0xc), the next pair is (0xc, 0x14) | |
1261 and the last (7th) pair is (0x160, 0x179). These pairs are (start,end+1) | |
1262 offsets of items in this block. So we have count+2 integers following | |
1263 the count value. | |
1264 </para> | |
1265 <para> | |
35 | 1266 Note the 7coffset of 0x0040, which is an index reference. In this case, |
1267 it is an internal reference pointer, which needs to be right shifted by 4 bits | |
28 | 1268 to become 0x0004, which is then a byte offset to be added to the above |
1269 indexOffset plus two (to skip the count), so it points to the (0x14, 0xea) | |
1270 pair. We have the offset and size of the "7c" block located at offset 0x14 | |
1271 with a size of 214 bytes in this case. The "7c" block starts with | |
1272 a header with the following format: | |
1273 </para> | |
1274 <literallayout class="monospaced"><![CDATA[ | |
1275 0000 signature [1 bytes] 0x7c constant | |
1276 0001 itemCount [1 bytes] 0x18 in this case | |
1277 0002 unknown [2 bytes] 0x0060 in this case | |
1278 0004 unknown [2 bytes] 0x0060 in this case | |
1279 0006 unknown [2 bytes] 0x0062 in this case | |
1280 0008 recordSize [2 bytes] 0x0065 in this case | |
35 | 1281 000a b5Offset [4 bytes] 0x0020 index reference |
1282 000e index2Offset [4 bytes] 0x0080 index reference | |
28 | 1283 0010 unknown [2 bytes] 0x0000 in this case |
1284 0012 unknown [2 bytes] 0x0000 in this case | |
1285 0014 unknown [2 bytes] 0x0000 in this case | |
1286 ]]></literallayout> | |
1287 <para> | |
35 | 1288 Note the b5Offset of 0x0020, which is an index reference. In this case, |
1289 it is an internal reference pointer, which needs to be right shifted by 4 bits | |
28 | 1290 to become 0x0002, which is then a byte offset to be added to the above |
1291 indexOffset plus two (to skip the count), so it points to the (0xc, | |
1292 0x14) pair. Finally, we have the offset and size of the "b5" block | |
1293 located at offset 0xc with a size of 8 bytes in this descriptor block. | |
1294 The "b5" block has the following format: | |
1295 </para> | |
1296 <literallayout class="monospaced"><![CDATA[ | |
1297 0000 signature [2 bytes] 0x04b5 constant | |
1298 0002 unknown [2 bytes] 0x0002 in this case | |
35 | 1299 0004 descoffset [4 bytes] 0x0060 index reference |
28 | 1300 ]]></literallayout> |
1301 <para> | |
35 | 1302 Note the descoffset of 0x0060, which again is an index reference. In this |
1303 case, it is an internal pointer reference, which needs to be right shifted by 4 | |
28 | 1304 bits to become 0x0006, which is then a byte offset to be added to the |
1305 above indexOffset plus two (to skip the count), so it points to the | |
1306 (0xea, 0xf0) pair. That gives us (0xf0 - 0xea)/6 = 1, so we have a | |
1307 recordCount of one. The actual data between 0xea and 0xf0 is unknown | |
1308 and unused here. | |
1309 </para> | |
1310 <para> | |
35 | 1311 Note the index2Offset above of 0x0080, which again is an index reference. In this |
1312 case, it is an internal pointer reference, which needs to be right shifted | |
28 | 1313 by 4 bits to become 0x0008, which is then a byte offset to be added to |
1314 the above indexOffset plus two (to skip the count), so it points to the | |
1315 (0xf0, 0x155) pair. This is an array of tables of four byte integers. | |
1316 We will call these the IND2 tables. The size of each of these tables is | |
1317 specified by the recordSize field of the "7c" header. The number of | |
1318 these tables is the above recordCount value derived from the "b5" block. | |
1319 </para> | |
1320 <para> | |
1321 Now the remaining data in the "7c" block after the header starts at | |
1322 offset 0x2a. There should be itemCount 8 byte items here, with the | |
1323 following format: | |
1324 </para> | |
1325 <literallayout class="monospaced"><![CDATA[ | |
1326 0000 referenceType [2 bytes] | |
1327 0002 itemType [2 bytes] | |
1328 0004 ind2Offset [2 bytes] | |
35 | 1329 0006 size [1 byte] |
1330 0007 unknown [1 byte] | |
28 | 1331 ]]></literallayout> |
1332 <para> | |
35 | 1333 The ind2Offset is a byte offset into the current IND2 table of some value. |
1334 If that is a four byte integer value, then once we fetch that, we have | |
1335 the same triple (item type, reference type, value) as we find in the | |
1336 0xbcec style descriptor blocks. If not, then this value is used directly. | |
1337 These 8 byte descriptors are processed recordCount times, each | |
28 | 1338 time using the next IND2 table. The item and reference types are as |
1339 described above for the 0xbcec format descriptor block. | |
1340 </para> | |
1341 </refsect1> | |
1342 | |
35 | 1343 <refsect1 id='pst.file.desc3.5'> |
1344 <title>Associated Descriptor Item 0x0002</title> | |
1345 <para> | |
1346 This style of descriptor block is almost unknown here. | |
1347 It seems to contain a list of ID1 values. | |
1348 </para> | |
1349 <literallayout class="monospaced"><![CDATA[ | |
1350 0000 01 01 02 00 26 28 00 00 18 77 0c 00 b8 04 00 00 | |
1351 | |
1352 0000 signature [2 bytes] 0x0101 constant | |
1353 0002 count [2 bytes] 0x0002 in this case | |
1354 0004 unknown [4 bytes] 0x002826 in this case | |
1355 repeating | |
1356 0008 id [4 bytes] 0x0c7718 in this case | |
1357 000c id [4 bytes] 0x0004b8 in this case | |
1358 ]]></literallayout> | |
1359 </refsect1> | |
1360 | |
24 | 1361 </refentry> |
16 | 1362 </reference> |