[Dcmlib] Fwd: DICOM Part 3 available as XML

Mathieu Malaterre mathieu.malaterre at gmail.com
Sun Jan 14 22:26:34 CET 2007


Ok this time the link should be working...

---------- Forwarded message ----------
From: Mathieu Malaterre
Date: 14 Jan 2007 13:22:31 -0800
Subject: DICOM Part 3 available as XML
To:

  [What do you do for fun over the week end when you are sick]

Hello,

  After playing with XML/XSL for the DICOM Part 6, I decided to give a
shot at Part 3. Well it was not as bad as I expected, my initials
results are here:

http://gdcm.svn.sourceforge.net/viewvc/*checkout*/gdcm/Source/InformationObjectDefinition/ModuleAttributes.xml

A couple of notes:
1. This is a partial XML representation of Part 3 due to a couple of
issue
  * pdftotext seems to go crazy on some tables, and require some manual
editing (*)
  * my python/regex script is not very robust and fail when the table
start at the bottom of the page
  * I did not handle a couples of tables. Namely:  C.11-4, C.13-* and
C.22.1-1 simply because they did not comes with the column `type'
(which is basically the only information I was looking for).
2. The XML elements/attributes might change in the future if I realize
this is a bottelneck. In particular I am thinking the attribute
@description might change in the near future.
3. No XSL-FO for now...
4. Finally the HTML does not display the full content of the XML file
(in particular the <include> XML element).

  I have a couple of XSLT scripts that allow transformation of
dicomlookup's HTML into XML to allow comparison. I found out a couple
of typos (Type="xx", `Sequenc`, `Transmi` ...). But the tables are from
Part 3-2004 so comparison is pretty much useless. Has anyone else done
similar work so that I can compare the quality of my XML against their
?

As usual comments welcome,
Mathieu
(*) reported as bugs:
- http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=406256
- http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=406810




More information about the Dcmlib mailing list