[Dcmlib] Re: Some more private Dicom Dict

Mathieu Malaterre mathieu.malaterre at kitware.com
Fri Sep 2 15:39:37 CEST 2005


CCint dcmlib just in case someone would be interested.

The python script is available (same license as gdcm in fact) under:

gdcm/Dict/ParseDict.py

There is absolutely nothing fancy in this script (maybe the one for the 
huge dicomv3 pdf file ).

Otherwise the real trick is the pdftotext command.

$ pdftotext -v 
                                              pdftotext version 3.00
Copyright 1996-2004 Glyph & Cog, LLC

And then according to the comment in ParseDict.py you should run first a 
command like this:

$ pdftotext -f 9 -l 81 -raw -nopgbrk 04_06PU.PDF 04_06PU-3.txt

So
$ wget 
www.hitachi-medical.co.jp/english/product/ct/dicom/acrobat/E1E-BW0782-3.pdf
$ pdftotext -f 21 -l 21  -raw -nopgbrk E1E-BW0782-3.pdf E1E-BW0782-3.txt

You'll see that in this case it is not worse using the regex from the 
python script.

HTH
Mathieu

Jean-Pierre Roux wrote:
> Salut , Mathieu
> 
> Il n'y a pas grand chose, dans ceux-la ...
> 
> Hitachi Pronto
> www.hitachi-medical.co.jp/english/product/ct/*dicom*/acrobat/E1E-BW0782-3.pdf 
> 
> (pdf page 21)
> 
> Hitachi Caringo
> www.hitachi-medical.co.jp/english/product/ct/*dicom*/acrobat/E1E-BW0799-2.pdf 
> 
> 
> Tu avais fait un script d'enfer pour extraire 'proprement' le 
> dictionaire a partir d'un doc pdf, n'est-ce pas?
> Il est licence gpl ?
> 
> Pourrais-tu nous le faire passer.
> Thx
> JP
> 
> 




More information about the Dcmlib mailing list