|
From: | jemarch |
Subject: | Re: [pdf-devel] Re: Modifications on pdf_token_read to get token boundaries |
Date: | Mon, 15 Jun 2009 22:29:32 +0200 |
User-agent: | Wanderlust/2.14.0 (Africa) SEMI/1.14.6 (Maruoka) FLIM/1.14.8 (Shijō) APEL/10.6 Emacs/23.0.60 (i686-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO) |
A possible solution for this problem could be to change/expand the stm public api in the following way: - Rename the current pdf_stm_seek/tell functions to pdf_stm_bseek/btell. The 'b' is for 'backend'. - Add a new function pdf_stm_tell that returns the number of octects since the last bseek (or since the creation of the stream if no bseek was made). Since the operation in the streams is inherently sequential, that schema makes sense in my opinion. The addition of a new function to the tokeniser returning the offset of the beginning of the last read token would then use this pdf_stm_tell function. What do you think? If you agree I will implement it as part of my type4+tokeniser patch. -- Jose E. Marchesi <address@hidden> http://www.jemarch.net GNU Project http://www.gnu.org
[Prev in Thread] | Current Thread | [Next in Thread] |