Re: Ghostscript/GhostPDL 9.22 Release Candidate 1

lilypond-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Ghostscript/GhostPDL 9.22 Release Candidate 1

From:	Ken Sharp
Subject:	Re: Ghostscript/GhostPDL 9.22 Release Candidate 1
Date:	Tue, 19 Sep 2017 15:03:14 +0100

At 15:44 19/09/2017 +0200, David Kastrup wrote:

Are there any example documents with thousands of pages and ten
thousands of PDF inclusions one could look at?

I would suggest that the fact you want to 'include' tens of thousands ofPDF files to be the problem, really.

I appreciate you are trying to deal with an existing problem, but usingGhostscript to do something it wasn't intended for isn't really the bestidea for solving the problem.

As I've said elsewhere there is a genuine bug which can be exposed doingwhat you want with Ghostscript and it would not surprise me if in the longrun it causes you another problem.

It would be possible to write a tool which could reliably detect identicalfonts in a PDF file, remove the duplicates and alter the references so thatthe PDF continued to work. In all honesty, if the problem is as importantas you say, this is probably a better solution. A tailored program,specifically designed to solve a specific problem is much more likely towork reliably than trying to use a general purpose program, designed for adifferent problem.

That said, it would be quite a big job, and I'm not actually offering totake it on.

My suggestion, which may not be feasible, is to keep everything in aneditable format until the last second


This is extracted from an email I decided earlier not to send:
-----------------------------------------------------------------------------

While I can tell you a lot about PostScript and PDF I can't help you at allwith TeX. In general, however, my experience of working with largedocuments is that the content should be maintained in the layoutapplication native format until the last moment. Broadly speaking this issimilar to keeping bitmap data in something like TIFF and only convertingto JPEG at the last moment, and for similar reasons.

When you create a PDF you are discarding all the 'metadata' that describesthe layout to the typesetting or layout application. Its all but impossibleto recover that information once its been lost.

Your problem with multiple fonts pretty much exhibits that; once you've gotthe PDF file, a layout engine can't tell that all the fonts are the same.Ghostscript can't either, which is why it now doesn't strip the duplicatesout. While I appreciate this is a problem for your particular use case, itis actually a considerable improvement for users in general.

Assuming that you are using TeX throughout for your documentation, then itseems to me that you should be creating your final document by appendingthe various TeX documents together and then producing a final PDF, insteadof appending multiple PDF files.

Presumably you want to show some parts of Lilypond as well, so I wouldcreate EPS figures for those. It will of course increase the number of fontinclusions again, but in the case of Lilypond I don't think that you can bemerging the fonts anyway, because Lilypond always uses glyphshow, andpdfwrite will create a uniquely named font for each usage. So you aren'tgaining any benefit from exploiting the Ghostscript bug with the Lilypondoutput.

So by maintaining the text and layout in TeX, inserting EPS figures asrequired, and only producing PDF as the last step in the process you wouldcreate a file which (as I understand it) would only contain a singleinstance of each font.

in short I'm not really suggesting that you change anything except yourworking practices, and maintain your files as TeX files rather than as PDF.Because I don't have any knowledge of your workflow (or TeX) I cannot sayif this is reasonable, it may well not be.

Ken

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [gs-devel] Ghostscript/GhostPDL 9.22 Release Candidate 1, (continued)
- Re: [gs-devel] Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/18
  - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/18
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/19
    - Re: [gs-devel] Ghostscript/GhostPDL 9.22 Release Candidate 1, William Bader, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp <=
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Werner LEMBERG, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, William Bader, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, David Kastrup, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Karlin High, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Knut Petersen, 2017/09/19
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Ken Sharp, 2017/09/20
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Werner LEMBERG, 2017/09/20
    - Re: Ghostscript/GhostPDL 9.22 Release Candidate 1, Knut Petersen, 2017/09/20

Prev by Date: Re: Ghostscript/GhostPDL 9.22 Release Candidate 1
Next by Date: Re: Ghostscript/GhostPDL 9.22 Release Candidate 1
Previous by thread: Re: Ghostscript/GhostPDL 9.22 Release Candidate 1
Next by thread: Re: Ghostscript/GhostPDL 9.22 Release Candidate 1
Index(es):
- Date
- Thread