SitePoint Sponsor

User Tag List

Results 1 to 9 of 9

Hybrid View

  1. #1
    SitePoint Enthusiast
    Join Date
    Feb 2012
    Posts
    56
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)

    ISO a tool that combines many HTML files into one

    I'm finishing up a documentation project which produced about three dozen HTML files. Now the client says he wants the content delivered in a single PDF.

    I'd like to do this by creating a file that says in effect, "include this HTML file, then this one, then this one,..." A tool will read this file, assemble all of HTML files into one big HTML file, and resolve all of the cross-links (links from one HTML file to another) into internal links (links from one part of the combined HTML to another). Then I can convert the combined HTML file to a PDF.

    My object is not so much to create the PDF file more efficiently, as to avoid having to update two parallel versions of the content once I've done so. If I had this tool I could regenerate the PDF in a couple of steps whenever an HTML file changes.

    Does such a tool exist?

  2. #2
    Avid Logophile silver trophy
    ParkinT's Avatar
    Join Date
    May 2006
    Location
    Central Florida
    Posts
    2,337
    Mentioned
    192 Post(s)
    Tagged
    4 Thread(s)
    Have you considered using Windows CHM (Help File) format?
    The [free, I think] tool provided by Microsoft does what you are describing.
    Don't be yourself. Be someone a little nicer. -Mignon McLaughlin, journalist and author (1913-1983)


    Git is for EVERYONE
    Literally, the best app for readers.
    Make Your P@ssw0rd Secure
    Leveraging SubDomains

  3. #3
    SitePoint Enthusiast
    Join Date
    Feb 2012
    Posts
    56
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    I appreciate your effort to be helpful, but this really doesn't address the question. My client asked me for a PDF. If I give him a CHM, he's just going to wonder why I don't follow instructions.

  4. #4
    SitePoint Mentor bronze trophy
    John_Betong's Avatar
    Join Date
    Aug 2005
    Location
    City of Angels
    Posts
    1,840
    Mentioned
    73 Post(s)
    Tagged
    6 Thread(s)
    http://www.php.net/manual/en/book.pdf.php

    Php to PDF has a library and a comment may save you some time:
    If you only have PDFLib Lite installed, I would not recommend bothering with this library, as you can really only output text and import an image, and that's about it. Forget about adding complexities such as color, blocks and other elements. Switch to an open source library such as FreePDF (
    Learn how to be ready for The New Move to Discourse

    How to make Make Money Now with a *NEW* look

    Be sure to congratulate Patche on earning Member of the Month for July 2014

  5. #5
    Avid Logophile silver trophy
    ParkinT's Avatar
    Join Date
    May 2006
    Location
    Central Florida
    Posts
    2,337
    Mentioned
    192 Post(s)
    Tagged
    4 Thread(s)
    Quote Originally Posted by Orthoducks View Post
    I appreciate your effort to be helpful, but this really doesn't address the question. My client asked me for a PDF. If I give him a CHM, he's just going to wonder why I don't follow instructions.
    However, the nature of a 'web' site is the interconnected relationship among pages. This cannot be represented in a serial document such as a PDF.
    Don't be yourself. Be someone a little nicer. -Mignon McLaughlin, journalist and author (1913-1983)


    Git is for EVERYONE
    Literally, the best app for readers.
    Make Your P@ssw0rd Secure
    Leveraging SubDomains

  6. #6
    SitePoint Member mamahadija's Avatar
    Join Date
    Apr 2014
    Location
    South Africa
    Posts
    13
    Mentioned
    1 Post(s)
    Tagged
    0 Thread(s)
    there are a lot of websites that offer free conversion of html pages to pdf i.e htmlpdf.com
    Last edited by Mittineague; Jul 24, 2014 at 17:22. Reason: removing unnecessary link

  7. #7
    SitePoint Enthusiast
    Join Date
    Feb 2012
    Posts
    56
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    Quote Originally Posted by mamahadija View Post
    there are a lot of websites that offer free conversion of html pages to pdf i.e htmlpdf.com
    Three out of four responders have misunderstood my question in different ways, soi I'm persuaded that I did a bad job of stating it, and I'd like to have another go.

    On one hand, I have a web site with several dozen pages. On the other, I have people who are effectively my clients (now at least two of them out of four) who tell me that their clients find it difficult to utilize information from web sites, and insist on a page-oriented format, customarily a PDF.

    To provide that I can convert each page to a PDF with Acrobat, or with a free PDF driver like CutePDF, and then combine the PDFs with Acrobat. But this approach has a serious disadvantage: the pages are heavily interlinked, and the links would continue to point to their original targets. From the reader's point of view, there would be all these links that obviously were supposed to point to other parts of the document, and they'd point to the same material on some web site instead. Dumb!

    So, I want a tool that lets me combine the HTML documents -- resolving the links from inter-document targets to intra-document targets, among other things -- and then convert to PDF. If the tool is really nice it can automatically add front matter, page separators, and a table of contents (although I'll probably have to add the page numbers by hand after the conversion).

    Returning briefly to the suggestions:

    The people making the request are asking specifically for a PDF. Thus the whole point is to produce a page-oriented rendition of the web site. Some other page oriented format would probably be OK, although I'd have to clear it with them. Another interactive format, e.g., CHM, would be completely off point.

    I don't get to tell them that they don't really want what they're asking for because a web site cannot be (faithfully) represented in a serial document. That's true, but irrelevant. This is one of those cases where the customer is always right; the customer's customer is right squared!

    Maybe some overriding constraint compels my clients' clients to use a serial format, and it just hasn't been explained to me. Maybe they're asking for this because they put their heads on backwards when they get out of bed. It doesn't matter.

    I'm looking for an HTML tool, not a software component that I could use to create my own tool. My "client" does not have time to wait while I engage in a bout of software development, nor does my boss pay me to do that. In any case the response didn't seem to imply that PDFLib can retarget links and do the other things required to solve this problem. I looked at its web site briefly, and got the impression that it \would just let me do those things myself by manipulating PDF rather than HTML. It's not clear what the percentage is in that... especially if it involves implementing the tool on a server, which would not be its natural environment.

    By the way, the problem is now solved to the extent it can be, because today is my last day in this job. Everything I can do for my "clients" is done. I encountered a similar problem once before, though, so I foresee encountering one again in the future, and I'd like to be ready when I do. On top of that, the problem is technically interesting.
    Last edited by Mittineague; Jul 24, 2014 at 17:23. Reason: fixing quote

  8. #8
    SitePoint Member
    Join Date
    Jul 2014
    Posts
    1
    Mentioned
    0 Post(s)
    Tagged
    0 Thread(s)
    There are many software & tools available online. You can convert your webpage to pdf very easily bu using these converters. I am using expert pdf. its good and fast. You can check detail here
    html-to-pdf.net/free-online-pdf-converter.aspx
    Last edited by Mittineague; Jul 24, 2014 at 17:21. Reason: removing unnecessary link

  9. #9
    Life is not a malfunction gold trophysilver trophybronze trophy
    TechnoBear's Avatar
    Join Date
    Jun 2011
    Location
    Argyll, Scotland
    Posts
    6,232
    Mentioned
    265 Post(s)
    Tagged
    5 Thread(s)
    Quote Originally Posted by shalini07 View Post
    There are many software & tools available online. You can convert your webpage to pdf very easily bu using these converters.
    Please read the whole thread before replying. Online converters have already been suggested, and Orthoducks has explained why they are not an appropriate solution.

    As Orthoducks has also said that the issue ceased to be his/her problem three months ago, there seems little point in reviving the discussion now.

    Thread closed.


Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •