1

I have tried many programs and solutions to save web pages (html, mht, doc, pdf). My favorite software was an addon for browsers from Omnipage (OCR). What I like about this is that it prints the whole page (continuously) and it doesn't write the URL or page numbers on the page, which I find annoying.

Does anyone know a software like this one (freeware or not)?

I tried CutePDF and it didn't work for me.

I want this for my offline use and would prefer a PDF.

digitxp
  • 14,432
  • 9
  • 56
  • 76
Remus Rigo
  • 2,927
  • 8
  • 48
  • 62

5 Answers5

1

If you want this simply to keep offline, there are a variety of options.

Chrome Scrapbook is a perfectly valid option for offline usage, although it doesn't do PDFs.

If you need PDFs, you can use either the PDFDownload or Joliprint. Personally, I prefer Joliprint because it reformats the page to make it more readable, though you probably will want to try both yourself.

PDF can be saved as PDFs, so there's no problem there...

And Word documents can be saved as PDF using Microsoft's Save as PDF addon.

digitxp
  • 14,432
  • 9
  • 56
  • 76
0

What is the purpose ?

I use the following extension for saving pages:

Chrome Scrapbook

https://chrome.google.com/extensions/detail/gokffdfnlmampchciemmflgbckijpmlb

There is a similar one for firefox.

Madhur Ahuja
  • 1,899
  • 5
  • 21
  • 26
0

Do you mean to save the individual files (.html, .css, .js, et al.) or do you mean to save a readable version for offline reading? Or do you mean printing? What web browser are you using?

If you are trying to read web pages offline in IE, choose File → Save As → Web Archive (.mht). I'm guessing that's what you meant because you mention .mht in your question. This allows offline reading of static pages, but does not update dynamic Flash or HTML5 pages.

You do not need to convert pages to PDF using CutePDF or anything else unless you want to read the pages on a non-Microsoft OS. If that's what you mean, please say so.

If you are trying to print the web page, just choose Print in IE. If you want to remove the file path and page number, choose File → Page Setup and remove the Header and Footer commands.

Dour High Arch
  • 1,056
  • 12
  • 18
  • i want to keep an offline version (PDF preferable) – Remus Rigo Jan 02 '11 at 19:12
  • Every alternative I described will keep an offline version. If this is not adequate, please explain why. Why does CutePDF not work for you? The URL and page numbers? I explained how to remove them, what happens when you do that? – Dour High Arch Jan 02 '11 at 20:12
0

You can try Teleport Pro to download & surf web sites offline.

It has too many features to customize.

NT.
  • 1,735
  • 5
  • 20
  • 35
0

You could use the Wget in linux, there are arguments that can crawl a page and save a local copy, with all the hyperlinks changed to their local counterparts.

Such as

wget -r 20 -k sitename.com

Mike
  • 11
  • 1
  • i was looking for a windows solution – Remus Rigo Jan 03 '11 at 10:14
  • @Remus Rigo: Wget works on Windows too (I once used it for ***serial*** downloading of videos and podcast episodes from a .bat script). There is a binary around and it likely also works [Cygwin](https://en.wikipedia.org/wiki/Cygwin), [Git Bash](https://superuser.com/questions/1053633/what-is-git-bash-for-windows-anyway), etc. – Peter Mortensen Jan 19 '23 at 13:43