How to save threads to your harddrive

5 posts / 0 new
Last post
TBone
Offline
Last seen: 2 hours 3 min ago
Joined: 01/27/2016 - 16:42
Posts: 128
Location: Europe
How to save threads to your harddrive

I often find threads in internet forums so interesting that I want to keep them for later use. I want the threads exactly as I see them today, with all pictures included.
There are far too many threads out there that are unusable now because all pictures are missing. A lot of knowledge is lost this way.

I found a solution that works for me with Firefox and Chrome.

  • I use a plugin to load the complete thread as one page. Pages are just added under each other, no reformatting.
  • Then optionally I use another plugin to delete stuff I do not want to save (like repetitive page headers/footers).
  • Finally I save the page as one single MHTML file.

Here is my toolchain for Mozilla Firefox:

  1. PageZipper (www.printwhatyoulike.com/pagezipper)
  2. Nuke Anything Enhanced – Old but still working.
  3. Mozilla Archive Format – Use similar UnHTM plugin when broken.

Similar toolchain for Chrome:

  1. PageZipper
  2. (Still searching for the best Nuker addon.) Click to Remove Element – seems to work. Dynamite – did not work for me, may try again.
  3. Save As MHTML MHT (different add-on, see below)(Or activate MHTML save feature in chrome://flags and Save as Single File.)
Edited by: TBone on 07/23/2017 - 14:37
TBone
Offline
Last seen: 2 hours 3 min ago
Joined: 01/27/2016 - 16:42
Posts: 128
Location: Europe

Some tricks and traps:

  • Forums need a ‘Next’ link or PageZipper will not work. Many do, some do not.
  • Collecting pages is not completly automatic. You have to scroll down the page until the page counter (like 5/6) shows the correct number. Or click the PageZip next button to step forward through the pages.
  • On a few forums PageZipper tends to ‘overshoot’ and show more pages than there are. I do not know why. CPF is one.
  • In very long threads (my feeling is it starts at 40 pages) sometimes PageZipper (in FF) leaves single pages empty. Try again tomorrow.
  • Switch off PageZipper when it is done so it will not show on the saved page.
  • Nuke Anything Enhanced shows a frame around the element that it will delete if you hover the mouse over it for a second. I found that feature accidentally after months of use. (Real men do not read manuals.)
  • Pictures that cannot be saved by hand (like the ones still working on Photobucket) cannot be saved by PageZipper in Firefox but in Chrome. Firefox tries to reload all pictures to save them and Photobucket will not allow direct downloads (they send HTML code instead). Chrome saves pictures directly from cache.
  • In MHTML files media files are encoded as MIME base 64. I have not tried it but there should be tools to extract single files or even modify the main HTML file.
  • Update: ‘Save As MHT’ works much better than ‘Save As MHTML’. It saves long filenames and avoids the security warnung for this file extension by using capital letters.
  • Chome is much faster than Firefox and rock solid. I tested it on the ‘What did you mod today?’ thread with almost 3250 posts. The saved file is 1.1 GB. Found no browser that can open it…

Here is a tough test thread: http://budgetlightforum.com/node/37399

jf_smm
Offline
Last seen: 1 hour 17 min ago
Joined: 05/30/2016 - 12:29
Posts: 291
Location: NorCal

This is very cool TBone, thanks for sharing it.

Sean

The Miller
The Miller's picture
Offline
Last seen: 28 min 51 sec ago
Joined: 12/14/2015 - 12:08
Posts: 7967
Location: Charente France

Wow nice
I sometimes print a page to PDF but this is better with more pages of a thread possible
Thanks!

Tinderbox UK
Tinderbox UK's picture
Offline
Last seen: 12 hours 24 min ago
Joined: 12/26/2016 - 06:17
Posts: 220
Location: England, United Kingdom

Works Good Thanks

 

John.

SkyRC MC3000 – Maha C9000 – LaCrosse BC-900 – Floureon BT-C3100 V2.2 – Accumanger 10 and 20 -Thunder AC6 LiitoKala Lii-100 – Cozypony 24W Solar Panel – Soshine H4 – Soshine T2 – Soshine E4S