
Originally Posted by
weegillis
To #1. .txt or .html would be the format for the files. .DOC, .RTF or others will be bloated, and cannot be edited on the fly with Notepad or other plain text editor.
The pages are already tagged, so you could just copy the 'content div' and save it to the new destination with the name or id you want to use (as in, .html or .htm, or even .php). Keep the tags, and even the class names and ids. The style hooks will already be in your CSS, making restyling a simple task you can tweak as you go. If the site has DOM scripts, chances are some of those ids (or possibly classes) are sought out as well. Another reason to keep the tags and attributes.
You'll need to do this a thousand times, so it will take some work. It is possible to automate some of the process with JavaScript. Write a function that hooks the content div, copies it, then overwrites the existing page, leaving only the content div. Then save page as and Back to remote page and do it again with your next page. The saved pages will likely still have doctype and <html/html> tags which would need to be removed in a text editor (or Dreamweaver, in a couple passes). After that, you've got your content modules.
There is no opening or editing of server files, only creation of new ones, which are stored locally. For ease, you could even keep the URI, as they will never conflict with anything in their own folder.
===
Not that it's a problem having 1000 pages in one folder, but 1000 links in one index page is a tall order, and totally impractical (and not very usable). One would presume there is some order to the content now, such as categories or grouping of some nature. Might these become the root index of their own folder, with only those included pages alongside? 100 links in a main index is not so bad, 50 is better. Having around 5 for simple site-wide navigation in every page would allow quick traversal across large groups. Each of the 5 (or so) would be indexes of still more indexes.
The index.php at each level would share the template intended for its level, and would fork to the engine needed to generate a page at that level. The engine, by the way would be written in PHP on the server, and JavaScript on the client side (for DOM manipulation such as adding e-mail addresses, setting event handlers, etc.).
The manner you describe does not appear to include a database, so none is discussed in the above method. For my own purposes, I would just write the PHP and store the files, but there may be other opinions on this method, which favor the use of a database. You would still need to capture all the content, so the above copying/saving method still applies, but instead of eventually uploading them to a remote folder, you add them to the database. In this case, you may as well go out and buy Thesis, and forget all the other stuff.
But, if you don't want a blog, or a CMS, or to maintain a database, PHP can be made to manage your 1000 pages and 2000 photos very nicely, and the server will be the database (the relationships will all be hardcoded in your templates and index tables).
The Little Schools link in my sig is undergoing a rebuild in the manner I've described. The template is designed to handle two photos merged into tagged content that is retrieved from its respective content folder. The navigation is sensitive to the request URI permitting Previous/Next links to be dynamically created based upon adjacent rows in the folder table.
This may well be very old school, and may also have issues. Being a hobbyist, I cannot speak with a professional objective view, only what works for me in my own craft. I needed something that worked without a database, and now I have it in a handful of different forms. All with basic and simple PHP (and some logic).