documentation: the assumptions behind presentation
We were talking about documentation that's meant to be viewed online. My favored format there is HTML; he was advocating PDF. And when we had peeled away the surface issues, it turned out that our user models are fundamentally different.
To me, online documentation is all about the content, not the page layout. I find chunks of whitespace before the next page distracting (wasteful of my valuable screen real-estage). Ditto running headers and footers; they don't run in my viewer but rather on the page whose size probably doesn't match that viewer, so what good are they to me? (Obviously if I print the doc that's different.) And a PDF viewer, being page-centric, does not afford me the convenience of resizing and getting a fresh layout. The viewer does not flow the content to me in useful ways, so it is a hinderance.
In a web browser with HTML, if I change the font size the content re-renders to fit in the window; I don't get a horizontal scroll bar. Not necessarily so with PDF; if I try to make the font too big for the allotted window, I have to scroll sideways to read -- which is enough of a PITA that I might then just print the sucker even though I wanted to read it online. A common response to this -- and my coworker's assumption -- is that people will maximize the viewer window and that will be (1) good enough to avoid this problem and (2) worth it for aesthetically-pleasing layout.
No, no, no! Yes, if I maximize the window then most normal PDF documents will be readable for me without scroll bars, but that is not the point. Why on earth would I maximize the window? And why should I impose this burden on my users?
For some reading, sure, this makes sense. But if I am consulting the documentation for an application I'm trying to run, I want to see that documentation and my application window at the same time. I might need to compare what I'm seeing in the app with what's in the screen shot in the doc. And if I'm writing code while referring to some SDK documentation, you can bet that I want to see my code next to their example code or API specification. Flipping back and forth forces me to allocate more memory, and I have better uses for those brain cells. If doc presentation hinders the user, the presentation has to change.
My coworker, on the other hand, is used to toggling. And he just plain reads enough online (that is, reads without simultaneously doing) that giving over the entire screen to a reader makes sense. It never occurred to him that people use documentation the way I do. When we began the conversation he was pretty sure I was a mutant due to my vision issues, but by the end he saw that normal people have these issues sometimes too, and he agreed to go off and think about what this means for our formatting templates. It was a productive conversation -- and that without either of us realizing the disconnect in advance and thus preparing diplomatic turns of phrase. :-)
Of course, it would be nice if we had some actual user data for our documentation. That's remarkably hard to get. I can observe our developers using my SDK documentation (and I do), but they're not necessarily represenative either, nor are they at all representative of end users. Actual customers, however, are notoriously bad at responding to surveys, and controlled user studies are expensive.

no subject
Deeply nonscientific, but still a data point.
no subject
The worst, however, are documents not ameniable to cutting and pasting. This often happens with older scanned DoD documentation. It makes it harder to search for things.... and there's another drawback with HTML: I can't do the search without having to wander through multiple files.
no subject
no subject
But I disagree about HTML and printing -- the problems you mention are consequences of particular layout- and chunking-decisions, not inherent to the medium. An HTML document can be laid out in a way that prints rather well, if not with the same level of page-layout control as with PDF. Furthermore, should the user desire to resize the document for printing, the same advantages of HTML over PDF for screen display still pertain.
This is not to say that PDF doesn't have its uses -- it is definitely The Right Tool for certain situations -- but its superiority over HTML for printing isn't a slam-dunk.
I think the too-small-chunks issue is often a result of page designers worrying more about layout than content, but I'll have to go find some of the pages that pissed me off that way again, and pay more attention to what it looks like the designers/writers were trying to do when they chose their page/chapter/chunk sizes. I could be wrong.
HTML and printing
Re: HTML and printing
Widow-orphan control was in page-layout and document-editing software at least 20 years ago, but an astounding number of people who ought to know better never bother using it.
Re: HTML and printing
no subject
Apparently many PDF viewers make it difficult to cut-and-paste text.
(Today I had a customer ask me for a plaintext version of a PDF list of license keys because he didn't have an easy way to extract them.)
In fact, at one point that was a selling point for PDF over other document formats - that it was non-trivial to copy the text.
no subject
no subject
no subject
no subject
I'm very glad that you're offering pdf and html versions of the material.
no subject
I'm glad that, 4+ years ago, I won the source-format argument so that we can. :-) (I'm not interested in generation that can't be automated.) We have XML source and generate both HTML and PDF from that source -- independently and automatically. We've only started generating PDF in the last year, but I saw this possibility from the beginning and made sure we planned for it.
no subject
no subject
no subject
no subject
no subject
no subject
no subject
2. You're comparing apples to oranges here. Ease of search is a feature of a single document over a set of documents. A bunch of data organized as a group of PDF documents is no easier to search than that same data organized as a group of HTML documents.
no subject
Definitely. The proposal that was under discussion was replacing my collection of HTML documents with a single (mucking huge) PDF file. (Fortunately, we're not going to do that.)
no subject
no subject
no subject
-- Dagonell,
P.S. Does this count as geeking about modern software? :)
no subject
Here's another vote for HTML for onscreen and PDF for paper and scribbling. I purely hate when some $#)#($*#$ documenter has decided what size I like my pages, how well I can see, what happens to graphics when I scroll, and so on, without a clue about how users really use & perceive the docs. Growl.
no subject
(Anonymous) 2005-08-24 05:04 am (UTC)(link)no subject
(Anonymous) 2005-08-24 04:51 am (UTC)(link)If you consult a human performance engineer, they have tools to watch eye movement, mouse movement, key strokes and the like to track how the user actually interacts with on line documentation. Most do appreciate context sensitive documentation requiring hard wiring to code (with somethink like Robohelp or Microsoft help files) that displays in pop up windows. If you use help for processes/procedures with Word you will get an excellent example. As they say on the X files, the answer is out there. Good luck.
no subject
Can you suggest articles that I should look at? I am, in particular, developing documentation for programmers, so my users are trying to design systems and write code. Others in my organization are writing documentation for end users and system administrators; I suspect there's a larger body of work for them. Of course, some issues about presentation transcend usage, but some might be specific to documentation contexts. (And I'm sure that all are sensitive to the user's degree of skill.)
no subject
I (hate)^n horizontal scrolling. HTML isn't a cure-all for that though, unless you make sure to never incorporate images bigger than about 400 pixels per dimension in the same file as text. I had to tweak my LJ settings because jerks were posting 1000-pixel-wide images diretly to some communities in my friends list and making the layout wider than my screen.
PDF versus PDA in a fight to the death
This results in the same problems with PDFs that your attempts to share screen space cause. Even on my "high-resolution" Palm (320x320, I think), there's no way that the entire width of a page can be read, which means that I have to engage in both horizontal and vertical scrolling. It actually gets worse with multi-column layouts, because then I end up scrolling horizontally just a little bit each time I change the view.
Re: PDF versus PDA in a fight to the death
no subject
With well-indexed documentation, of course, you shouldn't have to do that a lot.
no subject
One of the places I worked did a user survey on this subject and found that the preference was almost 50-50 (this was 6-7 years ago and not a large enough sampling for any type of statistical accuracy) and that most people had a strong opinion. However, when we split out reference material from other docs, even the folks who wanted PDF saw the benefits of HTML.
no subject
My doc set contains developer guides (conceptual material plus key entry points into the API), a quick-start guide (very procedural, for people too lazy to read documentation first), extensive documented examples (text includes key code excerpts; full source code is linked), instructions for running the apps in the suite, the javadoc, and the usual extras (high-level overview, glossary, index). One big PDF file for everything minus javadoc would I think be unwieldy (it would be about 600 pages if you didn't include the full source for the examples).
By the way, I also brought up your two arguments against PDF. Internal linking has gotten much better recently (though I don't know about linking from outside), and my coworker showed me non-fuzzy PDF that convinced me that with proper configuration and good font choices that can be dealt with.
One tidbit I didn't know: generating PDF, at least using XSLTProc, involves loading your entire source into memory. It can't stream. I had always assumed that a file-transformation process would be disk-bound, not memory-bound. A coworker reported builds (from a previous job) that took hours to complete. Building the PDF for the individual developer guides and examples takes about five minutes.