Sunteți pe pagina 1din 3

the XML strategist

Casting a critical eye on the Next Big Thing in technical publishing.

XML, Growing Up Fast One problem is that beau- It’s also possible that we
BY SARAH S. O’KEEFE, Associate Fellow tiful print layouts almost will finally see some prog-
always require formatting ress in support for XSL-FO
exceptions. When fitting creation tools. However,

T his year marks the


ten-year anniversary of
XML. In 1998, technical
result, we forget to see the
bigger picture—the wad-
ing birds, the snake sun-
copy to a page, a production
editor might tighten line
spacing imperceptibly to
the software vendors do not
appear to be focusing in this
area. The vendors that make
content develop- ning itself, and the fit an extra line or two on a development tools are more
ment was gener- marsh flowers. So page, which then causes the focused on XML and XSL,
ally based on a in this column, I remaining pages to fall into which are used much more
desktop publish- want to set aside place nicely. The automated widely. Another possibility
ing paradigm. the pressing cur- tools cannot match this would be that a publish-
The best practice rent issues—how level of sophistication, so ing tools company, such
for efficiency in to convert badly automatically generated as Adobe, Quark, or PTC,
publishing was formatted legacy layouts have a less polished would produce a graphical
to single-source content to XML, look than a layout that was XSL-FO editor. However,
content—perhaps whether DITA is the fine-tuned by a production these companies appear
by developing in Frame- right choice or not—and editor. to perceive XSL-FO as a
Maker and outputting instead look forward five to Many organizations no threat to their existing print
HTML content through ten years. longer print documentation solutions—why produce
WebWorks Publisher. A few Note: STC provides a for their customers. Instead, something that competes
very large companies had forum (stcforum.org/view they provide PDF files, directly with InDesign/
expensive systems based on forum.php?id=51) where you which customers can print FrameMaker, QuarkXPress,
SGML, or Standard General- can discuss this and other themselves. Thus, I believe or the E3 publishing engine?
ized Markup Language (the Intercom articles. If (when) that the print problem will
precursor to XML). you disagree with what I be solved in two ways: Convergence Continues
Today, the state of the art write here, you can start a • Lower standards. Organ- As documentation moves
is an XML-based publishing discussion there. You can izations decide that the online and becomes inter-
environment, with push- also contact me directly printed output from an active, training is moving
button delivery of various using the information at the automated XML-based online and becoming less
output formats, often in end of this column. workflow, while imperfect, interactive. The distinction
multiple languages. XML is good enough, especially between e-learning and
has found its way into orga- Solving the Printing if Web-based help is the interactive documentation
nizations of all sizes. A lot Problem primary output. is blurring.
has changed in ten years. Producing attractive print- • XML invades the page de- Automation in this area
Nonetheless, I believe that ed output from XML is still scription business. Adobe is still limited because
the current impact of XML unbelievably difficult. In the is working on a project, graphic, sound, and video
is small compared to what’s original version of my “XML code-named Mars, that formats tend to be binary.
coming in the next decade. and Structured Authoring” will create a version of They require proprietary
The evolution of content white paper, published in PDF written in XML. This authoring tools, and the
creation from format-driven 2002, I blithely assumed that will eliminate the need for file formats are difficult (or
publishing to structured the challenges of creating processing XML through impossible) to manipulate
authoring is a paradigm- print layouts through the XSL-FO, InDesign, Frame- programmatically. The Scal-
shifting (and cliché-ridden) Extensible Stylesheet Lan- Maker, or other XML- able Vector Graphics (SVG)
transition. guage Formatting Objects aware publishing tools. format, which describes
While wading through (XSL-FO) would be solved Instead, we could use vector images in XML, can
a swamp, it’s important to “within five years.” It is now XSL transformation to provide a foundation for
focus on each step to avoid six years later, and the XSL- produce “PDF-ML” from vector graphics in XML. But
getting muddy. But as a FO challenges remain. XML source content. photographs and screen

July/August 2008 27
the XML strategist

Over time, XML will become as ubiquitous as HTML.


captures do not yet have an to handle it. When SGML The Dream of Better authoring tools are not ma-
equivalent format. was developed in 1979, it Authoring Tools ture yet—nobody offers the
Flash authoring is moving provided “tag minimization” Some of you may remem- elegance and clarity of pur-
(slightly) in this direction. options because this made ber the early days of HTML, pose for XML that Dream-
According to Colin Moock the text files smaller. At the when we wrote content in weaver does for HTML.
(a Flash consultant and time, bringing file size down Notepad. The first genera- So while I’m making
author), the next version of a few kilobytes made an tion of HTML tools wasn’t predictions that may or may
Flash (CS4) will use a new appreciable difference terribly good, either. But to- not come true, I’d like to
format, XFL, to write out in processing time. By day, we have Dreamweaver, outline the list of software
Flash files. Currently, it uses 1998, tag minimization which is the de facto stan- requirements for a great
Flash (.fla) format, which was omitted from XML dard for Web authoring and authoring tool. It should
can only be read by Flash because the file size was no design. Why is it so success- include the following:
authoring tools. XFL will longer a problem for the ful? The software reflects a • Features should address
be XML-based—at least for computer hardware. XML deep understanding of what not just authoring tasks
text within the file and for files are more verbose than Web content creators do, but also the publishing
the description of the assets SGML files, but easier to and it provides features to and production side.
(sound and video) that are parse. Over time, textual accomplish those tasks ef- This might require
included in the file. I assume and graphical content has ficiently and elegantly. integration with content
that this means that the evolved from binary to Unfortunately, we have management systems, or
sound and video files will compacted text formats nothing comparable for the ability to send a com-
still be in a binary format. to verbose text formats. I XML content creation. pleted document onward
The XFL format will al- expect that audio and video In one segment, we have in your workflow.
low us to use text stored in will follow the same path. developer tools tarted up to • There should be sev-
XML as the foundation for In addition to the techni- support XML-based content eral ways of viewing and
content delivered in Flash as cal aspect of convergence, in addition to XML data. editing content, including
well as Web and print deliv- where information is migrat- Other tools use a desktop plain text (code view),
erables. This opens up Flash ing into XML, we also have publishing foundation with tree view, word proces-
authoring to single-sourcing an interesting convergence structured authoring sup- sor view, and output view
techniques for the first time. of “professional” and user- port. A third group evolved for every format that you
Sound and video use num- generated content. I plan from SGML authoring to produce. Content should
erous standards, and stream- to address that in a future support XML authoring. be fully editable in all of
ing technology adds another column. But the purpose-built XML these views.
layer of complications.
Imagine a multimedia Figure 1. When Flash understands XML, we can link XML-based text into Flash files.
development environment
Not XML…yet
where all content is en-
coded in XML. We would
be able to apply the same XML
automation we’re develop-
ing for text today to graph- Sound
ics, sound, and video.
I can already hear the
protests about how XML is
a terrible, inefficient format
for encoding anything other
than text. Remember that Video
the same argument was
made that XML was horrible HTML PDF Flash
for anything including text.
Yes, it’s inefficient, but it’s
not going to matter because
the hardware will be able

28 July/August 2008
the XML strategist

Figure 2. One format to rule them all... requirements list. If highly


designed print layouts are
All content in XML
mandatory for your content,
XML may not be ready for
Text Graphics Sound Video
you. On the other hand, if
the convergence of docu-
mentation and training is
of interest to you, you may
want to consider XML
sooner rather than later.
The structured authoring
paradigm is challenging for
XML-to-XML transformation many authors, and if your
authors have had difficul-
ties mastering the easier
unstructured tools, the state
Web Print/PDF e-Learning Others of XML authoring tools
may present a significant
obstacle.
We live in interesting
times.

XML-based output formats Suggested Readings

• All of the editable views the editor should offer rich, elegant, non-XML ChemicalML discussion, cml
should support condi- intelligent suggestions on authoring environment .sourceforge.net/historical/position
tional content display, where reuse might be ap- and wonder why moving to .html
so that you can author propriate. (Interestingly, XML requires a huge step
in a preview mode for a AuthorIT and Flare are backward in usability for Colin Moock’s blog post
specific output or with all both moving in this direc- authors. about XFL, www.moock.org/
content showing. tion. Sadly, their author- It’s my opinion that over blog/archives/000269.html
• All of the editing views ing tools are not XML time, XML will become
should be able to display authoring tools.) as ubiquitous as HTML. MathML, www.w3.org/Math/
all supported document The structured authoring
components, including As I write my list of de- paradigm, which allows us to Project Mars, labs.adobe.com/
text, graphics, audio, and mands, it occurs to me that I enforce required document wiki/index.php/Mars
video. For example, a should defend the software structures, is compelling
hypothetical iPhone vendors. The reason that for technical documents. SVG, www.w3.org/TR/SVG/
content mode should nobody has built a tool like Software will eventually
(currently) not render this yet is because it’s hard. address the authoring and
any Flash video content. The technical documenta- publishing challenges, and Sarah O’Keefe (xmlstrategist@
PDF mode should show tion market is not known for more content types will scriptorium.com) is founder
all multimedia content. spending freely on software, migrate to XML formats. and president of Scriptorium
Print preview mode even when the productivity The implementation cost Publishing Services, Inc.
should replace live video gains are compelling. will decline and the value of (www.scriptorium.com), based
with static graphic images an XML-based environment in Research Triangle Park,
and disable hyperlinks. Living Up to Hype will increase, resulting in North Carolina. She is a senior
• Assigning metadata to a Many of you are probably more organizations migrat- member of the Carolina Chapter
particular content unit tired of hearing about the ing to XML. STC and of the Consulting and
should be fast and easy. glories of XML. Perhaps you If you are considering Independent Contracting and
• As you author content, like your current feature- XML, develop your own Management communities.

July/August 2008 29

S-ar putea să vă placă și