Document Imaging Technology Tames the Paper Tiger
Document Imaging Technology Tames the Paper Tiger By Heather B. Hayes Responding to Freedom of Information Act requests has historically been one of the most tedious and time-consuming manual tasks in government. Officials must comb through thousands of paper documents, pore over each word for sensitive or private information, and then resort to skills they learned in kindergarten like cutting, pasting and marking to cover up or redact any unreleasab
Document Imaging Technology Tames the Paper Tiger
By Heather B. Hayes
Responding to Freedom of Information Act requests has historically been one of the most tedious and time-consuming manual tasks in government. Officials must comb through thousands of paper documents, pore over each word for sensitive or private information, and then resort to skills they learned in kindergarten like cutting, pasting and marking to cover up or redact any unreleasable information.
Such a task wouldn't be so unbearable if it was rare, but FOIA requests in recent years have swelled to unmanageable numbers, due in part to an increase in lawsuits, the proliferation of conspiracy-minded activist groups, and the zealousness of post-Watergate reporters.
Vredenburg photoLarry Den, vice president, Vredenburg |
Although e-mail, word processing and other electronic documents that originated during the computer age are subject to FOIA requests, more than 90 percent of government information remains in paper form, according to David Lipstein, director of marketing and development at Eastman Software, McLean, Va.
To deal with this fact, many agencies have for some time relied on a simple scanner and a copier in their efforts to respond to requests electronically. The problem, though, is that this method doesn't begin to solve the most time-consuming and labor-intensive burden: redaction.
With each document, officers not only must scan for sensitive information and mark it up or tape it over, they also must note the reason for the redaction (there are nine exemptions under FOIA, including national security, and several others under the Privacy Act, such as address and Social Security number). The document must then be routed to subject experts, supervisors and legal counsel and can be sent back and forth several times as the agency attempts to make the document appeal- and suitproof. "It's not a one-shot deal," says Sloka. "We're constantly making copies of copies of copies. There is paper floating around everywhere."
In recent years, a number of software developers have come up with sophisticated electronic redaction tools that manage to take much of the labor out of this tedious task. These include the aforementioned Imagine FOIA; Imaging for Windows, Professional version, from Eastman Software; and HighView, a FOIA-enabled imaging and document management system from Highland Technologies Inc., Lanham, Md. Others are Pagis Pro, from the Xerox Desktop Document System Division, Peabody, Mass., and Redax, from Digital Applications Inc., Aldan, Pa.
Although not all of the programs work the same, or as effectively, they generally attack the process like this: The user scans in the document, cleans it up and even adds optical character recognition capability for later retrieval. The user then begins looking for information that is sensitive to national interests, proprietary business data or personal information. When a sentence or whole paragraph is found to be unreleasable, a highlight is placed over the words and the officer can choose from a list of FOIA and Privacy Act exemptions to place over the highlight.
If routing capability is inherent in the system, the image is sent via e-mail or across the local area network to a supervisor or subject matter expert for approval. That person also can highlight text or images before routing the document back to the FOIA officer. The officer then burns in the redaction and saves it as an entirely separate file, which is sent to the requester. The original image and the redacted version are saved for posterity and for legal reasons. All versions are then stored in separate repositories.
"Now the hard work is done. You have a legacy of documents that you don't have to travel upstairs or downstairs or go rifling through boxes to find, which saves a tremendous effort because many of these documents are asked for time and again," explains Colletta. "And recognizing this, many agencies are now starting to scan the documents in proactively. Then all the FOIA officer has to do is search the database for the documents that pertain to the specific subject matter, do the redaction electronically and send them out."
Many government offices, including MacDill Air Force Base in Florida, the Office of the Secretary of the Interior and the Defense Intelligence Agency, have started out with just a redaction tool. Others like the INS, the Department of Justice and some members of the intelligence community have opted for full-scale, FOIA-fronted document management programs.
"Everybody has a slightly different approach and it really depends on what the agency wants to achieve and how much money they have to spend," says Lipstein. "There's not an off-the-shelf application that does everything that everybody wants. What we have are good, solid pieces of technology that can handle different parts of the process depending on how you want to implement it."
There are three elements necessary for an effective total solution, Den notes. The first is a software infrastructure that accommodates the ability to capture the data, redact it, store it in a repository and route it to the appropriate personnel. "It's not just having an image viewer put on top of Windows 95 and being able to look at a few TIF [tagged image file format] files that have been scanned in," he says. "It's a much more complicated infrastructure than that, because you've got to be able to determine what the request is, if you've done the request before, and where that information is located."
The second aspect is the ability to publish in a number of different forms, including paper, disk, CD-ROM and over the Web. The third element is the provision of a database robust enough to decipher requests to see whether they have been done before and are stored in the repository.
In their fervor to get their FOIA problem under control, most agencies are opting for commercial off-the-shelf applications that can be integrated into their existing information infrastructures. And most are looking to purchase both products and services off the General Services Administration schedule. Only a few requests for proposal have been let, and most of those are for complementary case tracking systems that are useful in reporting FOIA data to Congress.
In the end, the burden will be on industry to lead the way.
"Although everyone is grumbling about the E-FOIA legislation not being funded, I think the one thing it has accomplished is that it has called attention to how this has all been tackled previously and shown that you really can apply technology to the problem," Lipstein concludes. "There [is] a tremendous amount of long-term savings in dollar and people costs that will come once the initial investment is made."
NEXT STORY: Washington Technology Bits & Bytes