form of trial and error, a system is given a  training set  consisting of documents
preclassified into two or more groups, along with a set of features that might be
potentially useful in classifying the sets.  The system then  learns  rules that assign
weights to those features according to how well they work in classification, and assigns
each new document to a category with a certain probability.  
Notwithstanding their  artificial intelligence  description, automated text
classification systems are unable to grasp many distinctions between types of content that
would be obvious to a human.  And of critical importance, no presently conceivable
technology can make the judgments necessary to determine whether a visual depiction fits
the legal definitions of obscenity, child pornography, or harmful to minors.  
Finally, all the filtering software companies deposed in this case use some form of
human review in their process of winnowing and categorizing Web pages, although one
company admitted to categorizing some Web pages without any human review
. 
SmartFilter states that  the final categorization of every Web site is done by a human
reviewer.   Another filtering company asserts that of the 10,000 to 30,000 Web pages that
enter the  work queue  to be categorized each day, two to three percent of those are
automatically categorized by their PornByRef system (which only applies to materials
classified in the pornography category), and the remainder are categorized by human
review.   SurfControl also states that no URL is ever added to its database without human
review.  
Human review of Web pages has the advantage of allowing more nuanced, if not
62




Untitled Document




TotalRoute.net Business web hosting division of Vision Web Hosting Inc. All rights reserved.