|
:: personal
:: jenny
:: dan
:: miscellaneous |
bioI am an Assistant Professor of Computer Science at the University of West Georgia. My personal page at West Georgia contains my current class schedule, office hours, and other teaching related information. I received my Ph.D. from Georgia Tech's College of Computing where I was part of the Distributed Data Intensive Systems Lab. rockdj@cc.gatech.edu .::. College of Computing, Georgia Institute of Technology, Atlanta, GA 30332-0280 .::. 404.385.2585 research interestsMy research interests lie in the areas of database systems and mobile computing. I am especially interested in the application of database technologies to new domains such as the Internet and bioinformatics. LLNL Bioinformatics ProjectThe Lawrence Livermore National Laboratory DataFoundry Bioinformatics Project seeks to automatically unify the quickly proliferating BLAST data sources behind an integrated user interface. Research challenges include describing the data sources of interest, identifying relevant sources, and determining the inputs, output, and control flow characteristics of the identified sources. Page DigestThe Page Digest is a mechanism for efficient storage and processing of Web documents. The Page Digest design encourages a clean separation of the structural elements of Web documents from their content. Its encoding transformation is invertible without introducing significant additional cost or complexity to normal document parsing. Compared to using standard DOM implementations, our initial experimental results show that Page Digest encoding can provide an order of magnitude speedup when traversing a Web document or comparing two arbitrary Web documents. We have examined the potential benefits of using Page Digest in other large-scale Web Services such as Web Search Software, Web Data Extraction Services, and Automatic Fragment Detection for Dynamic Content Caching. Our experimental results show that change detection using the Page Digest operates in linear time, offering 75% improvement in execution performance when compared with popular existing change detection and difference systems. In addition, the Page Digest format reduces the tag name redundancy found in Web documents, which provides up to a 50% reduction in the document size without employing data compression techniques. recent publications
|