Just-In-Time Analytics on Large File and Storage Systems

Case ID:
C11343
Disclosure Date:
12/6/2010

C11343: Just-In-Time Analytics on Large File and Storage Systems



Technical Details:

As file systems reach the petabytes scale, users and administrators are increasingly interested in acquiring highlevel analytical information for file management and analysis. Two particularly important tasks are the processing of aggregate and top-k queries which, unfortunately, cannot be quickly answered by hierarchical file systems such as ext3 and NTFS. Existing pre-processing based solutions, e.g., file system crawling and index building, consume a significant amount of time, energy and space (for generating and maintaining the indexes) which in many cases cannot be justified by the infrequent usage of such solutions. In this paper, we advocate that user interests can often be sufficiently satisfied by approximate -i.e., statistically accurate -answers. We develop Glance, a just-in-time sampling-based system which, after consuming a small number of disk accesses, is capable of producing extremely accurate answers for a broad class of aggregate and top-k queries over a file system without the requirement of any prior knowledge. We use a number of real-world file systems to demonstrate the efficiency, accuracy and scalability of Glance.




Patent Information:
Title App Type Country Serial No. Patent No. File Date Issued Date Expire Date Patent Status
Just-In-Time Analytics on Large File Systems ORD: Ordinary Utility United States 13/328,810 9,244,975 12/16/2011 1/26/2016 12/16/2031 Granted
Just-In-Time Analytics on Large File Systems CIP: Continuation-in-part United States 13/402,764 9,244,976 2/22/2012 1/26/2016 12/16/2031 Granted
Inventors:
Category(s):
Get custom alerts for techs in these categories/from these inventors:
For Information, Contact:
Mark Maloney
dmalon11@jhu.edu
410-614-0300
Save This Technology:
2017 - 2022 © Johns Hopkins Technology Ventures. All Rights Reserved. Powered by Inteum