×

Computer file system using content-dependent file identifiers

DC CAFC
  • US 8,001,096 B2
  • Filed: 10/31/2007
  • Issued: 08/16/2011
  • Est. Priority Date: 04/11/1995
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method operable in a file system comprising a plurality of servers, the method comprising the steps of:

  • (A) adding a data item to the file system, the data item consisting of a sequence of non-overlapping parts, each part consisting of a corresponding sequence of bits, by;

    (A1) for each part in said sequence of parts, determining, using hardware in combination with software, a corresponding digital part identifier, wherein each said digital part identifier for each said part is determined based at least in part on a first function of all of the bits in the sequence of bits comprising the corresponding part, the first function comprising a first hash function;

    (A2) determining, using a second function, a digital identifier for the data item, said digital data item identifier being based, at least in part, on the contents of the data item, wherein two identical data items in the file system will have the same digital data item identifier in the file system, said second function comprising a second hash function;

    (A3) storing each part in said sequence of parts on multiple servers of said plurality of servers in the file system;

    (A4) storing first mapping data that maps the digital data item identifier of the data item to the digital part identifiers of the parts comprising the data item;

    (A5) storing second mapping data that maps the digital part identifier of each part in said sequence of parts to corresponding location data that identifies which of the plurality of servers in the file system stores the corresponding part; and

    (B) repeating step (A) for each of a plurality of data items; and

    (C) attempting to access a particular data item in the file system by;

    (C1) obtaining a particular digital data item identifier of the particular data item, said particular digital data item identifier of said particular data item being included in an attempt to access said particular data item in said file system;

    (C2) attempting to match, using hardware in combination with software, said particular digital data item identifier of said particular data item with a digital data item identifier in said first mapping data; and

    (C3) based at least in part on said attempting to match in step (C2), when said particular digital data item identifier obtained in step (C1) corresponds to an identifier in said first mapping data, using said first mapping data to determine a digital part identifier of each part comprising the particular data item;

    (C4) using said second mapping data and at least one digital part identifier determined in step (C3) to determine location data that identifies which of the plurality of servers in the file system stores the corresponding at least one part of the particular data item;

    (C5) attempting to access at least one part of the particular data item at one or more servers identified in step (C4) as storing said at least one part.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×