Parallel data processing architecture

US 7,454,411 B2
Filed: 01/30/2004
Issued: 11/18/2008
Est. Priority Date: 09/28/1999
Status: Expired due to Term

- Alert
- Pin

Associated Cases

Associated Defendants

First Claim

Patent Images

1. A parallel data processing system for search, storage and retrieval of data of a database responsive to client queries for specific data of said database, said parallel data processing system comprising:

a plurality of host processors including a root host processor, said root host processor being responsive to said client queries for said specific data of said database, wherein at least two host processors have a search engine and maintain information of a search queue of said client queries;

at least two host processors having a queue of search requests for specific data of said database, each of said host processors executing a search engine, communicating capacity and load information between host processors and said at least two host processors exchanging at least one search request, the search engine removing at least one search request from a search queue and generating an additional search request,each of said host and root host processors maintaining a list of available host processors and information about the capacity and load for each available host processor in memory and broadcasting its capacity and load information to other host processors and bringing its search queue into balance with another host processor according to a time constant in response to receipt of said broadcast capacity and load information; and

a communications system coupling said host and root processors, wherein at least two host processors communicate capacity and load information to other host processors;

selected host processors storing a database index for said database comprising nodes of a database tree for said database and data accessible via said nodes of said database tree.

View all claims

0 Assignments

Timeline View

Assignment View

Litigations

0 Petitions

Accused Products

Abstract

A tree-structured index to multidimensional data is created using naturally occurring patterns and clusters within the data which permit efficient search and retrieval strategies in a database of DNA profiles. A search engine utilizes hierarchical decomposition of the database by identifying clusters of similar DNA profiles and maps to parallel computer architecture, allowing scale up past previously feasible limits. Key benefits of the new method are logarithmic scale up and parallelization. These benefits are achieved by identification and utilization of naturally occurring patterns and clusters within stored data. The patterns and clusters enable the stored data to be partitioned into subsets of roughly equal size. The method can be applied recursively, resulting in a database tree that is balanced, meaning that all paths or branches through the tree have roughly the same length. The method achieves high performance by exploiting the natural structure of the data in a manner that maintains balanced trees. Implementation of the method maps naturally to parallel computer architectures, allowing scale up to very large databases.

46 Citations

View as Search Results

20 Claims

1. A parallel data processing system for search, storage and retrieval of data of a database responsive to client queries for specific data of said database, said parallel data processing system comprising:
- a plurality of host processors including a root host processor, said root host processor being responsive to said client queries for said specific data of said database, wherein at least two host processors have a search engine and maintain information of a search queue of said client queries;
  
  at least two host processors having a queue of search requests for specific data of said database, each of said host processors executing a search engine, communicating capacity and load information between host processors and said at least two host processors exchanging at least one search request, the search engine removing at least one search request from a search queue and generating an additional search request,each of said host and root host processors maintaining a list of available host processors and information about the capacity and load for each available host processor in memory and broadcasting its capacity and load information to other host processors and bringing its search queue into balance with another host processor according to a time constant in response to receipt of said broadcast capacity and load information; and
  
  a communications system coupling said host and root processors, wherein at least two host processors communicate capacity and load information to other host processors;
  
  selected host processors storing a database index for said database comprising nodes of a database tree for said database and data accessible via said nodes of said database tree.
- View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
- - 4. The parallel data processing system of claim 1, each host processor reconfiguring information on available host processors in response to the receipt of broadcast search queue length load and gathered processor capacity information.
  - 5. The parallel data processing system of claim 4, wherein the information on available host processors at each available host processor changes in response to failure of a host processor.
  - 6. The parallel data processing system of claim 4, wherein the information on available host processors at each available host processor changes in response to the addition of a host processor.
  - 7. The parallel data processing system of claim 1, wherein said plurality of host processors comprises groups of host processors.
  - 8. The parallel data processing system of claim 7, all host processors in each group operating on the same database.
  - 9. The parallel data processing system of claim 7, each group being assigned a portion of the database.
  - 10. The parallel data processing system of claim 9, each group being assigned a different portion of the database.
  - 11. The parallel data processing system of claim 10, wherein each processor of a group of processors is assigned the same portion of the database.
  - 12. The parallel data processing system of claim 1, said database index being a database tree for said database, said host processors capable of executing a set of tests, associating one test to each non-terminal node of said database index for said database.
  - 13. The parallel data processing system of claim 1, said available host processors comprising groups of m processors where m is an integer greater than 1.
  - 14. The parallel data processing system of claim 1, wherein said communications system is proximately located to said root host processor.
  - 15. The parallel data processing system of claim 1, wherein the plurality of host processors comprises at least two host processors having search engines and maintaining information of a search queue of said client queries, one of said host processors processing a search request and generating a new search request.
  - 16. The parallel data processing system of claim 1, further comprising shared memory between host processors.
  - 17. The parallel data processing system of claim 1, further comprising distributed memory among each processor.

2. A parallel data processing system for search, storage and retrieval of data of a database responsive to client queries for specific data of said database, said parallel data processing system comprising:
- a plurality of host processors including a root host processor, said root host processor being responsive to said client queries for said specific data of said database;
  
  each of said host and root host processors maintaining a list of available host processors and information about the capacity and load for each available host processor in memory;
  
  at least two host processors having a queue of search requests for specific data of said database, each of said host processors executing a search engine, communicating capacity and load information between host processors and said at least two host processors exchanging at least one search request, the search engine removing at least one search request from a search queue and generating an additional search request, anda communications system coupling said host and root processors, wherein at least two host processors communicate capacity and load information to other host processors and each have a search engine and each maintain load information of a search queue length of said client queries;
  
  each of said at least two host processors broadcasting its capacity and search queue length load information to other host processors and bringing its search queue of said client queries into balance according to a time constant with another host processor in response to receipt of said broadcast capacity and load information;
  
  selected host processors storing a database index for said database comprising nodes of a database tree for said database and data accessible via said nodes of said database tree wherein the plurality of host processors comprises three host processors, of which two host processors have search engines and maintain information of said search queue of said client queries and the third comprises said root host processor.
- View Dependent Claims (18)
- - 18. The parallel data processing architecture of claim 2 wherein said plurality of host processors comprise groups of host processors, each group having at least one assigned host processor, and each group being assigned a portion of the database.

3. A parallel data processing system for search, storage and retrieval of data of a database responsive to client queries for specific data of said database, said parallel data processing system comprising:
- a plurality of host processors including a root host processor, said root host processor being responsive to said client queries for said specific data of said database;
  
  each of said host and root host processors maintaining a list of available host processors and information about the capacity and load for each available host processor in memory;
  
  at least two host processors having a queue of search requests for specific data of said database, each of said host processors executing a search engine, communicating capacity and load information between host processors and said at least two host processors exchanging at least one search request, the search engine removing at least one search request from a search queue and generating an additional search request, anda communications system coupling said host and root processors, wherein at least two host processors communicate capacity and load information to other host processors and have a search engine and maintain load information of a search queue length of said client queries;
  
  each of said at least two host processors bringing its search queue of client queries into balance with another host processor according to a time constant in response to receipt of said broadcast capacity and load information;
  
  selected host processors storing a database index for said database comprising nodes of a database tree for said database and data accessible via said nodes of said database tree wherein the plurality of host processors comprises two host processors, of which one comprises said root host processor and both said host processors have search engines and maintain information of said search queue of said client queries.
- View Dependent Claims (19, 20)
- - 19. The parallel data processing architecture of claim 3 wherein said plurality of host processors comprise groups of host processors, each group having at least one assigned host processor, and each group being assigned a portion of the database.
  - 20. The parallel data processing architecture of claim 3 wherein the information on available host processors at each available host processor changes in response to one of failure and addition of a host processor.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
University of Tennessee Research Foundation (University of Tennessee)
Original Assignee
University of Tennessee Research Foundation (University of Tennessee)
Inventors
Yadav, Puneet, Icove, David J., Birdwell, John D., Wang, Tse-Wei, Horn, Roger D.
Primary Examiner(s)
Alam, Hosain T
Assistant Examiner(s)
Ahluwalia, Navneet K

Application Number

US10/767,776
Publication Number

US 20040186920A1
Time in Patent Office

1,754 Days
Field of Search

707 1- 10, 707100-1041, 707200-205, 718100-107
US Class Current

1/1
CPC Class Codes

G06F 16/2246   Trees, e.g. B+trees

G06F 16/2264   Multidimensional index stru...

G06F 16/285   Clustering or classification

G16B 40/00   ICT specially adapted for b...

G16B 40/30   Unsupervised data analysis

G16B 50/00   ICT programming tools or da...

G16B 50/20   Heterogeneous data integration

Y10S 707/99932   Access augmentation or opti...

Y10S 707/99933   Query processing, i.e. sear...

Y10S 707/99935   Query augmenting and refini...

Y10S 707/99942   Manipulating data structure...

Y10S 707/99945   Object-oriented database st...

Parallel data processing architecture

First Claim

0 Assignments

Litigations

0 Petitions

Accused Products

Abstract

46 Citations

20 Claims

Specification

Use Cases

Quick Links

Others

Parallel data processing architecture

First Claim

0 Assignments

Subscription Required

Subscription Required

Litigations

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

46 Citations

20 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others