An enterprise data processing module and method are described herein. The enterprise data processing module comprises at least one collector and at least one analyzer. The collectors may be operable to collect data pieces from a plurality of data sources. The analyzers may be operable to analyze the collected data pieces to determine cross-source relationships that exist between the data pieces collected from the plurality of sources. The analyzed data pieces may be stored in one or more big-data databases as blocks of data according to the cross-source relationships.
- Los Angeles CA, US John Petrocik - Irvine CA, US Alan Chaney - Simi Valley CA, US Nirmisha Bollampalli - Irvine CA, US Andrey Mogilev - Novosibirsk, RU Kevin Watters - Boston MA, US
A computer-implemented method performed at a server system having one or more processors and memory, the method comprising receiving a set of curated documents comprising one or more documents identified as being relevant to a sector, analyzing the set of curated documents to determine one or more words and a count of each of the one or more words for all documents of the curated set of documents, further analyzing the set of curated documents, by analyzing one or more n-grams based on the one or more words, determining a first score based on a term frequency and a global document frequency of each of the one or more words of each of the one or more n-grams, determining a document vector based on averages of the first score, where the document vector comprises a perfect document for the sector, and storing the document vector in the data store.
- Los Angeles CA, US John Petrocik - Irvine CA, US Alan Chaney - Simi Valley CA, US Nirmisha Bollampalli - Irvine CA, US Andrey Mogilev - Novosibirsk, RU Kevin Watters - Boston MA, US
A computer-implemented method performed at a server system having one or more processors and memory, the method comprising receiving a set of curated documents comprising one or more documents identified as being relevant to a sector, analyzing the set of curated documents to determine one or more words and a count of each of the one or more words for all documents of the curated set of documents, further analyzing the set of curated documents, by analyzing one or more n-grams based on the one or more words, determining a first score based on a term frequency and a global document frequency of each of the one or more words of each of the one or more n-grams, determining a document vector based on averages of the first score, where the document vector comprises a perfect document for the sector, and storing the document vector in the data store.
An enterprise data processing module and method are described herein. The enterprise data processing module comprises at least one collector and at least one analyzer. The collectors may be operable to collect data pieces from a plurality of data sources. The analyzers may be operable to analyze the collected data pieces to determine cross-source relationships that exist between the data pieces collected from the plurality of sources. The analyzed data pieces may be stored in one or more big-data databases as blocks of data according to the cross-source relationships.
- Los Angeles CA, US Alan Chaney - Los Angeles CA, US Clay Cover - Los Angeles CA, US
International Classification:
G06F 17/30 G06F 7/08
Abstract:
A data relationships storage platform for analysis of one or more data sources is described herein. A data processing system may be communicatively coupled to one or more data sources and one or more big-data databases. One or more collectors may collect data pieces from the one or more data sources. One or more analyzer may analyze the collected data pieces to determine whether one or more relationships exist between the collected data pieces. The analysis results in one or more data globs that include one or more of the data pieces and relationship information, such as tags. The tagged data globs may be communicated to and stored in one or more big-data databases.
- Los Angeles CA, US Alan Chaney - Los Angeles CA, US Clay Cover - Los Angeles CA, US
International Classification:
G06F 17/30 G06F 17/30
Abstract:
An enterprise data processing module and method are described herein. The enterprise data processing module comprises at least one collector and at least one analyzer. The collectors may be operable to collect data pieces from a plurality of data sources. The analyzers may be operable to analyze the collected data pieces to determine cross-source relationships that exist between the data pieces collected from the plurality of sources. The analyzed data pieces may be stored in one or more big-data databases as blocks of data according to the cross-source relationships.
Googleplus
Alan Chaney
Alan Chaney
Education:
Who Cares? - Various
About:
I loathe and detest the invasiveness of social media.
Bragging Rights:
Managed to not use social media and still enjoy life.