Mengmeng Chen - Belmont CA, US Yali Zhu - Foster City CA, US Thierry Cruanes - Foster City CA, US Alan Wood - San Jose CA, US
Assignee:
Oracle International Corporation - Redwood Shores CA
International Classification:
G06F 17/30
US Classification:
707687, 7079992
Abstract:
Parallel execution of SQL statements is made fault-tolerant. Specifically, during parallel query processing, a new row source is created that allows checkpoints to be stored at certain points in the query processing. If a failure occurs, then the query can be restarted from the most recent checkpoint (rather than from the very beginning of the query). The checkpoints are implemented at operation boundaries where rows need to be redistributed among processes. These are natural checkpoint locations because (1) each subsequent operation can proceed from the output of the previous operation, which is stored as a checkpoint, and (2) these operation completions tend to be points in the computation where the query state is minimized, thus minimizing checkpoint storage requirements.
Statement Based Migration For Adaptively Building And Updating A Column Store Database From A Row Store Database Based On Query Demands Using Disparate Database Systems
- Plano TX, US Mengmeng CHEN - San Jose CA, US Aniket ADNAIK - Sunnyvale CA, US Chi Yong KU - San Ramon CA, US
International Classification:
G06F 17/30
Abstract:
A method for updating a column store database and includes establishing a row store database, wherein each row comprises a plurality of attributes. The method includes establishing a column store database including attribute vectors corresponding to at least one attribute in the row store, wherein each attribute vector includes data used to satisfy at least one of previously received analytic queries. The method includes collecting a SQL change statements beginning from a synchronization point indicating when the row store database and the column store database are synchronized, and continuing until an analytic query is received. The method includes sending the plurality of SQL change statements to the column store database upon receipt of the analytic query for updating the column store database for purposes of satisfying the query, wherein the analytic query is directed to a queried range of primary key attributes in the plurality of attributes
Method And System For Adaptively Building A Column Store Database From A Temporal Row Store Database Based On Query Demands
- Plano TX, US Guangyu SHI - Cupertino CA, US Masood MORTAZAVI - San Jose CA, US Chi Yong KU - San Ramon CA, US Mengmeng CHEN - San Jose CA, US
International Classification:
G06F 17/30
Abstract:
A method for dynamically building a column store database from a row store database. The method includes establishing the row store database for storing data, wherein each row includes a plurality of attributes, and wherein data in row store database is current to a temporal point in time. The method includes establishing the column store database including data structured to satisfy received analytic queries. The method includes beginning from an initial state of the column store database, for each subsequently received analytic query, importing a targeted amount of data from a corresponding temporal state of the row store database into the column store database to satisfy the corresponding subsequently received analytic query.
System And Method For Column-Specific Materialization Scheduling
- Plano TX, US Ron-Chung Hu - Palo Alto CA, US Mengmeng Chen - San Jose CA, US
International Classification:
G06F 17/30
Abstract:
A method of dynamically computing an optimal materialization schedule for each column in a column oriented RDBMS. Dynamic column-specific materialization scheduling in a distributed column oriented RDBMS is optimized by choosing a materialization strategy based on execution cost including central processing unit (CPU), disk, and network costs for each individual exchange operator. The dynamic programming approach is computationally feasible because the optimal schedule for a sub-plan is path independent.
Pipelined Re-Shuffling For Distributed Column Store
- Plano TX, US Mengmeng Chen - San Jose CA, US Ron-Chung Hu - Palo Alto CA, US Masood Mortazavi - San Jose CA, US Fang Cao - San Jose CA, US
International Classification:
G06F 17/30
Abstract:
A method of pipelining re-shuffled data of a distributed column oriented relational database management system (RDBMS). A request is received from a consumer process that requires RDBMS column data to be shuffled in a specific order according to an order that each of a plurality of columns will be used by the consumer process. For each of the plurality of columns, the method re-shuffles the RDBMS column data according to the specific order to form re-shuffled RDBMS column data, and sends the re-shuffled RDBMS column data to the consumer process.
Managing Lock Or Latch In Concurrent Execution Of Queries
- Plano TX, US Mengmeng Chen - San Jose CA, US Aniket Adnaik - Sunnyvale CA, US
Assignee:
Futurewei Technologies Inc. - Plano TX
International Classification:
G06F 17/30 G06F 9/46
Abstract:
Various disclosed embodiments include methods and systems for managing lock or latch chains in concurrent execution of database queries. A method includes receiving a plurality of transactions, each transaction associated with one or more queuing requests. The method includes, for each transaction, determining one or more partition sets. Each partition set corresponds to one or more database partitions needed for the transaction. The one or more database partitions are included within a partitioned database. The method includes, for each transaction, determining one or more queues needed for the transaction and storing a bitmap representation of the one or more queues needed for the transaction. The one or more queues needed for the transaction correspond to the one or more database partitions needed for the transaction.
Mechanism For Optimizing Parallel Execution Of Queries On Symmetric Resources
- Plano TX, US Masood Mortazavi - Santa Clara CA, US Mengmeng Chen - San Jose CA, US Fang Cao - San Jose CA, US
International Classification:
G06F 17/30
US Classification:
707765
Abstract:
A method that comprises receiving a logical execution plan for a database query corresponding to a plurality of tables of the database, wherein the logical execution plan comprises one or more operators, receiving an operator cost for each of the operators in the logical execution plan, computing a first accumulated processing cost for a first of the tables based on the logical execution plan, operator selectivity, and operator costs corresponding to the first table, computing a second accumulated processing cost for a second of the tables based on the logical execution plan, operator selectivity, and operator costs corresponding to the second table, comparing the first accumulated processing cost and the second accumulated processing cost to determine a table with the highest accumulated processing cost, and responsive to comparing the accumulated processing costs, computing a physical execution plan that requires partitioning the table with the highest accumulated processing cost.
Boston College 2008 - 2009
Masters, Accounting
Fudan University 2004 - 2008
Bachelors, Bachelor of Arts, Economics
Skills:
Accounting Auditing Project Management Sarbanes Oxley Act Us Gaap Reporting Internal Controls Sec Filings Revenue Recognition Financial Analysis Certified Public Accountant Bilingual In English and Chinese Internal Control and Sox Regulations Technical Accounting Research Analytical and Written Skills Financial Accounting Financial Reporting Cpa Financial Modeling Us Gaap Management Microsoft Excel
Languages:
English Mandarin
Certifications:
Certified Public Accountant (Cpa) California Board of Accountancy
Huawei since Feb 2012
Staff Software Research Engineer
Oracle Jun 2009 - Feb 2012
Software Engineer
Microsoft Corporation Jun 2008 - Aug 2008
Program Manager Intern
Wisconsin Group at CERN Oct 2006 - Dec 2007
Software Engineer
Education:
University of Wisconsin-Madison 2008 - 2009
Master, Computer Science
Tsinghua University 2003 - 2006
Master, Embedded System Application
Tsinghua University 1999 - 2003
Bachelor, Engineering Physics
Deloitte Consulting - Greater Minneapolis-St. Paul Area since Feb 2012
Human Capital Analyst
iJoin Social Innovation Consulting - Shanghai City, China Dec 2012 - Dec 2012
Project Manager
iJoin Social Innovation Consulting - Beijing City, China Dec 2011 - Dec 2011
Project Consultant
Deloitte Consulting May 2011 - Aug 2011
Human Capital Summer Scholar
UW-Math Department Sep 2009 - May 2011
Researcher
Education:
University of Wisconsin-Madison 2008 - 2012
Bachelor of Science, Mathematics with application in Actuarial Science, Economics, International Studies.
University of Wisconsin-Madison 2008 - 2012
Skills:
Chinese Spanish Matlab Research Finance Microsoft Excel Program Management Statistics Leadership Economics Management Consulting
Interests:
Management consulting, Actuarial Science, Mathematics, Leadership Experience, Singing, Acting, Reading, Cooking, Travel and Adventures (Skydive,trekked to Mt Everest Base Camp at altitude of 17400 ft)
Honor & Awards:
AP Scholar with Distinction
Dean’s list
First place in the state of Wisconsin in AIME (Math Competition)
First place in Wisconsin DECA competition in accounting series