Yves Lussier - New York NY, US Indra Sarkar - New York NY, US Michael Cantor - New York NY, US
International Classification:
G06F 17/00
US Classification:
707200000
Abstract:
A method for creating an amalgamated bioinformatics database from at least a first database and a second database is presented. Concepts are identified in a first field from the records of the first database. A second field from the records of the second database which has data related to the first field is also identified. A first set of concepts is identified by traversing a mediating database using terms associated with the first field and a second set of concepts is also identified by traversing the mediating database using terms associated with the second field. Either the first set of concepts or the second set of concepts, or both, is identified using non-trivial terminological mapping. The set of related concepts in the first set of concepts and the second set of concepts is identified and a record is generated in the amalgamated bioinformatics database.
System And Method For Generating An Amalgamated Database
A method for creating an amalgamated bioinformatics database from at least a first database and a second database is presented. Concepts are identified in a first field from the records of the first database. A second field from the records of the second database which has data related to the first field is also identified. A first set of concepts is identified by traversing a mediating database using terms associated with the first field and a second set of concepts is also identified by traversing the mediating database using terms associated with the second field. Either the first set of concepts or the second set of concepts, or both, is identified using non-trivial terminological mapping. The set of related concepts in the first set of concepts and the second set of concepts is identified and a record is generated in the amalgamated bioinformatics database.
Methods And Systems For Extracting Phenotypic Information From The Literature Via Natural Language Processing
Carol Friedman - New York NY, US Yves A. Lussier - Chicago IL, US Lyudmila Ena - Rego Park NY, US
Assignee:
THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK - New York NY
International Classification:
G06F 17/27
US Classification:
704 9
Abstract:
Systems and methods for extracting and encoding genotype-phenotype information from journal articles and other publications are provided. In some embodiments, the disclosed subject matter includes a preprocessor, boundary identifier, parser, phrase recognizer and an encoder to convert natural-language input text and parameters into structured text. The structured text can take the form of codes which account for genotype-phenotype information and are compatible with a controlled vocabulary.
Yves Lussier - New York NY, US Jianrong Li - Flushing NY, US
International Classification:
A01H001/00
US Classification:
800260000
Abstract:
The present invention relates to the systematic use of terminology and knowledge based technologies to enable high-throughput mapping between databases having different vocabularies. In particular embodiments, it may be used to map between a database having a phenotypic terminology descriptive of non-human animals and a database having a broad-coverage clinical (anthropocentric) terminology.
Pharmacogenomics Of Intergenic Single-Nucleotide Polymorphisms And In Silico Modeling For Precision Therapy
- Tucson AZ, US Ikbel ACHOUR - Washington DC, US Joanne BERGHOUT - Tucson AZ, US Yves A. LUSSIER - Tucson AZ, US
Assignee:
Arizona Board of Regents on Behalf of the University of Arizona - Tucson AZ
International Classification:
G16B 20/20 G16C 20/50 G16B 30/00 G16B 5/00
Abstract:
Functionally altered biological mechanisms arising from disease-associated polymorphisms remain difficult to characterize when those variants are intergenic, or, fall between genes. The present invention uses computational modelling of single-nucleotide polymorphisms (SNPs) drawn from genome-wide association studies (GWAS) or other databases to identify SNP pairs, including SNP pairs where at least one SNP is outside a protein-coding region of a gene, having convergent biological mechanisms. Additional databases, including genomic databases, biological regulatory databases, and databases related to molecular function, are used to further identify and validate the similarity of the biological mechanisms of the SNP pairs. Prioritized SNP pairs having increased similarity of biological mechanisms are then used to identify disease mechanisms, candidate therapeutic drugs, and candidate therapeutic targets among downstream effectors of intergenic SNPs.
System And Method Of Predicting Personal Therapeutic Response
A system and method of predicting the course of progression of a disease and determining a personalized therapeutic regime for treating the disease in a subject includes obtaining the subject's normal-tissue transcriptome. The normal-tissue transcriptome is statistically correlated with the subject's perturbed transcriptome to identify one or more deregulated mechanisms; and statistics derived from the identified deregulated mechanism are used to predict the course of progression of the disease and to a recommend a personalized therapeutic regime for treating the disease. The perturbed transcriptome may in some cases be determined from a cancer of the subject, and in other cases from viral infected tissue cultured from the subject.