Lingjia Tang

age ~43

from Ann Arbor, MI

Also known as:
  • Lingjia Pang
Phone and address:
5559 Great Hawk Cir, Ann Arbor, MI 48105

Lingjia Tang Phones & Addresses

  • 5559 Great Hawk Cir, Ann Arbor, MI 48105
  • South Haven, MI
  • San Diego, CA
  • Charlottesville, VA
  • Mountain View, CA

Resumes

Lingjia Tang Photo 1

Chief Technology Officer And Cofounder

view source
Location:
Ann Arbor, MI
Industry:
Computer Software
Work:
UC San Diego - La Jolla, CA since Aug 2012
Research Scientist

Department of Computer Science, University of Virginia Aug 2005 - May 2012
Ph.D

Google - Mountain View, California May 2011 - Aug 2011
Summer Internship in Datacenter Performance CSI Team

Google May 2010 - Aug 2010
Summer Internship in Compiler/Platform Group

Microsoft May 2008 - Aug 2008
Summer Internship
Education:
University of Virginia 2005 - 2011
Ph.D, Computer Science
Zhejiang University 1999 - 2003
BS, Computer Science
Skills:
Computer Science
Algorithms
C++
Computer Architecture
C
Compilers
Python
Distributed Systems
Image Processing
Latex
Research
Linux
Machine Learning
Java
Runtime Systems
Signal Processing
Datacentres
High Performance Computing
Runtimes
Matlab
General Awesomeness
Parallel Computing
Cloud and Datacenter Efficiency
Interests:
Massive Attack
Amazon
University of Virginia
Sheldon Cooper
The Big Bang Theory (Tv Series)
Radiohead (Band)
The Daily Show
Littlebigplanet
Source Code
Mad Men (Tv Series)
Google
Barack Obama
Burial (Dubstep Musician)
Languages:
English
Mandarin
Lingjia Tang Photo 2

Lingjia Tang

view source

Us Patents

  • Systems And Methods For Intelligently Configuring And Deploying A Control Structure Of A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20210241066, Aug 5, 2021
  • Filed:
    Jan 15, 2021
  • Appl. No.:
    17/149786
  • Inventors:
    - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor MI, US
    Yiping Kang - Ann Arbor MI, US
    Yunqi Zhang - Ann Arbor MI, US
  • International Classification:
    G06N 3/00
    G06N 20/00
    G06N 5/04
    G06F 40/30
    G06F 16/332
    G06K 9/62
  • Abstract:
    A system and method of configuring a graphical control structure for controlling a machine learning-based automated dialogue system includes configuring a root dialogue classification node that performs a dialogue intent classification task for utterance data input; configuring a plurality of distinct dialogue state classification nodes that are arranged downstream of the root dialogue classification node; configuring a graphical edge connection between the root dialogue classification node and the plurality of distinct state dialogue classification nodes that graphically connects each of the plurality of distinct state dialogue classification nodes to the root dialogue classification node, wherein (i) the root dialogue classification node, (ii) the plurality of distinct classification nodes, (iii) and the transition edge connections define a graphical dialogue system control structure that governs an active dialogue between a user and the machine learning-based automated dialogue system.
  • Systems And Methods For Automatically Detecting And Repairing Slot Errors In Machine Learning Training Data For A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20210166138, Jun 3, 2021
  • Filed:
    Jan 15, 2021
  • Appl. No.:
    17/149998
  • Inventors:
    - Ann Arbor MI, US
    Anish Mahendran - Ann Arbor MI, US
    Parker Hill - Ann Arbor MI, US
    Jonathan K. Kummerfeld - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G06N 5/04
    G06N 20/00
  • Abstract:
    Systems and methods for automatically detecting annotation discrepancies in annotated training data samples and repairing the annotated training data samples for a machine learning-based automated dialogue system include evaluating a corpus of a plurality of distinct training data samples; identifying one or more of a slot span defect and a slot label defect of a target annotated slot span of a target training data sample of the corpus based on the evaluation; and automatically correcting one or more annotations of the target annotated slot span based on the identified one or more of the slot span defect and the slot label defect.
  • Systems And Methods For Constructing An Artificially Diverse Corpus Of Training Data Samples For Training A Contextually-Biased Model For A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20210004539, Jan 7, 2021
  • Filed:
    Sep 1, 2020
  • Appl. No.:
    17/009695
  • Inventors:
    - Ann Arbor MI, US
    Stefan Larson - Ann Arbor MI, US
    Christopher Clarke - Ann Arbor MI, US
    Kevin Leach - Ann Arbor MI, US
    Jonathan K. Kummerfeld - Ann Arbor MI, US
    Parker Hill - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G06F 40/35
    G06N 5/04
    G06N 20/00
    G06F 40/284
  • Abstract:
    Systems and methods for constructing an artificially diverse corpus of training data includes evaluating a corpus of utterance-based training data samples, identifying a slot replacement candidate; deriving distinct skeleton utterances that include the slot replacement candidate, wherein deriving the distinct skeleton utterances includes replacing slots of each of the plurality of distinct utterance training samples with one of a special token and proper slot classification labels; selecting a subset of the distinct skeleton utterances; converting each of the distinct skeleton utterances of the subset back to distinct utterance training samples while still maintaining the special token at a position of the slot replacement candidate; altering a percentage of the distinct utterance training samples with a distinct randomly-generated slot token value at the position of the slot replacement candidate; and constructing the artificially diverse corpus of training samples based on a collection of the percentage of the distinct utterance training samples.
  • Systems And Methods For Automatically Detecting And Repairing Slot Errors In Machine Learning Training Data For A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20200401914, Dec 24, 2020
  • Filed:
    Jun 8, 2020
  • Appl. No.:
    16/895147
  • Inventors:
    - Ann Arbor MI, US
    Anish Mahendran - Ann Arbor MI, US
    Parker Hill - Ann Arbor MI, US
    Jonathan K. Kummerfeld - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G06N 5/04
    G06N 20/00
  • Abstract:
    Systems and methods for automatically detecting annotation discrepancies in annotated training data samples and repairing the annotated training data samples for a machine learning-based automated dialogue system include evaluating a corpus of a plurality of distinct training data samples; identifying one or more of a slot span defect and a slot label defect of a target annotated slot span of a target training data sample of the corpus based on the evaluation; and automatically correcting one or more annotations of the target annotated slot span based on the identified one or more of the slot span defect and the slot label defect.
  • Systems And Methods For Automactically Categorizing Unstructured Data And Improving A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20200380964, Dec 3, 2020
  • Filed:
    May 18, 2020
  • Appl. No.:
    16/876198
  • Inventors:
    - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G10L 15/18
    G06N 20/00
    G06N 5/04
    G10L 15/22
    G10L 15/04
  • Abstract:
    Systems and methods for building a response for a machine learning-based dialogue agent includes implementing machine learning classifiers that predict slot segments of the utterance data based on an input of the utterance data; predict a slot classification label for each of the slot segments of the utterance data; computing a semantic vector value for each of the slot segments of the utterance data; assessing the semantic vector value of the slot segments of the utterance data against a multi-dimensional vector space of structured categories of dialogue, wherein the assessment includes: for each of a distinct structured categories of dialogue computing a similarity metric value; selecting one structured category of dialogue from the distinct structured categories of dialogue based on the computed similarity metric value for each of distinct structured categories; and producing a response to the utterance data.
  • Systems And Methods For Intelligently Configuring And Deploying A Machine Learning-Based Dialogue System

    view source
  • US Patent:
    20200364410, Nov 19, 2020
  • Filed:
    Jul 30, 2020
  • Appl. No.:
    16/944124
  • Inventors:
    - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor, US
    Parker Hill - Ann Arbor MI, US
    Yiping Kang - Ann Arbor MI, US
    Yunqi Zhang - Ann Arbor MI, US
  • International Classification:
    G06F 40/30
    G06N 5/04
    G06N 20/00
  • Abstract:
    A system and method for intelligently configuring a machine learning-based dialogue system includes a conversational deficiency assessment of a target dialog system, wherein implementing the conversational deficiency assessment includes: (i) identifying distinct corpora of mishandled utterances based on an assessment of the distinct corpora of dialogue data; (ii) identifying candidate corpus of mishandled utterances from the distinct corpora of mishandled utterances as suitable candidates for building new dialogue competencies for the target dialogue system if candidate metrics of the candidate corpus of mishandled utterances satisfy a candidate threshold; building the new dialogue competencies for the target dialogue system for each of the candidate corpus of mishandled utterances having candidate metrics that satisfy the candidate threshold; and configuring a dialogue system control structure for the target dialogue system based on the new dialogue competencies, wherein the dialogue system control structure governs an operation of an automated dialogue agent.
  • Systems And Methods For Intelligently Curating Machine Learning Training Data And Improving Machine Learning Model Performance

    view source
  • US Patent:
    20200272855, Aug 27, 2020
  • Filed:
    Apr 30, 2020
  • Appl. No.:
    16/864120
  • Inventors:
    - Ann Arbor MI, US
    Yunqi Zhang - Ann Arbor MI, US
    Jonathan K. Kummerfeld - Ann Arbor MI, US
    Parker Hill - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G06K 9/62
    G06N 5/04
    G06N 20/00
    G06F 16/332
  • Abstract:
    Systems and methods of intelligent formation and acquisition of machine learning training data for implementing an artificially intelligent dialogue system includes constructing a corpora of machine learning test corpus that comprise a plurality of historical queries and commands sampled from production logs of a deployed dialogue system; configuring training data sourcing parameters to source a corpora of raw machine learning training data from remote sources of machine learning training data; calculating efficacy metrics of the corpora of raw machine learning training data, wherein calculating the efficacy metrics includes calculating one or more of a coverage metric value and a diversity metric value of the corpora of raw machine learning training data; using the corpora of raw machine learning training data to train the at least one machine learning classifier if the calculated coverage metric value of the corpora of machine learning training data satisfies a minimum coverage metric threshold.
  • Systems And Methods For Machine Learning Based Multi Intent Segmentation And Classification

    view source
  • US Patent:
    20200257856, Aug 13, 2020
  • Filed:
    Feb 6, 2020
  • Appl. No.:
    16/783604
  • Inventors:
    - Ann Arbor MI, US
    Parker Hill - Ann Arbor MI, US
    Kevin Leach - Ann Arbor MI, US
    Sean Stapleton - Ann Arbor MI, US
    Jonathan K. Kummerfeld - Ann Arbor MI, US
    Johann Hauswald - Ann Arbor MI, US
    Michael A. Laurenzano - Ann Arbor MI, US
    Lingjia Tang - Ann Arbor MI, US
    Jason Mars - Ann Arbor MI, US
  • International Classification:
    G06F 40/30
    G06N 20/00
    G06N 7/00
    G06F 40/284
  • Abstract:
    Systems and methods for synthesizing training data for multi-intent utterance segmentation include identifying a first corpus of utterances comprising a plurality of distinct single-intent in-domain utterances; identifying a second corpus of utterances comprising a plurality of distinct single-intent out-of-domain utterances; identifying a third corpus comprising a plurality of distinct conjunction terms; forming a multi-intent training corpus comprising synthetic multi-intent utterances, wherein forming each distinct multi-intent utterance includes: selecting a first distinct in-domain utterance from the first corpus of utterances; probabilistically selecting one of a first out-of-domain utterance from the second corpus and a second in-domain utterance from the first corpus; probabilistically selecting or not selecting a distinct conjunction term from the third corpus; and forming a synthetic multi-intent utterance including appending the first in-domain utterance with one of the first out-of-domain utterance from the second corpus of utterances and the second in-domain utterance from the first corpus of utterances.

Googleplus

Lingjia Tang Photo 3

Lingjia Tang

Lived:
San Diego, CA, USA
Hefei, China
Hangzhou, China
Charlottesville, VA, USA
Beijing, China
Mountainview, CA, USA
Work:
University of Michigan (2013)
University of California, San Diego (2012-2013)
Relationship:
Married
About:
Ph.D, Computer Science
Lingjia Tang Photo 4

Lingjia Tang

Youtube

A4 Minutes Episode 5: Clinc co-founder Lingji...

A4 Minutes is a new series where Meredith Bruckner sits down with wome...

  • Duration:
    6m 53s

Programming-lang... Runtime Systems in Datac...

Video Chairs: David Darais, Bader AlBassam.

  • Duration:
    24m 8s

Why the best voice recognition tech is not Si...

Lingjia and her team have created voice recognition technology that is...

  • Duration:
    14m 45s

Lingjia introducing her workflow in myca

Dr. Lingjia Tang is a professor of Computer Science and an entrepreneu...

  • Duration:
    8m 53s

Protean Code: A Leap Forward in Compiler Tech...

Mike Laurenzano, Lingjia Tang, and Jason Mars present Protean Code.

  • Duration:
    2m 21s

Why you should stop using todo lists or other...

Dr. Lingjia Tang, co-founder of myca.ai is an expert on productivity. ...

  • Duration:
    1m 5s

Tang Resonating Pipa Concerto//

... ... ... 2019... 2019.07... ... ...

  • Duration:
    22m 18s

- TangYi | Live 04 Nov 2021 I IMG 0447

RELAX.

  • Duration:
    1h 37m 33s

Get Report for Lingjia Tang from Ann Arbor, MI, age ~43
Control profile