About
I am a research scientist at Google DeepMind, working on large pretrained models.
Previously I have studied a wide range of topics such as robotic soccer, computer system diagnosis, product search, and question answering.
I graduated from the Language Technologies Institute, School of Computer Science at Carnegie Mellon University. My thesis was advised by professor William W. Cohen. I worked at Google and Apple on language understanding and question answering, and was chief scientist at SayMosaic.
I TAed Machine Learning with Large Datasets (2012) and Machine Learning (2010). Here is my collection of interesting stuff.
Publications most recent first
Earlier publications — scroll for the full list
Patents 7
Manuscripts & Presentations
Ni Lao, Weakly Supervised Natural Language Understanding, AIFrontiers tutorial, 2018
Ni Lao, Do Androids Dream of Great Success?, 2018
Ni Lao, Neural Symbolic Language Understanding, 2017
Ni Lao, Text Generation Survey, 2017
Ni Lao, Xipeng Qiu, Knowledge Acquisition, 2017
Ni Lao, NIPS 2016 Overview, 2016
Ni Lao, Neural Symbolic Machines, 2016
Ni Lao, Elephant and AI, LTI Colloquium Report, Spring 2012
Ni Lao, Programming by Demonstrations and Verbal Commands, LTI Colloquium Report, Spring 2012
Ni Lao, Beyond Shallow Semantics, LTI Colloquium Report, Fall 2011
Ni Lao, CCG, Fractal, and Emergence, LTI Colloquium Report, Spring 2011
Ni Lao, Reinforcement Learning In An Unknown Domain (slides), 2011
Ni Lao, Probabilistic Ontology Model, LTI Colloquium Report, Fall 2010
Ni Lao, Split-Emit Process for Natural Language Generation, Advanced NLP seminar, 2009
Ni Lao, Jun Zhu, Contrastive Feature Induction for Efficient Structure Learning of Conditional Random Fields, 2009
Ni Lao, T. Mitamura, E. Nyberg, Tree Representations for Chinese Semantic Role Labeling, 2009
Ni Lao, Read The Web (slides), Advanced IR seminar, 2007
Ni Lao, Schema Extraction Model, Advanced IR seminar, 2007
Ni Lao, Knowledge Acquisition From Text — A Survey, Statistical NLP class, 2007
Thesis
PhD thesis, 2012. Efficient Random Walk Inference with Knowledge Bases (slides). Carnegie Mellon University
Master thesis, 2006. Data Mining Problems in Automatic Computer Diagnosis. Tsinghua University
Bachelor thesis, 2003. Mining Spatial-Temporal Data Using Constructive Induction. Tsinghua University
Code & Data
Code
2012 — Path Ranking Algorithm, a system for relational retrieval on heterogeneous graphs (github)
2006 — geoSVM, a predictive system for modeling species potential distributions based on SVM. See Wenyun's page
Data Sets
2012 — NELL v165, NELL knowledge graph in both triple format and PRA format
2010 — yeast2, updated yeast data with extra information about Mesh headings, chemicals and affiliations (321K entities, 6.1M links)
2010 — fly, a biological literature graph with 770K entities and 3.5M links
2010 — yeast, a biological literature graph with 164K entities and 2.8M links
Academic Services
2023: ACL*, EMNLP*, ICML, IJGIS, Neurips*, TALLIP, Computers & Security (*Area chair for large language models and reasoning)
2022: ACL, CoNLL, EMNLP, ICLR, KDD, Neurips, TGIS
2021: ACL, AAAI, CoNLL, EMNLP, ICLR, NAACL, SIGIR, TALLIP, GeoAI, NLP4ConvAI
2020: ACL, AAAI, COLING, CoNLL, EACL, EMNLP, ICLR, ICML, IJCAI, Neurips, SIGIR, TALLIP, TKDE
I co-organized the Deep RL Meets Structured Prediction workshop, 2019 — ICLR page, homepage, intro slides
2019: ACL, AAAI, CCL, CoNLL, EMNLP, ICLR, IJCAI, NAACL, SIGIR, TKDE
2018: ACL, CCKS, COLING, EMNLP, NAACL, NLPCC, NIPS, SIGIR
2017: ACL, CCKS, EMNLP, IJCAI, IJCNLP, SIGIR, TKDE, WSDM, Google Research Grants
2016: CIKM, COLING, IJCAI, NAACL, TKDE, WWW, Google Research Grants
2015: CIKM, ICML, IJCAI, MLJ, NIPS, TKDE
Since 2012 I have been the manager of the Machine Learning News Google Group, which seems to be quite popular in academia.
Interesting Stuff
At the age of eleven, I began Euclid, with my brother as my tutor. This was one of the great events of my life, as dazzling as first love.— Bertrand Russell
Books & Resources
Benoit Mandelbrot, The Fractal Geometry of Nature
L. S. Stavrianos, The World to 1500: A Global History
Dale Purves et al., Neuroscience (textbook)
A few intriguing facts about the retina, extracted from Masland RH (2001), The Fundamental Plan of the Retina, Nat. Neurosci. 4(9): 877–86
Edmund Rolls, Cerebral Cortex
Matt Mahoney, Data Compression Explained (tutorial)
Machine Learning class materials — Spring 2010
Homeworks and recitations I designed for the machine learning class.
HW1 · Decision Trees and Information Theory · solution
HW2 · Multiclass Classification · solution
HW3 · Linear Regression and Bias-Variance Trade-off · solution
HW4 · Learning Theory · solution
Recitation · Linear Algebra and Matlab
Recitation · Expectation-Maximization
Recitation · Computational Learning Theory
Recitation · Boosting