Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
In this blog, I would like to give a quick overview of different types of matrices and their transformations (e.g., Cholesky decomposition) in stan
. These matrices include variance-covaiance matrix ($\Sigma$), correlation matrix ($R$), Cholesky factor of covariance matrix ($L$) and Cholesky factor of correlation matrix ($L_{corr}$). Before we move to Cholesky decomposition, it is good to know the relationship between variance-covaiance matrix ($\Sigma$) and correlation matrix ($R$). Simply put, we can rewrite $\Sigma$ as the product of a diagonal matrix ($\sigma$) and a correlation matrix $(R)$ in the following way. To achieve this, you can use the quad_form_diag(R, sigma)
function in stan.
Published:
I have been thinking about building a web app for simulating data with given parameters and recovering the parameters with Bayesian MCMC samplers in JavaScript. This web app can not only make the procedures more transparent, but also help us understand the magic of the Bayesian MCMC approach. More importantly, I have benefited from this simulation-based way of thinking, so I would like to promote it in my blog.
Published:
Published:
To be honest, logistic regression (LR) can be quite confusing, since it involves too many new terms, including odds, odds ratio, log odds ratio, log-odds/logit and sigmoid function. People with various backgrounds often explain LR in different ways. In statistics, LR is defined as a regression model where odds and odds ratio are first introduced and log-odds/logit transformation is explained later. But in machine learning, LR is mostly used for classification tasks (e.g., spam vs. not spam) and they (only) highlight the sigmoid activation function plus binary cross-entropy loss. They do not even explain what is odds ratio and log odds ratio in machine learning tutorials of LR.
Published:
In this blog, I will introduce a new R package chkptstanr
, which allows checkpoints in stan
. This feature is useful when you want to cache the model outputs by certain time steps or resume a long run at specific points due to an interruption. In general, this package can be considered as a wrapper for cmdstan
, and thus you need to download and install cmdstan
on your machine. Here are the step-by-step instruction on how you can deploy the environment and run the stan model via chkptstanr.
Published:
In this blog, I will provide a step-by-step instruction of how we can generate data from a mixed effect model and recover the parameters of the model with the simulated dataset. This simulation-based experiment can help us better understand the structure and generative process of the multilevel model with correlated random intercepts and slopes. To proceed, I will first illustrate the general form of mixed effect models, and generate data based on a given set of design matrices and parameters ($X,\beta, Z, b$). In the end, I will set a Bayesian model to estimate the parameters on the simulated data via stan
.
Published:
Lewandowski-Kurowicka-Joe (LKJ) distribution is a very useful prior distribution for parameter estimation in correlation matrices, and is also tightly related to matrix factorizations such as Cholesky decomposition. For example, when you use Cholesky decomposition to decompose a variance-covariance matrix ($\Sigma$) into the multiplication of 3 matrices, you can set $\text{LKJCorr}$ prior for the correlation matrix $\text{R}$.
Published:
With the development of dplyr or its umbrella package tidyverse, it becomes quite straightforward to perform operations over columns or rows in R. These column- or row-wise methods can also be directly integrated with other dplyr verbs like select
, mutate
, filter
and summarise
, making them more comparable with other functions in apply
or map
families. In this blog, I will briefly cover some useful manipulations over rows or columns.
Published:
This post provides an example of simulating data in a Multivariate Normal distribution with given parameters, and estimating the parameters based on the simulated data via Cholesky decomposition in stan
. Multivariate Normal distribution is a commonly used distribution in various regression models that generalize the Normal distribution into multidimensional space. Its PDF can be expressed as:
Published:
PyTorch has gained great popularity among industrial and scientific projects, and it provides a backend for many other packages or modules. It is also accompanied with very good documentation, tutorials, and conferences. This blog attempts to use PyTorch to fit a simple linear regression via three optimisation algorithms:
Published in Depling, 2015
Recommended citation: Jing, Yingqi & Haitao Liu, 2015. "Mean Hierarchical Distance: Augmenting Mean Dependency Distance." In: Proceedings of the Third International Conference on Dependency Linguistics. Uppsala, Sweden, August 24-26, pp. 161-170. https://aclanthology.org/W15-2119.pdf
Published in Journal of Foreign Languages, 2016
Recommended citation: Liu, Haitao & Yingqi Jing, 2016. "Quantitative Analysis of English Hierarchical Structure." Journal of Foreign Languages (in Chinese). 6, pp. 2-11. http://rdbk1.ynlib.cn:6251/Qw/Paper/620069
Published in Motifs in Language and Text, 2017
Recommended citation: Jing, Yingqi & Haitao Liu, 2017. "Quantitative Analysis of English Hierarchical Structure." In: Haitao Liu & Junying Liang (eds.) Motifs in Language and Text. Berlin/Boston: Mouton de Gruyter. pp. 133-150. https://www.degruyter.com/document/doi/10.1515/9783110476637-008/html?lang=en
Published in Physics of Life Reviews, 2017
Recommended citation: Jing, Yingqi, 2017. "Coevolution of dependency distance, hierarchical structure and word order: Comment on “Dependency distance: a new perspective on syntactic patterns in natural languages” by Haitao Liu et al. " Physics of Life Reviews. 21, pp. 228-229. https://pubmed.ncbi.nlm.nih.gov/28602719/
Published in Cognitive Science, 2021
Recommended citation: Jing, Yingqi, Paul Widmer & Balthasar Bickel. 2021. "Word Order Variation is Partially Constrained by Syntactic Complexity." Cognitive Science. 45, e13056. https://onlinelibrary.wiley.com/doi/10.1111/cogs.13056
Published in Journal of Uralic Linguistics, 2022
Recommended citation: Norvik, Miina, Yingqi Jing, Michael Dunn, Robert Forkel, Terhi Honkola, Gerson Klumpp, Richard Kowalik, Helle Metslang, Karl Pajusalu, Minerva Piha, Eva Saar, Sirkka Saarinen and Outi Vesakoski. 2022. "Uralic typology in the light of new comprehensive data sets." Journal of Uralic Linguistics. 1, pp. 4-42. https://benjamins.com/catalog/jul.00002.nor
Published in Language, 2022
Recommended citation: Jing, Yingqi, Damian Blasi & Balthasar Bickel. 2022. "Dependency Length Minimization and its Limits: a Possible Role for a Probabilistic Version of the Final-Over-Final Condition." Language. 98(3). pp. 397-418. https://muse.jhu.edu/article/864631
Published in Diachronica, 2023
Recommended citation: Jing, Yingqi, Paul Widmer & Balthasar Bickel. 2023. "Word Order Evolves at Similar Rates in Main and Subordinate Clauses." Diachronica. 40(4). pp. 544-568. https://doi.org/10.1075/dia.20035.jin
Published:
2017/9: Yingqi Jing. Exploring harmonic word order: a complex network perspective. Talk at the Workshop of Perspectives on word order, University of Zurich, Zurich, Switzerland.
Published:
2017/12: Yingqi Jing, Damian Blasi & Balthasar Bickel. Harmony and dendrophilia in syntax. Talk at the 12th Conference of the Association for Linguistics Typology (ALT), Australian National University, Canberra, Australia.
Published:
2018/4: Yingqi Jing, Damian Blasi & Balthasar Bickel. Harmony and dendrophilia in syntax. Talk at the 12th International Conference on Language Evolution (EvoLang), Nicolas Copernicus University, Toruń, Poland.
Published:
2018/8: Yingqi Jing. Diachronic evidence of harmonic word order: A complex network approach. Talk at the 51st Annual Meeting of the Societas Linguistica Europaea (SLE), University of Tallinn, Tallinn, Estonia.
Published:
2018/11: Jing Yingqi. Harmony, locality and dependency direction in syntax. Talk at the Workshop of Perspectives on Word Order Evolution: Reconstruction, Typology, and Processing, University of Zurich, Zurich, Switzerland.
Published:
2019/7: Yingqi Jing, Paul Widmer and Balthasar Bickel. Evolutionary rates and directions of word order change across main and subordinate clauses in Indo-European. Talk at the 24th International Conference on Historical Linguistics (ICHL), Australian National University, Canberra, Australia.
Published:
2019/9: Manuel Widmer, Erik van Gijn, Alena Witzlack-Makarevich, Erika Just, Yingqi Jing & Nico Neureiter. On the evolution of so-called hierarchical person-marking systems in Tupian and Sino-Tibetan. Talk at the 13th Conference of the Association for Linguistic Typology (ALT), University of Pavia, Pavia, Italy.
Published:
2019/9: Yingqi Jing, Paul Widmer and Balthasar Bickel. Challenging the penthouse: word order variation in main vs subordinate clauses. Talk at the 13th Conference of the Association for Linguistic Typology (ALT), University of Pavia, Pavia, Italy.
Published:
2020/4: Yingqi Jing, Paul Widmer and Balthasar Bickel. Evolutionary rates and directions of word order change across main and subordinate clauses in Indo-European. Talk at the 13th International Conference on Language Evolution (EvoLang), Vrije Universiteit Brussel, Brussels, Belgium. (cancelled due to pandemic)
Published:
2021/6: Miina Norvik and Yingqi Jing. Uralic Typological Database. Talk at the Workshop of Uralic Historical Atlas, University of Turku, Finland.
Published:
2021/10: Yingqi Jing. A quantitative investigation of Uralic linguistic typology. Talk at the Workshop on Uralic Typology, University of Tartu, Estonia.
Published:
2022/08: Norvik, Miina, Yingqi Jing, Michael Dunn, Helle Metslang, Karl Pajusalu and Outi Vesakoski. Uralic typology in the light of a new comprehensive dataset. Talk at the 13th International Congress for Finno-Ugric Studies, University of Vienna, Austria.
Published:
2022/09: Yingqi Jing, Norvik, Miina, Outi Vesakoski and Michael Dunn. Phylogenetic multilevel models reveal a simplicity bias in Uralic. Poster at the Joint Conference on Language Evolution, Kanazawa, Japan.
Published:
2022/12: Yingqi Jing, Joakim Nivre, and Michael Dunn. Phylogenetic evidence against dependency locality in Indo-European. Talk at the 14th Conference of the Association for Linguistic Typology (ALT), University of Texas at Austin, Texas, USA.
Published:
2022/09: Yingqi Jing, Norvik, Miina, Outi Vesakoski and Michael Dunn. Phylogenetic multilevel models reveal a simplicity bias in Uralic. Talk at the 14th Conference of the Association for Linguistic Typology (ALT), University of Texas at Austin, Texas, USA.