Skip to content

2011

The Genetic and Lexical Stacks

Many complex phenomena may be decomposed using a stack. For example, one might decompose contemporary scientific theory into a stack as follows: physics -- chemistry -- biology -- psychology -- sociology.

Visualizing KNN Regression

K-nearest neighbor (KNN) regression is a popular machine learning algorithm. However, without visualization, one might not be aware of some quirks that are often present in the regression. Below I give a visualization of KNN regression which show this quirkiness.

Optimizing HSK Study with MaxRank

Short Version

I've put together a PDF containing the revised HSK vocab for levels 1--6, sorted in such a way to maximize word learning rate. The list was sourced from Lingomi, with sorting applied using the MaxRank method. I have found this particular presentation of the list to be especially useful; so I put it here in hopes that others can also benefit.

Thematic Chinese Vocabulary

Learning vocabulary in thematic groups is an effective way to learn. However, as is often the case, it is challenging to find good learning materials. For thematic vocabulary, we want sources which simultaneously do the following:

  1. contain a sufficient quantity of vocabulary in the desired fields (i.e., have both breadth and depth)
  2. organize words and phrases by theme (i.e., are thematic)
  3. give some example usages (i.e., provide context)

Specifically for Chinese, I've found two excellent resources thus far.

Automated Annotation Tool

The other day I picked up my Chinese copy of Alice in Wonderland that I picked up in Beijing last year. My intention was to lay in the sun by the lake until I had finished the first page, using the dictionary as needed to achieve basic comprehension. The result was a bad sunburn and only two of four paragraphs finished. What went wrong?