log in | about 
This a personal web page/blog of Leonid Boytsov. He currently works as a research scientist at M*Modal (on speech recognition).

Leonid was a graduate research assistant (aka PhD student) at the Language Technologies Institute at Carnegie Mellon University (under the supervision of Professor Eric Nyberg). In his thesis "Efficient and Accurate Non-Metric k-NN Search with Applications to Text Matching" he explored how various linguistic, neural, and lexical features can be incorporated directly into a candidate generation component (via k-NN search). He was assisting in teaching the following courses & seminars: Algorithms for NLP (11-711) in 2013, Software Engineering I (11-791) and Data Science Seminar (11-631) in 2014.

An important by-product of Leonid's research is an efficient and flexible library for k-NN search codenamed NMSLIB created in collaboration with several other folks. A brief description of this collaboration can be found on this LTI news page. Feel free to check our code on GitHub.

Leonid also co-authored an extremely efficient algorithm for light-weight compression of sorted integer numbers. We show that this algorithm can decompress at the speed of reading from memory. You can find software on GitHub. This software grew out of a now-popular library FastPFor. FastPFor has Python bindings.

Leonid likes collecting material related to search technologies and other AI-related topics (e.g., algorithms, software, interesting papers, and even historical anecdotes). Note that his opinions and views do not necessarily represent opinions of his employer, his dissertation advisor, or the Language Technologies Institute.

Featured blog posts:

Additional information: