Note

Nearest Neighbor methods are a very popular ML algorithm. We show how to implement k-Nearest Neighbors, weighted k-Nearest Neighbors, and k-Nearest Neighbors with mixed distance functions. In this chapter we also show how to use the Levenshtein distance (edit distance) in TensorFlow, and use it to calculate the distance between strings. We end this chapter with showing how to use k-Nearest Neighbors for categorical prediction with the MNIST handwritten digit recognition.

引言¶

We introduce the concepts and methods needed for performing k-Nearest Neighbors in TensorFlow.

文本距离函数¶

In order to use a distance function on text, we show how to use edit distances in TensorFlow.

下载本章 Jupyter Notebook

计算混合距离函数¶

Here we implement scaling of the distance function by the standard deviation of the input feature for k-Nearest Neighbors.

下载本章 Jupyter Notebook

地址匹配¶

We use a mixed distance function to match addresses. We use numerical distance for zip codes, and string edit distance for street names. The street names are allowed to have typos.

下载本章 Jupyter Notebook

图像处理的近邻法¶

The MNIST digit image collection is a great data set for illustration of how to perform k-Nearest Neighbors for an image classification task.

下载本章 Jupyter Notebook

本章学习模块¶

tensorflow.zeros¶

Creates a tensor with all elements set to zero.

This operation returns a tensor of type dtype with shape shape and all elements set to zero.

>>> tf.zeros([3, 4], tf.int32)
<tf.Tensor: shape=(3, 4), dtype=int32, numpy=
array([[0, 0, 0, 0],
       [0, 0, 0, 0],
       [0, 0, 0, 0]], dtype=int32)>

param shape: A list of integers, a tuple of integers, or a 1-D Tensor of type int32.
param dtype: The DType of an element in the resulting Tensor.
param name: Optional string. A name for the operation.
returns: A Tensor with all elements set to zero.

tensorflow.ones¶

Creates a tensor with all elements set to one (1).

See also tf.ones_like.

This operation returns a tensor of type dtype with shape shape and all elements set to one.

>>> tf.ones([3, 4], tf.int32)
<tf.Tensor: shape=(3, 4), dtype=int32, numpy=
array([[1, 1, 1, 1],
       [1, 1, 1, 1],
       [1, 1, 1, 1]], dtype=int32)>

param shape: A list of integers, a tuple of integers, or a 1-D Tensor of type int32.
param dtype: Optional DType of an element in the resulting Tensor. Default is tf.float32.
param name: Optional string. A name for the operation.
returns: A Tensor with all elements set to one (1).

引言¶

最近邻法的使用¶

文本距离函数¶

计算混合距离函数¶

地址匹配¶

图像处理的近邻法¶

本章学习模块¶

tensorflow.zeros¶

tensorflow.ones¶