Unknownpgr

Implementation of line sensor algorithm based on Bayes' theorem

2023-09-11 14:59:42 | English, Korean

Translated with the help of ChatGPT and Google Translator

A line sensor is a sensor used in a line tracer type robot. It is used to estimate the position of a line relative to the center of the sensor so that the robot can follow the line.

Line sensors can be implemented in various forms. Here, we deal with an algorithm that estimates the position of the line when using multiple Infrared (IR) sensors in a row, as shown in the example below.

IR0---IR1---IR2---IR3---IR4---IR5---IR6---IR7

Before discussion, let's define the position of IR0 as -1 and the position of IR7 as 1 for each IR sensor's position xnx_n. That is, let's say xn=2n/71x_n=2n/7-1.

Existing algorithm

The robot club I work in, [Zetin] (https://zetin.uos.ac.kr), estimated the position of the line using a weighted average as shown below.

Regarding the sensor value vnv_n, let's say that the sensor value is 0 when it detects a black background with nothing, and that the value is 1 when the sensor goes exactly above the white line. The value of the sensor located at the border of the line is between 0 and 1.

At this time, previously, the position of the line was estimated by giving weight to the sensor value as follows.

x^=n=07xnvnn=07vn\hat{x} = \frac{\sum_{n=0}^7 x_nv_n}{\sum_{n=0}^7 v_n}

Although this method is very simple, it can lead to greatly incorrect estimates of line position if noise is included. To improve this, I implemented an algorithm to estimate line position using Bayes' theorem.

Bayes' theorem

Bayes' theorem is as follows.

P(AB)=P(BA)P(A)P(B)P(A|B) = \frac{P(B|A)P(A)}{P(B)}

In this equation

At this time, if P(B)P(B) is an event that depends on P(A)P(A), it is organized as follows using the law of overall probability.

P(AB)=P(BA)P(A)AP(BA)P(A)P(A|B) = \frac{P(B|A)P(A)}{\sum_{A}P(B|A)P(A)}

If the distribution is a continuous probability distribution, it is expressed as follows.

P(AB)=P(BA)P(A)AP(BA)P(A)dAP(A|B) = \frac{P(B|A)P(A)}{\int_{A}P(B|A)P(A)dA}

Line estimation using Bayes’ theorem

What we want to know is the position pp of the line given the values v=(v0,v1,,v7)\vec v= (v_0, v_1, \cdots, v_7) of the 8 sensors. To do this, we will obtain the probability density function for the line position pp and select the pp value with the highest value. Since this probability density estimates the distribution of sensor positions given sensor values, it is expressed as the conditional probability P(pv)P(p|\vec v). However, since we do not know this conditional probability, we will obtain this probability distribution using Bayes' theorem of continuous probability distribution summarized earlier as shown below.

P(pv)=P(vp)P(p)pP(vp)P(p)dpP(p|\vec v) = \frac{P(\vec v|p)P(p)}{\int_{p}P(\vec v|p)P(p)dp}

In this equation, in order to determine the prior probability P(p)P(p), the distribution of sensor values must be measured. Since the line-following robot follows the line position to 0, the actual sensor value will have a distribution centered on 0. However, here, for general purpose, let us assume a uniform distribution in an appropriate range instead of this assumption. In this case, this range can be slightly larger than [1,1][-1,1], because the sensor can detect the line even if the line is slightly outside the outermost sensor. (This is also a difference from the existing algorithm. In the existing algorithm, the position of the line is always within [1,1][-1,1].)

That is, within that range, P(p)=kP(p)=k. Accordingly, the equation is transformed as follows.

P(pv)=P(vp)pP(vp)dpP(p|\vec v) = \frac{P(\vec v|p)}{\int_{p}P(\vec v|p)dp}

From this, you can immediately estimate the probability distribution by simply finding P(vp)P(\vec v|p). And since the denominator of the above equation is a constant, finding pp that maximizes the above equation is the same as finding pp that maximizes P(vp)P(\vec v|p).

As mentioned above, in Bayes' theorem, P(vp)P(\vec v|p) is called likelihood. Therefore, this method is called Maximum Likelihood Estimation (MLE).

However, the maximum likelihood method usually assumes that random variables are sampled from the same probability distribution. However, in this case, the probability distribution of values is different depending on the sensor location, so it is different from the general maximum likelihood method. However, since estimating the parameter - in this case the line position - is fundamentally the same as maximizing the likelihood, it would be safe to call this approach the maximum likelihood method.)

To find this, we will obtain the probability distribution of the sensor value according to the distance between the line and the sensor for one sensor, and then expand this to the probability distribution for the sensor value vector.

First, let's denote the probability distribution that the sensor value is vv for a sensor dd away, that is, the probability density function, as fd(v)f_d(v). At this time, the reason why it appears as a probability distribution rather than a single value is due to noise. Let us assume that these noises are independent of each other. (In reality, there are noise sources (external light, etc.) that have a similar effect on all sensors, so they will not be independent, but it is safe to assume that they are independent.) Then, the probability distribution P(vp)P(\vec v|p) for the sensor value vector. is expressed as follows:

P(vp)=nfxnp(vn)P(\vec v|p) = \prod_{n} f_{|x_n-p|}(v_n)\\

in other words,

  1. When the position of the line is given
  2. From this, the probability density function of each sensor value expected is
  3. Probability density obtained by inputting actual sensor values
  4. Expressed as a product.

To check whether this is the correct probability distribution, check whether the integral is 1. According to Pubini's theorem, it can be shown that it becomes 1 as follows.

vP(vp)dv=v1v2vnnfxnp(vn)dv1dv2dvn=nvnfxnp(vn)dvn=n1=1\begin{align} \int_v P(\vec v|p)dv &=\int_{v_1}\int_{v_2}\cdots\int_{v_n}\prod_{n} f_{|x_n-p|}(v_n)dv_1dv_2\cdots dv_n \\ &= \prod_{n} \int_{v_n} f_{|x_n-p|}(v_n)dv_n\\ &= \prod_{n} 1\\ &=1 \end{align}

Pubini's theorem is the theorem that in Riemann integral, if f(x,y)f(x,y) is continuous for a square area RR, the equation below holds true.

Rf(x,y)dA=abcdf(x,y)dxdy\int_R f(x,y)dA = \int_a^b\int_c^d f(x,y)dxdy

From this, if f(x,y)f(x,y) is expressed as the product of two functions f1(x)f2(y)f_1(x)f_2(y), the equation below is established.

Rf(x,y)dA=abcdf1(x)f2(y)dxdy=abf1(x)dxcdf2(y)dy\int_R f(x,y)dA = \int_a^b\int_c^d f_1(x)f_2(y)dxdy = \int_a^b f_1(x)dx\int_c^d f_2(y)dy

This can be easily extended to higher dimensions.

Therefore, if you substitute the obtained P(vp)P(\vec v|p) into the above equation, it is organized as follows.

P(pv)=nf(vn,xnp)pnf(vn,xnp)dpP(p|\vec v) = \frac{\prod_{n} f(v_n, |x_n-p|)}{\int_{p}\prod_{n} f(v_n, |x_n-p|)dp }

From this, what we want to know is pp that maximizes this value. At this time, the denominator of this equation is a constant, so maximizing it is the same as maximizing the numerator. Therefore, it is organized as follows.

p^=argmaxpnf(vn,xnp)\hat{p} = \arg\max_p \prod_{n} f(v_n, |x_n-p|)

The above equation is obtained by multiplying the probability density function several times, so the number may be very large or small. Therefore, log transformation is performed and summarized as follows.

p^=argmaxpnlogf(vn,xnp)\hat{p} = \arg\max_p \sum_{n} \log f(v_n, |x_n-p|)

Experiment

We implemented this in Python and experimented with it. For the experiment, the probability density function of the sensor value according to distance was defined as follows.

vN(μ,0.1)μ=max(13d,0)v\sim\mathcal{N}(\mu, 0.1)\\ \mu=\max(1-3|d|,0)

If the actual line position is set to 0.64, the sensor value is measured as follows. Alt text

However, since real sensors inevitably contain noise, normal distribution noise was added. Alt text

From this, the line position can be estimated as follows. Alt text

This shows that the new algorithm estimates the line position more accurately than the existing algorithm.

optimization

Since this uses multiple exponential and logarithmic operations, the amount of calculation may be too large to be calculated in an embedded system. And since the value may be very small, errors may occur due to floating point calculations.

At this time, this can be optimized by making the following assumptions.

  1. Assume that the probability density function of sensor values is normal distribution.
  2. The mean of the distribution is a function of line position.
  3. The standard deviation of the distribution is a constant.
  4. Sensor values are independent between sensors.

From these assumptions, let the mean of the sensor value distribution for the distance xx between the sensor and the line be μ(x)\mu(x) and the standard deviation be σ\sigma. Then, the probability density function of the sensor value is organized as follows.

f(v,x)=12πσexp((vμ(x))22σ2)f(v,x) = \frac{1}{\sqrt{2\pi}\sigma}\exp\left(-\frac{(v-\mu(x))^2}{2\sigma^2} \right)

Next, let the location of each sensor be pnp_n and the measured sensor value be vnv_n. Then p(vx)p(v|x) is organized as follows.

p(vx)=nf(vn,xpn)=n12πσexp((vnμ(xpn))22σ2)\begin{align} p(v|x) &= \prod_{n} f(v_n, |x-p_n|)\\ &= \prod_{n} \frac{1}{\sqrt{2\pi}\sigma}\exp\left(-\frac{(v_n-\mu(|x-p_n|))^2}{2 \sigma^2}\right) \end{align}

Therefore, the log likelihood obtained by taking the logarithm of this equation is summarized as follows.

logp(vx)=nlog12πσexp((vnμ(xpn))22σ2)=n(log12πσ(vnμ(xpn))22σ2)=n(log2πσ+(vnμ(xpn))22σ2)=nlog2πσ12σ2n(vnμ(xpn))2\begin{align} \log p(v|x) &= \sum_{n} \log \frac{1}{\sqrt{2\pi}\sigma}\exp\left(-\frac{(v_n-\mu(|x -p_n|))^2}{2\sigma^2}\right)\\ &= \sum_{n} \left(\log \frac{1}{\sqrt{2\pi}\sigma}-\frac{(v_n-\mu(|x-p_n|))^2}{2 \sigma^2}\right)\\ &= -\sum_{n} \left(\log \sqrt{2\pi}\sigma+\frac{(v_n-\mu(|x-p_n|))^2}{2\sigma^2}\right )\\ &= -n\log \sqrt{2\pi}\sigma-\frac{1}{2\sigma^2}\sum_{n} (v_n-\mu(|x-p_n|))^2 \end{align}

However, what we want to find is xx that maximizes this equation. Therefore, if we ignore the constant term and constant product, it can be summarized as follows. (Note that argmax is changed to argmin by changing the sign.)

x^=argminxn(vnμ(xpn))2\begin{align} \hat{x} &= \arg\min_x \sum_{n} (v_n-\mu(|x-p_n|))^2\\ \end{align}

From this, we can confirm that this equation is independent of the standard deviation of the sensor.

To summarize, it can be applied to actual robots as follows. First, perform the following in the tuning step:

  1. Measure the sensor value vv by changing the distance dd between the sensor and the line. At this time, measurements are made by applying various noises, such as providing external light at one location or slightly changing the angle of the line.
  2. Find the function μ(d)\mu(d) that represents the average of vv according to dd. The best thing is to obtain a physical model based on the distance between the sensor and the line and then tune the parameters according to the actual measured values. However, it would also be okay to simply linearly interpolate the actual measurements or curve fit an appropriate curve.

From this, the inference step performs the following:

  1. Find the measured sensor value vnv_n.
  2. Iterate through the possible xx values and calculate xx that maximizes n(vnμ(xpn))2\sum_{n} (v_n-\mu(|x-p_n|))^2. (pnp_n is the location of the sensor.)

Below is an implementation of this in Python.

import numpy as np

def mu(ds):
     ds = 1-np.abs(ds)*3
     ds = np.maximum(ds,0)
     return ds

def optimized(values,positions,mu,xs):
     n = len(values)
     result = np.zeros(len(xs))
     for i in range(n):
         result += (values[i]-mu(xs-positions[i]))**2
     return result

vs = [ ... ] # measured sensor values
ps = np.linspace(-1,1,8) # sensor positions
xs = np.linspace(-1,1,300) # candidate positions
ys = optimized(vs,ps,mu,xs) # log likelihood
x_hat = xs[np.argmin(ys)] # estimated position

Below is a visualization of the results of the above code. Likewise, the solid red line represents the actual line position, and the green dotted line represents the line position estimated from this. (The sensor values used are different from above.) Alt text

conclusion


- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -