In this project, I have detected lane lines on the road using Python and OpenCV.
I have developed a computer vision pipeline that processes a group of test images then applied this pipeline to two test video streams.
- Load test images
- Apply grayscale transform
- Smooth the image to suppress noise and any spurious gradients
- Apply Canny edge detection algorithm
- Cut-out any edges that are out of the lane lines region (region of interest)
- Get the lines located in the region of interest using hough transform
- Fitting and extrapolating these lines for both right and left lane
- Determining the x and y points of each of the right and left lane lines
- Extrapolating or fitting these points for both lane lines
- Draw the lane lines on the original image
- Drawing the lane lines on a blank image firstly
- Weighting the original image with the lane lines image
In the next, I will be explaining each step of the pipeline.
- Ubuntu 16.04 LTS
- Python 3.6.4
- OpenCV 3.1.0
- Anaconda 4.4.10
I have loaded the test images from the test_images directory.
These images are loaded to a list test_image
which will be used for feeding the pipeline with the test images for detecting the lane liens.
images_list = os.listdir("test_images/")
test_images = []
for img in images_list:
test_images.append(mpimg.imread("test_images/"+img))
The goal of this step is to convert the image under processing to grayscale level. For many applications of image processing, color information doesn't help us identify important edges or other features. So, it is preferable in many computer vision appplications to have the image in grayscale. Specifically, for our goal here to detect lane lines in an image, it is needed to convert the image to grayscale to get the best output of the next steps of our pipeline. In the following is my function to convert to grayscale:
def grayscale(image):
"""
Description:
Applies the gray-scale transform
Parameters:
image: A color input image
Output:
A gray-scaled image
"""
return cv2.cvtColor(image, cv2.COLOR_RGB2GRAY)
Here we apply the gaussian smoothing which is essentially a way of suppressing noise and spurious gradients by averaging.
We choose the kernel_size
of Gaussian smoothing to be any odd number.
A large kernel_size
implies averaging ,or smoothing, over a large area
In the following is my function that uses the cv2.GaussianBlur
function to apply the gaussian smoothing.
def gaussian_blur(image, kernel_size = 5):
"""
Description:
Smoothes the image by applying guassian filter to the image
Parameters:
image: A gray-scaled input image
kernel_size (Default = 5): This is the window the convolves the whole image applying the filter to the image
Output:
Smoothed (Averaged or Filtered) image
"""
return cv2.GaussianBlur(image, (kernel_size, kernel_size), 0)
Rapid changes in brightnesss is that we call edges.
From Wikipedia:
The Canny edge detector is an edge detection operator that uses a multi-stage algorithm to detect a wide range of edges in images.
Process of Canny edge detection algorithm
1. Apply Gaussian filter to smooth the image in order to remove the noise
2. Find the intensity gradients of the image
3. Apply non-maximum suppression to get rid of spurious response to edge detection
4. Apply double threshold to determine potential edges
5. Track edge by hysteresis: Finalize the detection of edges by suppressing all the other edges that are weak and not connected to strong edges.
The goal of Canny Edge Detection is to identify the boundaries of an object of an image (specifiacally here, our lane lines).
You can think of it as the strength an edge as being defined by the how different the values are in adjacent pixels in the image. As our image is a mathematical function of x and y "f(x,y)" so we can perfom mathematical operations on it. So taking the derivative of this function w.r.t x and y simultaneously is called the gradient.
The canny edge detection algorithm will firstly detect strong edges (strong gradients pixels) above the high threshold and rejects pixels below the low threshold. Pixels with values between low and high thresholds will be included as long as they are connected to strong edges. The ouputs is a binary image with pixels tracing out the detected edges and black everything else.
The following is the Canny Edge Detection function that uses cv2.Canny
def canny(image, low_threshold = 50, high_threshold = 150):
"""
Description:
Canny edge detector which detects strong edges (strong gradients pixels) above the high threshold and
rejects pixels below the low threshold. Pixels with values between low and high thresholds will be included
as long as they are connected to strong edges
Parameters:
image: The input gray-scaled smoothed image
low_thresold (Default = 50): The threshold value of rejecting pixel
high_threshold (Default = 150): The threshold of strong edges in the image
Output:
A binary image with pixels tracing out the detected edges and black everything else
"""
return cv2.Canny(image, low_threshold, high_threshold)
As our goal is to identify and detect the lane lines on the road so we are not interest in other shapes, objects, ... etc in the image. So, here we cut-out and focus on the region of the lane lines will be located assuming that there is a front-camera mounted at the center of the vehicle (say it is mounted on the ceter mirror). This region of interest is defined by a polygon in which the detected edges will appear and black everywhere else.
Here is the function that defines the polygon of region of interest and masks it:
def region_of_interest(image):
"""
Descrition:
Applies an image mask.Only keeps the region of the image defined by the polygon
formed from `vertices`. The rest of the image is set to black.
Paramteres:
image: The image from Canny edge detector
Output:
An image that has the edges in the defined polygon and black everywhere else
"""
image_shape = image.shape
# The following points are the vertices points of the polygon
bottom_left = (110,image_shape[0])
up_left = (450, image_shape[0]/1.65)
bottom_right = (image_shape[1]-450, image_shape[0]/1.65)
up_right = (image_shape[1]-20,image_shape[0])
vertices = np.array([[bottom_left, up_left, bottom_right, up_right]], dtype=np.int32)
#defining a blank mask to start with
mask = np.zeros_like(image)
#defining a 3 channel or 1 channel color to fill the mask with depending on the input image
if len(image.shape) > 2:
channel_count = img.shape[2] # i.e. 3 or 4 depending on your image
ignore_mask_color = (255,) * channel_count
else:
ignore_mask_color = 255
#filling pixels inside the polygon defined by "vertices" with the fill color
cv2.fillPoly(mask, vertices, ignore_mask_color)
#returning the image only where mask pixels are nonzero
masked_image = cv2.bitwise_and(image, mask)
return masked_image
From Wikipedia:
The Hough transform is a feature extraction technique used in image analysis, computer vision, and digital image processing.[1] The purpose of the technique is to find imperfect instances of objects within a certain class of shapes by a voting procedure. This voting procedure is carried out in a parameter space, from which object candidates are obtained as local maxima in a so-called accumulator space that is explicitly constructed by the algorithm for computing the Hough transform.
Simply speaking, the hough transfrom will take an image (canny edge detectoin and region of interes applied) which has the shape of lane lines scatered along it as white dots/edges/small or long lines. The hough transform officially gives you the parameters of that line that passes through each piece of lane line (based on how you define hough parameters). Using the cv2.HoughLinesP
, it will give a list having the whole lines of that image. Each item of this list is the start and end (x,y) coordinates of that line.
Here is the function for hough tranfrom:
def hough_transform(image):
"""
Descrition:
Hough Transfrom is used to indetify lines located in the region of interest
Parameters:
image: This is the output a Canny transform and after applying the region of interest mask.
Output:
Returns a list with hough lines.
"""
rho = 1 # distance resolution in pixels of the Hough grid
theta = (np.pi/180) # angular resolution in radians of the Hough grid
threshold = 10 # minimum number of votes (intersections in Hough grid cell)
min_line_length = 1 #minimum number of pixels making up a line
max_line_gap = 1 # maximum gap in pixels between connectable line segments
lines = cv2.HoughLinesP(image, rho, theta, threshold, np.array([]), minLineLength=min_line_length, maxLineGap=max_line_gap)
return lines
Now we have the lane lines but at least one of them is segmented and can not be drawn as a complete line on the image. Here, we develop a way of extrapolating the lane lines. This is done through two steps:
- Determining the x and y points of each the right and left lane lines
- Extrapolating or fitting these points for both lane lines
In order to be able to extrapolate the line, we need firstly to determine whether this line belongs to the right lane of the left lane. To do so, we will depend on the slope of the lines detected from the hough transfrom. If the slope is positive, so this indicates a line belonging to the left lane line. On the other hand, if the slope is negative, so the line is beloning to right lane line. For both lines, we collect the x and y points to be able to extrapolate the line in the next step.
Here is the function for determining the right lane line points:
def get_right_lane_points(lines_image):
"""
Descrition:
This function calculates the slope of each line detected from hough transform.
If the slope is +ve, then this line belongs to the right lane.
Note: 0.4 is selected for better filteration of the lines ouput
Parameters:
lines_image: This is the output of hough tranfrom (a list contains the hough lines)
Output
right_lane_x_points: a list contains the all x points of the right lane
right_lane_y_points: a list contains the all y points of the right lane
"""
right_lane_x_points = []
right_lane_y_points = []
for line in lines_image:
for x1,y1,x2,y2 in line:
slope = (y2 - y1) / (x2 - x1)
if(slope > 0.4): #+ve slope -> right lane
#print("Right Lane Detected\n")
right_lane_x_points.append(x1)
right_lane_x_points.append(x2)
right_lane_y_points.append(y1)
right_lane_y_points.append(y2)
return right_lane_x_points, right_lane_y_points
And this is the function for determining the left lane line points:
def get_left_lane_points(lines_image):
"""
Descrition:
This function calculates the slope of each line detected from hough transform.
If the slope is -ve, then this line belongs to the left lane.
Note: -0.6 is selected for better filteration of the lines ouput
Parameters:
lines_image: This is the output of hough tranfrom (a list contains the hough lines)
Output
left_lane_x_points: a list contains the all x points of the left lane
left_lane_y_points: a list contains the all y points of the left lane
"""
left_lane_x_points = []
left_lane_y_points = []
for line in lines_image:
for x1,y1,x2,y2 in line:
slope = (y2 - y1) / (x2 - x1)
if(slope < -0.6): #-ve slope -> left lane
#print("Left Lane Detected\n")
left_lane_x_points.append(x1)
left_lane_x_points.append(x2)
left_lane_y_points.append(y1)
left_lane_y_points.append(y2)
return left_lane_x_points, left_lane_y_points
Now, by knowing the point of the right and left lane lines separetely, we can extrapolate these point and get the whole line passing through these points all. IF: y = mx + b THEN: x = (y - b)/m Given that: y is defined from the region of interest.
Here is the function for fitting and extrapolating the lane lines:
def fit_lane_line(lane_line_x_points, lane_line_y_points, image_shape):
"""
Descrition:
The function extrapolates the detectioned points in each lane from the line equation given that
Ymin and Ymax are defined from the region of interest.
IF: y = mx + b
THEN: x = (y - b)/m
Parameters:
lane_line_x_points: This is a list for the x points of our intended lane line
lane_line_y_points: This is a list for the y points of our intended lane line
image_shape: The height and width of the image to get the Ymin and Ymax
Output
Xmin, Ymin: The start point of our intended lane line
Xmax, Ymax: The end point of our intended lane line
"""
#Getting the slop and intersect of the right lane line
fit_lane_line = np.polyfit(lane_line_x_points, lane_line_y_points, 1)
m = fit_lane_line[0] # Slope
b = fit_lane_line[1] # Intercept
#These are the y values defined at my region of interest
Ymax = image_shape[0]
Ymin = image_shape[0]/1.65
# If equation of a line: y = m*x + b
# Hence: x = (y - b)/m
Xmax = (Ymax - b) / m
Xmin = (Ymin - b) / m
#Converting the values from float to int to be suitable for cv2.line
Ymax = int(Ymax)
Ymin = int(Ymin)
Xmax = int(Xmax)
Xmin = int(Xmin)
return Xmax, Xmin, Ymax, Ymin
In this step, we are:
- Drawing the lane lines on a blank image firstly
- Weighting the original image with the lane lines image
We draw the lane line on a blank image firstly knowing the start and end point of each line
def draw_lines(image, Xmin, Xmax, Ymin, Ymax):
"""
Descrition:
This function is used to draw the lane line on a blank image
Parameters:
Xmin, Ymin: The start point of our intended lane line
Xmax, Ymax: The end point of our intended lane line
Output:
Image of the lane line drawn on it
"""
#Drawing the lane line on the blank image
return cv2.line(image,(Xmin,Ymin),(Xmax,Ymax),(255,0,0),10)
Then we draw the lane lines on the original image
def weighted_img(image, initial_img, α=0.8, β=1., γ=0.):
"""
Descrition:
This function is used to weight (combine) the lane lines image and the original one
Parameters:
image: The image which has the lane lines
initial_img: should be the image before any processing.
Output:
image is computed as follows: initial_img * α + image * β + γ
NOTE: initial_img and img must be the same shape!
"""
return cv2.addWeighted(initial_img, α, image, β, γ)
This is our algorithm function that wraps and calls the whole parts of our pipeline. It takes a color test image and returns an image in which the lane lines on the road should be annotated and detected.
def process_pipeline(image):
"""
Descrition:
This function is save an image to the specified path with the same origianal name
Parameters:
image: image under test to detect the lane lines in it
Output:
result: The output of our pipeline. It should be the original image with the lane lines annotated.
"""
image_shape = image.shape
gray_scale_image = grayscale(image)
smoothed_gray_scale_image = gaussian_blur(gray_scale_image)
edge_detected_image = canny(smoothed_gray_scale_image)
masked_edge_detected_image = region_of_interest(edge_detected_image)
lines_image = hough_transform(masked_edge_detected_image)
right_lane_x_points , right_lane_y_points = get_right_lane_points(lines_image)
left_lane_x_points , left_lane_y_points = get_left_lane_points(lines_image)
Xright_min , Xright_max , Yright_min , Yright_max = fit_lane_line(right_lane_x_points, right_lane_y_points, image_shape)
Xleft_min , Xleft_max , Yleft_min , Yleft_max = fit_lane_line(left_lane_x_points, left_lane_y_points, image_shape)
lane_lines_image = np.copy(image)*0 # creating a blank to draw lines on
lane_lines_image = draw_lines(lane_lines_image, Xright_min , Xright_max , Yright_min , Yright_max)
lane_lines_image = draw_lines(lane_lines_image, Xleft_min , Xleft_max , Yleft_min , Yleft_max)
result = weighted_img(lane_lines_image, image)
return result
This is the computer vision algorithm pipeline for detecting lane lines on the road of an image. The algorithm is working on straight lane lines of both test images and video streams. Curve lane liens are beyond the scope of this projet.
MIT