Optimizing the bike infrastructure system in SF using Machine Learning
City Bikes
Bike infrastructure system offer a method to rent bikes by utilizing an automated system of membership, rental, and return at kiosk locations across San Francisco. The primary issue with it is the expensive repair and maintenance of these stations. As a result, the service hubs often don't fully utilize their potential. Among the major inefficiencies of urban bike systems is the inability to satisfy increasing demand or properly capitalize on the stations' maximum capacity
Our project deals with optimizing the bike sharing systems in the city of SF to help distribute the resources effectively and efficiently. My role on the project was to clearly define the problem space, help in building the ML model and create the data visualization in Streamlit App for the project
Timeline
Oct-Dec 2021
Team
Samarth Gowda
Kevin Chian
Toolkit & techniques
Python
Pytorch
TensorFlow
Numpy
Streamlit
Deliverables
Streamlit
Final Movie
Github Repo
TL;DR
Bike service stations do not end up using to their full potential
Our internet research study informed us that the bike stations in SF have varying demands with certain stations popular than the other resulting in inefficient systems. We wanted to solve this problem by examining the concealed data layers within daily cyclists travel. To achieve this, various connections are identified, and essential EDA is performed to comprehend urban cyclist patterns
![](https://framerusercontent.com/images/2ZV8bEXDxzfPkZ77yidc5rgRA.jpg)
Bike station - Image credits - Flicker.com
Defining the problem space
How can we improve the existing bike-ride system efficiency in San Francisco to decongest the city ?
One of the biggest inefficiencies of the city bike system is either not being able to meet the growing demand or not utilizing the stations to their full potential. This demand is a hefty variable and it is almost impossible to know what will be the demand on a particular day
SOLUTION
Predict the usage and analyze the underutilized stations using Regression
Bike sharing systems function as a sensor network, which can be used for studying mobility in a city. Apart from interesting real-world applications of bike sharing systems, the characteristics of data being generated by these systems make them attractive for the research. Opposed to other transport services such as bus or subway, the duration of travel, departure, and arrival position is explicitly recorded in these systems. This feature turns the bike sharing system into a virtual sensor network that can be used for sensing mobility in the city. Hence, it is expected that most of the important events in the city could be detected via monitoring these data.
Final Dashboard on Streamlit showcasing live data visualizations
Process
Dashboards & Visualizations
![](https://framerusercontent.com/images/Kxbh4g41tSbAZFHPLmipvBFdwag.jpeg)
Visualizations showcasing the most popular stations
![](https://framerusercontent.com/images/1ejkdTD8dmIpZn7yIw1oYvQ8A2A.jpeg)
Regression model error
![](https://framerusercontent.com/images/eBF2YgX3EW90fa6g5wKujCCftY.jpeg)
Average usage in all stations over a year
![](https://framerusercontent.com/images/3yKURgKv1RzrLJX4y13Tl4FpeA.jpeg)