restaurant_webscraping

This project, though not heavily tech-based, had a significant impact and offered immense learning opportunities. It involved a social sciences digitization initiative, where I collaborated with university professors and research assistants to gather information on restaurant menus, aiming to explore the culinary history of the Gulf region.

My role was to collect and digitize these menus. Intrigued, I proposed to the lead professor to web scrape a site listing over 5,000 restaurants, a task aligning with my full-stack web development experience and allowing me to quickly learn Python web scraping with BeautifulSoup. The challenge was navigating and understanding the structure of multiple web pages, as the entire database wasn't available on a single page.

A significant hurdle arose when only the first menu image would download, leading me to discover that the images were loading dynamically, rendering BeautifulSoup inadequate. After extensive debugging, I adapted my approach using Selenium to successfully scrape the needed data. My perseverance paid off, resulting in a comprehensive collection of images beneficial to the research.

Building on this success, I am now contributing to the development of a machine-learning model to analyze these images and create structured data more efficiently. This ongoing work exemplifies my commitment to continuous learning and applying my technical skills to diverse, real-world problems.

The main program is dynamic_webscraper.py and I have also uploaded a part of the results Abu Dhabi Menus.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Abu Dhabi Menus		Abu Dhabi Menus
README.md		README.md
chromedriver.exe		chromedriver.exe
dynamic_webscraper.py		dynamic_webscraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

restaurant_webscraping

About

Releases

Packages

Languages

AhmadSavaiz03/restaurant_webscraping

Folders and files

Latest commit

History

Repository files navigation

restaurant_webscraping

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages