UCI Catalogue Degree Plan Scraper

Overview

This tool is designed to scrape degree plans from the UCI Catalogue by extracting degree plan tables, including course sequences and links to specific course information, from the university’s catalogue. The data is used to assist the UCI Curricular Analytics project.

On a high level, it:

Navigates to specific degree program pages.
Extracts the degree plan tables, including the course sequences for different years.
Captures hyperlinks associated with each course.
Processes normal course listings and dynamically loaded content.

The scraper makes use of Python, primarily leveraging:

requests: To fetch the HTML content from UCI's Catalogue.
BeautifulSoup: To parse and extract the degree plan tables.
pandas: To store the scraped data in a structured format (DataFrame) for further analysis or export.

Procedure

Load URLs of UCI degree plans into urls.txt from the UCI Catalogue.
Run script.py and monitor progress through the console.
View scraped degree plans under the sample_dp_exports/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
sample_dp_scrapers		sample_dp_scrapers
.gitignore		.gitignore
README.md		README.md
concat_script.py		concat_script.py
script.py		script.py
test.py		test.py
urls.txt		urls.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UCI Catalogue Degree Plan Scraper

Overview

Procedure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

UCI Catalogue Degree Plan Scraper

Overview

Procedure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages