pFad - Phone/Frame/Anonymizer/Declutterfier! Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

URL: http://github.com/jairo8925/web-scraper

ink crossorigen="anonymous" media="all" rel="stylesheet" href="https://github.githubassets.com/assets/primer-8522af645b000615.css" /> GitHub - jairo8925/web-scraper: A Python program that takes a website address and parses its data to be saved to a text file. · GitHub
Skip to content

jairo8925/web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraper

A Python program that takes a website address and parses its data to be saved to a text file. Sends an HTTP request to the website and uses BeautifulSoup to parse its data. Specifically, this program scrapes articles from https://www.nature.com/nature/articles and saves each one in a separate .txt file.

To start, provide the number of pages to specify the number of pages on which the program should look for the articles. Next, provide the type of article that the program should look for (eg. News, Correspondence, Research Highlight). After the program is done, the articles will be saved in the directories Page_1 to Page_N (N corresponds to page number), depending on what page an article was found in.

My other project (Multilingual Online Translator) that uses web scraping can be found here

About

A Python program that takes a website address and parses its data to be saved to a text file.

Topics

Resources

Stars

Watchers

Forks

Contributors

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad © 2024 Your Company Name. All rights reserved.





Check this box to remove all script contents from the fetched content.



Check this box to remove all images from the fetched content.


Check this box to remove all CSS styles from the fetched content.


Check this box to keep images inefficiently compressed and original size.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy