Fornesus Web Scraping Project

Introduction

Buenas at Kumusta, I am Fornesus and this is a web scraping project that I am creating using the BS4 (BeautifulSoup4) library in Python.

I am scraping the content of my blog, Fornesus Blog, avaialble at https://fornesus.blog.

The content in the homepage of Fornesus Blog , are as follows:

Content Division
element #1:

Child #1 - a:

+ "https://fornesus.blog/"

Content Division
element #2:

Child #1 - a:

+ "https://fornesus.blog/login"

Content Division
element #3:

Child #1 - a:

+ "https://fornesus.blog/blog/"

Content Division
element #4:

Child #1 - a:

+ "https://fornesus.com"

Content Division
element #5:

Child #1 - a:

+ "https://fornes.us"

Content Division
element #6:

Child #1 - a:

+ "https://fornesusart.com"

Content Division
element #7:

Child #1 - a:

+ "https://blog.fornesusart.com"

Content Division
element #8:

Child #1 - a:

+ "https://chris.city"

Content Division
element #9:

Child #1 - h1:

+ "Latest Posts"

Content Division
element #10:

Child #1 - h2:

+ "Do we actually live this long?"

Content Division
element #11:

Child #1 - p:

+ "I don’t know if I’m going to reach 100 years old, but if I do, this is what I would tell myself, and ask myself. Read more"

Content Division
element #12:

Child #1 - h2:

+ "Dream jobs aren’t real, at least not to me."

Content Division
element #13:

Child #1 - p:

+ "It is pointless to have a dream job when you aren’t happy with the way that work is structured in the first place.  This applies, at least, to me. Because, for me, a job is literally a job. It is what it is. A job is something that you do, pursue, and even grow in… Read more"

Content Division
element #14:

Child #1 - h2:

+ "We should un-invent the clock!"

Content Division
element #15:

Child #1 - p:

+ "So, I mean, I have this goddamn alarm that I keep forgetting to unset but get angry when I don’t set it because I always want to know when it’s 11:11 for no reason other than this superstition that disillusioned millennials have that 11:11 is somehow a special time even though time is literally something… Read more"

Content Division
element #16:

Child #1 - h2:

+ "Loneliness is political. Solitude is a luxurious choice."

Content Division
element #17:

Child #1 - p:

+ "Loneliness has been considered a bad thing for a while. Of course, there is scientific data to prove just that. Loneliness is considered to be just as deadly as being idle for long periods of time. So, as a perpetually lonely person who, even in a crowded room, struggles to make connections. I know that… Read more"

Content Division
element #18:

Child #1 - h2:

+ "A long life might be torture, at least from my perspective"

Content Division
element #19:

Child #1 - p:

+ "I would much rather live a life well-lived, rather than this security blanket of safety that I currently have no choice but to stick to. I was a sheltered kid who became a reclusive adult, but if I had a choice, I wouldn’t live this way. I would live life to the fullest if only… Read more"

Content Division
element #20:

Child #1 - h2:

+ "I refuse to reduce my clutter, here’s why…"

Content Division
element #21:

Child #1 - p:

+ "I literally have ADHD and cleaning up disorients my OCD. That’s it, that’s literally all it is. I have pretty much stopped reducing physical clutter because it just does not feel right. And because I always end up placing my prior clutter out of sight to where I now have cluttered drawers. But, emotionally, I’m… Read more"

Content Division
element #22:

Child #1 - a:

+ "https://fornesus.blog/?query-16-page=2"

Child #2 - a:

+ "https://fornesus.blog/?query-16-page=3"

Child #3 - a:

+ "https://fornesus.blog/?query-16-page=7"

Content Division
element #23:

Child #1 - h2:

+ "About Fornesus"

Child #2 - p:

+ "Buenas at Kumusta everyone, I’m Chris, also known online as “Fornesus”, and here, I give my musings, thoughts, and rambles."

Child #3 - p:

+ "Sometimes, I will write about my own experiences, at other times, about my perspective, and on others, something totally random."

Child #4 - p:

+ "In any case, I will write, and keep writing."

Child #5 - p:

+ "I have managed and created several blogs over the years, but I have chosen to focus on this one for my general purpose writing, while my Art Blog and Chris City blogs are mainly for my budding art and design career ambitions."

Child #6 - h2:

+ "Musings, Thoughts & Rambles"

Child #7 - p:

+ "Everyone has a story to tell, and a perspective that’s unique to them."

Child #8 - p:

+ "My goal, here, is to empower you to tell your’s, and inspire those willing to listen as I tell mine."

Child #9 - p:

+ "NOTE: If you no longer wish to see ads, please use the “Login” link to sign in with your WordPress account (or make one if you don’t have one already)."

Content Division
element #24:

Child #1 - p:

+ "Connect with me (and I really mean connect):"

Content Division
element #25:

Child #1 - p:

+ "Subscription"

Content Division
element #26:

Child #1 - h1:

+ "Ready to subscribe?"

Content Division
element #27:

Child #1 - p:

+ "For as little as $1 a month or $10 a year, you can support me and subscribe to my newsletter. This, not only helps me financially, but will also inspire me to write more."

Child #2 - p:

+ "You can, of course, also subscribe for free."

Child #3 - p:

+ "Thank you so much!"

Child #4 - p:

+ "– Fornesus"

Content Division
element #28:

Child #1 - p:

+ "Type your email…"

Child #2 - p:

+ "Subscribe"

Content Division
element #29:

Child #1 - p:

+ "I write about my musings, thoughts, and rambles."

Content Division
element #30:

Child #1 - p:

+ "Fornesus, the Observer"

Content Division
element #31:

Child #1 - p:

+ "Blog at WordPress.com."

Conclusion

End of the output for the content in my blog, (Fornesus Blog - fornesus.blog).

I am Fornesus, owner of my blog and the programmer of this code. Thank you for visiting!