Skip to content

Forum in maintenance, we will back soon 🙂

Web scrapping from ...
 
Notifications
Clear all

Web scrapping from Indeed.com

2 Posts
2 Users
0 Reactions
547 Views
(@sivaram-bandaru)
Posts: 43
Trusted Member Customer
Topic starter
 

Step 1: I did a job search on Indeed.com and used the link as below in puthin

from bs4 import BeautifulSoup
import requests
url="https://www.indeed.com/jobs?q=IT+director&l=Remote&from=searchOnHP&vjk=84953521ad7c4774"
req=requests.get(url)


soup=BeautifulSoup(req.text,"html.parser")


# Find all the job posts
job_posts = soup.find_all('meta', name="description")
# Print the title of each job post
for job_post in job_posts:
    title = job_post.a.text
    print(title)
 
Step 2: Verified the html code has the description as below.
<meta http-equiv="content-type" content="text/html; charset=utf-8">

	<meta name="description" content="1,349 IT Director jobs available in Remote on Indeed.com. Apply to Director of Information Technology, Director of Partnerships, Director of Analytics and more!">

	<meta name="referrer" content="origin-when-cross-origin">
 
Step 3: Running the python code in step 1 is giving lots of errors as below
line 507, in send
raise ConnectTimeout(e, request=request)
 
What is that I am doing wrong?
 
Posted : 09/08/2023 4:29 pm
Topic Tags
Hasan Aboul Hasan
(@admin)
Posts: 1276
Member Admin
 

Indeed.com, like many other websites, has terms of service that prohibit scraping.

 
Posted : 09/09/2023 7:09 am
Share:
[the_ad_group id="312"]