Blog title: researching the XYZ
Short description of post.
Published: December 6, 2021
Reading Time: 5 minutes
Title
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nulla eu sem viverra, blandit justo in, ullamcorper nisl. Aliquam maximus lorem et molestie pretium. In luctus facilisis eros id finibus. Nam ac lectus lacus. Nullam quis egestas risus. Pellentesque habitant morbi tristique senectus et netus et malesuada fames ac turpis egestas. Fusce id elit gravida sem gravida sollicitudin. Sed congue nulla elit, id cursus nibh viverra vitae.
Title
In vel sapien a justo condimentum finibus. Fusce non magna a leo suscipit fringilla nec in magna. Vestibulum viverra dapibus magna in placerat. Pellentesque sed sodales ex. Vivamus blandit eu justo ac luctus. Etiam nec tellus molestie, suscipit velit ut, lacinia velit. Quisque libero tellus, hendrerit ac dignissim et, faucibus ac nulla. Vestibulum finibus posuere varius. Maecenas finibus est non egestas aliquet. Cras venenatis metus sed iaculis fermentum. In ipsum dui, sagittis ac augue id, feugiat bibendum dui. Sed pellentesque orci orci, quis mollis massa tempor mattis.
1#! /bin/python3.10
2import re
3import requests
4import os
5from os.path import exists
6from bs4 import BeautifulSoup
7from colorama import Fore, Style
8company_list = ['scythe','dragos']
9company_url = {'scythe':'https://www.scythe.io/about/careers','dragos':'https://jobs.lever.co/dragos'}
10
11# Define formatting
12reset = Style.RESET_ALL
13green = Fore.GREEN
14purple = Fore.MAGENTA
15sep = Fore.BLUE + "---------------------------" + reset
16
17def get_format(response,company_name):
18 '''Return the HTML attribute needed for each site'''
19 match company_name:
20 case "scythe":
21 soup = BeautifulSoup(response.text, 'html.parser').findAll("h3",attrs={"id": "w-node-_6a3848d7-bd9c-4061-be22-05d0c32b7a82-c32b7a81"})
22 return soup
23 case "dragos":
24 soup = BeautifulSoup(response.text, 'html.parser').findAll("h5",attrs={"data-qa": "posting-name"})
25 return soup
26
27def parse(posting_location,company_name):
28 '''Issue the HTML request for job posting page'''
29 # Issue request to scyhte careers page
30 response = requests.get(posting_location,company_name)
31
32 # Get the format of the specific site to parse
33 parsed_response = get_format(response,company_name)
34
35 return parsed_response
36
37def parse_html(html_response):
38 '''Takes list of HTML strings and parses them to contain just the job posting'''
39 # for each job posting
40 postings_list = []
41 for i in html_response:
42 # Remove HTML from job posting
43 postings_list.append(re.sub('<[^<]+?>', '', str(i)))
44 return postings_list
45
46def get_new(postings_list,old_list,company_name):
47 '''Get the difference in the new job postings and the old ones'''
48 new_post = list(set(postings_list) - set(old_list)) + list(set(old_list) - set(postings_list))
49 path = './data/'+company_name
50
51 # Save job posting to file
52 with open(path, 'w') as f:
53 for listing in postings_list:
54 f.write("%s \n" % listing)
55 if new_post:
56 return company_name,new_post
57
58def get_old(company_name):
59 '''Get the previous job postings and add them to a list'''
60 # If data file does not exist, create it
61 path = './data/'+company_name
62 if not exists(path):
63 open(path, 'a').close()
64 with open(path) as file:
65 lines = file.readlines()
66 old_list = []
67 for i in lines:
68 # for each job posting, strip out the characters we don't need
69 old_list.append(i.rstrip())
70 return old_list
71
72def print_results(company_name,new_post):
73 '''print the job postings'''
74 company_name = str(new_post[0]).upper()
75 print(f"{green}{company_name} has new job postings!")
76 print(sep)
77 new_post = new_post[1:]
78 job_postings = list(new_post)
79 for i in job_postings:
80 print(f"{purple}",*i,sep="\n")
81 print(sep)
82 pass
83
84def select_company():
85 '''Select the company passed in list and then call the function required for that specific job'''
86 # Get the company name from the index number passed in through the function
87 for company in range(len(company_list)):
88 company_name = list(company_url.keys())[company]
89 posting_location = list(company_url.values())[company]
90 html_response = parse(posting_location,company_name)
91 old_list = get_old(company_name)
92 postings_list = parse_html(html_response)
93 new_post = get_new(postings_list,old_list,company_name)
94 if new_post:
95 print_results(company_name,new_post)
96 else:
97 print(f"{green}{company_name}{purple} has no new job postings")
98
99print(f"{green}Ear2Ground:{purple} A Program to help you keep tabs on the job postings of infosec companies")
100print(sep)
101def main():
102 path = './data/'
103 if not os.path.exists(path):
104 os.makedirs(path)
105 select_company()
106
107if __name__ == "__main__":
108 main()
Pellentesque habitant morbi tristique senectus et netus et malesuada fames ac turpis egestas. Nunc aliquam, urna a ultricies dictum, mi arcu commodo eros, dictum vulputate leo magna ac orci. Quisque ultricies molestie nibh, eget sagittis est commodo ut. Cras fermentum, lectus eu interdum rhoncus, erat tortor aliquam velit, eu iaculis purus ex sed lorem. Nunc maximus nisi eu mauris ultricies, non placerat nisl fringilla. Quisque dignissim tellus enim, ut faucibus diam iaculis sit amet. Aenean vestibulum et nunc tempor elementum. Mauris vel augue id justo tempor euismod. Nunc elementum vulputate ante sit amet pulvinar. Orci varius natoque penatibus et magnis dis parturient montes, nascetur ridiculus mus. Fusce vitae faucibus lectus.
- This
- Is
- A
- List
Smaller Title 2
- This
- Is
- Also
- A
- List
Sed imperdiet metus at porta blandit. Proin aliquet fringilla fringilla. In eu mi tempus, condimentum leo in, fermentum purus. Fusce eget dignissim quam. Nulla faucibus elit vel ligula laoreet tempor. Phasellus sed magna velit. Donec at euismod mi. Cras suscipit interdum ligula.
This is a block quote. Groovy.
In ac euismod diam, quis vehicula tellus. Duis mollis, nulla quis egestas congue, eros enim tempor urna, ac pretium elit mi quis nulla. Nunc id vestibulum felis. Aliquam quis massa at dui posuere mattis. Curabitur fermentum rutrum nisl, nec hendrerit velit vestibulum ut. Donec varius euismod ex, eget lacinia odio scelerisque eget. Cras posuere, massa tincidunt tristique semper, tellus felis porta lorem, eu pulvinar velit lacus sit amet orci. Ut finibus dolor ac lectus tristique, non condimentum justo tristique. Pellentesque consectetur mollis tincidunt. Praesent dapibus, dui sed rhoncus luctus, erat ligula posuere eros, quis ullamcorper justo leo id tellus.