Skip to content

Commit 853c210

Browse files
committed
Release new version 1.0.5
1 parent 0d32905 commit 853c210

File tree

3 files changed

+31
-15
lines changed

3 files changed

+31
-15
lines changed

README.md

Lines changed: 19 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,10 @@ Built with ❤︎ and :coffee: by [Omkar Pathak](https://github.com/OmkarPathak
2020
- Extract mobile numbers
2121
- Extract skills
2222
- Extract total experience
23-
- Extract education (not very accurate as of now)
24-
- Extract experience (not very accurate as of now)
23+
- Extract college name
24+
- Extract degree
25+
- Extract designation
26+
- Extract company names
2527

2628
# Installation
2729

@@ -61,13 +63,15 @@ data = ResumeParser('/path/to/resume/file').get_extracted_data()
6163
For running the resume extractor you can also use the `cli` provided
6264

6365
```bash
64-
usage: pyresparser [-h] [-f FILE] [-d DIRECTORY]
66+
usage: pyresparser [-h] [-f FILE] [-d DIRECTORY] [-r REMOTEFILE]
67+
[-sf SKILLSFILE]
6568

6669
optional arguments:
67-
-h, --help show this help message and exit
68-
-f FILE, --file FILE resume file to be extracted
69-
-d DIRECTORY, --directory DIRECTORY directory containing all the resumes to be extracted
70-
-r REMOTEFILE, --remotefile REMOTEFILE remote path for resume file to be extracted
70+
-h, --help show this help message and exit
71+
-f FILE, --file FILE resume file to be extracted
72+
-d DIRECTORY, --directory DIRECTORY directory containing all the resumes to be extracted
73+
-r REMOTEFILE, --remotefile REMOTEFILE remote path for resume file to be extracted
74+
-sf SKILLSFILE, --skillsfile SKILLSFILE custom skills CSV file against which skills are searched for
7175
```
7276

7377
For extracting data from a single resume file, use
@@ -88,6 +92,12 @@ For extracting data from remote resumes, execute
8892
pyresparser -r <path_to_remote_resume_file>
8993
```
9094

95+
For extracting data against your specified skills, create a CSV file with no headers. Sample file can be found [here](pyresparser/skills.csv)
96+
97+
```bash
98+
pyresparser -sf <path_to_custom_skills_file>
99+
```
100+
91101
# Notes:
92102

93103
- If you are running the app on windows, then you can only extract .docs and .pdf files
@@ -136,6 +146,8 @@ The module would return a list of dictionary objects with result as follows:
136146

137147
- [https://medium.com/@divalicious.priya/information-extraction-from-cv-acec216c3f48](https://medium.com/@divalicious.priya/information-extraction-from-cv-acec216c3f48)
138148

149+
- **Special thanks** to dataturks for their [annotated dataset](https://dataturks.com/blog/named-entity-recognition-in-resumes.php)
150+
139151
# Donation
140152

141153
If you have found my softwares to be of any use to you, do consider helping me pay my internet bills. This would encourage me to create many such softwares :smile:

pyresparser/command_line.py

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66
import io
77
import csv
88
import multiprocessing as mp
9+
import urllib
910
from urllib.request import Request, urlopen
1011
from pyresparser import ResumeParser
1112
from itertools import product
@@ -71,13 +72,16 @@ def __extract_from_directory(self, directory, skills_file=None):
7172
return 'Directory not found. Please provide a valid directory.'
7273

7374
def __extract_from_remote_file(self, remote_file):
74-
print_cyan('Extracting data from: {}'.format(remote_file))
75-
req = Request(remote_file, headers={'User-Agent': 'Mozilla/5.0'})
76-
webpage = urlopen(req).read()
77-
_file = io.BytesIO(webpage)
78-
_file.name = remote_file.split('/')[-1]
79-
resume_parser = ResumeParser(_file)
80-
return [resume_parser.get_extracted_data()]
75+
try:
76+
print_cyan('Extracting data from: {}'.format(remote_file))
77+
req = Request(remote_file, headers={'User-Agent': 'Mozilla/5.0'})
78+
webpage = urlopen(req).read()
79+
_file = io.BytesIO(webpage)
80+
_file.name = remote_file.split('/')[-1]
81+
resume_parser = ResumeParser(_file)
82+
return [resume_parser.get_extracted_data()]
83+
except urllib.error.HTTPError:
84+
return 'File not found. Please provide correct URL for resume file.'
8185

8286
def resume_result_wrapper(args):
8387
if len(args) == 2:

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55

66
setup(
77
name='pyresparser',
8-
version='1.0.4',
8+
version='1.0.5',
99
description='A simple resume parser used for extracting information from resumes',
1010
long_description=open('README.rst').read(),
1111
url='https://github.com/OmkarPathak/pyresparser',

0 commit comments

Comments
 (0)