Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
crawlersNoticias
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
4
Issues
4
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
m3
crawlersNoticias
Commits
f7f6bcc7
Commit
f7f6bcc7
authored
7 years ago
by
Renán Sosa Guillen
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
date parser
parent
155c0d82
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
3 deletions
+4
-3
parse_date_files.py
parse_date_files.py
+4
-3
No files found.
parse_date_files.py
View file @
f7f6bcc7
...
...
@@ -4,10 +4,10 @@ from collections import OrderedDict
"""
Uso:
python parse_date_files.py <
nombre_del_crawler
>
python parse_date_files.py <
ruta_del_crawler> <nombre_archivo
>
Ej.
python parse_date_files.py descarga_hacia_atras/laJornadaBC2
python parse_date_files.py descarga_hacia_atras/laJornadaBC2
noticias.json
"""
def
dictRowGenerator
(
line
):
...
...
@@ -46,13 +46,14 @@ def dictRowGenerator(line):
info
=
sys
.
argv
[
1
]
news_file
=
sys
.
argv
[
2
]
media
=
info
[
info
.
rfind
(
"/"
)
+
1
:]
download_type
=
info
[:
info
.
rfind
(
"/"
)]
this_file_path
=
os
.
path
.
dirname
(
os
.
path
.
realpath
(
__file__
))
json_file_path
=
this_file_path
+
"/"
+
download_type
+
"/"
+
media
destination_path
=
this_file_path
+
"/"
+
media
json_file
=
json
.
loads
(
open
(
json_file_path
+
"/
noticias.json"
)
.
read
())
json_file
=
json
.
loads
(
open
(
json_file_path
+
"/
"
+
news_file
)
.
read
())
date_set
=
set
()
for
news
in
json_file
:
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment