Bulk export files to local directory

Bug #1619440 reported by Paul Everitt
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
KARL4
Fix Released
High
Carlos de la Guardia

Bug Description

OSF has a need to download a LOT of files (2,800) to investigate them. They'd like to give us a text file with filenames, one per line, and we then give them a ZIP file (or manually upload to Box). They can't give us the community and sub-folder...they only know the "filename".

- We can do this as a one-off console script

- Read the lines in the text file

- Iterate through each File in the catalog, making a mapping of filename -> docid

- Iterate through each line in the text file

- If the entry matches a File filename, "dump it" (see below)

- If not, write to a log file about entries that weren't found

Dumping
===========

- Write the file contents to the current working directory

- Use a filename such as communityname-foldername-subfoldername-filename

- Also in the directory, keep an errors.log file with entries for missing filenames

Tags: auth
Changed in karl4:
milestone: 023 → 024
Changed in karl4:
milestone: 024 → 025
Changed in karl4:
milestone: 025 → 026
Changed in karl4:
milestone: 026 → 028
Changed in karl4:
status: New → Won't Fix
Revision history for this message
Paul Everitt (paul-agendaless) wrote :

I believe this is going to come back, Oleg asked about it.

Changed in karl4:
milestone: 028 → 030
status: Won't Fix → New
Revision history for this message
Paul Everitt (paul-agendaless) wrote :

Hi Carlos, let's make this your one task for this week. Hopefully you can use Newt queries for getting to the filename. Two other points: case-insensitive and only for things that are file-like: File, attachments, etc.

Changed in karl4:
importance: Medium → High
Revision history for this message
Paul Everitt (paul-agendaless) wrote :

Also, run this on *staging* not production (as they want the state of the database a number of months ago.) But make sure that staging has content from July 2016.

Revision history for this message
Carlos de la Guardia (cguardia) wrote :

Done. Waiting for Oleg's comments.

Changed in karl4:
status: New → In Progress
Changed in karl4:
status: In Progress → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.