1
0
Fork 0
No description
Find a file
2024-02-09 23:03:22 -05:00
lib handle /search API after running /media requests 2024-02-09 22:28:41 -05:00
.editorconfig init 2024-02-09 17:27:28 -05:00
.gitignore init 2024-02-09 17:27:28 -05:00
config.json concurrent retrieval with user-limited threadcount 2024-02-09 20:31:21 -05:00
LICENSE.md docs 2024-02-09 23:03:22 -05:00
package-lock.json init 2024-02-09 17:27:28 -05:00
package.json init 2024-02-09 17:27:28 -05:00
README.md docs 2024-02-09 23:03:22 -05:00
run-downloadDb.js handle /search API after running /media requests 2024-02-09 22:28:41 -05:00
run-initDb.js docs 2024-02-09 21:13:23 -05:00

gallery-dl-archive-manager

Scripts to manage a (currently twitter only) archive using gallery-dl. Much of the code came from a need to augment pre-existing, outdated archives that were originally created from the twittermediadownloader browser extension.

Config

This repo uses its own config.json in order to save media in the same format as twittermediadownloader. The scripts depend on the media being saved in this format.

Scripts

Initializes a user database from existing folders. Useful if you have a pre-existing archive of users.

Args

Standard args:

--path={/path/to/your/archive}

The path to the archive. This is a parent directory with a list of child directories which correspond to users.

--threads={#}

Max number of concurrent download threads. Only this number of concurrent gallery-dl download threads will run at a given time, other remaining users will be queued.

Additional args to pass to gallery-dl. Note that these aren't currently checked for duplicates that may be used by this repo.

TODO

run-addUsers.js --path={/path/to/your/archive} --users={string array of user(s)} --threads={#}

Should add a new user to the db and initiate a full download similar to run-downloadDb.js

Should pull from the user database and update the archive without doing a full download.

  • --mode
    • =search: The DB should save with a lastUpdated field. This should be used as a date for the /search API. Preferred if it's been a long time since an update has happened for a user and/or the user has uploaded a significant amount of media since lastUpdated.
    • =media: This will run /media and stop after hitting skipped files, as skipped files indicate hitting lastUpdated. Preferred in normal circumstances.