1
0
Fork 0
No description
Find a file
2024-02-10 15:08:03 -05:00
lib better error handling 2024-02-10 15:07:54 -05:00
.editorconfig init 2024-02-09 17:27:28 -05:00
.gitignore init 2024-02-09 17:27:28 -05:00
config.json concurrent retrieval with user-limited threadcount 2024-02-09 20:31:21 -05:00
LICENSE.md docs 2024-02-09 23:03:22 -05:00
package-lock.json init 2024-02-09 17:27:28 -05:00
package.json init 2024-02-09 17:27:28 -05:00
README.md Update README.md 2024-02-10 15:08:03 -05:00
run-downloadDb.js better error handling 2024-02-10 15:07:54 -05:00
run-initDb.js docs 2024-02-09 21:13:23 -05:00

gallery-dl-archive-manager

Scripts to manage a (currently twitter only) archive using gallery-dl. Much of the code came from a need to augment pre-existing, outdated archives that were originally created from the twittermediadownloader browser extension.

Config

This repo uses its own config.json in order to save media in the same format as twittermediadownloader. The scripts depend on the media being saved in this format.

Scripts

node run-initDb.js

Initializes a user database from existing folders. Useful if you have a pre-existing archive of users.

Args:

  • --path={/path/to/your/archive}

Example:

  • node run-initDb.js --path=/mnt/data/archive will read all child directories in /mnt/data/archive (e.g. /mnt/data/archive/userA, /mnt/data/archive/userB, etc.) and create a db.json file in /mnt/data/archive listing the users.

node run-downloadDb.js

Runs a full download of all users listed in the db.json of the archive (the provided --path).

Args:

  • --path={/path/to/your/archive}
  • --threads={#}
  • --args={gallery-dl args}

Example:

  • node run-downloadDb.js --path=/mnt/data/archive --threads=3 --args="-r 2.5M" will run a full download (/media followed by /search starting from the oldest pulled file from /media) of all the users in the /mnt/data/archive/db.json file, limiting concurrent download threads to 3. It will pass the additional args -r 2.5M --no-skip to the gallery-dl bin being executed; -r 2.5M --no-skip corresponds to limiting the download rate to 2.5M and downloading all files without skipping (for the sake of example).

Args

Standard args:

--path={/path/to/your/archive}

The path to the archive. This is a parent directory with a list of child directories which correspond to users.

--threads={#}

Max number of concurrent download threads. Only this number of concurrent gallery-dl download threads will run at a given time, other remaining users will be queued.

Additional args to pass to gallery-dl. See gallery-dl CLI options for reference. Note that these aren't currently checked for duplicates that may be used by this repo.

TODO

run-addUsers.js

Should add a new user to the db and initiate a full download similar to run-downloadDb.js

Args:

  • --path={/path/to/your/archive}
  • --users={string array of user(s)}
  • --threads={#}

run-updateDb.js

Should pull from the user database and update the archive without doing a full download. The DB should save with a lastUpdated field. This should be used as a date for the /search API. Preferred if it's been a long time since an update has happened for a user and/or the user has uploaded a significant amount of media since lastUpdated. Note: if you've updated the DB recently, it may be more performant to run node run-downloadDb.js with --args="-A {#}" to simply run the /media check instead of the /search check, where -A will abort the user after {#} tries.

Args:

  • --path={/path/to/your/archive}
  • --threads={#}
  • --args={gallery-dl args}

run-renameUser.js

Should rename an existing user in the db, optionally renaming their existing archive and its contents if --full=true.

Args:

  • --from={'username'}
  • --to={'username'}
  • --full={true|false}
  • --path={/path/to/your/archive}
  • --args={gallery-dl args}