bluesky doesn't use regex really, so the implementation detail is moved to each section. |
||
|---|---|---|
| src | ||
| .gitignore | ||
| Cargo.toml | ||
| env.example | ||
| README.md | ||
pickpocket
rifle through the pockets of an upstream art station to retrieve the optimal image quality available.
installation
git clone, edit .env (or set the same environment variables), and cargo run --release.
usage
deviantart
standard deviantart post urls (e.g., https://www.deviantart.com/username/art/art-id-123123123123123) will retrieve the primary image from the post's srcset, which includes size variations.
this scraper uses regex, which is brittle but ultimately just as bad as trying to parse the xml doctree when it comes to deviantart.
bluesky
standard bluesky post urls (e.g., https://bsky.app/profile/username/post/post-id) will retrieve all available images in the post.
this scraper uses a combination of json parsing and regex. it uses the official bsky apis, so is unlikely to change.
troubleshooting
pickpocket uses tracing to provide deeper debugging information if required. simply run with cargo run --debug.
license
agpl v3.0