Darkdump is a OSINT interface for carrying out deep web investgations written in python in which it allows users to enter a search query in which darkdump provides the ability to scrape .onion sites relating to that query to try to extract emails, metadata, keywords, images, social media etc. Darkdump retrieves sites via Ahmia.fi and scrapes those .onion addresses when connected via the tor network.
git clone https://github.com/josh0xA/darkdump
cd darkdump
python3 -m pip install -r requirements.txt
python3 darkdump.py --help
To use Darkdump effectively, you need to configure Tor to allow your script to control it via the Tor control port. Here's how to set up your torrc
file and verify that Tor is running:
If Tor is not already installed on your system, you need to install it. Here's how you can install Tor on various operating systems:
Debian/Kali/Ubuntu: sudo apt install tor
MacOS: brew install tor
Locate your torrc file. This file is usually found at /etc/tor/torrc
on Linux and sometimes Mac.
Add the following lines to your torrc to enable the control port and set a control port password:
ControlPort 9051
HashedControlPassword [YourHashedPasswordHere]
Replace [YourHashedPasswordHere]
with a hashed password which can be generated using the tor --hash-password
command: tor --hash-password "my_password"
Linux: sudo systemctl start tor.service
MacOS: brew services start tor
python3 darkdump.py -q "hacking" -a 10 --scrape --proxy
- search for 10 links and scrape each site
python3 darkdump.py -q "free movies" -a 25
- don't scrape, just return 25 links for that query (does not require tor)
python3 darkdump.py -q "marketplaces" -a 15 --scrape --proxy -i
- search for 10 links and scrape each site as well as find and store images.
_ _ _ __
__| | __ _ _ __| | ____| |_ _ _ __ ___ _ __ / /
/ _` |/ _` | '__| |/ / _` | | | | '_ ` _ \| '_ \ / /
| (_| | (_| | | | < (_| | |_| | | | | | | |_) | / /
\__,_|\__,_|_| |_|\_\__,_|\__,_|_| |_| |_| .__/ /_/ v3 by Josh Schiavone
|_|
usage: darkdump.py [-h] [-v] [-q QUERY] [-a AMOUNT] [-p] [-i] [-s]
Darkdump is an interface for scraping the deepweb through Ahmia. Made by yours truly.
options:
-h, --help show this help message and exit
-v, --version returns darkdump's version
-q QUERY, --query QUERY
the keyword or string you want to search on the deepweb
-a AMOUNT, --amount AMOUNT
the amount of results you want to retrieve
-p, --proxy use tor proxy for scraping
-i, --images scrape images and visual content from the site
-s, --scrape scrape the actual site for content and look for keywords
The developer of this program, Josh Schiavone, is not resposible for misuse of this data gathering tool. Do not use darkdump to navigate websites that take part in any activity that is identified as illegal under the laws and regulations of your government. May God bless you all.
MIT License
Copyright (c) Josh Schiavone