Transparency about what we collect, store, and share.
MapTheNet records domain-level link relationships only. When our crawler visits a page, it extracts outbound links and records the relationship as a pair of domain names (e.g., example.com links to example.org). We also record:
MapTheNet does not collect any of the following:
Our crawler identifies itself with the user-agent string
MapTheNetBot/1.0. It checks each domain's
/robots.txt file before crawling and fully complies with
any Disallow directives that apply to our user-agent or to all crawlers.
If you wish to block our crawler, add the following to your robots.txt:
User-agent: MapTheNetBot
Disallow: /
Crawl data is retained indefinitely in aggregated form to enable historical analysis of web structure over time. Raw crawl logs (which contain timestamped link observations) are retained for 12 months and then deleted.
Opted-out domains are removed from all current and future public exports within 30 days of the opt-out request.
MapTheNet processes only publicly available information (domain names and their link relationships) for purposes of academic research and public interest. No personal data is processed.
Our activities are analogous to those of search engine crawlers and internet measurement projects such as the Internet Archive and the Common Crawl.
If you have questions about our data practices, use the contact form. To request removal of your domain, visit the Opt-Out page.