- Directory Listing: Finding open directories using Google Dork on your target organisation helps one to understand the directory structure on the webserver. It may reveal sensitive information or it may lead to information disclosure.
- Configuration Files: Often times configuration files contains sensitive information such as hardcoded passwords, sensitive drive locations or API tokens which can help you gain privilege access to the internal resources.
- Database Files: Database Files are data files that are used to store the contents of the database in a structured format into a file in separate tables and fields. Depending on the nature of the web application these files could provide access to sensitive information.
- WordPress: WordPress is an open-source CMS written in PHP. WordPress has thousands of plugins to build, customise and enhance the websites. There are numerous vulnerabilities in these plugins. Finding WordPress related
- Log Files: Log files sometimes provide detailed information of the users’ activities in a particular application. These files are good to look at session cookies or other types of tokens.
- Backup and Old Files: Backup files are original copies of the critical systems. These provide access to PII or access to sensitive records.
- Login Pages: It is extremely important to identify login pages of your target organisation to perform bruteforce attempts or trying default credentials to gain further access to organisation resources.
- SQL Errors: SQL errors leaks sensitive information about the backend systems. This can help one to perform enumeration on the database types and see if the application is vulnerable to input validation related attacks such as SQL Injection.
- Apache Config Files: Apache HTTP Server is configured by placing directives in plain text configuration files. The main configuration file is usually called httpd.conf. In addition, other configuration files may be added using the Include directive, and wildcards can be used to include many configuration files. Any directive may be placed in any of these configuration files. Depending on the entries in these config files it may reveal database connection strings, username and passwords, the internal workings, used and referenced libraries and business logic of application.
- Robots.txt File: Robots.txt file instructs web robots how to crawl pages on their website. Depending on the content of the file, an attacker might discover hidden directories and files.
- DomainEye: DomainEye is a domain/host investigation tool that has the largest domain databases. They provide services such as reverse Whois, reverse IP lookup, as well as reverse NS and MX.
- Publicly Exposed Documents: Such documents can be used to extract metadata information.
- phpinfo(): Exposing phpinfo() on its own isn’t necessarily a risk, but in combination with other vulnerabilities could lead to your site becoming compromised. Additionally, module versions could make attackers life easier when targeting application using newly discovered exploits.
- Finding Backdoors: This can help one to identify website defacements or server hijacking related issues. By exploiting the open redirect vulnerability on the trusted web application, the attacker can redirect victims to a phishing page.
- Install/Setup Files: Such files allows an attacker to perform enumeration on the target organisation. Information gathered using these files can help discover version details which can then be used to perform the targeted exploit.
- Open Redirects: With these, we look at various known parameters vulnerable to open redirect related issues.
- Apache Struts RCE: Successfully exploiting an RCE vulnerability could allow the attacker to run arbitrary programs. Here, we are looking for files with extensions of “.action” or “.do”.
- 3rd Party Exposure: Here we are looking for exposure of information on third party sites such as Codebeautify, Codeshare and Codepen.
- Check Security Headers: Identify quickly if the target site is using security related headers in the server response.
- GitLab: Quickly look for sensitive information on the GitLab.
- Find Pastebin Entries: Shows you the results related to the target organisation on the Pastebin site. This could be passwords or any other sensitive information related to the target organisation.
- Employees on LINKEDIN: Identifying employee names on LinkedIn can help you build a username list when it comes to password spraying attack.
- .HTACCESS / Sensitive Files: Look for sensitive file exposure. This may indicate a server misconfiguration.
- Find Subdomains: Subdomain helps you expand the attack surface on the target organisation. There are numerous tools available to automate the process of subdomain enumeration.
- Find Sub-Subdomains: Identify sub-sub domains on the target organisation using Google Dork,
- Find WordPress related exposure: WordPress related exposure helps you gain access to sensitive files and folders.
- BitBucket & Atlassian: Source code leakage, hardcoded credentials and access to cloud infrastructure.
- PassiveTotal: PassiveTotal is a great tool to perform threat investigation. Using BigBountyRecon we will use PassiveTotal to identify subdomains on the target information.
- Stackoverflow: Source code exposure or any technology-specific questions mentioned on the Stackoverflow.
- Find WordPress related exposure using Wayback Machine: Look for archieved WordPress files using WaybackMachine.
- GitHub: Quickly look for sensitive information on the GitHub.
- OpenBugBounty: Look for publicly exposed security issues on the OpenBugBounty website.
- Reddit: Information about the particular organisation on the Reddit platform.
- Crossdomain.xml: Look for misconfigured crossdomain.xml files on the target organisation.
- ThreatCrowd: Search engine for threats, however, we are going to use this to identify additional sub-domains.
- .git Folder: Source code exposure. it’s possible to download the entire repository content if accessible.
- YouTube: Look for any recent news on Youtube.
- Digitalocean Spaces: Spaces is an S3-compatible object storage service that lets you store and serve large amounts of data. We will look for any data exposures.
- .SWF File (Google): Flash is dead. We are going to use Google Dorks to look for older versions of flash .swf’s which contain vulnerabilities.
- .SWF File (Yandex): Flash is dead. We are going to use Yandex to look for older versions of flash .swf’s which contain vulnerabilities.
- .SWF File (Wayback Machine): Flash is dead. We are going to use WaybackMachine to look for older versions of flash .swf’s which contain vulnerabilities.
- Wayback Machine: Look for archived files to access old files.
- Reverse IP Lookup: Reverse IP Lookup lets you discover all the domain names hosted on any given IP address. This will help you to explore the attack surface for a target organisation.
- Traefik: Look for an open-source Edge Router for an unauthenticated interface which exposes internal services.
- Cloud Storage and Buckets: Google CSE for various cloud storages - aws, digitalocean, backblaze, wasabi, rackspace, dropbox, ibm, azure, dreamhost, linode, gcp, box, mailru
- s3 Buckets: Open s3 buckets.
- Censys (IPv4, Domains & Certs): Search engine for finding internet devices. We will use this to look for additional sub-domains using various endpoints on Censys.
- Shodan: Search engine for Internet-connected devices
- SharePoint RCE: Look for CVE-2020-0646 SharePoint RCE related endpoint.
- API Endpoints: Find WSDL files.
- Gist Searches: Quickly look for sensitive information on the Gist pastes.
- CT Logs: Certificate Transparency (CT) is an Internet security standard and open-source framework for monitoring and auditing digital certificates. We will use to look for additional sub-domains for a targeted organisation.
- Password Leak: Look for plaintext passwords of internal employees exposed in various leaks.
- What CMS: Identify the version and type of CMS used by a target organisation for targeted enumeration and exploit research.
Search for plaintext passwords for a target organisation:
Looking for subdomains and other interesting information on the target organisation:
Finding Apache Struts related assets:
Verifying if the URL contains extenstion of “.do”:
Step1: Download the file from Release section: https://github.com/Viralmaniar/BigBountyRecon/releases/download/v0.1/BigBountyRecon.exe
Step2: Run the EXE file
Step3: Enter the target domain
Step4: Click on different buttons in the tool to find information
Step5: In case of Google Captcha simply click on the puzzle and move ahead
Table obtained from: https://exposingtheinvisible.org/guides/google-dorking/
Here is a table with possible dorks for various search engines.
|cache:[url]||Shows the version of the web page from the search engine’s cache.||✓|
|related:[url]||Finds web pages that are similar to the specified web page.||✓|
|info:[url]||Presents some information that Google has about a web page, including similar pages, the cached version of the page, and sites linking to the page.||✓|
|site:[url]||Finds pages only within a particular domain and all its subdomains.||✓||✓||✓||✓|
|intitle:[text] or allintitle:[text]||Finds pages that include a specific keyword as part of the indexed title tag. You must include a space between the colon and the query for the operator to work in Bing.||✓||✓||✓||✓|
|allinurl:[text]||Finds pages that include a specific keyword as part of their indexed URLs.||✓|
|meta:[text]||Finds pages that contain the specific keyword in the meta tags.|
|filetype:[file extension]||Searches for specific file types.||✓||✓||✓|
|intext:[text], allintext:[text], inbody:[text]||Searches text of page. For Bing and Yahoo the query is inbody:[text]. For DuckDuckGo the query is intext:[text]. For Google either intext:[text] or allintext:[text] can be used.||✓||✓||✓|
|inanchor:[text]||Search link anchor text||✓|
|location:[iso code] or loc:[iso code], region:[region code]||Search for specific region. For Bing use location:[iso code] or loc:[iso code] and for DuckDuckGo use region:[iso code].An iso location code is a short code for a country for example, Egypt is eg and USA is us. ISO 3166-1 - Wikipedia||✓||✓|
|contains:[text]||Identifies sites that contain links to filetypes specified (i.e. contains:pdf)||✓|
|altloc:[iso code]||Searches for location in addition to one specified by language of site (i.e. pt-us or en-us)||✓|
|feed:[feed type, i.e. rss]||Find RSS feed related to search term||✓||✓||✓|
|hasfeed:[url]||Finds webpages that contain both the term or terms for which you are querying and one or more RSS or Atom feeds.||✓||✓||✓|
|ip:[ip address]||Find sites hosted by a specific ip address||✓||✓|
|language:[language code]||Returns websites that match the search term in a specified language||✓||✓|
|book:[title]||Searches for book titles related to keywords||✓|
|maps:[location]||Searches for maps related to keywords||✓|
|linkfromdomain:[url]||Shows websites whose links are mentioned in the specified url (with errors)|