Reports detailing the active measurement needs and desired infrastructure capabilities as identified by the community, along with potential technical solutions we could explore.
This document compiles the datasets we've identified as having an impact (or potential impact) on enhancing the security stance of the foundational layers of Internet infrastructure. We encourage public feedback.
The goal of this Annotated Schema (AS), is to provide a limited ontology of annotations for dataset metadata that inform a prospective user of the classes, properties, and identifiers contained in the data.
Hoiho: Hostname-based Geolocation of IP addresses is an open-source tool released as a part of scamper. It uses CAIDA's Macroscopic Internet Topology Data Kit (ITDK) and observed round trip times to infer regular expressions that extract apparent geolocation hints from hostnames. The ITDK contains a large dataset of routers with annotated hostnames, which are used as input.
Spoofer is a suite of open-source software tools to assess and report on the deployment of source address validation (SAV) best anti-spoofing practices. This client-server system periodically tests a network's ability to both send and receive packets with forged source IP addresses (spoofed packets). The CAIDA Spoofer Data API provides a public data interface to the publicly shareable data collected by the Spoofer service.
DNS Zone Database (DZDB) is a platform providing access to time-series data derived from current and historical zone files provided by generic Top-Level Domains (gTLDs) participating in the Central Zone Data Service (CZDS) or directly by Registries Operators in compliance with corresponding license agreements.
BGP2GO: To facilitate security research and analysis, we study the feasibility of indexing numeric identifiers over time: We index BGP prefixes, ASNs, communities, and IP addresses to data sets in which they occur. Currently, we process all BGP update files from RouteViews' route collectors. We prototyped BGP2Go, a web application that assists in selecting and obtaining relevant MRT data sets for further analysis. We hope to extend it to include other types of data, e.g., RIR allocation files, DNS (OpenIntel), DNS data. Indexing more data will facilitate correlation of activities of an identifier across data sets.
Facilitating Advances in Network Topology Analysis (FANTAIL) system was developed to enable discovery of the full potential value of massive raw Internet end-to-end path measurement data sets, allowing researchers to use high-level queries to perform data processing and analysis tasks on matching traces without owning/operating a cluster, and without learning big data programming.
BGPStream is an open-source software for live and historical BGP data analysis, supporting scientific research, operational monitoring, and post-event analysis. It provides access to real-time and historical Routviews and RIPE RIS BGP data.
The RouteViews API is designed for network operators and researchers who require regular access to current RouteViews data for monitoring the global routing system. Traditionally, RouteViews collectors provided command-line access, enabling network operators to perform quick checks on BGP announcements and general reachability information. However, with the Internet's continued growth and the expanding size of both IPv4 and IPv6 routing tables, this direct command-line access has increasingly burdened the RouteViews collector infrastructure.
The RV API replaces the regular automated access that many operators and researchers rely on, alleviating the strain on collector resources. Additionally, it complements the BGP updates and RIB dumps available in the RouteViews BGP data archive, which can be accessed at https://archive.routeviews.org. By using the API, users can efficiently obtain the necessary routing data without impacting the performance of the RouteViews infrastructure.