Pageviews - Product Overview

About

This proprietary clickstream dataset captures granular, zero-party browsing data from our panel of opted-in users, with a stable 3-year history. The data is obtained through our own desktop browser and mobile app, and an extension to Chrome and Firefox.

Gener8's clickstream data product stands out due to its unique data acquisition method. The data is primarily sourced through the active consent of a panel of users who opt to share their data, by using Gener8's B2C apps. This approach ensures the data's authenticity and quality, setting it apart from traditional data collection methods.

This rich dataset caters to various use-cases and verticals within the digital economy. It's particularly valuable for businesses, market researchers, and analysts seeking insights into consumer behavior, market trends, and competitor analysis. The primary use-cases encompass company performance, customer profiling, market research, sentiment and engagement analysis, and product development.

Whether utilised on its own or in conjunction with Gener8's other datasets, e-receipt data plays a pivotal role in helping businesses make data-driven decisions and gain a competitive edge in the digital landscape.

Schema

NameTypeOptionalDescription
idSTRINGNPermanent and unique ID for each pageview.
received_atTIMESTAMPNUTC timestamp of when Gener8 received the pageview
timestampTIMESTAMPNUTC timestamp of when the pageview occurred
url_protocolSTRINGNThe protocol component of the pageview URL
url_domainSTRINGNThe domain component of the pageview URL
url_pathSTRINGYThe path component of the pageview URL
url_querySTRINGYThe query component of the pageview URL, if there was one
titleSTRINGYWebpage title, taken from the head element
referrer_protocolSTRINGYThe protocol component of the referring URL, if there was one
referrer_domainSTRINGYThe domain component of the referring URL, if there was one
referrer_pathSTRINGYThe path component of the referring URL, if there was one
referrer_querySTRINGYThe query component of the referring URL, if there was one
active_durationINTEGERYTime spent by the user actively browsing the webpage, reported in seconds.
deviceSTRINGYThe type of device the pageview came from. Either Desktop or Mobile.
key_phraseSTRINGYThe extracted search term from the request, available for a select number of domains, such as Google Search.
countrySTRINGYGeocoded two letter country code the pageview occurred in, based on client IP address at the time.
regionSTRINGYGeocoded region the pageview occurred in, based on client IP address at the time.
citySTRINGYGeocoded city this pageview occured in, based on client IP address at the time.
postal_codeSTRINGYGeocoded postal code this pageview occured in, based on client IP address at the time. Not available for all regions.
latitudeFLOATYGeocoded latitude, based on client IP address at the time
longitudeFLOATYGeocoded longitude, based on client IP address at the time
user_idSTRINGNPermanent and unique user ID.
tab_idINTEGERYThe tab identifier provided by the user's browser
timezoneSTRINGYThe timezone the pageview was made from
user_agentSTRINGYThe user agent of the browser from which the pageview was received

Delivery

Method

Frequency

Format

Parquet + Gzip

Sample

Available on request