Pageviews - Product Overview
About
This proprietary clickstream dataset captures granular, zero-party browsing data from our panel of opted-in users, with a stable 3-year history. The data is obtained through our own desktop browser and mobile app, and an extension to Chrome and Firefox.
Gener8's clickstream data product stands out due to its unique data acquisition method. The data is primarily sourced through the active consent of a panel of users who opt to share their data, by using Gener8's B2C apps. This approach ensures the data's authenticity and quality, setting it apart from traditional data collection methods.
This rich dataset caters to various use-cases and verticals within the digital economy. It's particularly valuable for businesses, market researchers, and analysts seeking insights into consumer behavior, market trends, and competitor analysis. The primary use-cases encompass company performance, customer profiling, market research, sentiment and engagement analysis, and product development.
Whether utilised on its own or in conjunction with Gener8's other datasets, e-receipt data plays a pivotal role in helping businesses make data-driven decisions and gain a competitive edge in the digital landscape.
Schema
| Name | Type | Optional | Description |
|---|---|---|---|
| id | STRING | N | Permanent and unique ID for each pageview. |
| received_at | TIMESTAMP | N | UTC timestamp of when Gener8 received the pageview |
| timestamp | TIMESTAMP | N | UTC timestamp of when the pageview occurred |
| url_protocol | STRING | N | The protocol component of the pageview URL |
| url_domain | STRING | N | The domain component of the pageview URL |
| url_path | STRING | Y | The path component of the pageview URL |
| url_query | STRING | Y | The query component of the pageview URL, if there was one |
| title | STRING | Y | Webpage title, taken from the head element |
| referrer_protocol | STRING | Y | The protocol component of the referring URL, if there was one |
| referrer_domain | STRING | Y | The domain component of the referring URL, if there was one |
| referrer_path | STRING | Y | The path component of the referring URL, if there was one |
| referrer_query | STRING | Y | The query component of the referring URL, if there was one |
| active_duration | INTEGER | Y | Time spent by the user actively browsing the webpage, reported in seconds. |
| device | STRING | Y | The type of device the pageview came from. Either Desktop or Mobile. |
| key_phrase | STRING | Y | The extracted search term from the request, available for a select number of domains, such as Google Search. |
| country | STRING | Y | Geocoded two letter country code the pageview occurred in, based on client IP address at the time. |
| region | STRING | Y | Geocoded region the pageview occurred in, based on client IP address at the time. |
| city | STRING | Y | Geocoded city this pageview occured in, based on client IP address at the time. |
| postal_code | STRING | Y | Geocoded postal code this pageview occured in, based on client IP address at the time. Not available for all regions. |
| latitude | FLOAT | Y | Geocoded latitude, based on client IP address at the time |
| longitude | FLOAT | Y | Geocoded longitude, based on client IP address at the time |
| user_id | STRING | N | Permanent and unique user ID. |
| tab_id | INTEGER | Y | The tab identifier provided by the user's browser |
| timezone | STRING | Y | The timezone the pageview was made from |
| user_agent | STRING | Y | The user agent of the browser from which the pageview was received |
Delivery
Method
- Amazon S3
- Google Cloud Storage (GCS)
- Azure Blob Storage
Frequency
- Hourly
- Daily
- Weekly
- Monthly
- Quarterly
- On-Demand
Format
Parquet + Gzip
Sample
Available on request