Skip to contents

Data set for more than 400 US counties containing CLI (COVID-19-like-illness) incidence derived from two surveys, and a reference signal as reported by JHU CSSE. This example data is a snapshot as of September 21, 2020, and ranges from April 11, 2020 to September 01, 2020.

The reference signal case is based on reports made available by the Center for Systems Science and Engineering at Johns Hopkins University.

One survey was run by Google, in partnership with Delphi.

The other survey, the COVID-19 Trends and Impact Survey, was run by Delphi in collaboration with Facebook.

Data is reported for counties that had at least 200 cumulative COVID-19 cases on May 14, 2020, according to JHU CSSE.

Usage

county_smoothed_cli_comparison

Format

An epiprocess::epi_df (object of class c("epi_df", "tbl_df", "tbl", "data.frame")) with 63840 rows and 5 columns.

Source

This object contains a modified part of the COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University as republished in the COVIDcast Epidata API. This data set is licensed under the terms of the Creative Commons Attribution 4.0 International license by Johns Hopkins University on behalf of its Center for Systems Science in Engineering. Copyright Johns Hopkins University 2020.

Modifications:

  • From the COVIDcast Epidata API: The signal confirmed_cumulative_num was used to determine eligibility for inclusion. The signal confirmed_7dav_incidence_prop was computed by Delphi from the original JHU-CSSE data by calculating moving averages of the preceding 7 days, so the signal for June 7 is the average of the underlying data for June 1 through 7, inclusive.

  • Furthermore, the data has been limited to a specific time range, the signal names slightly altered, and formatted into an epi_df.

This object contains a modified part of the data aggregations in the API that are prepared from the COVID-19 Trends and Impact Survey; see the first link for more information on citing in publications. The data is made available via the Delphi Epidata API.

These aggregations are licensed under the terms of the Creative Commons Attribution license.

Modifications:

  • The data has been limited to a very small number of rows, the signal names slightly altered, and formatted into an epi_df.

This object contains a modified part of the Google symptom surveys. Aggregations based on the survey are licensed under the terms of the Creative Commons Attribution license.

Modifications:

  • The data has been limited to a very small number of rows, the signal names slightly altered, and formatted into an epi_df.

Data dictionary

The data has columns:

geo_value

The 5-digit county FIPS code associated with each row of measurements.

time_value

The time value, a date in YYYY-MM-DD format

goog

Seven-day average of CLI (covid-like-illness) cases from the Google survey

fb

Seven-day average of CLI (covid-like-illness) cases from CTIS

case

Reference signal. Seven-day average of CLI (covid-like-illness) cases