A dataset containing demographic information of individuals with various details such as sex, age in days, weight in kilograms, and other related information.
Format
A data frame with 15 variables:
- primaryid
Unique identifier for each individual (
numeric).- sex
Sex of the individual (
factor), with levelsMfor male andFfor female.- age_in_days
Age of the individual in days (
numeric).- wt_in_kgs
Weight of the individual in kilograms (
numeric).- occr_country
Country where the event occurred (
factor).- event_dt
Date of the event (
integer), in the format YYYYMMDD.- occp_cod
Occupation of the reporter (
factor), with levelsCNfor consumers/patients,MDfor medical doctors,PHfor pharmacist,RNfor registered nurse,HPfor other healthcare professional,LWfor lawyer,OTfor other occupation.- reporter_country
Country of the reporter (
factor).- rept_cod
Report code (
factor), with levelsEXPfor expedited (i.e., a serious and unexpected reaction which should be reported within 30 days from the first reception by the pharmaceutical company),PERfor periodic (i.e., expected non-serious event, sent by the pharmaceutical company to the regulatory agency on a yearly basis),DIRfor directed reports (i.e., submitted directly by the reporter through the MedWatch form, without going through pharmaceutical companies).- init_fda_dt
First date of reception by the FDA (
integer), in the format YYYYMMDD.- fda_dt
Date of reception of the last update by the FDA (
integer), in the format YYYYMMDD.- premarketing
Logical indicator if the case is premarketing (i.e., from clinical trials) (
logical).- literature
Logical indicator if the case is from literature (
logical).- RB_duplicates
Logical indicator if the case is a duplicate based on the Rule Based algorithm (
logical).- RB_duplicates_only_susp
Logical indicator if the case is a duplicate based on the Rule Based algorithm considering only suspected drugs (
logical).
