Skip directly to search Skip directly to A to Z list Skip directly to site content Skip directly to page options
CDC Home

Linking up with the NCHS Research Data Center

A collective blog to foster communication among RDC

Share
Compartir

Select Month: January 2013

RDC Best Practices: Original versus Derived Variables

As an RDC analyst, I would like to share with you some advice I give to all researchers whose proposals are assigned to me.

As you know, it is a researchers’ responsibility to extract and send public-use NCHS and non-NCHS data to the RDC to be merged with restricted variables by their analyst. It is recommended that you familiarize yourselves with NCHS data by doing preliminary analysis with the public use data. Often you may need to rename, recode or re-categorize variables for your analysis. It may seem like a good idea to send us a public use dataset with derived variables instead of the original ones. I strongly urge you not to do this.

While it is not against RDC rules to send us recodes instead of original variables, doing so may lead to extra work for your RDC analyst as well as delays and extra charges. There are two examples that come to mind.  On both proposals, researchers sent in derived variables instead of the original ones. Researchers working on the first proposal made a mistake while creating derived variables. With regard to the second proposal, the Student advisor changed her mind about the grouping of analytic variables and the researchers needed to categorize them in a different way. Since the original variables were not included in the data sets the researchers sent to the RDC, the researchers had to resend the public use datasets with original variables and I had to redo the merge. This resulted in delays for both projects and additional data setup fees.

The conclusion is: send your analyst the original variables, instead of the derived ones! Following this simple rule will save RDC analysts time as well as your time and money. If you want to create derived variables and keep them on your permanent analytic dataset, just send us your programming code to create such variables. Your RDC analyst can either run the code while creating your analytic dataset or put your code into your folder along with the data so that you can create the derived variables yourselves.

Signed RDC Analyst

 
Contact Us:
  • Centers for Disease Control and Prevention
    1600 Clifton Rd
    Atlanta, GA 30333
  • 800-CDC-INFO
    (800-232-4636)
    TTY: (888) 232-6348
  • Contact CDC–INFO
USA.gov: The U.S. Government's Official Web PortalDepartment of Health and Human Services
Centers for Disease Control and Prevention   1600 Clifton Rd. Atlanta, GA 30333, USA
800-CDC-INFO (800-232-4636) TTY: (888) 232-6348 - Contact CDC–INFO
A-Z Index
  1. A
  2. B
  3. C
  4. D
  5. E
  6. F
  7. G
  8. H
  9. I
  10. J
  11. K
  12. L
  13. M
  14. N
  15. O
  16. P
  17. Q
  18. R
  19. S
  20. T
  21. U
  22. V
  23. W
  24. X
  25. Y
  26. Z
  27. #