Computing Tools
 Data Resources    
 Staff Services  
 Contacts   
 Home

Data Management

Data management includes the procurement, preservation, manipulation and documentation of data used in research. The support goals in this area include:

  • Assistance with Data Acquistion - this includes ICPSR data orders and other acquistions that can be undertaken through the Social Science Data Center or Perkins Library.
     
  • Archival Support - assisting with data perservation or in making data publicly available.
     
  • Instruction - identifying experts in the use of particular data management tools or techniques and facilitating opportunities for them to teach short courses or workshops.
     
  • Tutorials - on data management technique, locally prepared or derived from pointers to other sites. Data management technique is conditioned by the choice of tool (statistical package or programming language). But within each programming environment, there are general sets of issues that come up:
     
    • Input of raw data
      • Rectangular files
      • Hierarchical files
      • Numeric versus character data input
      • Input formats - free field, fixed field, specialized informat
      • Record formats - fixed, variable
    • Derivation of new measures (variables)
      • Assigning new variables
        • Numeric data manipulations
        • Character data manipulations
        • Date and time manipulations
      • Conditional variable assignments
      • Handling missing data
    • Data set transformations
      • Subsetting
      • Sorting and BY group processing
      • Concatenating
      • Merging
      • Aggregating
      • Transposing
      • Sampling

       



Webmaster:socsciweb@aas.duke.edu