DIGITAL GOVERNMENT PROJECT Progress Report: April 24, 2000 The Research Team NISS: Alan Karr, Ashish Sanil, Jaeyong Lee [, James CMU: Adrian Dobro, George Duncan, Stephen LANL: Sallie Keller–McNulty MCNC: Bonnie Parrish, Karen Litwin, Syam Sun- Build a Web-based query system that
1. Is dynamic and history-dependent
2. Dispenses statistical analyses rather than
3. Uses statistical technology to preserve con-
Implement the system on “live” Federal agency Understand how the system is used and performs Evaluate disclosure risk models and risk reduction
strategies at realistic scales, using the systemas testbed
Summary of Progress to Date
• Algorithms for geographic (or other) aggregation (Sanil,
• Statistical implications of aggregation (Lee, Sanil, Karr)• Prototype table server design (Karr, Sanil, Hilden–Minton)• QHDB schema for table server (Sanil, Karr, Hilden–Minton)• NASS prototype under construction (Karr, Lee, Sanil,
• Scalability of methods to compute bounds (Fienberg,
• Bayesian framework for confidentiality protection (Dun-
• Confidentiality Reading Group, involving NISS, RTI, other
• Interactions with other DG projects (Columbia, UNC)
Table Server Prototype Data: Sample Census data set with
• 8 (after trimming) categorical variables: Age,
Education, Employer type, Marital status,
Query: Sub-table of full 8-way table Response: Requested sub-table (FTP, character dis-
play, visualization) or statement that it cannot
Problem Conceptualization New Release ≡ Movement of Frontier Risk Criteria
• Predictive capability for sensitive variable
• Accuracy of IPF reconstruction of full table
• [Accuracy of LP bounds on cell entries]
System Design
• Visualization as a means of risk reduction
• Visual interfaces incorporating association
Fienberg/Dobro Progress: Formal results on bounds for tables and their rela-
tionship to log-linear model and graphical structures. New theorems for the "decomposable case" and ex-tensions that reduce the bounding problem to smallerdimensional components.
With Duncan, exploration of formal structures requiredto weight the tradeoff between disclosure risk and so-cietal gains from data release, using a formal Bayesianinformation theoretic approach. Current Challenges: Scaling up the results so that they are
computationally feasible for actual government sur-vey settings. Products: Papers (PNAS); code to be incorporated in table Duncan/Keller–McNulty Progress: Initial steps toward formal Bayesian decision–the-
oretic framework for confidentiality protection throughdisclosure limitation. The framework explicitly incorpo-rates disclosure risk and data utility. It also permits thecomparison of disclosure limitation through matrix mask-ing and generation of synthetic data.
With Fienberg, exploration of formal structures requiredto weight the tradeoff between disclosure risk and so-cietal gains from data release, using a formal Bayesianinformation theoretic approach. Current Challenges: Formally analyze the impact on dis-
closure risk and data utility of data swapping. Bet-ter understand synthetic data as a disclosure limitationtool. Develop associated procedures for disclosure riskestimation and disclosure limitation that scale. Products: New algorithms. Review paper on confidential-
ity and disclosure limitation, to be published in the In-ternational Encyclopedia of the Social and BehavioralSciences (Duncan). The Next Six Months
• Complete NASS prototype; write associated
• Functional table server prototype with dynamic
risk estimation and visualizations. Major scal-ability questions will remain.
• Initial concepts of query, risk, response for re-
• [Initial consideration of longitudinal data]
Journal of Nutritional & Environmental MedicineMay 2007; 16(2): 149–166MARGARET MOSS, MA (CANTAB), UCTD (MANCHESTER), DIPION, CBIOL,MIBIOL, Director of the Nutrition and Allergy Clinic11 Mauldeth Close, Heaton Mersey, Stockport, Cheshire SK4 3NPAbstractPurpose: To collate evidence on nutrient deficiencies caused by drugs. Design: Search of Medline and other databases, and published litera
Use of Life Cycle Assessment in Evaluating Solvent Recovery Alternatives in Pharmaceutical Manufacture William A. Carole, C. Stewart Slater, Mariano J. Savelski*, Timothy Moroz, Anthony Furiato, Kyle Lynch Rowan University, Dept. of Chemical Engineering 201 Mullica Hill Rd., Glassboro, NJ 08028, USA Keywords: pharmaceutical manufacture, solvent recovery, pervaporation, life cycle asse