Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Chemometrics for Process Analytics: Data Alignment
CPAC Summer Institute June 2011
Brian G. Rohrback Infometrix, Inc., Bothell, WA
2011 CPAC Summer Institute
History of a CPAC initiative
April 2000: Analect users meeting Started with focus on on-line
Matlab computing standards March and May 2001: Pittcon
and CPAC meetings Broadened vendor interest,
formalized as a CPAC initiative 2002 - 2004: IFPAC meetings
End-users, vendors, academics: focus on accessibility, FDA
2005 - 2007: IFPAC meetings Product introductions, integration,
feedback 2008 - 2011+: IFPAC meetings
Second generation COPA products, services
• Model formats are more accessible
• Software has been created to allow use of chemometric models in more flexible and more seamless ways
2011 CPAC Summer Institute
Poll of Process Users
1. Analytical failure prediction 2. Result validation 3. More process-specific information (timely, higher quality,
more focused) 4. Simplification of procedures 5. Elimination of analytical discrepancies 6. Reduction in the lifecycle cost (cost of ownership)
Driven in part by dwindling manpower, skill sets and capabilities !
3
2011 CPAC Summer Institute
The Data Processing Bottleneck
Ultimately, the task will be to quantitate or classify samples Identifying the origin of a sample Assessing some physical property or proportion Unfolding a mixture into the relative proportion of ingredients
(which can be mixtures themselves)
Accurate peak identification is undermined by retention time variability
All chromatography is plagued by peak shifting Chemometrics provides a means for significantly reducing
retention time inconsistency
The goal: simplified and objective quality control
4
2011 CPAC Summer Institute
The key is multivariate instrumentation
You can’t control what you don't measure All measurements are inferentials Multiple measurements allow tighter specifications –
multivariate instruments Timeliness is key
Turnaround speed of the instrument On-line versus near-line versus off-line
Elimination of the analyst Real-Time Release – real time data analysis and automated, application-specific interpretation
5
2011 CPAC Summer Institute
Chemometrics for instrumentation: the value proposition
Anything you can do to improve precision of the multivariate measurements collected by the instrument will allow you to tighten the control – essentially for free.
One way is to construct an application-specific, objective evaluation system: Experimental design Exploratory data analysis
Leading to Multivariate modeling (qualitative and quantitative analysis)
Just as key is the signal processing aspect of chemometrics to reduce instrument-derived variability Within an instrument (e.g., noise reduction) Between instruments (i.e., transfer of calibration)
6
2011 CPAC Summer Institute
Alignment Technology
What if we can make all of our instruments look as much alike as possible?
Interchangeability Common interpretive base
7
2011 CPAC Summer Institute
Raw process chromatograms
Full Data;2
0 10 20 30 40Time Index (E +03)
0.0
0.1
0.2
Res
pons
e
8
2011 CPAC Summer Institute
The same chromatograms after alignment
Full Data;2
0 10 20 30 40Time Index (E +03)
0.0
0.1
0.2
Res
pons
e
9
2011 CPAC Summer Institute
Product Development Trail
Voice Recognition Technology (academic articles) Translation into Chromatography (again academic)
“Did not work”, use of traditional technology (version 1)
Interaction with End User Group Version 2, start to deploy Identification of high value targets Recognition of barriers to large-scale deployment
Enter CPAC Contacts, expertise (publications) Low learning curve
2011 CPAC Summer Institute
38 samples of M. simiae (5 labs)
11
2011 CPAC Summer Institute 12
Area Summation
2011 CPAC Summer Institute 13
Area Summation
aligned
2011 CPAC Summer Institute
1H NMR in the Aromatic Region
NMR Spectra
14
2011 CPAC Summer Institute
1H NMR in the Aromatic Region
NMR Spectra - Aligned
15
2011 CPAC Summer Institute
QC of x-ray contrast agents Original HPLC data
Aligned HPLC data
16
2011 CPAC Summer Institute
QC of x-ray contrast agents
17
2011 CPAC Summer Institute
QC of x-ray contrast agents
18
2011 CPAC Summer Institute
PCA as a basis for interpretation
PC1
PC2
PC3
19
2011 CPAC Summer Institute
PC1
PC2
PC3
PCA as a basis for interpretation
20
2011 CPAC Summer Institute
On-line Simulated Distillation
21
400 Samples Un-Aligned
Same Samples Aligned
2011 CPAC Summer Institute
Comparison of PCA scores
85% of all of the variation in the raw data is due to the misaligned peaks.
Correcting for this shows us that there are three different production regimes in these data.
Before alignment After alignment
22
2011 CPAC Summer Institute
Automated alignment works
5 year period, 6 GCs
23
2011 CPAC Summer Institute
Automated alignment works
aligned
5 year period, 6 GCs
24
2011 CPAC Summer Institute
Same Sample run on multiple instruments (runs 7 weeks apart)
aligned
2011 CPAC Summer Institute
200
300
400
500
600
700
800
0 10 20 30 40 50 60 70 80 90 100Yield, wt%
BP,
F
2 minute run
Yield Curves Match
µGC, Lab, and Process
26
2011 CPAC Summer Institute
Continuous validation of a multivariate instrument We can correct retention times to match an application-
specific relevant sample This eliminates the transfer of calibration problem in
chromatography Common regression and classification algorithms can be
applied automatically to infer physical properties or characteristics
This allows us to bring more complex analyses into on-line use
27
Chemometrics for Process Analytics: Data AlignmentHistory of a CPAC initiativePoll of Process UsersThe Data Processing BottleneckThe key is multivariate instrumentationChemometrics for instrumentation: �the value propositionAlignment TechnologyRaw process chromatogramsThe same chromatograms after alignmentProduct Development Trail38 samples of M. simiae (5 labs)Area SummationArea SummationNMR SpectraNMR Spectra - AlignedQC of x-ray contrast agentsQC of x-ray contrast agentsQC of x-ray contrast agentsPCA as a basis for interpretationPCA as a basis for interpretationOn-line Simulated DistillationComparison of PCA scoresAutomated alignment worksAutomated alignment worksSame Sample run on multiple instruments� (runs 7 weeks apart)Yield Curves MatchContinuous validation of a multivariate instrument