Cpact home page CPACT Logo  
-
welcome
Plant Feature Detection and Performance Monitoring
Project Objectives
  • To demonstrate the beneficial application of Multivariate Statistical Process Control (MSPC) and Plant Performance Monitoring in process manufacturing and in pilot plant developments.
  • To provide a contribution to a better understanding of process operations.
  • To predict the final quality of the product from the more rapidly monitored process variables.
  • To establish a protocol for the implementation of multivariate statistical process performance monitoring.
  • To provide showcase demonstrators that will illustrate the power of the techniques across a range of application sectors.

Achievements

The project objectives were met and exceeded both in terms of undertaking internationally recognised research, and catalysing the introduction of innovative manufacturing methodologies within Quest International, BP, Pfizer, SmithKline Beecham and Zeneca. The research work focused on meeting the industrial manufacturing challenges defined by the end-user companies. This necessitated extending the state-of-the-art in process performance monitoring and process diagnostics. Achievements with individual CPACT companies, in collaboration with the research team at Newcastle, are summarised below. The Research Associates and Masters students employed on Project 6 have all attained good positions in industry or else have gone on to study for a higher degree.

1 University of Newcastle (Centre for Process Analytics and Control Technology)

New developments in multivariate data analysis, modelling and process performance monitoring for both batch and continuous processes were all initially evaluated on benchmark simulations prior to their being applied to industrial challenges and data sets.

  • Different approaches for batch performance monitoring were evaluated - individual batch analysis, end of batch analysis and observation level batch analysis. It was concluded that the different techniques were complimentary and that no one methodology outperformed the other approaches.

  • A comparison between the tri-linear technique of parallel factor analysis (PARAFAC) and the bi-linear approach of multi-way performance monitoring was undertaken on a benchmark simulation of a batch polymerisation process. Advantages and limitations were identified with both approaches.

  • A new on-line batch-monitoring algorithm was developed for the tri-linear technique of (PARAFAC). This is based on a weighted scores in-filling procedure. Modified confidence bounds were developed for the PARAFAC algorithm. The methodology developed was shown to be equally applicable to bi-linear (multi-way) approaches. Considerable benefits have been identified with the revised algorithm in terms of the reduction in number of false alarms and clarity of identifying the source of the process change through contribution plots with associated bounds.
  • PARAFAC models are known to suffer from degeneracy, a comparative study that focused on the various PARAFAC algorithms (ALS, ASD and ACOVER) has been carried out and the advantages and disadvantages of the different algorithms assessed. Based on the results of this study, a solution to the problem of degeneracy has been researched.
  • The issue of sensitivity analysis for variable selection, for both predictive and performance monitoring models, has been investigated through use of principal component analysis. This approach complements that of process understanding that is essential in model building. This was applied to the Pfizer fermentation demonstration process.
  • Investigations into the modelling of non-linear processes using a number of local linear models are on going. Local models are combined through either through Bayesian validity or fuzzy membership functions. The aim is to be able to use linear approaches to model and monitor non-linear processes where the overall global non-linear model comprises several local linear models. The methodology is being evaluated on simulations of semi-batch processes including fermentation and polymerisation.
  • Plant wide, dynamic multivariate statistical process control is being progressed through simulation studies on the benchmark Tennessee Eastman problem. Tools investigated included dynamic state space models. Comparisons between dynamic PCA, dynamic PLS and canonical variate state space approaches is being carried out.

  • The dynamic modelling of complex non-linear processes through non-linear dynamic partial least squares (PLS) has been achieved. The methodology was demonstrated by application to a benchmark simulation of a pH neutralisation process. A new model-based predictive control approach based on the use of a dynamic PLS model has also been proposed.

  • The development and application of dynamic partial least squares (PLS) and dynamic multi-block PLS modelling methodologies for the prediction of important parameters in a biological waste-water treatment process has been progressed. Complimentary to this was the application of multivariate performance monitoring techniques to the SmithKline Beecham activated sludge treatment plant. This study demonstrated the power of the techniques for enhanced monitoring of such processes. CPACT Phase II will progress these initial studies through to a full plant application with GSK.
  • Comparison of a range of clustering techniques on shake flask and fermenter data in new drug development with Zeneca Pharmaceuticals. This analysis demonstrated the power of multivariate statistical data analysis and visualisation techniques in the discovery of new bio-therapeutics. A new algorithm for the equalisation of batch lengths based on dynamic time warping was developed and applied to the analysis of the Zeneca drug development data where the batches differed considerably in length and data was only available on a limited number of batches.
  • A methodology previously proposed to evaluate prediction intervals for neural network models has been modified and extended to linear and non-linear algorithms. The intervals are computed using a first-order Taylor series expansion and the Jacobian matrix of the functional mapping provided by the PLS algorithms.
  • The continued development of MATLAB© based Multivariate Statistical Process Control software, PreScreen, MultiDAT and BatchDAT. The software was issued to those partners actively involved in the process performance monitoring project.

2 BP (Grangemouth)

  • Data from three plants were analysed using multivariate statistical data analysis techniques - Innovex (low density polymer); Rigidex (high density polymer); and PIB (high viscosity polymer).
  • The standard process performance monitoring techniques of principal component analysis (PCA) and partial least squares were applied to data collected during the manufacture of high-density polyethylene (Rigidex) for the early detection of deviations from specification of melt index and density. A similar set of objectives was identified with respect to the Innovex process, i.e. early warning of process malfunctions. This study has also demonstrated how final product quality can be efficiently monitored using partial least squares (PLS). PLS also provides a technology for the on-line estimation of difficult, or expensive, to measure process variables, such as melt flow index.
  • For the PIB process, PCA models for each polymer grade were built, corresponding to the normal operating conditions for different production grades. Process control and other disturbances as well as grade transitions were identified as data points that exceeded the confidence bounds in the squared prediction error and T2 plots. Contribution plots were used to identify the variables responsible for different operating conditions and the sources of the disturbances. For modelling the polymer viscosity, PLS models were developed based on each grade separately as well as using a combination of grades to form generalised models.

3 BP (Hull)

  • The aim of this analysis was to identify the potential of multivariate statistical process control for optimising both production and control parameter values for the A4 reformer process at the BP Salt End Site. The main objective of this work was to stabilise and optimise the production of CO by providing a deeper understanding of the relationship between the NG composition and the CO produced. PCA and PLS were initially applied to the data. A comparison was then made between these individual monitoring models and those based on the novel generic modelling approach using a pooled-covariance matrix. The comparison was based on how quickly the disturbance (N2 and CH4 in the NG) could be detected. The success of this work is reflected in that a working process performance model has been developed for the A4 reformer process and is to be validated at-line using a real-time, on-line MATLAB based performance monitoring tool-kit and subsequently using MDC’s MSPC+® as part of CPACT Phase II.

Overall the work in Project 6 with BP has shown that multivariate statistical projection based data analysis and data visualisation techniques are powerful tools that can provide additional insight into the manufacturing process and manufacturing issues. These complement the experience of the plant engineers, at the same time enhancing process supervisors and operators involvement in the drive towards manufacturing excellence.

4 Pfizer

  • With increasing global competition, it is of critical importance that each fermentation produces high quality product at maximum yield. The Pfizer challenge was to improve the consistency and yield on a large-scale fermentation process. Initially a sensitivity analysis was carried out to identify the key variables that determine process behaviour. The results were confirmed from process understanding. Two strategies to the monitoring of a batch type process were then investigated, multi-way principal component analysis (MPCA) and batch observation level analysis. Both methods identified those batches that had low yields. Of particular interest was the fact that the two approaches identified different batches as non-conforming. This is not surprising as the process is particularly complex as well as being highly regulated. The analysis showed that MPCA and batch observation level analysis was complementary in terms of process performance monitoring. A further outcome of this work was the conclusion that the data visualisation methods developed are appropriate for use by fermentation personnel as process deviations can be highlighted and dealt with immediately, thereby ensuring that the process is returned to good operational conditions with minimum loss of yield. The success of this work is reflected in the development of an on-line performance monitoring scheme in MATLAB for evaluation on the production process.

  • A second study addressed the Pfizer waste water plant with the aim of demonstrating the potential of multivariate statistical data analysis and visualisation techniques to provide an enhanced understanding of an activated sludge wastewater treatment plant.

5 Zeneca (Alderly Edge)

  • A major challenge in the development of new bio-therapeutics is the extraction of hidden information in data collected during the innovation process. A number of statistical feature extraction techniques were investigated including linear and non-linear Principal Component Analysis, Factor Analysis, K-Means Clustering, Fuzzy C-Means, Projection Pursuit and Hierarchical Clustering and the Kohonen Self-Organising Map. The data clustering methods were applied in an exploratory manner to data gathered from a series of experiments under known conditions on shake-flasks and laboratory scale fermentations. None of the methods investigated were found to significantly outperform the others and the cluster results were consistent with one another. In terms of ease and clarity of interpretation, Factor Analysis, slightly outperformed the other approaches. It is believed that these tools, used in conjunction with biologists’ knowledge, can offer major benefits to laboratory scientists in terms of improved understanding of experiments during the design of new bio-therapeutics. The information extracted was shown to enhance the understanding of these complex processes and help biologists optimise process development.

6 SmithKline Beecham

  • The early detection and monitoring of the presence of impurities that can form in reacting mixtures is of key importance in the bio-chemical, fine chemical, polymer and pharmaceutical industries. The study with SB identified a number of challenges that require to be addressed to isolate the key factors that potentially contribute to the presence of process impurities. Multi-way PCA was applied to batch data collected on a drug intermediate. A number of interesting features were identified and addressed leading to an enhanced understanding as to the cause of increased impurity levels. The inclusion of agitator speed resulted in clustering related to different speeds causing the monitoring model to be insensitive to the detection of subtle process events. By applying a recently developed multi-product modelling approach, this issue was addressed. Through the application of a novel data augmentation approach more subtle features in the data were extracted, allowing a more representative model of the process to be constructed. The study succeeded in providing an enhanced understanding of the process and the identification of the key parameters and operational procedures related to reactant impurity levels. The success of this work has lead to the award of an EPSRC Mathematics CASE PhD studentship with SB.

  • Biological wastewater treatment plants are increasingly being used for the on-site processing of industrial effluent streams prior to discharge. The application of multivariate statistical techniques to these processes is an attractive solution to achieving a deeper understanding of their complex operation, thereby achieving enhanced operational performance. The aim of this work was to demonstrate the applicability of multivariate statistical methods for the modelling of the process outputs and monitoring of the performance of an industrial activated sludge wastewater treatment plant. Nominal models based on normal operating conditions were built. These successfully detected faulty process operation. Satisfactory one-step ahead predictions of important process output variables were obtained through dynamic PLS models. Improved modelling and the identification of those particular plant areas (blocks) responsible for faulty operation were achieved using multi-block PLS. Such results can be used as a feedback to process engineers to enhance process knowledge, reduce the time taken to identify the causes of process problems and abnormal operations as well as identifying important variables that might be additionally monitored. The success of this work is reflected in SB wishing to move towards monitoring the process on-line in real time as part of CPACT Phase II.

7 Quest International

  • Quest International operates a fed-batch yeast propagation process. This involves growing Saccharomyces cerevisiae on a complex substrate using a relatively sophisticated control system. A high degree of process knowledge is required to understand and control the interactive response of the yeast to the substrate matrix such that the desired yeast characteristics match required quality parameters. Current understanding by Quest personnel is extensive but due to the complex interaction of matrices, problems with quality parameters or yield results can take too long to track to source. The multivariate statistical projection techniques of principal component analysis and projection to latent structures were applied to extract the information inherent within the data. The benefits of the technology were seen in the recognition of a number of previously unidentified features. For example differences were identified between the two fermenter houses, these were traced initially to capacity differences and then more subtly, to their respective control systems; clustering occurred between recipes which was attributable to recipe variability and for certain recipes, differences in recipe adjustments; and fermenter vessel and molasses effects were identified for individual recipes. Overall the potential of the methodology to explain process variability was realised and new insights into variable interactions was achieved. This study resulted in increased process knowledge and understanding.

8 Software Development

  • MATLAB software developments in data pre-screening (PreScreen) and multivariate statistical process modelling and control (MultiDAT) to Project 6 members under a University Evaluation Agreement which does not allow their use for commercial gain. BatchDAT, a Batch Process MSPC package is in the final stages of validation prior to being available to CPACT companies.

Manufacturing Excellence and CPACT

At recent meetings of Projects 5 and 6, views were expressed that there is a strong interdependence between Projects 5 and 6 and parts of Project 2. Through their conjunction, it is possible to tackle "plant-wide manufacturing problems". The "3P" issues of Product development, Process development and Product manufacturing (vertical integration) combined with academic research and in association with analytical and control systems providers provide access to new technologies across the "3P" industrial problems (horizontal integration).

In this respect the CPACT contributions come from:

  • The importance of using multivariate statistical methods for data exploration, feature extraction and to obtain an enhanced understanding of the process (Project 6).
  • The development of advanced linear and non-linear multivariate statistical methodologies (Project 6).
  • The application of plant-wide process control and optimisation, as opposed to single unit optimisation (Project 5).
  • The need to make real use of more detailed information on the chemical and physical properties of the material being processed (Project 2).
  • The need to be able to develop process models and calibration models from small data sets (Project 2).
  • The importance of being able to monitor the whole manufacturing route from product inception and development, through pilot studies to final product manufacturing and customer feedback (Project 6).

It was suggested the CPACT company members all had these issues in one form or another – i.e. they were generic and CPACT was unique in being able to address these with a range of tools capable of being used to provide generic solutions.

The Japanese manufacturing engineer will first look for a statistical tool to identify the cause of the problem. The European Process Engineer will attempt to solve the problem by modeling from first principles. CPACT is unique in the UK, and possible Europe, in that it embraces both of these approaches

Projects 5 and 6 recommend that CPACT embraces the best of the Eastern and Western approaches to problem solving. This can be achieved through the establishment of an industrially led generic programme focussing on contributions to "Manufacturing Excellence".

diagram

Project Objectives and Deliverables

  • To demonstrate the beneficial application of MSPC in process manufacturing and in pilot plant developments. (Delivered).
  • To provide a contribution to the better understanding of process operations. (Delivered)
  • To predict the final quality of the product from the more rapidly monitored process variables. (Delivered).
  • To provide showcase demonstrators that will illustrate the power of the techniques across a range of application sectors. (On-going).
  • To establish a protocol for the implementation of MSPC. (Delivered outside CPACT).
  • To explore the need for, and potential applications of, non-linear multivariate statistical process monitoring. (Delivered outside CPACT).

Document Ref: 01/P5/P6/Execsum
Issued: 26 February 2001

Contact us
Introduction
Partners
Research areas
News
Events
Vacancies
Search this site
Members only area
CPACT Logo
CPACT home
CPACT home
-