DETECTION OF DE-CHALLENGE IN SPONTANEOUS REPORTING SYSTEMS


INTRODUCTION

Causality assessment (CA) is a method used in pharmacovigilance to evaluate the relationship between drugs and reported adverse drug reactions (ADR). It involves identifying the drugs causing ADR and evaluating the reaction of dechallenge. Dechallenge is a response observed in a patient, such as a reduction or disappearance of ADR after drug withdrawal. There are two types: positive dechallenge, which resolves with drug withdrawal, and negative dechallenge, which follows its own course. Rechallenge is essential to confirm the cause and relationship of ADR. Prescribers and patients may not follow typical procedures for rechallenge, and prescribers prefer dechallenge over rechallenge.

Data mining is a statistical approach used to discover useful patterns from large amounts of data, including algorithms for classification, prediction, clustering, and association. The Multi-item Gamma Position Shrinker is used by the Food and Drug Administration (FDA) in the US for signalling potential ADRs. Naive Bayes (NB) is a statistical classifier used for classification, but it assumes mutually independent variables. This method can lead to a simple prediction framework, but there is a possibility of zero probability. To overcome this issue, a standard technique like Laplacian correction was used.

 

MATERIALS AND METHODS:

The Medical Dictionary for Regulatory Activities (MedDRA) is an international medical terminology used to describe adverse events at five levels: System Organ Class (SOC), High Level Group Term (HLGT), Higher Level Term (HLT), Preferred Term (PT), and Lowest Level Term (LLT). In a study involving FDA ADRs from 2011 and 2012, data was extracted from the FDA's SRS, and duplicate reports were deleted. The FDA stored diseases category at the Preferred Terms (PT) level. Researchers suggested using a coarser-grained adverse event representation, SOC, than the PT level for data mining.

Data were loaded from the FDA's text file into an Oracle database using Extract, Transform, and Load (ETL) tools, and indications were constructed using patient identifiers. Records with SOC were considered for dechallenge classification. The attributes considered for dechallenge included disease categories, valid trade and verbatim names, outcomes like life-threatening (LT), death (DE), congenital anomaly (CA), hospitalization-initial or prolonged (HO), disability (DS), and required intervention (RI) to prevent permanent impairment.

 

The performance of the data mining algorithm was estimated using parameters like accuracy, error, precision, and Receiver Operating Characteristic (ROC) curve. Accuracy, precision, and error were calculated using Formulas 1, 2, and 3. The ROC curve was used to display the graphical representation of the perfect, liberal, random, and conservative performance of an algorithm.

                                     Formula 1:

                                    

                                     Formula 2:

 

                                     Formula 3:

 

RELATED WORKS:

Statistical data mining techniques have been applied in post-marketing surveillance, with various studies examining safety signal detection problems, data sources, and methods used in safety signal classification. Decision tree algorithms have been applied for drug safety classification. The Naive Credal Classifier, an extension of NB, is used for high uncertain information domains. Denis et al. propose a classifier for eliminating noise in datasets. Chen and Shengrui propose classification methods for high-dimensional data. Kotsiantis and Pintelas modified NB classifiers using bagging and boosting procedures. Zhang et al. proposed a novel model called Hidden Naive Bayes to avoid computational complexity.

 

DATA MINING MODEL:

The data mining model identified patients by unique numbers and analyzed drug and disease category outcomes. FDA classified drugs as 1 for valid trade and 2 for verbatim name. Disease categories were denoted at the SOC level using FDA's PT and MedDRA's PT using xml mapping.

 

THE ALGORITHMS USED:

Naive bayas

The fundamental assumption to attribute independence was considered in this study. Dechallenge attribute presented in FDA records were taken as class label for detection. NB theorem given in Formula 4 had been used to calculate the probability of an outcome. The class label attribute dechallenge had two distinct values (Yes, No) represented by hypothesis (H).

                                   

 

P(H/X) is the posterior probability where hypothesis (H) represents the presence of dechallenge with X as known disease category, drug code and outcome. P (X/H) is the posterior probability of X about H. P (H) is the prior probability of H regardless of disease category, drug code and outcome. P(X) is the prior probability of X.

For calculating the prior probability P (X), dechallenge record sets with 'unknown' category were filtered. Then the posterior probability was calculated based on outcome, disease category and drug code. The data set of 2011 and 2012 records contained the constraints mentioned for the failure of Naive Bayes classifier.

 

IMPROVED NAIVE BAYES:

The algorithm calculates the Influence Factor for attributes like drug code, disease category, and outcome on the label dechallenge, using Formula 5 to determine the dependability of an attribute value on the class attribute.

 

 

Where I(X/Ci) =Influence Factor

N(X\C i ) =Number of records in which attribute value X had the class label C i and N(Ci)=Total Number of records in which the class label were C i .The dataset was divided based on the class label dechallenge 'Yes' and 'No'. Influence factor for attributes with high values were taken and others ignored.

 

CONCLUSION:

The FDA and WHO's data lacks health science education and training for patient care. Post-marketing surveillance techniques like detecting dechallenge can help practitioners and prescribers understand drugs with reactions. Unknown samples should be properly classified for data analysis. FDA suggests dechallenge evidence as the most important criteria for causality reviews. Classification algorithms show that NB+ outperforms NB in accuracy and minimal error.

 

Student Name: Mohamed Salih Aslam

Student ID: 188/092023

Qualification: B.Pharmacy

e-Mail ID: msalihaslam@gmail.com

Comments

Popular Posts