Review Article| Volume 28, ISSUE 1, P1-7, March 2008

Introduction to the Mining of Clinical Data

      The increasing volume of medical data online, including laboratory data, represents a substantial resource that can provide a foundation for improved understanding of disease presentation, response to therapy, and health care delivery processes. Data mining supports these goals by providing a set of techniques designed to discover similarities and relationships between data elements in large data sets. Currently, medical data have several characteristics that increase the difficulty of applying these techniques, although there have been notable medical data mining successes. Future developments in integrated medical data repositories, standardized data representation, and guidelines for the appropriate research use of medical data will decrease the barriers to mining projects.
      To read this article in full you will need to make a payment

      Purchase one-time access:

      Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
      One-time access price info
      • For academic or personal research use, select 'Academic and Personal'
      • For corporate R&D use, select 'Corporate R&D Professionals'

      Subscribers receive full online access to your subscription and archive of back issues up to and including 2002.

      Content published before 2002 is available via pay-per-view purchase only.


      Subscribe to Clinics in Laboratory Medicine
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect


        • Rees J.
        Complex disease and the new clinical sciences.
        Science. 2002; 296: 698-700
      1. Canadian Institute for Health Research. Secondary use of personal information in health research: case studies. 2002. Available at: Accessed August 26, 2007.

        • Safran C.
        • Bloomrosen M.
        • Hammond W.E.
        • et al.
        Toward a national framework for the secondary use of health data: an American medical informatics association white paper.
        J Am Med Inform Assoc. 2007; 14: 1-9
        • Grossman J.
        • Mackenzie F.J.
        The randomized controlled trial: gold standard, or merely standard?.
        Perspect Biol Med. 2005; 48: 516-534
        • Jager K.
        • Stel V.
        • Wanner C.
        • et al.
        The valuable contribution of observational studies to nephrology.
        Kidney Int. 2007; 72: 539-542
        • Babcock C.
        Parallel processing mines retail data.
        Computerworld. 1994; 28: 6
        • Harrison D.
        Backing up 100 terabytes.
        Network Computing. 1993; 413: 98-104
        • Fayyad U.M.
        • Piatetsky-Shapiro G.
        • Smyth P.
        From data mining to knowledge discovery: an overview.
        in: Fayyad U.M. Piatetsky-Shapiro G. Smyth P. Advances in knowledge discovery and data mining. AAAI Press, Menlo Park (CA)1996: 1-34
        • Lee S.
        • Siau K.
        A review of data mining techniques.
        Industrial Management & Data Systems. 2001; 100: 41-46
        • Fayyad U.M.
        • Piatetsky-Shapiro G.
        • Smyth P.
        From data mining to knowledge discovery in databases.
        AI Magazine. 1996; 17: 37-54
        • Hipp J.
        • Güntzer U.
        • Nakhaeizadeh G.
        Data mining of association rules and the process of knowledge discovery in databases.
        in: Perner P. Advances in data mining: applications in e-commerce, medicine, and knowledge management. Springer, Berlin (Germany)2002: 207-226
        • Hand D.J.
        Principles of data mining.
        Drug Saf. 2007; 30: 621-622
      2. SAS Institute Inc. SAS Enterprise Miner®. Available at: Accessed August 26, 2007.

      3. SPSS Inc. Clementine®. Available at: Accessed August 26, 2007.

      4. Cognos Inc. Data mining. Available at: Accessed August 26, 2007.

      5. Insightful Corp. Insightful Miner®. Available at: Accessed August 26, 2007.

      6. Oracle Corp. Oracle data mining. Available at: Accessed August 26, 2007.

        • Tan P.
        • Steinbach M.
        • Kumar V.
        Introduction to data mining.
        Addison-Wesley Longman Publishing Co., Inc., Boston2005
        • Dunham M.H.
        Data mining: introductory and advanced topics.
        Prentice Hall, Upper Saddle River, NJ2002
        • Hand D.
        • Blunt G.
        • Kelly M.
        • et al.
        Data mining for fun and profit.
        Stat Sci. 2000; 15: 111-126
        • Cios K.J.
        • Moore G.W.
        Uniqueness of medical data mining.
        Artif Intell Med. 2002; 26: 1-24