ISDS 2001 Chapter : Course Outline Chapter 5 Text And Web Mining SPR13
Document Summary
Chapter 5 text and web mining: opening vignette: mining text for security and counterterrorism, problem, solution, results, what we can learn from this vignette. 2: definition: text mining is the semiautomatic process of extracting patterns from large amounts of unstructured data sources. Search engines use known relationships to find documents, whereas text mining aims to discover new patterns. Information can be gained by sifting" through court orders (law), discharge summaries (medicine), patent files (technology), customer comments (marketing), quarterly reports (finance), to name a few. 3: email, tteexxtt mmiinniinngg ccaann bbee aapppplliieedd ttoo mmeessssaaggeess oorr ee-- mmaaiillss ttoo rroouuttee tthheemm ttoo tthhee mmoosstt aapppprroopprriiaattee ppaarrttyy ttoo pprroocceessss tthhaatt mmeessssaaggee, filters examine words in the subject line. A good subject line encourages people to open an email, but care must be given to prevent filtering. According to sitesell. com, maker of the free email marketing tool, spamcheck, words such as.