Explore various approaches to organize and extract useful text from unstructured data using Java
About This Book
- Integrate basic tasks to tackle more complex NLP problems
- Train NLP models to address domain-specific problem areas
- Learn to use a variety of core NLP techniques with this pragmatic guide
Who This Book Is For
If you are a Java programmer who wants to learn about the fundamental tasks underlying natural language processing, this book is for you. You will be able to identify and use NLP tasks for many common problems, and integrate them in your applications to solve more difficult problems. Readers should be familiar/experienced with Java software development.
What You Will Learn
- Develop a deep understanding of the basic NLP tasks and how they relate to each other
- Discover and use the available tokenization engines
- Implement techniques for end of sentence detection
- Apply search techniques to find people and things within a document
- Construct solutions to identify parts of speech within sentences
- Use parsers to extract relationships between elements of a document
- Integrate basic tasks to tackle more complex NLP problems
In Detail
Natural Language Processing (NLP) is an important area of application development and its relevance in addressing contemporary problems will only increase in the future. There has been a significant increase in the demand for natural language-accessible applications supported by NLP tasks.
Natural Language Processing with Java will explore how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. It covers concepts of NLP that even those of you without a background in statistics or natural language processing can understand.
Richard M Reese
Richard M Reese has worked in both industry and academics. For 17 years, he worked in the telephone and aerospace industries, serving in several capacities, including research and development, software development, supervision, and training. He currently teaches at Tarleton State University, where he is able to apply his years of industry experience to enhance his classes. Richard has written several Java and C books. He uses a concise and easy-to-follow approach to topics at hand. His books include EJB 3.1 Cookbook; books about new features of Java 7 and 8, Java Certification, and jMonkey Engine; and a book on C pointers.