Chunking
Textbooks
1. Speech and Language Processing - An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition
BY: Daniel Jurafsky and James H. Martin - Chapter 12
Comprehensive coverage of syntactic parsing and phrase chunking techniques.
2. Foundations of Statistical Natural Language Processing
BY: Christopher D. Manning and Hinrich Schütze - Chapter 11
Statistical approaches to shallow parsing and chunk identification.
3. Natural Language Processing with Python
BY: Steven Bird, Ewan Klein and Edward Loper - Chapter 7
Practical implementation of chunking using NLTK with regex patterns.
4. Introduction to Information Retrieval
BY: Christopher Manning, Prabhakar Raghavan, and Hinrich Schütze
Text processing fundamentals including phrase extraction and chunking.
Video Lectures and Online Courses
NPTEL Courses:
- Natural Language Processing - IIT Bombay
- Computational Linguistics - IIT Kharagpur
Stanford University:
- CS224N: Natural Language Processing - Stanford
- CS224U: Natural Language Understanding - Stanford
YouTube EDU Channels:
- Natural Language Processing Lectures - Chunking episodes
- NLTK Tutorials - Practical chunking examples
Research Papers and Articles
1. Noun Phrase Chunking using Regular Expression Patterns
BY: Steven Abney
Computational Linguistics, Vol. 17, No. 2, 1991
2. Text Chunking using Transformation-Based Learning
BY: Lance Ramshaw and Mitchell Marcus
ACL 1995
3. Memory-Based Shallow Parsing
BY: Walter Daelemans, Sabine Buchholz, Jorn Veenstra
CoNLL 1999
Online Resources and Tools
Academic Resources:
- Penn Treebank - Standard parsing and chunking dataset
- CoNLL Shared Tasks - Chunking evaluation datasets
- Universal Dependencies - Multi-language syntactic annotation
Interactive Tools:
- NLTK Chunking Demo - Interactive chunking patterns
- spaCy Phrase Matcher - Modern chunking techniques
- Stanford Parser Demo - Online parsing and chunking
Practice Resources:
- NLTK Book Chapter 7 - Hands-on chunking exercises
- Python NLTK Tutorials - Step-by-step chunking guide
- Regular Expression Tester - Test chunking patterns
Additional Reading
For Advanced Study:
- "Syntactic Pattern Recognition for Information Extraction" by Mary Elaine Califf
- "Shallow Parsing with Conditional Random Fields" by Fei Sha and Fernando Pereira
- "Introduction to the CoNLL-2000 Shared Task: Chunking" by Erik Tjong Kim Sang
For Indian Languages:
- "Hindi Chunking using CRFs" by various authors from LTRC, IIIT Hyderabad
- "Statistical Approach to Hindi Shallow Parsing" by Bharati et al.
- "Indian Language Text Processing" - Multiple authors, CDAC publications
Practice Resources
Corpora and Datasets:
- CoNLL-2000 Chunking Data - Standard evaluation corpus
- Brown Corpus - Classic English text corpus with chunking annotations
- Hindi Treebank - Hindi syntactic analysis resources
Tools and Libraries:
- NLTK Chunking - Python chunking library
- spaCy Rule-based Matching - Modern pattern matching
- OpenNLP Chunker - Java-based chunking tools