What is the primary problem in Information Retrieval (IR) and what is the objective of an IR system? For each of the “Ingest” steps discuss how decisions on it can affect the primary problem (e.g., can it reduce the problem or have no effect) and the primary objective if it effects that. Ingest steps are: locating and getting an item to ingest (discuss crawling web pages); duplicate detection; normalization; zoning; stemming; entity identification; and categorization

What is the primary problem in Information Retrieval (IR) and what is the objective of an IR system? For each of the “Ingest” steps discuss how decisions on it can affect the primary problem (e.g., can it reduce the problem or have no effect) and the primary objective if it effects that. Ingest steps are: locating and getting an item to ingest (discuss crawling web pages); duplicate detection; normalization; zoning; stemming; entity identification; and categorization