<<O>>  Difference Topic IRseminar08-0502 (r1.15 - 02 May 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 14 to 14

  • Matthew W. Bilotti, Paul Ogilvie, Jamie Callan and Eric Nyberg, Structured retrieval for question answering, Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pp 351-358, 2007.

Required Browsing:

Changed:
<
<
  • true knowledge (Usually there is a few days waiting period for a beta account. If you are seeing this on thursday evening you might as well use my account. Username: First five letters of my first name, password: Username+asdf)
>
>

 <<O>>  Difference Topic IRseminar08-0502 (r1.14 - 02 May 2008 - ElifAktolga)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 23 to 23

Changed:
<
<

Questions

>
>

Questions


  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
  • Now IR has begun to consider organizing the information in the corpora so that users can easily access or discover some new information, e.g. building links between events or incident threading. NLP may be more helpful for these complicated new IR tasks? Ideas from some typical NLP tasks like coreference resolution can be borrowed? - Xing Yi
  • My question is one of usability. After browsing several NLP search engines, they are all centered around QA task. However, it seems that most users are content with short keyword queries, and have no will to explicitly write natural language queries. Can we instead use NLP to implicitly infer (when possible) the possible question that underlies the short query? Google/Yahoo/Live already do this to some extent. E.g., typing "time amherst ma" gives an immediate answer: "12:29pm Friday (EDT) - Time in Amherst, Massachusetts". Or "capital germany" - "Germany — Capital: Berlin". It would be interesting to see how this can be done for more complicated question types. --- Michael
  • Some advanced NLP techniques in IR seem useful under restrictions. I think the case to maximize the utility is that target text is short and the query is long. It is not easy to discover plausible linguistic structures from too long text. Further, it is so to analyze user's context or intension from short queries. The claim that NLP is useful for Q&A or approaches of NLP search engines seem to be driven from such limitations. -- Jangwon
  • In terms of generic web retrieval (i.e., not QA), I believe that very low level NLP could aid in the process. For example, after determining that the user is posing a "what-is" query, more weight could be given to hits from Wikipedia or some other source of "what-is"-like definitions. "How-far-is" questions could trigger Google/Yahoo/Microsoft maps to be the first result. Such NLP parsing is cheap and I think, pending user studies, could be very useful. -- HenryFeild
Changed:
<
<

>
>
  • `The more complex the question to answer becomes, the more NLP we need.' – What do you think about this? Is there a way to get around this in an IR-like fashion? Example complex QA questions: `When was America discovered and how?' or `Who is the wife of a former president who is a candidate for this year’s elections?' Inference is indispensable for these kinds of queries. Maybe incorporating NLP features in addition to keywords for determining ranking might be a good idea to tackle this. --Elif

 <<O>>  Difference Topic IRseminar08-0502 (r1.13 - 02 May 2008 - XingYi)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 26 to 26

Questions

  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
Added:
>
>
  • Now IR has begun to consider organizing the information in the corpora so that users can easily access or discover some new information, e.g. building links between events or incident threading. NLP may be more helpful for these complicated new IR tasks? Ideas from some typical NLP tasks like coreference resolution can be borrowed? - Xing Yi

  • My question is one of usability. After browsing several NLP search engines, they are all centered around QA task. However, it seems that most users are content with short keyword queries, and have no will to explicitly write natural language queries. Can we instead use NLP to implicitly infer (when possible) the possible question that underlies the short query? Google/Yahoo/Live already do this to some extent. E.g., typing "time amherst ma" gives an immediate answer: "12:29pm Friday (EDT) - Time in Amherst, Massachusetts". Or "capital germany" - "Germany — Capital: Berlin". It would be interesting to see how this can be done for more complicated question types. --- Michael
  • Some advanced NLP techniques in IR seem useful under restrictions. I think the case to maximize the utility is that target text is short and the query is long. It is not easy to discover plausible linguistic structures from too long text. Further, it is so to analyze user's context or intension from short queries. The claim that NLP is useful for Q&A or approaches of NLP search engines seem to be driven from such limitations. -- Jangwon
  • In terms of generic web retrieval (i.e., not QA), I believe that very low level NLP could aid in the process. For example, after determining that the user is posing a "what-is" query, more weight could be given to hits from Wikipedia or some other source of "what-is"-like definitions. "How-far-is" questions could trigger Google/Yahoo/Microsoft maps to be the first result. Such NLP parsing is cheap and I think, pending user studies, could be very useful. -- HenryFeild
Added:
>
>


 <<O>>  Difference Topic IRseminar08-0502 (r1.12 - 02 May 2008 - HenryFeild)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 28 to 28

  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
  • My question is one of usability. After browsing several NLP search engines, they are all centered around QA task. However, it seems that most users are content with short keyword queries, and have no will to explicitly write natural language queries. Can we instead use NLP to implicitly infer (when possible) the possible question that underlies the short query? Google/Yahoo/Live already do this to some extent. E.g., typing "time amherst ma" gives an immediate answer: "12:29pm Friday (EDT) - Time in Amherst, Massachusetts". Or "capital germany" - "Germany — Capital: Berlin". It would be interesting to see how this can be done for more complicated question types. --- Michael
  • Some advanced NLP techniques in IR seem useful under restrictions. I think the case to maximize the utility is that target text is short and the query is long. It is not easy to discover plausible linguistic structures from too long text. Further, it is so to analyze user's context or intension from short queries. The claim that NLP is useful for Q&A or approaches of NLP search engines seem to be driven from such limitations. -- Jangwon
Added:
>
>
  • In terms of generic web retrieval (i.e., not QA), I believe that very low level NLP could aid in the process. For example, after determining that the user is posing a "what-is" query, more weight could be given to hits from Wikipedia or some other source of "what-is"-like definitions. "How-far-is" questions could trigger Google/Yahoo/Microsoft maps to be the first result. Such NLP parsing is cheap and I think, pending user studies, could be very useful. -- HenryFeild

 <<O>>  Difference Topic IRseminar08-0502 (r1.11 - 02 May 2008 - JangwonSeo)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 27 to 27

  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
  • My question is one of usability. After browsing several NLP search engines, they are all centered around QA task. However, it seems that most users are content with short keyword queries, and have no will to explicitly write natural language queries. Can we instead use NLP to implicitly infer (when possible) the possible question that underlies the short query? Google/Yahoo/Live already do this to some extent. E.g., typing "time amherst ma" gives an immediate answer: "12:29pm Friday (EDT) - Time in Amherst, Massachusetts". Or "capital germany" - "Germany — Capital: Berlin". It would be interesting to see how this can be done for more complicated question types. --- Michael
Added:
>
>
  • Some advanced NLP techniques in IR seem useful under restrictions. I think the case to maximize the utility is that target text is short and the query is long. It is not easy to discover plausible linguistic structures from too long text. Further, it is so to analyze user's context or intension from short queries. The claim that NLP is useful for Q&A or approaches of NLP search engines seem to be driven from such limitations. -- Jangwon

 <<O>>  Difference Topic IRseminar08-0502 (r1.10 - 02 May 2008 - MichaelBendersky)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 26 to 26

Questions

  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
Changed:
<
<
  • Now IR has begun to consider organizing the information in the corpora so that users can easily access or discover some new information, e.g. building links between events or incident threading. NLP may be more helpful for these complicated new IR tasks? Ideas from some typical NLP tasks like coreference resolution can be borrowed? - Xing Yi
>
>
  • My question is one of usability. After browsing several NLP search engines, they are all centered around QA task. However, it seems that most users are content with short keyword queries, and have no will to explicitly write natural language queries. Can we instead use NLP to implicitly infer (when possible) the possible question that underlies the short query? Google/Yahoo/Live already do this to some extent. E.g., typing "time amherst ma" gives an immediate answer: "12:29pm Friday (EDT) - Time in Amherst, Massachusetts". Or "capital germany" - "Germany — Capital: Berlin". It would be interesting to see how this can be done for more complicated question types. --- Michael

 <<O>>  Difference Topic IRseminar08-0502 (r1.9 - 02 May 2008 - XingYi)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 26 to 26

Questions

  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright
Added:
>
>
  • Now IR has begun to consider organizing the information in the corpora so that users can easily access or discover some new information, e.g. building links between events or incident threading. NLP may be more helpful for these complicated new IR tasks? Ideas from some typical NLP tasks like coreference resolution can be borrowed? - Xing Yi

 <<O>>  Difference Topic IRseminar08-0502 (r1.8 - 02 May 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 25 to 25

Questions

  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim
Added:
>
>
  • The Voorhees paper seemed to imply that anything other than simple NLP is effectively useless in improving IR unless its perfect, or the task is limited (i.e. only QA). Should we even bother worrying about developments in NLP if this is the case? Will we ever find a natural dovetailing of deep NLP and IR? - Marc Cartright

 <<O>>  Difference Topic IRseminar08-0502 (r1.7 - 02 May 2008 - JinyoungKim)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 24 to 24

Questions

Changed:
<
<
*
>
>
  • Although most past research is quite skeptical on this, wouldn't there be any opportunity to improve ad-hoc retrieval using linguistic processing? If little evidence in query becomes a problem, how about using query expansion to extract named entities and phrase regarding given query? Also, linguistic structs in longer queries(TREC description and narration) doesn't seem to be utilized to its full potential - JinyoungKim

 <<O>>  Difference Topic IRseminar08-0502 (r1.6 - 01 May 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 18 to 18

Changed:
<
<

Required Skimming:

>
>

Recommended:


 <<O>>  Difference Topic IRseminar08-0502 (r1.5 - 29 Apr 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

People leading: Manjunatha Jagalur

Deleted:
<
<

UNDER CONSTRUCTION


Background:

Given that most of the text documents are in natural languages and user needs can be easily encapsulated as a natural language query, intuition suggests that IR(Information Retrieval) systems built upon NLP (Natural Language Processing) systems should have enormous advantage over other statistical method based systems. But this intuition has not yet translated into a successful general search engine. However numerous studies have shown that concepts from NLP can be incorporated successfully into systems that address particular problems from IR like QA(Question Answering). In this seminar we shall discuss the utility of NLP in IR, application of NLP in QA systems, and some of the NLP based search engines that are still in development.
 <<O>>  Difference Topic IRseminar08-0502 (r1.4 - 28 Apr 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

Line: 7 to 7

UNDER CONSTRUCTION

Background:

Changed:
<
<
Although NLP seems to be the most natural method for Information Retrieval, until now there is successful search engine using it. In this seminar we shall discuss the utility of NLP in IR, QA systems that use IR and some of the NLP based search engines that are still in development.
>
>
Given that most of the text documents are in natural languages and user needs can be easily encapsulated as a natural language query, intuition suggests that IR(Information Retrieval) systems built upon NLP (Natural Language Processing) systems should have enormous advantage over other statistical method based systems. But this intuition has not yet translated into a successful general search engine. However numerous studies have shown that concepts from NLP can be incorporated successfully into systems that address particular problems from IR like QA(Question Answering). In this seminar we shall discuss the utility of NLP in IR, application of NLP in QA systems, and some of the NLP based search engines that are still in development.

Required papers:

Changed:
<
<
  • Voorhees 1999
  • Flank
  • Ephyra
>
>

Required Browsing:

Changed:
<
<
  • true knowledge
  • brain boost
  • spock
>
>
  • true knowledge (Usually there is a few days waiting period for a beta account. If you are seeing this on thursday evening you might as well use my account. Username: First five letters of my first name, password: Username+asdf)
  • Brain Boost
  • spock

Required Skimming:

Changed:
<
<
  • true knowledge patent
  • Powerset paper
  • Lin Evaluation

Additional:

  • Advertising
  • New event detection
  • NLP in DB
>
>

Questions

Added:
>
>
*

 <<O>>  Difference Topic IRseminar08-0502 (r1.3 - 28 Apr 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic: NLP in IR

People leading: Manjunatha Jagalur

Added:
>
>

UNDER CONSTRUCTION


Background:

Changed:
<
<
Focus on: QA, Summarization and Machine Translation.
>
>
Although NLP seems to be the most natural method for Information Retrieval, until now there is successful search engine using it. In this seminar we shall discuss the utility of NLP in IR, QA systems that use IR and some of the NLP based search engines that are still in development.

Required papers:

Changed:
<
<
>
>
  • Voorhees 1999
  • Flank
  • Ephyra

Required Browsing:

  • true knowledge
  • brain boost
  • spock

Changed:
<
<

Recommended:

>
>

Required Skimming:

  • true knowledge patent
  • Powerset paper
  • Lin Evaluation

Changed:
<
<

Tangential but interesting:

>
>

Additional:

  • Advertising
  • New event detection
  • NLP in DB

Changed:
<
<

Related Resources

>
>

Questions


 <<O>>  Difference Topic IRseminar08-0502 (r1.2 - 18 Apr 2008 - ManjunathaJagalur)

META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Changed:
<
<

Topic, people leading.

>
>

Topic: NLP in IR

People leading: Manjunatha Jagalur


Background:

Changed:
<
<
>
>
Focus on: QA, Summarization and Machine Translation.

Required papers:

 <<O>>  Difference Topic IRseminar08-0502 (r1.1 - 25 Feb 2008 - JamesAllan)
Line: 1 to 1
Added:
>
>
META TOPICPARENT IRseminarS08

IR seminar, May 2, 2008

Topic, people leading.

Background:

Required papers:

Recommended:

Tangential but interesting:

Related Resources

Revision r1.1 - 25 Feb 2008 - 19:23 - JamesAllan
Revision r1.15 - 02 May 2008 - 20:41 - ManjunathaJagalur