Documente Academic
Documente Profesional
Documente Cultură
Natural Language in AI
• dialogue-based
– NL interfaces
– spoken and written communication
– uses natural language understanding
• discourse (any string more than 1 Sentence
long)
• text-based
– text categorization, text generation, information
extraction, machine translation
• Information extraction
• Machine translation
• Drafting
• Text summarization
• Knowledge-based Information
Extraction
• Machine learning IE
• Template-based, Wrappers
• Template Mining
Template-based, Wrappers
• IE’s output is a populated database, which can be
used as a case base
• The values for the slots are strings from the source
text
• The resulting database works as a template
Template Mining
• well suited for areas, “where the text is terse and
sentences are unambiguous and declarative in
nature”.
• Information extraction
• Machine translation
• Drafting
• Text summarization
By Tom Mitchell
•
http://babelfish.altavista.com/babelfish/tr
http://world.altavista.com/tr
http://www.systransoft.com/
• Information extraction
• Machine translation
• Drafting
• Text summarization
3. Body: in its conclusion it is usually the court decision and its foundations.
This is where the search for illocutionary expressions takes place. Upper
paragraphs describe details of the situation, indicating the laws that categorize
the subject, and points to values to foundation.
:
:
:
foundation
:
:
conclusion
4. Closing: Starts with one paragraph about votes followed by date, place and
names of attorneys.
one paragraph about votes followed by date, place and names of attorneys
Text-based Natural Language
Natural Language
Topics
• Information extraction
• Machine translation
• Drafting
• Text summarization
NL Understanding
• Speech recognition
– intonation, pronunciation, speed
• Natural Language Processing
– syntactic , semantic , pragmatic analysis
Natural Language Generation
– intention, generation, speech synthesis
Ice cream
I scream
- Syntactic Analysis
- Parsing
- Semantics
- Pragmatics
NP VP
• uses context
• uses partial representation
• includes purpose and performs
disambiguation
• Where, when, by whom an utterance was
said
• speech synthesis
• from words into speech signal
• applications of neural networks
• templates with recordings from humans
• record every word in a dictionary
• record every phoneme (worst choice!)
• JUPITER uses a commercial speech
synthesizer
– http://www.isi.edu/natural-language/mt/nitrogen/
http://www.mri.mq.edu.au/~peba/MLPeba/system.html
http://cslu.cse.ogi.edu/HLTsurvey/ch4node3.html#SECTIO
N4
"What will the weather be like in Boston tomorrow?" Jupiter invokes the following
procedure:
- Speech recognition: SUMMIT converts the spoken sentence into text
- Language understanding: TINA parses the text into a semantic frame -- a
grammatical structure containing the basic terms needed to query the Jupiter
database
- Language generation: GENESIS uses the semantic frame's basic terms to
build a Structured Query Language (SQL) query for the database
- Information retrieval: Jupiter executes the SQL query and retrieves the
requested information from the database
- Language generation: TINA and GENESIS convert the query result into a
natural language sentence
- Information delivery: Jupiter delivers the generated sentence to the user via
voice (using a speech synthesizer) and/or display