Tuesday, August 12, 2008

1

This week: I'll gather information about informal communication, colloquialisms and their impact on processing Chemistry data. Also I'll explore the multilingual issues that seems to be pertinent in this regard.

Next week: This week first, then next week.

Issues: Forms of colloquialism, variations in colloquialism across several languages, methods of extraction and expansion, possible applications. A very important thing is CONTRACTION, i.e., generation of colloquial forms from expanded (and may be somewhat formal and rigmarolic) ideas and discussions. So basically we need to explore BOTH directions - how to expand and then how to contract (appropriately).

Achievements:

http://en.wikipedia.org/wiki/Wiki

http://www.cs.umass.edu/~culotta/pubs/culotta04dependency.pdf

studying on kernels, dependency trees, etc. Need to know about methods...

No comments: