From Unstructured to Structured Web Content
Date: Thursday, May 17 2007 - 7:00PM
Location: themechanism|eEmerge, 440 9th Avenue 8th Floor, New York NY 10001
Join us for an overview and discussion of UIMA (Unstructured Information Management Architecture.) This is a mature, large, well-documented code base to:
- "analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user."
- "UIMA is a component framework for analyzing unstructured content such as text, audio and video. It comprises an SDK and tooling for composing and running analytic components written in Java and C++, with some support for Perl, Python and TCL."
UIMA will become increasingly useful, as more and more data mining techniques will have to work together to maximize results. There is already much talk in the commercial, academic, and standards community to support this framework in the near future.
Come join us for an interesting, informative meeting focusing on this semantic building and standards compliant tool.
Attendees: Nicolas Vandenberghe (ITerating), Tyler, Marvin, Bertalan Danko, David Siegel, Jesse Kirshbaum, Gabriel Friedman, Alex, Ken, Marco Neumann (KONA, GS), William Surowiec, Daniel Eldridge, Michael, David Rogers, Michel Dufresne, Dave Fletcher (The Mechanism)