National Research Council CanadaSkip all menusSkip first menu Menu
National Research Council Canada Government of Canada
NRC-IIT - Institute for Information Technology
NRC-IIT - Institute for Information Technology
Research Programs
3D Technologies
Artificial Intelligence Technologies
Broadband Visual Communication
Computational Video
e-Learning
Health Initiative
High Performance Computing
Human-Computer Interaction
Information Analysis and Retrieval
Adverb
EurekaSeek
Extractor4Speech
Uqausiit: Inuktitut Language Technologies
Lexical Semantics from Web Mining
LitMiner
Interactive Language Technologies
Internet Logic
People-Centred Technologies
Security and Privacy
Software Engineering
Research in NRC-IIT Locations
Research Success Stories
Printable version Printable
version
Home | Research | Research Programs | Information Analysis and Retrieval | LitMiner

Information Analysis and Retrieval

Scientific Literature Mining (LitMiner)

How can scientific researchers hope to know about all of the latest advances and new discoveries in their field, given that more than 40,000 scholarly articles are published in the scientific literature every month? How can they be sure of finding all of the relevant knowledge “hidden” in journal articles?

Even with the advent of massive numerical and structural databases, the scientific literature still holds the newest information and the intelligence surrounding the data. The problem is that researchers cannot hope to read all the articles relevant to their field of study if they are also to conduct research.

In response to this pressing challenge, the NRC Institute for Information Technology (NRC-IIT), in collaboration with the NRC Institute for Biological Sciences (NRC-IBS), the Canada Institute for Scientific and Technical Information (CISTI), the Samuel Lunenfeld Institute and Blueprint International, is developing a unified collection of text and language processing tools to solve the real information needs of genomic and proteomic scientists.

In the short-term, the goal is to save researchers time by letting computers assume some of the tasks. In the longer term, the goal is to support hypothesis formation in ways that are not possible with the current organization of the literature.

The LitMiner project is currently in its first stage, which is to integrate several existing text tools into a proof-of-concept prototype. More elaborate scenarios of use will be possible from that prototype, which will ultimately lead to more useful systems.

The research is being conducted in both in text processing and bio-informatics. Most of the tools being combined in LitMiner are machine learning, information retrieval or text mining algorithms, either new or based on novel modifications of existing algorithms.

While the application to the scientific literature is driving the further development of these algorithms, the research is important in its own right.

Related NRC-IIT Publications

Research Contact

Dr. Joel Martin
Group Leader
Interactive Information

NRC Institute for Information Technology
1200 Montreal Road
Building M-50, Room C-335
Ottawa, ON K1A 0R6
Telephone: +1 (613) 990-0113
Fax: +1 (613) 952-7151
E-mail: Joel.Martin@nrc-cnrc.gc.ca

Business Contact

Randall Milburn
Business Development Officer
Business Development Office, NCR

NRC Institute for Information Technology
1200 Montreal Road
Building M-50, Room 201
Ottawa, ON K1A 0R6
Telephone: +1 (613) 990-6590
Fax: +1 (613) 952-0074
E-mail: Randall.Milburn@nrc-cnrc.gc.ca


Date Modified: 2003-04-03
Top of Page