Index
of
presentation
|
Open-sourcing OpCit software
for processing reference metadata
OpCit is making available a series of Perl modules
to enable OAi archives to extract and structure reference data from full-text
papers for use by OAi service providers:
-
Markup_TeX.pm: insert mark-up 'xxxOpCit' at
the beginning of each reference in the source file. This mark-up is used by 'Parser_DVI.pm
to identify each reference;
-
TeX2DVI.pm: convert TeX/LaTeX to DVI
file; and then DVI to text file by 'dvitype' (Unix command);
-
Parse_DVI.pm: parses the text file created
by 'dvitype' to produce a list of references;
-
Citation.pm: parses each reference (citation)
string to discover its metadata (authors, journal, volume, issue, etc.)
Initially these modules will be offered
to the physics archive maintainers at Los Alamos and CERN, for which they
are optimised, and may then be open-sourced.
Perl scripts which make calls to the above modules
can be developed locally.
|
|