Open Citation Project
OpCit tech seminar, July 2001, Southampton
A joint NSF - JISC
International Digital Libraries
Project
Index 
of 
presentation
Open-sourcing OpCit software for processing reference metadata

OpCit is making available a series of Perl modules to enable OAi archives to extract and structure reference data from full-text papers for use by OAi service providers:

  • Markup_TeX.pm: insert mark-up 'xxxOpCit' at the beginning of each reference in the source file. This mark-up is used by 'Parser_DVI.pm to identify each reference;
  • TeX2DVI.pm:  convert TeX/LaTeX to DVI file; and then DVI to text file by 'dvitype' (Unix command);
  • Parse_DVI.pm: parses the text file created by 'dvitype' to produce a list of references;
  • Citation.pm: parses each reference (citation) string to discover its metadata (authors, journal, volume, issue, etc.)
Initially these modules will be offered to the physics archive maintainers at Los Alamos and CERN, for which they are optimised, and may then be open-sourced. 
Perl scripts which make calls to the above modules can be developed locally.
 
 
 Previous
 Hitchcock and Jiao, Opcit data input: reference parsing
 Next