ScholarVista¶
ScholarVista is a tool that extracts and plots information from a set of Academic Research Papers in PDF / TEI XML format. To process PDFs, it utilizes Grobid to generate the TEI XML files, then ScholarVista extracts the relevant information from the TEI XML files and generates the following data:
- Keyword Cloud for each of the paper's abstract and for the total of all abstracts.
- Links List for each one of the links found in the paper.
- Figures Histogram comparing the number of figures per paper.