Delve
DELVE DATA SETS

  • Citation Network
  • Preprocessed Dataset
  • ml data
  • Bi data

This is the full citation data set which includes the documents record in an xml format, the citation network in an edgelist format, the full body text and the normalized text information (porter stemmer) Click here to download

This contains all the preprocessed data set. This includes the node2vec( with default values and p=4, q = 1), tfidf, LSI (d=100), doc2vec, etc Click here to download

This contains the multi-Labeled document data set and mappings. Click here to download

This contains the binary Labeled edge data set and mappings. Click here to download