Reuters-21578

Description

Currently the most widely used test collection for text categorization research, though likely to be superceded over the next few years by RCV1. The data was originally collected and labeled by Carnegie Group, Inc. and Reuters, Ltd. in the course of developing the CONSTRUE text categorization system. Further details, including discussion of previous versions of the collection (e.g. Reuters-22173), are available in the README file.

Resource Fields

Resource Type:

dataset

Submitted By:

Matthew Lavin

Date Submitted:

2016-12-30 17:30:17


Project Open Data Required Fields (version 1.1)

Modified

[No data]

Publisher

[No data]

Contact Name

[No data]

Unique Identifier

[No data]

Public Access Level

[No data]

Project Open Data Additional Fields (version 1.0)

Contact email

[No Data]

Endpoint

[No Data]

Format

[No Data]

Project Open Data Required-if-Applicable Fields (version 1.1)

Access Level Comment

[No Data]

Bureau Code

[No Data]

Program Code

[No Data]

License

Research use

Rights

[No Data]

Spatial

[No Data]

Temporal

[No Data]