datasets : harmonize Netflix parsers with the rest #26
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
R&D: library
Research and (re-)design a library component
The Netflix Prize dataset uses a custom parser because one data example does not fit into a single dataset row (such as CSV data) but has a custom "stanza-based" format. For example, these are two stanzas of the "qualifying.txt" data file :
It would be nice to upgrade the library such that it can deal with these cases
Solution sketch:
The text was updated successfully, but these errors were encountered: