New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
How to use similarity settings other then "BM25" in Elasticsearch index ?
#709
opened Oct 3, 2020 by
nsankar
Datasets performance slow? - 6.4x slower than in memory dataset
#708
opened Oct 3, 2020 by
eugeneware
TypeError: '<' not supported between instances of 'NamedSplit' and 'NamedSplit'
#705
opened Oct 2, 2020 by
pvcastro
Dataset browser url is still https://huggingface.co/nlp/viewer/
#686
opened Sep 29, 2020 by
jarednielsen
Does both 'bookcorpus' and 'wikipedia' belong to the same datasets which Google used for pretraining BERT?
#666
opened Sep 23, 2020 by
wahab4114
runing dataset.map, it raises TypeError: can't pickle Tokenizer objects
#665
opened Sep 23, 2020 by
xixiaoyao
load_dataset from local squad.py, raise error: TypeError: 'NoneType' object is not callable
#664
opened Sep 23, 2020 by
xixiaoyao
map/filter multiprocessing raises errors and corrupts datasets
bug
#620
opened Sep 11, 2020 by
timothyjlaurent
UserWarning: The given NumPy array is not writeable, and PyTorch does not support non-writeable tensors
#616
opened Sep 11, 2020 by
BramVanroy
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.