Skip to content
#

dataset

Here are 2,817 public repositories matching this topic...

Zylatis
Zylatis commented Jun 20, 2019
  • Faker version: 1.0.7
  • OS: Ubuntu 16.04

I was wondering if it is possible to generate a list of 'n' unique company names? I saw some PR's which gave a unique keyword for 'words' but doesn't seem to extend to other providers? I understand i could just keep regenerating and dropping duplicates until I got a unique set of length n, but would be nice to just have a keyword for that (plus this m

vinyldarkscratch
vinyldarkscratch commented Jan 31, 2020

Found in #5551, it appears that the sorting linter may not be properly functioning. The author of the PR had placed a feature in the wrong location, however the linter had not complained about the alphabetical sorting. Furthermore, when the author attempted placing it in the correct location, the linter didn't complain either. We should either double-check the linter, or the file that was modif

Guitaricet
Guitaricet commented Sep 11, 2019

📚 Documentation

Description

It is not clear how (and when) to use SubwordField from the documentation. And it is hard to find usage examples. It would be great if someone who used it would add at least a few lines to its doc.

For example, if I am using github.com/VKCOM/YouTokenToMe tokenizer - should I create SubwordField or Field. And what is the difference between them?

bloodwass
bloodwass commented Jun 17, 2019

Expected Behavior

I want to convert torch.nn.Linear modules to weight drop linear modules in my model (possibly big), and I want to train my model with multi-GPUs. However, I have RuntimeError in my sample code. First, I have _weight_drop() which drops some part of weights in torch.nn.Linear (see the code below).

Actual Behavior

RuntimeError: arguments are located on different GPUs at /

addisonlynch
addisonlynch commented Apr 14, 2018

It would be nice to have some general developer documentation for potential contributors to help in cases such as #510, etc.

What are the best steps to take towards accomplishing this? Maybe something similar (albeit not all details needed) to the Pandas developer docs?

I've begun an implementation of this on my fork, basicall

andrewjw1995
andrewjw1995 commented May 16, 2018

The documentation file appears to have been generated with no space between the hashes and the header text. This is causing the headers to not display correctly, and is difficult to read. See below for an example of with and without the space:

##

Mobius API Documentation


###Microsoft.Spark.CSharp.Core.Accumulator</

Improve this page

Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.