Skip to content
#

tabular-data

Here are 121 public repositories matching this topic...

patrickdevivo
patrickdevivo commented Mar 10, 2020

Hi there! I wanted to propose adding the following badge to the README to indicate how many TODO comments are in this codebase:

TODOs

The badge links to tickgit.com which is a free service that indexes and displays TODO comme

argenisleon
argenisleon commented Feb 22, 2020

Hi,
I am trying to load a CSV with no header using

df = vaex.open('data/star0000-1.csv',sep=",", header=None, error_bad_lines=False)

but I get

could not convert column 0, error: TypeError('getattr(): attribute name must be string'), will try to convert it to string
Giving up column 0, error: TypeError('getattr(): attribute name must be string')
could not convert column 
trantor
trantor commented Jan 23, 2020

Hello.
I've come across what (to me) seems to be a problem with the FILENAME and FILENUM variables.

# mlr --version
Miller v5.6.2

# cat /tmp/csv1
A,B,C
_2GB,255,2
_4GB,120,4
_6GB,50,6
_10GB,10,10

# cat /tmp/csv2
FIRST,SECOND,THIRD,FOURTH
1,2,3,4
5,6,7,8
9,10,11,12
13,14,15,16

# mlr --icsv cat then put 'print FILENAME'   /tmp/csv1 /tmp/csv2
/tmp/csv1
A=_2GB,B=255,C=2
/
js87zz
js87zz commented Jan 21, 2020

Those who fail to find a document or an example refer here to resolve the issue first.
We are planned to build our official document site and before that is completed, please refer to the below link.

Jemi
Jemi commented Feb 18, 2020

I'm using tsv-utils from the arch linux aur, trying to format some word frequency data from the new general services list dataset. tsv-utils makes at least two errors that I'm able to see when I'm running this commandline:

tsv-select -f 1,7 NGSL+1.01+with+SFI.tsv | tsv-pretty | less

adding -s 5 to tsv-pretty works around this problem. The tsv file was converted from the file NGSL+1.01+with

bernardoccordeiro
bernardoccordeiro commented Dec 4, 2019

While using rows for a project, could not use the import_from_pdf function after putting rows as a dependency. The solution, ultimately, was to also include PyMuPDF and cached_property as dependencies for my project, so as to enable the pdf plugin for rows. This information, however, is not clear by looking only at the docs. It would be desirable to list the dependencies for each plugi

premadh
premadh commented Sep 17, 2019

This is a suggested code or documentation change, improvement to the code, or feature request

The package is great works in most conditions (many thanks for this) but also makes lazy that I don't want to wrangle misread pdf pages. Hence, I'd like to request below.

Provide a parameter/method to specify the number of columns; start and end co-ordinates of each column so that table is extracted

danfowler
danfowler commented Feb 24, 2017

Overview

This issue contains CLI improvements

Tasks

  • support headers options
  • support granular check options
  • support providing validation config instead of options (like stringified json)

My guess would be:

goodtables --checks "{'duplicate-row': True}" datapackage datapackage.json 

But it returns:

goodtables.exceptions.GoodtablesE
tirthajyoti
tirthajyoti commented Sep 9, 2019

Hi,

Thanks for this excellent implementation. I am trying it out now.
Your homepage docs and the Jupyter notebook example say max_epoch=100 but in fact, in the latest install, it is set to 5.

Please reconcile the descriptions. The model is computationally heavy, so it is good to have a realistic expectation about the epochs to run before one starting fitting :-)

sno
craigds
craigds commented Apr 23, 2020

At present sno reset exists, but it actually does something totally unrelated to git reset: it throws away uncommited changes in your working tree. That's the equivalent of git checkout .

git reset actually doesn't touch your working tree; it modifies the repository head and possibly the index. So it's a totally different command.

In testing sno, I find myself needing a more gittish re

csvpack library / gem - tools 'n' scripts for working with tabular data packages using comma-separated values (CSV) datafiles in text with meta info (that is, schema, datatypes, ..) in datapackage.json; download, read into and query CSV datafiles with your SQL database (e.g. SQLite, PostgreSQL, ...) of choice and much more

  • Updated Jan 26, 2019
  • Ruby
tidycells

Improve this page

Add a description, image, and links to the tabular-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tabular-data topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.