Skip to content
#

dedupe

Here are 48 public repositories matching this topic...

ThomasWaldmann
ThomasWaldmann commented Apr 13, 2020

If somebody has some time for FUSE benchmarking:

diff --git a/src/borg/fuse.py b/src/borg/fuse.py
index 429790e4..27ab1c1a 100644
--- a/src/borg/fuse.py
+++ b/src/borg/fuse.py
@@ -644,12 +644,13 @@ def read(self, fh, offset, size):
                 data = self.data_cache[id]
                 if offset + n == len(data):
                     # evict fully read chunk from cache
-     
jlovison
jlovison commented Apr 27, 2017

Looking at the source, setting -1 to either the size or content flags completely ignores potential changes in the size or content.

The help text implies that setting to -1 is equivalent to setting to zero (i.e. no difference in size/content allowed), whereas it's actually the opposite (any difference in size/content allowed).

ecsalomon
ecsalomon commented Sep 28, 2017

Labeled training example pairs should be stored in a table for selection and reuse. Data stored for examples should include:

  • Source
  • Source ids
  • Label
  • Label date
  • Comment to store labeling rules applied by labeler

Storing examples like this allows them to by reused in the following ways:

  • Select specific subsets of labeled pairs to build models from
  • Store multiple labels f

Improve this page

Add a description, image, and links to the dedupe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dedupe topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.