borg create without a cache – Prototype, Do Not Merge #2350

enkore · 2017-03-28T13:36:39Z

enkore · 2017-03-28T13:44:13Z

src/borg/cache.py

+            # All chunks from the repository have a refcount of MAX_VALUE, which is sticky,
+            # therefore we can't/won't delete them. Chunks we added ourselves in this transaction
+            # (e.g. checkpoint archives) are tracked correctly.
+            init_entry = ChunkIndexEntry(refcount=ChunkIndex.MAX_VALUE, size=0, csize=ChunkIndex.MAX_VALUE)


csize=0 is another safe choice and has a more compact representation (1 byte vs 5 bytes).

enkore · 2017-03-28T13:44:24Z

src/borg/cache.py

-        self.files[path_hash] = msgpack.packb(entry)
-        self._newest_mtime = max(self._newest_mtime or 0, st.st_mtime_ns)
+    def commit(self, config):
+        config.set('cache', 'manifest', 'not in sync')


this must be a hex string

ThomasWaldmann

I did not review all the size/csize stuff.

ThomasWaldmann · 2017-03-28T13:58:09Z

src/borg/archiver.py

@@ -2329,6 +2331,8 @@ def process_epilog(epilog):
                               help='only display items with the given status characters')
        subparser.add_argument('--json', action='store_true',
                               help='output stats as JSON (implies --stats)')
+        subparser.add_argument('--avoid-cache-sync', dest='avoid_cache_sync', action='store_true',
+                               help='Avoid synchronizing the local cache')


"avoid" sounds a bit like it would still do it for some cases, just not for most.

--no-chunks-cache-sync ?

ThomasWaldmann · 2017-03-28T13:59:42Z

src/borg/repository.py

@@ -32,7 +32,7 @@
 TAG_DELETE = 1
 TAG_COMMIT = 2

-LIST_SCAN_LIMIT = 10000  # repo.list() / .scan() result count limit the borg client uses
+LIST_SCAN_LIMIT = 100000  # repo.list() / .scan() result count limit the borg client uses


did you check the other places where this is used, whether it could have some negative impact?

ThomasWaldmann · 2017-03-28T14:01:01Z

src/borg/cache.py

@@ -180,6 +182,7 @@ def __init__(self, repository, key, manifest, path=None, sync=True, do_files=Fal
        self.timestamp = None
        self.lock = None
        self.txn_active = False
+        self.txn_set = ['config']


nitpick: naming a list a set is a bit strange. also, I usually avoid having a data type in the name.

txn_files maybe?

ThomasWaldmann · 2017-03-28T14:02:36Z

src/borg/cache.py

+        if do_files:
+            self.files = FilesCache(self)
+        else:
+            self.files = DummyFilesCache()


files = FC if do_files else DFC

ThomasWaldmann · 2017-03-28T14:05:53Z

src/borg/cache.py

+        return None
+
+    def memorize_file(self, path_hash, st, ids):
+        return None


has no return value, so just use "pass"? (see same below in commit())

ThomasWaldmann · 2017-03-28T14:07:53Z

src/borg/cache.py

+        """
+        Return whether a chunk with *id* was seen. Optionally verify *size* for
+        enhanced collision resistance.
+        """


Currently, this seems to either return a boolean (seen / not seen) or an int (refcount, >0 = seen, 0 = not seen).
Can we avoid the type confusion?

ThomasWaldmann · 2017-03-28T14:11:48Z

src/borg/cache.py

+            result = self.cache.repository.list(limit=LIST_SCAN_LIMIT, marker=marker)
+            if not result:
+                break
+            pi.show(len(result))


Guess this should be:

pi.show(increase=len(result))

btw, if you have rather huge LIST_SCAN_LIMIT, this will be a rather jumpy progress indicator.
ofc, it depends on repo size...

ThomasWaldmann · 2017-03-28T14:13:39Z

src/borg/cache.py

    def add_chunk(self, id, chunk, stats, overwrite=False):
-        if not self.txn_active:
-            self.begin_txn()
+        assert not overwrite, 'Logic Bug'


maybe a few more words than just 'Logic Bug'?

ThomasWaldmann · 2017-03-28T14:14:24Z

src/borg/cache.py

-            raise Exception("chunk has same id [%r], but different size (stored: %d new: %d)!" % (
-                            id, stored_size, size))
-        return refcount
+        return id in self.chunks


bool vs. int, see above.

enkore added 4 commits March 11, 2017 23:45

create: --avoid-cache-sync

10029bb

restructuring

05f1688

structuring

dcc56ae

stuff works

69c7523

enkore changed the title ~~borg create without a cache~~ borg create without a cache – Protype, Do Not Merge Mar 28, 2017

enkore changed the title ~~borg create without a cache – Protype, Do Not Merge~~ borg create without a cache – Prototype, Do Not Merge Mar 28, 2017

enkore commented Mar 28, 2017

View reviewed changes

ThomasWaldmann reviewed Mar 28, 2017

View reviewed changes

ThomasWaldmann mentioned this pull request Mar 30, 2017

the new (c)size ticket #2357

Closed

enkore closed this Apr 28, 2017

shadowlmd mentioned this pull request Sep 22, 2020

borg is incompatible with setuptools 50.0.0 #5312

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

borg create without a cache – Prototype, Do Not Merge #2350

borg create without a cache – Prototype, Do Not Merge #2350

enkore commented Mar 28, 2017 •

edited

Loading

enkore Mar 28, 2017

enkore Mar 28, 2017

ThomasWaldmann left a comment

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

ThomasWaldmann Mar 28, 2017

borg create without a cache – Prototype, Do Not Merge #2350

borg create without a cache – Prototype, Do Not Merge #2350

Conversation

enkore commented Mar 28, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThomasWaldmann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enkore commented Mar 28, 2017 •

edited

Loading