github/jgit - jgit - 帆软第三方插件仓库

Commit Graph

Author	SHA1	Message	Date
Jonathan Nieder	31d79ae0af	Remove unused logger from RecursiveMerger JGit doesn't currently use java.util.logging.Logger. Remove this never-used Logger introduced in `ab99b78ca0` (Implement recursive merge strategy, 2013-02-21) to make that easier to see. Change-Id: I92c578e7f3617085a667de7c992174057be3eb71	12 years ago
Robin Rosenberg	9dcd8c2c90	Make the Reflog a public API again Change-Id: I8ced7098da5b345fd9af2fdfafd1ef6a44ccee0d	12 years ago
Robin Rosenberg	2af9a4c7b1	Merge "status: Print conflict description for unmerged paths"	12 years ago
Robin Stocker	3699ea648e	Document RevTag#getObject() that returned object is unparsed Change-Id: I238d388e40362721eecf37f64ad7d48a399ff129	12 years ago
Robin Stocker	60f0eb748c	Improve class documentation of TagCommand Change-Id: I9c636b927fa2d7cfbe1eb5535a9e702b2209f51d	12 years ago
Shawn Pearce	1b4320f1fa	Revert "Add tests for FileUtils.delete and EMPTY_DIREECTORIES_ONLY" This reverts commit `7aa54967a2`. The unit test dependend upon the specific order of names that listFiles() returned members in. The order is completely undefined and may differ even on different versions of Linux based systems. A proper unit test for this code would have considered both cases, where the deletion function was able to remove an empty subdirectory, or fail to remove a subdirectory because a file was still present within. This is not such a test. Change-Id: Ib0a706fea01e4b1ed8c8e859247d247a1279b4bc	12 years ago
Robin Stocker	a50ed5666f	status: Print conflict description for unmerged paths Prefix unmerged paths with conflict description (e.g. "both modified:"), the same way C Git does. Change-Id: I083cd191ae2ad3e2460aa4052774aed6e36c2699	12 years ago
Robin Rosenberg	ee222a3be1	Create constants in ConfigConstants for the "diff" section Change-Id: I5cf5fe60374d1e94eb031488e4f92c8e521f41a6 Signed-off-by: Chris Aniszczyk <zx@twitter.com>	12 years ago
Robin Stocker	2396bab339	Fix examples with refs/heads/ in RefSpec Javadoc Change-Id: I06c1c7242a1b4c8f499c27a598cca714803799b7 Signed-off-by: Chris Aniszczyk <zx@twitter.com>	12 years ago
Robin Stocker	1080cc5a0d	IndexDiff: Provide stage state for conflicting entries Adds a new method getConflictingStageStates() which returns a Map<String, StageState> (path to stage state). StageState is an enum for all possible stage combinations (BOTH_DELETED, ADDED_BY_US, ...). This can be used to implement the conflict text for unmerged paths in output of "git status" or in EGit for decorations/hints. Bug: 403697 Change-Id: Ib461640a43111b7df4a0debe92ff69b82171329c Signed-off-by: Chris Aniszczyk <zx@twitter.com>	12 years ago
Robin Rosenberg	1c40d83f52	Merge "A deleted work tree file is not a conflict when merge wants to delete it"	12 years ago
Robin Rosenberg	f37e25e2c3	Merge "Untracked files should not be included in stash"	12 years ago
Matthias Sohn	427db940ca	Do not export package org.eclipse.jgit from jgit tests Commit `3344b93c` erroneously exported the package org.eclipse.jgit.lib from the org.eclipse.jgit.test bundle which made this a split package since the bundle org.eclipse.jgit exports the same package. Split packages are evil in general and most probably caused the build cycle errors observed recently when importing the jgit projects in Eclipse [1]. [1] http://dev.eclipse.org/mhonarc/lists/jgit-dev/msg02012.html Change-Id: I89919e56b928acdbff0b90e3919808025a8562c6 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	12 years ago
Robin Rosenberg	7a42b7fb95	Untracked files should not be included in stash The previous code stashed untracked files and left them in the work tree. Bug: 403282 Change-Id: I71727addb2b55fb8e409cae2b6af8138b1ff7ef1	12 years ago
Robin Rosenberg	526b6266a5	Remove some unnecessary dependencies on FileRepostory Change-Id: Ib6ee3a2874a7e2240aa68f4ac32d00c4d1fab5ae Signed-off-by: Chris Aniszczyk <zx@twitter.com>	12 years ago
Robin Rosenberg	6e90671a51	Merge "Remove unused dependencies"	12 years ago
Robin Stocker	78fca8a099	Improve test coverage of AutoCRLF(In\|Out)putStream Bug: 405672 Change-Id: I3894e98617fcee16dc2ac9853c203c62eb30c3ab Signed-off-by: Chris Aniszczyk <zx@twitter.com>	12 years ago
Shawn Pearce	fa1bc6abb7	Merge changes Id2848c16,I7621c434 * changes: Rescale "Compressing objects" progress meter by size Split delta search buckets by byte weight	12 years ago
Shawn Pearce	5d8a9f6f3f	Rescale "Compressing objects" progress meter by size Instead of counting objects processed, count number of bytes added into the window. This should rescale the progress meter so that 30% complete means 30% of the total uncompressed content size has been inflated and fed into the window. In theory the progress meter should be more accurate about its percentage complete/remaining fraction than with objects. When counting objects small objects move the progress meter more rapidly than large objects, but demand a smaller amount of work than large objects being compressed. Change-Id: Id2848c16a2148b5ca51e0ca1e29c5be97eefeb48	12 years ago
Shawn Pearce	21e4aa2b9e	Split delta search buckets by byte weight Instead of assuming all objects cost the same amount of time to delta compress, aggregate the byte size of objects in the list and partition threads with roughly equal total bytes. Before splitting the list select the N largest paths and assign each one to its own thread. This allows threads to get through the worst cases in parallel before attempting smaller paths that are more likely to be splittable. By running the largest path buckets first on each thread the likely slowest part of compression is done early, while progress is still reporting a low percentage. This gives users a better impression of how fast the phase will run. On very complex inputs the slow part is more likely to happen first, making a user realize its time to go grab lunch, or even run it overnight. If the worst sections are earlier, memory overruns may show up earlier, giving the user a chance to correct the configuration and try again before wasting large amounts of time. It also makes it less likely the delta compression phase reaches 92% in 30 minutes and then crawls for 10 hours through the remaining 8%. Change-Id: I7621c4349b99e40098825c4966b8411079992e5f	12 years ago
Shawn Pearce	e74263e743	Merge "Support excluding objects during DFS compaction"	12 years ago
Shawn Pearce	3c27ee1a91	Support excluding objects during DFS compaction By excluding objects the compactor can avoid storing objects that are already well packed in the base GC packs, or any other pack not being replaced by the current compaction operation. For deltas the base object is still included even if the base exists in another exclusion set. This favors keeping deltas for recent history, to support faster fetch operations for clients. Change-Id: Ie822fe075fe5072fe3171450fda2f0ca507796a1	12 years ago
Matthias Sohn	aa7be667bc	Make recursive merge strategy the default merge strategy Use recursive merge as the default strategy since it can successfully merge more cases than the resolve strategy can. This is also the default in native Git. Change-Id: I38fd522edb2791f15d83e99038185edb09fed8e1 Signed-off-by: Matthias Sohn <matthias.sohn@sap.com>	12 years ago
Colby Ranger	eaa52b12f5	Update PackBitmapIndexRemapper to handle mappings not in the new pack. Previously, the code assumed all commits in the old pack would also be present in the new pack. This assumption caused an ArrayIndexOutOfBoundsException during remapping of ids. Fix the iterator to only return entries that may be remapped. Furthermore, update getBitmap() to return null if commit does not exist in the new pack. Change-Id: I065babe8cd39a7654c916bd01c7012135733dddf	12 years ago
Robin Rosenberg	4c638be79f	Fix boundary conditions in AutoCRLFOutputStream This fixes some problems with inputs around the size of the internal buffer in AutoCRLFOutputStream (8000). Tests supplied by Robin Stocker. Bug: 405672 Change-Id: I6147897290392b3bfd4040e8006da39c302a3d49	12 years ago
Robin Rosenberg	a6ed390ea7	NLS warning cleanup Change-Id: Ia76aa02dd330a1f88096c2b059b363aa38d653e9	12 years ago
Robin Rosenberg	5db307a695	Merge "Fix a possible NPE"	12 years ago
Shawn Pearce	5f03dc61b4	Merge changes I845caede,Ie25c6d3a,I5caec313,Ib11ff99f,I9ccf20c3,Ic7826f29,I1bdd8b58,Idb84c1d7,I078841f9 * changes: Always attempt delta compression when reuseDeltas is false Avoid TemporaryBuffer.Heap on very small deltas Correct distribution of allowed delta size along chain length Split remaining delta work on path boundaries Replace DeltaWindow array with circularly linked list Micro-optimize copy instructions in DeltaEncoder Micro-optimize DeltaWindow primary loop Micro-optimize DeltaWindow maxMemory test to be != 0 Mark DeltaWindowEntry methods final	12 years ago
Shawn Pearce	c9707e6353	Always attempt delta compression when reuseDeltas is false If reuseObjects=true but reuseDeltas=false the caller wants attempt a delta for every object in the input list. Test for reuseDeltas to ensure every object passes through the searchInWindow() method. If no delta is possible for an object and it will be stored whole (non-delta format), PackWriter may still reuse its content from any source pack. This avoids an inflate()-deflate() cycle to recompress the object contents. Change-Id: I845caeded419ef4551ef1c85787dd5ffd73235d9	12 years ago
Shawn Pearce	a5c6aac76c	Avoid TemporaryBuffer.Heap on very small deltas TemporaryBuffer is great when the output size is not known, but must be bound by a relatively large upper limit that fits in memory, e.g. 64 KiB or 20 MiB. The buffer gracefully supports growing storage by allocating 8 KiB blocks and storing them in an ArrayList. In a Git repository many deltas are less than 8 KiB. Typical tree objects are well below this threshold, and their deltas must be encoded even smaller. For these much smaller cases avoid the 8 KiB minimum allocation used by TemporaryBuffer. Instead allocate a very small OutputStream writing to an array that is sized at the limit. Change-Id: Ie25c6d3a8cf4604e0f8cd9a3b5b701a592d6ffca	12 years ago
Shawn Pearce	8a7c2f97d0	Correct distribution of allowed delta size along chain length Nicolas Pitre discovered a very simple rule for selecting between two different delta base candidates: - if based whole object, must be <= 50% of target - if at end of a chain, must be <= 1/depth * 50% of target The rule penalizes deltas near the end of the chain, requiring them to be very small in order to be kept by the packer. This favors deltas that are based on a shorter chain, where the read-time unpack cost is much lower. Fewer bytes need to be consulted from the source pack file, and less copying is required in memory to rebuild the object. Junio Hamano explained Nico's rule to me today, and this commit fixes DeltaWindow to implement it as described. When no base has been chosen the computation is simply the statements denoted above. However once a base with depth of 9 has been chosen (e.g. when pack.depth is limited to 10), a non-delta source may create a new delta that is up to 10x larger than the already selected base. This reflects the intent of Nico's size distribution rule no matter what order objects are visited in the DeltaWindow. With this patch and my other patches applied, repacking JGit with: [pack] reuseObjects = false reuseDeltas = false depth = 50 window = 250 threads = 4 compression = 9 CGit (all) 5,711,735 bytes; real 0m13.942s user 0m47.722s [1] JGit heads 5,718,295 bytes; real 0m11.880s user 0m38.177s [2] rest 9,809 bytes The improved JGit result for the head pack is only 6.4 KiB larger than CGit's resulting pack. This patch allowed JGit to find an additional 39.7 KiB worth of space savings. JGit now also often runs 2s faster than CGit, despite also creating bitmaps and pruning objects after the head pack creation. [1] time git repack -a -d -F --window=250 --depth=50 [2] time java -Xmx128m -jar jgit debug-gc Change-Id: I5caec31359bf7248cabdd2a3254c84d4ee3cd96b	12 years ago
Shawn Pearce	3b7924f403	Split remaining delta work on path boundaries When an idle thread tries to steal work from a sibling's remaining toSearch queue, always try to split along a path boundary. This avoids missing delta opportunities in the current window of the thread whose work is being taken. The search order is reversed to walk further down the chain from current position, avoiding the risk of splitting the list within the path the thread is currently processing. When selecting which thread to split from use an accurate estimate of the size to be taken. This avoids selecting a thread that has only one path remaining but may contain more pending entries than another thread with several paths remaining. As there is now a race condition where the straggling thread can start the next path before the split can finish, the stealWork() loop spins until it is able to acquire a split or there is only one path remaining in the siblings. Change-Id: Ib11ff99f90a4d9efab24bf4a85342cc63203dba5	12 years ago
Shawn Pearce	65f44bef23	Remove DFS locality ordering during packing PackWriter generally chooses the order for objects when it builds the object lists. This ordering already depends on history information to guide placing more recent objects first and historical objects last. Allow PackWriter to make the basic ordering decisions, instead of trying to override them. The old approach of sorting the list caused DfsReader to override any ordering change PackWriter might have tried to make when repacking a repository. This now better matches with WindowCursor's implementation, where PackWriter solely determines the object ordering. Change-Id: Ic17ab5631ec539f0758b962966c3a1823735b814	12 years ago
Shawn Pearce	af33a911d0	Replace DeltaWindow array with circularly linked list Typical window sizes are 10 and 250 (although others are accepted). In either case the pointer overhead of 1 pointer in an array or 2 pointers for a double linked list is trivial. A doubly linked list as used here for window=250 is only another 1024 bytes on a 32 bit machine, or 2048 bytes on a 64 bit machine. The critical search loops scan through the array in either the previous direction or the next direction until the cycle is finished, or some other scan abort condition is reached. Loading the next object's pointer from a field in the current object avoids the branch required to test for wrapping around the edge of the array. It also saves the array bounds check on each access. When a delta is chosen the window is shuffled to hoist the currently selected base as an earlier candidate for the next object. Moving the window entry is easier in a double-linked list than sliding a group of array entries. Change-Id: I9ccf20c3362a78678aede0f0f2cda165e509adff	12 years ago
Shawn Pearce	0f32901ab7	Micro-optimize copy instructions in DeltaEncoder The copy instruction formatter should not to compute the shifts and masks twice. Instead compute them once and assume there is a register available to store the temporary "b" for compare with 0. Change-Id: Ic7826f29dca67b16903d8f790bdf785eb478c10d	12 years ago
Shawn Pearce	1db50c9d91	Micro-optimize DeltaWindow primary loop javac and the JIT are more likely to understand a boolean being used as a branch conditional than comparing int against 0 and 1. Rewrite NEXT_RES and NEXT_SRC constants to be booleans so the code is clarified for the JIT. Change-Id: I1bdd8b587a69572975a84609c779b9ebf877b85d	12 years ago
Shawn Pearce	6903fa4a34	Micro-optimize DeltaWindow maxMemory test to be != 0 Instead of using a compare-with-0 use a does not equal 0. javac bytecode has a special instruction for this, as it is very common in software. We can assume the JIT knows how to efficiently translate the opcode to machine code, and processors can do != 0 very quickly. Change-Id: Idb84c1d744d2874517fd4bfa1db390e2dbf64eac	12 years ago
Robin Rosenberg	4955301fac	Merge "Consider working tree changes when stashing newly added files"	12 years ago
Shawn Pearce	4db695c1c6	Mark DeltaWindowEntry methods final This class and all of its methods are only package visible. Clarify the methods as final for the benefit of the JIT to inline trivial code. Change-Id: I078841f9900dbf299fbe6abf2599f0208ae96856	12 years ago
Shawn Pearce	b5cbfa0146	Merge changes Ideecc472,I2b12788a,I6cb9382d,I12cd3326,I200baa0b,I05626f2e,I65e45422 * changes: Increase PackOutputStream copy buffer to 64 KiB Tighten object header writing in PackOutuptStream Skip main thread test in ThreadSafeProgressMonitor Declare members of PackOutputStream final Always allocate the PackOutputStream copyBuffer Disable CRC32 computation when no PackIndex will be created Steal work from delta threads to rebalance CPU load	12 years ago
Robin Rosenberg	8272f65730	Merge "LogCommand.all(): filter out refs that do not refer to commit objects"	12 years ago
Robin Rosenberg	ad2ffc576b	Merge "LogCommand.all(), peel references before using them"	12 years ago
Shawn Pearce	6c0bb4351d	Increase PackOutputStream copy buffer to 64 KiB Colby just pointed out to me the buffer was 16 KiB. This may be very small for common objects. Increase to 64 KiB. Change-Id: Ideecc4720655a57673252f7adb8eebdf2fda230d	12 years ago
Shawn Pearce	46ef61a702	Tighten object header writing in PackOutuptStream Most objects are written as OFS_DELTA with the base in the pack, that is why this case comes first in writeHeader(). Rewrite the condition to always examine this first and cache the PackWriter's formatting flag for use of OFS_DELTA headers, in modern Git networks this is true more often then it it is false. Assume the cost of write() is high, especially due to entering the MessageDigest to update the pack footer SHA-1 computation. Combine the OFS_DELTA information as part of the header buffer so that the entire burst is a single write call, rather than two relatively small ones. Most OFS_DELTA headers are <= 6 bytes, so this rewrite tranforms 2 writes of 3 bytes each into 1 write of ~6 bytes. Try to simplify the objectHeader code to reduce branches and use more local registers. This shouldn't really be necessary if the compiler is well optimized, but it isn't very hard to clarify data usage to either javac or the JIT, which may make it easier for the JIT to produce better machine code for this method. Change-Id: I2b12788ad6866076fabbf7fa11f8cce44e963f35	12 years ago
Shawn Pearce	d01fe32795	Skip main thread test in ThreadSafeProgressMonitor update(int) is only invoked from a worker thread, in JGit's case this is DeltaTask. The Javadoc of TSPM suggests update should only ever be used by a worker thread. Skip the main thread check, saving some cycles on each run of the progress monitor. Change-Id: I6cb9382d71b4cb3f8e8981c7ac382da25304dfcb	12 years ago
Shawn Pearce	66192817cd	Declare members of PackOutputStream final These methods cannot be sanely overridden anywhere. Most methods are package visible only, or are private. A few public methods do exist but there is no useful way to override them since creation of PackOutputStream is managed by PackWriter and cannot be delegated. Change-Id: I12cd3326b78d497c1f9751014d04d1460b46e0b0	12 years ago
Shawn Pearce	2be6927d8e	Always allocate the PackOutputStream copyBuffer The getCopyBuffer() is almost always used during output. All known implementations of ObjectReuseAsIs rely on the buffer to be present, and the only sane way to get good performance from PackWriter is to reuse objects during packing. Avoid a branch and test when obtaining this buffer by making sure it is always populated. Change-Id: I200baa0bde5dcdd11bab7787291ad64535c9f7fb	12 years ago
Shawn Pearce	eb17495ca4	Disable CRC32 computation when no PackIndex will be created If a server is streaming 3GiB worth of pack data to a client there is no reason to compute the CRC32 checksum on the objects. The CRC32 code computed by PackWriter is used only in the new index created by writeIndex(), which is never invoked for the native Git network protocols. Object reuse may still compute its own CRC32 to verify the data being copied from an existing pack has not been corrupted. This check is done by the ObjectReader that implements ObjectReuseAsIs and has no relationship to the CRC32 being skipped during output. Change-Id: I05626f2e0d6ce19119b57d8a27193922636d60a7	12 years ago
Shawn Pearce	d0a5337625	Steal work from delta threads to rebalance CPU load If the configuration wants to run 4 threads the delta search work is initially split somewhat evenly across the 4 threads. During execution some threads will finish early due to the work not being split fairly, as the initial partitions were based on object count and not cost to inflate or size of DeltaIndex. When a thread finishes early it now tries to take 50% of the work remaining on a sibling thread, and executes that before exiting. This repeats as each thread completes until a thread has only 1 object remaining. Repacking Blink, Chromium's new fork of WebKit (2.2M objects 3.9G): [pack] reuseDeltas = false reuseObjects = false depth = 50 threads = 8 window = 250 windowMemory = 800m before: ~105% CPU after 80% after: >780% CPU to 100% Change-Id: I65e45422edd96778aba4b6e5a0fd489ea48e8ca3	12 years ago
Robin Rosenberg	1bede91db2	Consider working tree changes when stashing newly added files Bug: 402396 Change-Id: I50ff707c0c9abcab3f98eea21aaa6e824f7af63a	12 years ago

1 2 3 4 5 ...

2658 Commits (b83c269369cb83cea10bd5b32891a4bb42793230) All Branches Search

2658 Commits (b83c269369cb83cea10bd5b32891a4bb42793230)

All Branches