oracular (7) dgit.7.gz

Provided by: dgit_11.10_all bug

NAME

       dgit - principles of operation

SUMMARY

       dgit  treats  the Debian archive as a version control system, and bidirectionally gateways
       between the archive and git.  The git view of the package can contain the  usual  upstream
       git  history,  and  will  be  augmented  by  commits  representing  uploads  done by other
       developers not using dgit.  This git history is stored in a canonical  location  known  as
       dgit-repos which lives on a dedicated git server.

       git  branches  suitable for use with dgit can be edited directly in git, and used directly
       for building binary packages.  They can be shared using all conventional means for sharing
       git  branches.   It  is  not  necessary  to  use  dgit  to work with dgitish git branches.
       However, dgit is (usually) needed in order to convert  to  or  from  Debian-format  source
       packages.

SEE ALSO

       dgit(1)
              Reference manual and documentation catalogue.

       dgit-*(7)
              Tutorials and workflow guides.  See dgit(1) for a list.

MODEL

       You may use any suitable git workflow with dgit, provided you satisfy dgit's requirements:

       dgit maintains a pseudo-remote called dgit, with one branch per suite.  This remote cannot
       be used with plain git.

       The  dgit-repos  repository  for  each  package  contains  one   ref   per   suite   named
       refs/dgit/suite.  These should be pushed to only by dgit.  They are fast forwarding.  Each
       push on this branch corresponds to an upload (or attempted upload).

       However, it is perfectly fine to have other branches in dgit-repos;  the  dgit-repos  repo
       for  the  package  can  even  be  the  same  as  the `origin' remote, although this is not
       generally the case in Debian.

       dgit push-* will also make signed tags called archive/debian/version (with version encoded
       a  la  DEP-14)  and push them to dgit-repos.  These are used at the server to authenticate
       pushes.

       Uploads made by dgit contain an additional field Dgit in the source package  .dsc.   (This
       is added by dgit push-*.)  This specifies: a commit (an ancestor of the dgit/suite branch)
       whose tree is identical to the unpacked source upload; the distro to which the upload  was
       made;  a  tag  name which can be used to fetch the git commits; and a url to use as a hint
       for the dgit git server for that distro.

       Uploads not made by dgit are represented in git by commits which are synthesised by  dgit.
       The  tree  of  each such commit corresponds to the unpacked source; there is a commit with
       the contents, and a pseudo-merge from last known upload - that is, from  the  contents  of
       the  dgit/suite  branch.   Depending on the source package format, the contents commit may
       have a more complex structure, but ultimately it will be a convergence of stubby  branches
       from origin commits representing the components of the source package.

       dgit  expects trees that it works with to have a dgit (pseudo) remote.  This refers to the
       dgit-created git view of the corresponding archive.

       The dgit archive tracking view is synthesised locally, on demand, by each  copy  of  dgit.
       The  tracking  view is always a descendant of the dgit-repos suite branch (if one exists),
       but may be ahead of it if uploads have been done without dgit.  The archive tracking  view
       is always fast forwarding within each suite.

       dgit push-* can operate on any commit which is a descendant of the suite tracking branch.

       dgit  does not make a systematic record of its imports of orig tarball(s).  So it does not
       work by finding git tags or branches referring to orig tarball(s).  The orig tarballs  are
       downloaded  (by  dgit clone) into the parent directory, as with a traditional (non-gitish)
       dpkg-source workflow.  You need to retain these tarballs in the parent directory for  dgit
       build and dgit push-*.  (They are not needed for purely-git-based workflows.)

       dgit  repositories  could  be  cloned  with  standard  (git)  methods.   However, the dgit
       repositories do not contain uploads not made  with  dgit.   And  for  sourceful  builds  /
       uploads the orig tarball(s) will need to be present in the parent directory.

       To  a  user  looking  at  the archive, changes pushed in a simple NMU using dgit look like
       reasonable changes made in an NMU: in a `3.0 (quilt)' package the delta from the  previous
       upload is recorded in new patch(es) constructed by dpkg-source.

COMBINED SUITES

       dgit  can  synthesize  a combined view of several underlying suites.  This is requested by
       specifying, for suite, a comma-separated list:

              mainsuite,subsuite...

       This facility is available with dgit clone, fetch and pull, only.

       dgit will fetch the same package from each specified underlying suite, separately  (as  if
       with  dgit  fetch).   dgit  will then generate a pseudomerge commit on the tracking branch
       remotes/dgit/dgit/suite which has the tip of each of the underlying suites as an ancestor,
       and which contains the same as the suite which has the highest version of the package.

       The package must exist in mainsuite, but need not exist in the subsuites.

       If a specified subsuite starts with - then mainsuite is prepended.

       So,  for  example,  stable,-security  means to look for the package in stable, and stable-
       security, taking whichever is newer.  If stable is  currently  jessie,  dgit  clone  would
       leave you on the branch dgit/jessie,-security.

       Combined  suites  are  not  supported by the dgit build operations.  This is because those
       options are intended for building for uploading source packages, and look in the changelog
       to  find  the  relevant suite.  It does not make sense to name a dgit-synthesised combined
       suite in a changelog, or to try to upload to it.

       When using this facility, it is important to always specify the same suites  in  the  same
       order: dgit will not make a coherent fast-forwarding history view otherwise.

       The  history  generated  by  this  feature  is not normally suitable for merging back into
       upstreams, as it necessarily contains unattractive pseudomerges.

LIMITATIONS

       Because the synthesis of the suite tracking branches is done locally  based  only  on  the
       current archive state, it will not necessarily see every upload not done with dgit.  Also,
       different versions of dgit (or  the  software  it  calls)  might  import  the  same  .dscs
       differently (although we try to minimise this).  As a consequence, the dgit tracking views
       of the same suite, made by different instances of dgit, may vary.  They will have the same
       contents, but may have different history.

       There  is  no  uniform  linkage  between  the tracking branches for different suites.  The
       Debian infrastructure does not do any automatic import of uploads made without  dgit.   It
       would  be  possible for a distro's infrastructure to do this; in that case, different dgit
       client instances would see exactly the same history.

       There has been no bulk import of historical uploads into Debian's dgit infrastructure.  To
       do  this  it  would  be  necessary to decide whether to import existing vcs history (which
       might not be faithful to dgit's invariants) or previous non-Dgit uploads (which would  not
       provide a very rich history).

       git  represents only file executability.  git does not represent empty directories, or any
       leaf objects other than plain files and symlinks.  The behaviour of Debian source  package
       formats  on  objects  with  unusual  permissions is complicated.  Some pathological Debian
       source packages will no longer build if empty directories are pruned (or if  other  things
       not  reproduced  by git are changed).  Such sources cannot be worked with properly in git,
       and therefore not with dgit either.

READ-ONLY DISTROS

       Distros which do not maintain a set of dgit history git repositories can still be used  in
       a read-only mode with dgit.  Currently Ubuntu is configured this way.

GITATTRIBUTES

       git  has features which can automatically transform files as they are being copied between
       the working tree and the git history.  The attributes can be specified in the source  tree
       itself, in .gitattributes.  See gitattributes(5).

       These  transformations  are  context-sensitive  and  not,  in general, reversible, so dgit
       operates on the principle that the dgit git history contains the actual  contents  of  the
       package.   (When  dgit  is  manipulating  a  .dsc, it does so in a private area, where the
       transforming gitattributes are defused, to achieve this.)

       If transforming gitattributes are used, they can cause trouble, because the  working  tree
       files  can  differ from the git revision history (and therefore from the source packages).
       dgit warns if it finds a .gitattributes file (in a package  being  fetched  or  imported),
       unless the transforming gitattributes have been defused.

       dgit  clone  and  dgit  setup-new-tree  disable  transforming gitattributes by default, by
       creating a  suitable  .git/info/attributes.   See  dgit  setup-new-tree  and  dgit  setup-
       gitattributes in dgit(1).

       Note  that  dgit  does not disable gitattributes unless they would actually interfere with
       your work on dgit branches.  In particular, gitattributes which affect git archive are not
       disabled,  so  .origs  you  generate by hand can be wrong.  You should consider using git-
       deborig (1) which gets this right, suppressing the attributes.

PACKAGE SOURCE FORMATS

       If you are not the maintainer, you do not need to worry about the  source  format  of  the
       package.  You can just make changes as you like in git.  If the package is a `3.0 (quilt)'
       package, the patch stack will usually not be represented in the git history.

FILE EXECUTABILITY

       Debian source package formats do not always faithfully reproduce changes to executability.
       But dgit insists that the result of dgit clone is identical (as far as git can represent -
       see Limitations, above) to the result of dpkg-source -x.

       So files that are executable in your git tree must be executable in the  result  of  dpkg-
       source  -x  (but  often aren't).  If a package has such troublesome files, they have to be
       non-executable in dgit-compatible git branches.

FORMAT 3.0 (QUILT)

       For a format `3.0 (quilt)' source package, dgit may have to make a commit on your  current
       branch to contain metadata used by quilt and dpkg-source.

       This  is  because  `3.0  (quilt)'  source  format  represents  the patch stack as files in
       debian/patches/ actually inside the source tree.  This means that, taking the  whole  tree
       (as seen by git or ls) (i) dpkg-source cannot represent certain trees, and (ii) packing up
       a tree in `3.0 (quilt)' and then unpacking it does not always yield the same tree.

       dgit will automatically work around this for you when  building  and  pushing.   The  only
       thing  you  need  to  know  is that dgit build, sbuild, etc., may make new commits on your
       HEAD.  If you're not a quilt user this commit won't contain any changes to files you  care
       about.

       Simply  committing  to  source  files (whether in debian/ or not, but not to patches) will
       result in a branch that dgit quilt-fixup can linearise.  Other kinds of changes, including
       editing patches or merging, cannot be handled this way.

       You can explicitly request that dgit do just this fixup, by running dgit quilt-fixup.

       If  you  are  a  quilt  user  you  need to know that dgit's git trees are `patches applied
       packaging branches' and do not contain the .pc directory (which is used by quilt to record
       which  patches  are applied).  If you want to manipulate the patch stack you probably want
       to be looking at tools like git-debrebase, gbp pq, or git-dpm.

   quilt fixup error messages
       When dgit's quilt fixup fails, it prints messages like this:

       dgit: base trees orig=5531f03d8456b702eab6 o+d/p=135338e9cc253cc85f84
       dgit: quilt differences: src:  == orig ##     gitignores:  == orig ##
       dgit: quilt differences:      HEAD ## o+d/p               HEAD ## o+d/p
       starting quiltify (multiple patches, linear mode)

       dgit: error: quilt fixup cannot be linear.  Stopped at:
       dgit:  696c9bd5..84ae8f96: changed debian/patches/test-gitignore

       orig   is an import of the .orig tarballs dgit found, with the debian/ directory from your
              HEAD  substituted.   This is a git tree object, not a commit: you can pass its hash
              to git-diff but not git-log.

       o+d/p  is another tree object, which is the  same  as  orig  but  with  the  patches  from
              debian/patches applied.

       HEAD   is of course your own git HEAD.

       quilt differences
              shows whether each of the these trees differs from the others (i) in upstream files
              excluding .gitignore files;  (ii)  in  upstream  .gitignore  files.   ==  indicates
              equality; ## indicates inequality.

       dgit quilt-fixup --quilt=linear walks commits backwards from your HEAD trying to construct
       a linear set of additional patches, starting at the end.  It hopes to eventually  find  an
       ancestor whose tree is identical to o+d/p in all upstream files.

       In  the  error  message,  696c9bd5..84ae8f96  is  the first commit child-parent edge which
       cannot sensibly be either ignored, or turned into a  patch  in  debian/patches.   In  this
       example,  this  is  because  it  itself  changes  files in debian/patches, indicating that
       something unusual is going on and that continuing is not safe.  But  you  might  also  see
       other kinds of troublesome commit or edge.

       Your  appropriate  response depends on the cause and the context.  If you have been freely
       merging your git branch and do not need need a pretty linear  patch  queue,  you  can  use
       --quilt=single  or  --quilt=smash.   (Don't use the single-debian-patch dpkg source format
       option; it has strange properties.)  If you want a pretty linear series, and this  message
       is  unexpected,  it  can  mean  that  you  have unwittingly committed changes that are not
       representable by dpkg-source (such as some mode changes).  Or  maybe  you  just  forgot  a
       necessary --quilt= option.

       Finally,  this  problem  can  occur  if  you have provided Debian git tooling such as git-
       debrebase, git-dpm or git-buildpackage with upstream git commit(s) or tag(s) which are not
       100% identical to your orig tarball(s).

SPLIT VIEW AND SPLITTING QUILT MODES

       When  working with git branches intended for use with the `3.0 (quilt)' source format dgit
       can automatically convert a suitable maintainer-provided git branch (in one of  a  variety
       of formats) into a dgit branch.

       When  a splitting quilt mode is selected dgit build commands and dgit push-* will, on each
       invocation, convert the user's HEAD into the dgit view, so that it  can  be  built  and/or
       uploaded.

       Split  view  mode can also be enabled explicitly with the --split-view command line option
       and the .split-view access configuration key.

       When split view is  in  operation,  regardless  of  the  quilt  mode,  any  dgit-generated
       pseudomerges  and  any quilt fixup commits will appear only in the dgit view.  dgit push-*
       will push the dgit view to the dgit git server.  The dgit view is always a  descendant  of
       the maintainer view.  dgit push-* will also make a maintainer view tag according to DEP-14
       and push that to the dgit git server.

       Splitting quilt modes must be enabled explicitly (by the use  of  the  applicable  command
       line  options,  subcommands,  or  configuration).   This  is because it is not possible to
       reliably tell (for example) whether a git tree for a dpkg-source `3.0 (quilt)' package  is
       a patches-applied or patches-unapplied tree.

       Split  view conversions are cached in the ref dgit-intern/quilt-cache.  This should not be
       manipulated directly.

FILES IN THE ORIG TARBALL BUT NOT IN GIT - AUTOTOOLS ETC.

       This section is mainly of interest to maintainers who want to use dgit with their existing
       git history for the Debian package.

       Some  developers like to have an extra-clean git tree which lacks files which are normally
       found in source tarballs and therefore in Debian source  packages.   For  example,  it  is
       conventional to ship ./configure in the source tarball, but some people prefer not to have
       it present in the git view of their project.

       dgit requires that the source package unpacks to exactly the same files as are in the  git
       commit on which dgit push-* operates.  So if you just try to dgit push-* directly from one
       of these extra-clean git branches, it will fail.

       As the maintainer you therefore have the following options:

       •      Delete the files from your git branches, and your Debian source packages, and carry
              the deletion as a delta from upstream.  (With `3.0 (quilt)' this means representing
              the deletions as patches.  You may need to pass  --include-removal  to  dpkg-source
              --commit,  or pass corresponding options to other tools.)  This can make the Debian
              source package less useful for people without Debian build infrastructure.

       •      Persuade upstream that the source code in their git history  and  the  source  they
              ship as tarballs should be identical.  Of course simply removing the files from the
              tarball may make the tarball hard for people to use.

              One answer is to commit the (maybe autogenerated) files, perhaps with  some  simple
              automation  to  deal  with  conflicts and spurious changes.  This has the advantage
              that someone who clones the git repository finds the program just as easy to  build
              as someone who uses the tarball.

       Of  course  it  may also be that the differences are due to build system bugs, which cause
       unintended files to end up in the source package.  dgit will  notice  this  and  complain.
       You may have to fix these bugs before you can unify your existing git history with dgit's.

FILES IN THE SOURCE PACKAGE BUT NOT IN GIT - DOCS, BINARIES ETC.

       Some  upstream  tarballs  contain build artifacts which upstream expects some users not to
       want to rebuild (or indeed to find hard  to  rebuild),  but  which  in  Debian  we  always
       rebuild.

       Examples  sometimes  include  crossbuild  firmware  binaries  and documentation.  To avoid
       problems when building  updated  source  packages  (in  particular,  to  avoid  trying  to
       represent  as  changes  in  the  source  package  uninteresting or perhaps unrepresentable
       changes to such files) many maintainers arrange for the package  clean  target  to  delete
       these files.

       dpkg-source  does not (with any of the commonly used source formats) represent deletion of
       binaries (outside debian/) present in upstream.  Thus deleting such files in a dpkg-source
       working tree does not actually result in them being deleted from the source package.  Thus
       deleting the files in rules clean sweeps this problem under the rug.

       However, git does always properly record file deletion.  Since dgit's  principle  is  that
       the  dgit  git  tree  is the same of dpkg-source -x, that means that a dgit-compatible git
       tree always contains these files.

       For the non-maintainer, this can be observed in the following suboptimal occurrences:

       •      The package clean target often deletes these  files,  making  the  git  tree  dirty
              trying  to  build the source package, etc.  This can be fixed by using dgit -wg aka
              --clean=git, so that the package clean target is never run.

       •      The package build modifies these files, so that builds make  the  git  tree  dirty.
              This can be worked around by using `git reset --hard' after each build (or at least
              before each commit or push).

       From the maintainer's point of view,  the  main  consequence  is  that  to  make  a  dgit-
       compatible  git branch it is necessary to commit these files to git.  The maintainer has a
       few additional options for mitigation: for example, it may be possible for the rules  file
       to  arrange  to  do  the  build in a temporary area, which avoids updating the troublesome
       files; they can then be left in the git tree without seeing trouble.

PROBLEMS WITH PACKAGE CLEAN TARGETS ETC.

       A related problem is other unexpected  behaviour  by  a  package's  clean  target.   If  a
       package's  rules  modify  files  which are distributed in the package, or simply forget to
       remove certain files, dgit will complain that the tree is dirty.

       Again, the solution is to use dgit -wg aka --clean=git, which instructs dgit  to  use  git
       clean  instead  of  the package's build target, along with perhaps git reset --hard before
       each build.

       This is 100% reliable, but has the downside that if you forget to git add  or  to  commit,
       and then use dgit -wg or git reset --hard, your changes may be lost.