Skip to content

Latest commit

 

History

History
253 lines (187 loc) · 9.52 KB

bloatware-technical-debt.md

File metadata and controls

253 lines (187 loc) · 9.52 KB

% How to reduce bloatware technical debt?

Old software project

Software started in the early years 2000 share similar characteristics:

  • Files organized in the Visual C++ way (directories include and src are separated)
  • C++98 syntax (C++ has been stable for 20 years)
  • Utilities reinvented (e.g. FileSystem, Mutex...) because there was no alternative in mature widely-used libraries when project has started
  • Code refactoring is not regularly realized (Technical Debt is not cleaned)
  • Migration from CVS/SVN to one single Git repository
    • Use GitLab or other easy-to-use alternative
    • Repo contains many binary files (as third party dependencies)
    • Almost all 3rd parties are grouped together
    • But other external source code is mixed within the company source code
    • Despite the huge repo size, the git clone still takes less than 10 minutes on Linux (clonning on Windows is a nigthmare)
    • One big repo is easier to split, than several repositories to fusion
    • One single version for the different products within the repository
    • Easier to follow one version than dozens versions and dependencies
  • Growing over the years
    • bloatware
    • Many refactoring have been attempted but not finalized
    • Some other finalized refactoring have been forgetten in old branches
    • Build hell
    • Code style is not anymore consistent
  • Difficult to understand all the parts, because of
    • various programming languages
    • various underlying technologies
    • various modules and sub-modules
    • useful documentation not easy to read
  • Documentation reflects the mess
    • dispersed in different places
    • some documentation is obsolete
    • other is missing
    • other is redundant
    • but some components are well documented
  • Test is often better than documentation
    • various test tools developed
    • many tests (unit tests, functional tests...)
    • continuous testing does not run all the tests
    • code coverage is not sure (should be very low)
  • Multi-platform (Windows, Linux... Android...)
  • Current implementation does not enable high performance

Advices

This document provides some advices. Depending on the project specificities and on the team culture, some advices are relevant, some others can be skipped.

This change is part of the continuous improvement. Therefore, team members are encouraged to propose many more suggestions over the months.

Risk Management

  • Any change not impacting the delivery must be guaranteed by producing same binaries
  • Any change not impacting the behavior must be guaranteed by passing all non-regression tests
  • Any change improving performance must be guaranteed by benchmark

Therefore, before any change, implement combined "safety belt and shoulder harness" (ceinture et bretelles).

As these verifications have to be reproducible, script the building and comparison processes.

Artifact Repository

Provide recent tools

  • Package and deploy recent tools for all
  • Prevent several persons doing the same work in parallel (searching, downloading, configuring, building...)
  • Repository Git for the download/build script + Jenkins job (less effort, reusable)
  • Share packages with other teams
  • On Ubuntu Linux, the team may use Ubuntu Make

The team decides the list of tools. Example:

  • git
  • cmake
  • gradle
  • gcc
  • clang
  • vim

Rewrite outdated build scripts

  • Portability (e.g. CMake file should be usable without any specific environment variable)
  • Smaller files to maintain
  • Smaller modularity - Visible dependency tree
  • Generated files must be produced within the build sub-directories (do not mess the source code directories)

CMake

  • Separate Release and Debug builds (two different CMAKE_BUILD_TYPE values)
  • Coverage build is another CMAKE_BUILD_TYPE value
  • Allow building external library Boost from CMake

Source code Urbanization

  • Reorganize source code tree with clarification in mind
  • Rename files having same name
  • For each sub-module: put in near sub-folders documentation, unit test and source code (include and src)
  • Split too-big-modules in autonomous smaller ones
  • Extract third parties from core source code (keep only the code with company ownership)
  • Move all third party in artifact repositories (can be other Git repositories)
  • Produce exactly the same binaries from the same SHA1 (100% same content) => Do not embed timestamps in produced binaries
  • Add in the produced delivery the version of each 3rd party, the version of each tool, the OS version and other build dependencies

Improve documentation

  • Portal : One single entry point to access all project parts (Git repos, jenkins, redmine, any documentation)
  • Group together and clean the dispatched documentations
  • Encourage documentation across the source tree using README.md files (Markdown)
  • Generate documentation from source code using, for example, Doxygen + Clang, clang-doc or cldoc...
  • Show the tests (Unit Tests, Acceptance Tests...) as source code specification.

Test suite

Create or complete:

  • Unit tests
  • Integration tests
  • Functional tests
  • Performance/Load tests
  • Vulnerability tests

Decide the tests to add according to code-coverage and usage model.

  • Use ctest
  • Use static code analyzers (cppcheck, clang-tidy, clang-check...)
  • Test all the build types (Release, Debug and Coverage)
  • With and without valgrind
  • Test using
  • Increase #define GLIBCXX_DEBUG_MESSAGE_LENGTH

Continuous Testing

Several Continuous Integration passes

  • Fast pass checking the essentials (less than 10 minutes)
  • Intermediate completenesses (e.g. one hour period)
  • Paranoia night builds to perform deeper testing (also during the week-end)
  • One complete even slow (less than one week)
  • Endurance test for many weeks (e.g. one month)

Tricks

  • Share the cache of ccache across workers (across developers?)
  • Use distcc instead of several workers? (developers?)
  • Improve reports (Jenkins, SonarQube, GitLab CI)
  • Jenkins (or GitLab CI) builds and tests the pulled merge request on different platforms (integrator waits for green light)

Remove unnecessary code

  • Reduce both coverage test effort and maintenance cost
  • Localize dead code

More warnings

Performance

Continuous code cleaning & refactoring

  • Replace "Not Invented Here" libraries by widely-used libraries
  • Use modern C++ features
  • Use same language for all the parts when possible
  • Reduce the number of programming languages (rewrite/replace a module using another programming language)
    • When an issue in blocking
    • When implementing a major feature
  • Delete old branches
  • Use more recent third-parties versions
  • Use clang-modernize

Continuous Delivery

  • Go further than Continuous Integration
  • Not to be confused with Continuous Deployment

Copyright

  • Add tiny copyright notice on top of all company-owned files
  • Check third party licenses

Libre software

  • Spend time to improve and supply improvements to upstream
  • Remunerate (financial) valuable libre software
  • Convince managers to reverse a small portion of the benefits (mécénat)
  • The team members propose the candidates and vote