History

Alex Rickabaugh d5aa6b5bd6 style(compiler-cli): reformat of codebase with new clang-format version (#36520 )

This commit reformats the packages/compiler-cli tree using the new version
of clang-format.

PR Close #36520

2020-04-08 14:51:08 -07:00

src

style(compiler-cli): reformat of codebase with new clang-format version (#36520 )

2020-04-08 14:51:08 -07:00

api.ts

style(compiler-cli): reformat of codebase with new clang-format version (#36520 )

2020-04-08 14:51:08 -07:00

BUILD.bazel

fix(ivy): recompile on template change in ngc watch mode on Windows (#34015 )

2020-02-04 10:40:22 -08:00

index.ts

fix(ivy): don't run decorator handlers against declaration files (#34557 )

2020-01-10 15:54:51 -08:00

README.md

docs(ivy): move incremental package README file to the correct location (#34912 )

2020-01-23 13:30:10 -08:00

README.md

What is the `incremental` package?

This package contains logic related to incremental compilation in ngtsc.

In particular, it tracks dependencies between ts.SourceFiles, so the compiler can make intelligent decisions about when it's safe to skip certain operations.

What optimizations are made?

ngtsc currently makes two optimizations: reuse of prior analysis work, and the skipping of file emits.

Reuse of analyses

If a build has succeeded previously, ngtsc has available the analyses of all Angular classes in the prior program, as well as the dependency graph which outlines inter-file dependencies. This is known as the "last good compilation".

When the next build begins, ngtsc follows a simple algorithm which reuses prior work where possible:

For each input file, ngtsc makes a determination as to whether the file is "logically changed".

"Logically changed" means that either:

The file itself has physically changed on disk, or
One of the file's dependencies has physically changed on disk.

Either of these conditions invalidates the previous analysis of the file.

ngtsc begins constructing a new dependency graph.

For each logically unchanged file, its dependencies are copied wholesale into the new graph.

ngtsc begins analyzing each file in the program.

If the file is logically unchanged, ngtsc will reuse the previous analysis and only call the 'register' phase of compilation, to apply any necessary side effects.

If the file is logically changed, ngtsc will re-analyze it.

Skipping emit

ngtsc makes a decision to skip the emit of a file if it can prove that the contents of the file will not have changed since the last good compilation. To prove this, two conditions must be true.

The input file itself must not have changed since the previous compilation.
None of the files on which the input file is dependent have changed since the previous compilation.

The second condition is challenging to prove, as Angular allows statically evaluated expressions in lots of contexts that could result in changes from file to file. For example, the name of an @Pipe could be a reference to a constant in a different file. As part of analyzing the program, the compiler keeps track of such dependencies in order to answer this question.

The emit of a file is the most expensive part of TypeScript/Angular compilation, so skipping emits when they are not necessary is one of the most valuable things the compiler can do to improve incremental build performance.

The two dependency graphs

For both of the above optimizations, ngtsc makes use of dependency information extracted from the program. But these usages are subtly different.

To reuse previous analyses, ngtsc uses the prior compilation's dependency graph, plus the information about which files have changed, to determine whether it's safe to reuse the prior compilation's work.

To skip emit, ngtsc uses the current compilation's dependency graph, coupled with the information about which files have changed since the last successful build, to determine the set of outputs that need to be re-emitted.

How does incremental compilation work?

The initial compilation is no different from a standalone compilation; the compiler is unaware that incremental compilation will be utilized.

When an NgtscProgram is created for a subsequent compilation, it is initialized with the NgtscProgram from the previous compilation. It is therefore able to take advantage of any information present in the previous compilation to optimize the next one.

This information is leveraged in two major ways:

The previous ts.Program itself is used to create the next ts.Program, allowing TypeScript internally to leverage information from the previous compile in much the same way.
An IncrementalDriver instance is constructed from the old and new ts.Programs, and the previous program's IncrementalDriver.

The compiler then proceeds normally, using the IncrementalDriver to manage the reuse of any pertinent information while processing the new program. As a part of this process, the compiler (again) maps out all of the dependencies between files.

Determination of files to emit

The principle question the incremental build system must answer is "which TS files need to be emitted for a given compilation?"

To determine whether an individual TS file needs to be emitted, the compiler must determine 3 things about the file:

Have its contents changed since the last time it was emitted?
Has any resource file that the TS file depends on (like an HTML template) changed since the last time it was emitted?
Have any of the dependencies of the TS file changed since the last time it was emitted?

If the answer to any of these questions is yes, then the TS file needs to be re-emitted.

Tracking of changes

On every invocation, the compiler receives (or can easily determine) several pieces of information:

The set of ts.SourceFiles that have changed since the last invocation.
The set of resources (.html files) that have changed since the last invocation.

With this information, the compiler can perform rebuild optimizations:

The compiler uses the last good compilation's dependency graph to determine which parts of its analysis work can be reused, and an initial set of files which need to be re-emitted.
The compiler analyzes the rest of the program and generates an updated dependency graph, which describes the relationships between files in the program as they are currently.
Based on this graph, the compiler can make a final determination for each TS file whether it needs to be re-emitted or can safely be skipped. This produces a set called pendingEmit of every file which requires a re-emit.
The compiler cycles through the files and emits those which are necessary, removing them from pendingEmit.

Theoretically, after this process pendingEmit should be empty. As a precaution against errors which might happen in the future, pendingEmit is also passed into future compilations, so any files which previously were determined to need an emit (but have not been successfully produced yet) will be retried on subsequent compilations. This is mostly relevant if a client of ngtsc attempts to implement emit-on-error functionality.

However, normally the execution of these steps requires a correct input program. In the presence of TypeScript errors, the compiler cannot perform this process. It might take many invocations for the user to fix all their TypeScript errors and reach a compilation that can be analyzed.

As a result, the compiler must accumulate the set of these changes (to source files and resource files) from build to build until analysis can succeed.

This accumulation happens via a type called BuildState. This type is a union of two possible states.

`PendingBuildState`

This is the initial state of any build, and the final state of any unsuccessful build. This state tracks both pendingEmit files from the previous program as well as any source or resource files which have changed since the last successful analysis.

If a new build starts and inherits from a failed build, it will merge the failed build's PendingBuildState into its own, including the sets of changed files.

`AnalyzedBuildState`

After analysis is successfully performed, the compiler uses its dependency graph to evaluate the impact of any accumulated changes from the PendingBuildState, and updates pendingEmit with all of the pending files. At this point, the compiler transitions from a PendingBuildState to an AnalyzedBuildState, which only tracks pendingEmit. In AnalyzedBuildState this set is complete, and the raw changes can be forgotten.

If a new build is started after a successful build, only pendingEmit from the AnalyzedBuildState needs to be merged into the new build's PendingBuildState.

Component to NgModule dependencies

The dependency of a component on its NgModule is slightly problematic, because its arrow is in the opposite direction of the source dependency (which is from NgModule to the component, via declarations). This creates a scenario where, if the NgModule is changed to no longer include the component, the component still needs to be re-emitted because the module has changed.

This is one of very few cases where pendingEmit must be populated with the logical changes from the previous program (those files determined to be changed in step 1 under "Tracking of changes" above), and cannot simply be created from the current dependency graph.

What optimizations are possible in the future?

There is plenty of room for improvement here, with diminishing returns for the work involved.

Semantic dependency tracking

Currently the compiler tracks dependencies only at the file level, and will re-emit dependent files if they may have been affected by a change. Often times a change though does not require updates to dependent files.

For example, today a component's NgModule and all of the other components which consume that module's export scope are considered to depend on the component file itself. If the component's template changes, this triggers a re-emit of not only the component's file, but the entire chain of its NgModule and that module's export scope. This happens even though the template of a component does not have any impact on any components which consume it - these other emits are deeply unnecessary.

In contrast, if the component's selector changes, then all those dependent files do need to be updated since their directiveDefs might have changed.

Currently the compiler does not distinguish these two cases, and conservatively always re-emits the entire NgModule chain. It would be possible to break the dependency graph down into finer-grained nodes and distinguish between updates that affect the component, vs updates that affect its dependents. This would be a huge win, but is exceedingly complex.

Skipping template type-checking

For certain kinds of changes, it may be possible to avoid the cost of generating and checking the template type-checking file. Several levels of this can be imagined.

For resource-only changes, only the component(s) which have changed resources need to be re-checked. No other components could be affected, so previously produced diagnostics are still valid.

For arbitrary source changes, things get a bit more complicated. A change to any .ts file could affect types anywhere in the program (think declare global ...). If a set of affected components can be determined (perhaps via the import graph that the cycle analyzer extracts?) and it can be proven that the change does not impact any global types (exactly how to do this is left as an exercise for the reader), then type-checking could be skipped for other components in the mix.

If the above is too complex, then certain kinds of type changes might allow for the reuse of the text of the template type-checking file, if it can be proven that none of the inputs to its generation have changed. This is useful for two very important reasons.

Generating (and subsequently parsing) the template type-checking file itself is expensive.
Under ideal conditions, after an initial template type-checking program is created, it may be possible to reuse it for emit and type-checking in subsequent builds. This would be a pretty advanced optimization but would save creation of a second ts.Program on each valid rebuild.

README.md

What is the incremental package?

What optimizations are made?

Reuse of analyses

Skipping emit

The two dependency graphs

How does incremental compilation work?

Determination of files to emit

Tracking of changes

PendingBuildState

AnalyzedBuildState

Component to NgModule dependencies

What optimizations are possible in the future?

Semantic dependency tracking

Skipping template type-checking

What is the `incremental` package?

`PendingBuildState`

`AnalyzedBuildState`