fortran-lang · henilp105 · Oct 12, 2023 · Aug 10, 2023 · Aug 10, 2023 · Aug 10, 2023
diff --git a/source/learn/quickstart/gotchas.md b/source/learn/quickstart/gotchas.md
@@ -0,0 +1,206 @@
+Gotchas
+=======
+
+Fortran syntax is generally simple and consistent, but as with many languages it has some flaws. Sometimes because of legacy reasons (the evolution of the Fortran standard strongly emphasizes the backward compatibility because of the huge amount of legacy code that is still actively used), sometimes because of real flaws (poor choice at some point, which is difficult if not impossible to correct without breaking the backward compatibility), and sometimes just because Fortran is not C or C++ or Python or whatever: it has its own logic, with features that can sometimes be surprising to developers who are used to some other languages. 
+
+All code snippets are compiled with gfortran 13.
+
+Implicit typing
+---------------
+
+```
+program foo
+    integer :: nbofchildrenperwoman, nbofchildren, nbofwomen
+    nbofwomen = 10
+    nbofchildrenperwoman = 2
+    nbofchildren = nbofwomen * nbofchildrenperwoman
+    print*, "number of children:", nbofchildrem
+end program
+```
+The program compiles and the execution gives:
+```
+ number of children:           0
+```
+Wait... Fortran is unable to multiply two integer numbers?? Of course not... The problem here is a typo in the variable name when printing it: `nbofchildreM` instead of `nbofchildreN`. But why the compiler didn't catch the typo? Well, because by default Fortran uses implicit typing: when encountering a variable that hasn't been explicitly typed, the compiler infers the type according to the first letter of the name. Variables names starting by I, J, K, L, M, N, are of type `INTEGER`, and all other ones are of type `REAL` (hence the classic joke "`GOD` is `REAL`, unless declared as `INTEGER`").
+
+Implicit typing is as old as Fortran, in times where there was no explicit typing. Although it can still be convenient for quickly writing some test code, this practice is highly error prone and is discouraged. The strongly recommended good practice is to always disable implicit typing by stating `implicit none` (introduced in Fortran 90) at the beginning of all program units (main program, modules, and standalone routines):
+
+```
+program foo
+implicit none
+    integer :: nbofchildrenperwoman, nbofchildren, nbofwomen
+    nbofwomen = 10
+    nbofchildrenperwoman = 2
+    nbofchildren = nbofwomen * nbofchildrenperwoman
+    print*, "number of children:", nbofchildrem
+end program
+```
+And now the compilation fails, allowing to quickly correct the typo:
+```
+    7 |     print*, "number of children:", nbofchildrem
+      |                                               1
+Error: Symbol 'nbofchildrem' at (1) has no IMPLICIT type; did you mean 'nbofchildren'?
+```
+
+Implied save
+------------
+
+```
+subroutine foo()
+implicit none
+    integer :: c=0
+
+    c = c+1
+    print*, c
+end subroutine
+
+program main
+implicit none
+    integer :: i
+
+    do i = 1, 5
+        call foo()
+    end do
+end program
+```
+People used to C/C++ expect this program to print 5 times `1`, because they interpret `integer :: c=0` as the concatenation of a declaration and an assignment, as if it was:
+```
+integer :: c
+c = 0
+```
+**But it is not**. This program actually outputs:
+```
+1
+2
+3
+4
+5
+```
+`integer :: c=0` is actually a one-shot **compile time initialization**, and it makes the variable persistent between calls to `foo()`. It is actually equivalent to: 
+```
+integer, save :: c=0
+```
+The `save` attribute is equivalent to the C `static` attribute to make a variable persistent, and it is *implied* in the case the variable is initialized. This is a modernized syntax (introduced in Fortran 90) compared to the legacy (and still valid) syntax:
+```
+integer c
+data c /0/
+```
+Old fortraners just know that the modernized syntax is equivalent to the legacy one, even when `save` is not specified. But as a matter of fact the *implied save* can be misleading to newcomers who are used to the C logic. That's why it is generally recommended to **always** specify the `save` attribute:
+```
+integer, save :: c=0   ! save could be omitted, but it's clearer with it
+```
+
+*Note: an initialization expression of a derived type component is a fully different case:*
+```
+type bar
+    integer :: c = 0
+end type
+```
+*Here, the `c` component is initialized to zero each time a `type(bar)` variable is instantiated (**runtime** initialization).*
+
+
+Floating point literal constants
+---------------------------------
+
+The following code snippet defines a double precision constant `x` (which is on most systems a IEEE754 64 bits floating point, with 15 significant digits):
+```
+program foo
+implicit none
+    integer, parameter :: dp = kind(0d0)
+    real(kind=dp), parameter :: x = 9.3
+    print*, precision(x), x
+end program
+```
+The output is:
+```
+          15   9.3000001907348633
+```
+So, `x` has 15 significant digits as expected, and still the printed value is wrong from the 8th digit. The reason is that floating point literal constants have implicitely the default real kind, wich is usually the IEEE754 single precision floating point (with about 7 significant digits). The real number $9.3$ has no exact floating point representation, so it is first approximated to single precision up to the 7th digit, then casted to double precision before being assigned to `x`. But the previously lost digits are obviously not recovered.
+
+The solution is to explicitly specify the kind of the constant: 
+```
+real(kind=dp), parameter :: x = 9.3_dp
+```
+And now the output is correct up to the 15th digit:
+```
+          15   9.3000000000000007     
+```
+
+Floating point literal constants (again)
+---------------------------------
+
+Suppose now you need a floating point constant that is 1/3 (one-third). You may write:
+```
+program foo
+implicit none
+    integer, parameter :: dp = kind(0d0)
+    real(dp), parameter :: onethird = 1_dp / 3_dp
+    print*, onethird
+end program
+```
+Then the output is (!):
+```
+   0.0000000000000000     
+```
+The reason is that `1_dp` and `3_dp` are **integer** literal constants, despite the `_dp` suffix that is *supposed* to represent a floating point kind. Consequently the division is the integer division, with 0 as a result. The gotcha here is that the standard allows compilers to use identical kind values for `REAL` and `INTEGER` types. For instance with gfortran, on most platforms the value $8$ is both the double precision kind AND the 64 bits integer kind, so that `1_dp` is a fully valid integer constant. In constrast, the NAG compiler uses by default unique kind values, such that in the example above `1_dp` would produce a compilation error.
+
+The right way to denote floating point constants is to **always** include the point: 
+```
+    real(dp), parameter :: onethird = 1.0_dp / 3.0_dp
+```
+Then the ouput is:
+```
+  0.33333333333333331     
+```
+
+Leading space in prints
+-----------------------
+
+```
+program foo
+implicit none
+    print*, "Hello world!"
+end program
+```
+Ouput:
+```
+% gfortran hello.f90 && ./a.out
+ Hello world!
+```
+Note the extra leading space, which is not present in the string of the source code. Historically, the first character was containing a [carriage control code](https://en.wikipedia.org/wiki/ASA_carriage_control_characters) for the early printers, and it was not printed per se. The space " " was instructing the printer to perform a CR+LF sequence before printing the content, and was automatically prepended by the Fortran `print*` statement. Some compilers still do that, although the modern output devices do neither intercept nor use the control character, which is hence "printed". If this leading blank is a problem (it rarely is), then instead of the `*` (which means "let the compiler decide how to format the output") we can code an explicit format:
+```
+    print "(A)", "Hello world!"
+```
+In this case, the compiler does no longer prepend the leading space:
+```
+% gfortran hello.f90 && ./a.out
+Hello world!
+```
+
+Filename extension
+------------------
+
+Suppose we put the above "Hello world" program in the source file `hello.f`. Most compilers will produce many compilation errors:
+```
+% gfortran hello.f
+hello.f:1:1:
+
+ program foo
+ 1
+Error: Non-numeric character in statement label at (1)
+hello.f:1:1:
+
+ implicit none
+ 1
+Error: Non-numeric character in statement label at (1)
+hello.f:2:1:
+
+     print*, "Hello world!"
+     1
+Error: Non-numeric character in statement label at (1)
+
+...[truncated]
+```
+The reason is that the `.f` extension is by a widely accepted convention "reserved" for the legacy "fixed source form", which was designed for the punch card systems. In particular, the columns 1-6 were reserved for labels, continuation characters, and comments, and the actual statements and instructions had to be within the columns 7-72. The free source form removes all the limitations of the fixed source form, but since the latter cohabits with the former, the convention adopted by most compilers is to use by default the `.f90` extension for the free form sources. Note that this can generally be changed by some compiler switches, and that the most recent versions of the Fortran Package Manager (fpm) considers by default that all sources are free form, regardless the extension.
+
+*Note: a common misconception is that the `.f90` source files are restricted to the Fortran 90 revision of the standard and cannot contain features introduced in more recent revisions (Fortran 95/2003/2008/2018). **This is wrong and completely unrelated**: the only reason why `.f90` has been chosen is that the free form has been introduced in the Fortran 90 revision. Both `.f` and `.f90` source files can contain features from any revision of the standard.*
diff --git a/source/learn/quickstart/index.md b/source/learn/quickstart/index.md
@@ -15,4 +15,5 @@ Arrays and strings <arrays_strings>
 Operators and flow control <operators_control_flow>
 Organising code structure <organising_code>
 Derived Types <derived_types>
+Gotchas <gotchas>
 ```