Organisation: | Copyright (C) 2021-2025 Olivier Boudeville |
---|---|
Contact: | about (dash) howtos (at) esperide (dot) com |
Creation date: | Saturday, November 20, 2021 |
Lastly updated: | Sunday, January 5, 2025 |
Erlang is a concurrent, functional programming language available as free software; see its official website for more details.
Erlang is dynamically typed, and is executed by the BEAM virtual machine. This VM (Virtual Machine) operates on bytecodes and can perform Just-In-Time compilation. It powers also other related languages, such as Elixir and LFE.
Taken from this presentation:
Hint
What makes Elixir StackOverflow’s #4 most-loved language?
What makes Erlang and Elixir StackOverflow’s #3 and #4 best-paid languages?
How did WhatsApp scale to billions of users with just dozens of Erlang engineers?
What’s so special about Erlang that it powers CouchDB and RabbitMQ?
Why are multi-billion-dollar corporations like Bet365 and Klarna built on Erlang?
Why do PepsiCo, Cars.com, Change.org, Boston’s MBTA, and Discord all rely on Elixir?
Why was Elixir chosen to power a bank?
Why does Cisco ship 2 million Erlang devices each year? Why is Erlang used to control 90% of Internet traffic?
Erlang can be installed thanks to the various options listed in these guidelines.
Building Erlang from the sources of its latest stable version is certainly the best approach; for more control we prefer relying on our custom procedure.
For a development activity, we recommend also specifying the following options to our conf/install-erlang.sh script:
Run ./install-erlang.sh --help for more information.
Once installed, ensure that ~/Software/Erlang/Erlang-current-install/bin/ is in your PATH (e.g. by enriching your ~/.bashrc accordingly), so that you can run erl (the Erlang interpreter) from any location, resulting a prompt like:
$ erl Erlang/OTP 24 [erts-12.1.5] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit] Eshell V12.1.5 (abort with ^G) 1>
Then enter CTRL-C twice in order to come back to the (UNIX) shell.
Congratulations, you have a functional Erlang now!
To check from the command-line the version of an Erlang install:
$ erl -eval '{ok, V} = file:read_file( filename:join([code:root_dir(), "releases", erlang:system_info(otp_release), "OTP_VERSION"]) ), io:fwrite(V), halt().' -noshell 24.2
Ceylan users shall note that most of our related developments (namely Myriad, WOOPER, Traces, LEEC, Seaplus, Mobile, US-Common, US-Web and US-Main) depart significantly from the general conventions observed by most Erlang applications:
If it is as simple to run erl, we prefer, with Ceylan settings, running make shell in order to benefit from a well-initialized VM (notably with the full code path of the current layer and the ones below).
Refer then to the shell commands, notably for:
1> rr(code:lib_dir(xmerl) ++ "/include/xmerl.hrl").
See also the JCL mode (for Job Control Language) to connect and interact with other Erlang nodes.
The goal is to have processes, running on multiple Erlang VMs instantiated on various hosts of a network, interact (of course, as always, by message-passing based on Erlang processes).
Let's suppose that we have two hosts, foo.example.com (possibly on the LAN) and bar.other.info (for example a gateway available on the Internet - hence a bit secured, and with many settings activated), each having the latest version of Erlang installed.
To test whether VMs on either side can communicate, one may launch on foo.example.com for example (killing EMPD and using a specific, rather random port for it, for safer/paranoid testing):
$ killall epmd # Possibly safer to specify just '-name myfoo' rather directly the FQDN # ('-name myfoo@foo.example.com'), as we can check the name resolution used # by the VM (bogus values like '-name myfoo@ibm.com' would be accepted): # $ ERL_EPMD_PORT=4506 erl -name myfoo -setcookie abc Erlang/OTP 27 [erts-15.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit:ns] Eshell V15.2 (press Ctrl+G to abort, type help(). for help) (myfoo@foo.example.com)1>
And on bar.other.info:
$ killall epmd $ ERL_EPMD_PORT=4506 erl -name mybar@bar.other.info -setcookie abc Erlang/OTP 27 [erts-15.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit:ns] Eshell V15.2 (press Ctrl+G to abort, type help(). for help) (mybar@bar.other.info)1>
Then the following calls may be of interest, to interlink these VMs and check that they agree on interacting.
On foo.example.com:
1> net_adm:ping('mybar@bar.other.info'). pong
And on bar.other.info:
1> net_adm:ping('myfoo@foo.example.com'). pong
Then either of the two nodes can perform mostly anything on the other host (of course depending on the respective permissions of the users that run them); and of course Erlang cookies are not a security feature, they are just an ad hoc way of preventing unwanted interactions between nodes.
A lot more often than expected, pang is received (instead of pong), meaning that unfortunately the connection could not be created.
Among the many possible pitfalls, there are (approximately sorted by decreasing level of probability, according to our experience):
To focus a bit on developments relying on our conventions (notably the Ceylan-Myriad ones) about firewall settings, one may first inspect the /etc/iptables.settings-Gateway.sh file to check the firewall settings that are currently listed (e.g. grep 'myriad_default_epmd_port\|tcp_' iptables.settings-Gateway.sh).
This may result for example in:
myriad_default_epmd_port=4502 enable_unfiltered_tcp_range="true" tcp_unfiltered_low_port=60000 tcp_unfiltered_high_port=65000
(supposing they apply on both hosts)
One could ensure that these were indeed the ones applied (if ever the prior settings were changed yet with no firewall reloading afterwards), by checking the /root/.last-gateway-firewall-activation file.
Finally, one may run our iptables-inspect.sh script to request iptables to describe its current, actual state.
Then, to test the connectivity of corresponding VMs, first on foo.example.com:
$ killall epmd beam.smp $ ERL_EPMD_PORT=4502 erl -name myfoo@foo.example.com -setcookie abc -kernel inet_dist_listen_min 60000 inet_dist_listen_max 65000 Erlang/OTP 27 [erts-15.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit:ns] Eshell V15.2 (press Ctrl+G to abort, type help(). for help) (myfoo@foo.example.com)1>
And on bar.other.info, this could be:
$ export ERL_EPMD_PORT=4502 $ erl -name mybar@bar.other.info -setcookie abc -kernel inet_dist_listen_min 60000 inet_dist_listen_max 65000 Erlang/OTP 27 [erts-15.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit:ns] Eshell V15.2 (press Ctrl+G to abort, type help(). for help) (mybar@bar.other.info)1> net_adm:ping('myfoo@foo.example.com'). pong
At last it works!
For an overall application named Foobar, we recommend defining in conf/foobar.app.src an application specification template that, once properly filled regarding the version of that application and the modules that it comprises (possibly automatically done thanks to the Ceylan-Myriad logic), will result in an actual application specification file, foobar.app.
Such a file is necessary in all cases, to generate an OTP application (otherwise with rebar3 nothing will be built), an OTP release (otherwise the application dependencies will not be reachable), and probably an hex package as well.
This specification content is to end up in various places:
For an OTP active application of interest - that is one that provides an actual service, i.e. running processes, as opposed to a mere library application, which provides only code - such a specification defines, among other elements, which module will be used to start this application. We recommend to name this module according to the target application and to suffix it with _app, like in:
{application, foobar, [ [...] {mod, {foobar_app, [hello]}}, [...]
This implies that once a user code will call application:start(foobar), then foobar_app:start(_Type=normal, _Args=[hello]) will be called in turn.
This start/2 function, together with its stop/1 reciprocal, are the functions listed by the OTP (active) application behaviour; at least for clarity, it is better that foobar_app.erl comprises -behaviour(application).
The previous OTP callbacks may be called by specific-purpose launching code; we tend to define an exec/0 function for that: then, with the Myriad make system, executing on the command-line make foobar_exec results in foobar_app:exec/0 to be called.
Having such a pre-launch function is useful when having to set specific information beforehand (see application:set_env/{1,2}) and/or when starting by oneself applications (e.g. see otp_utils:start_applications/2).
In any case this should result in foobar_app:start/2 to be called at application startup, a function whose purpose is generally to spawn the root supervisor of this application.
Note that, alternatively (perhaps for some uncommon debugging needs), one may execute one's application (e.g. foo) by oneself, knowing that doing so requires starting beforehand the applications it depends on - be them Erlang-standard (e.g. kernel, stdlib) or user-provided (e.g. bar, buz); for that both their modules [3] and their .app file [4] must be found.
[3] | If using Ceylan-Myriad, run, from the root of foo, make copy-all-beams-to-ebins to populate the ebin directories of all layers (knowing that by default each module is only to be found directly from its source/build directory, and thus such a copy is usually unnecessary). |
[4] | If using Ceylan-Myriad, run, from the root of foo, make create-app-file. |
This can be done with:
$ erl -pa XXX/bar/ebin -pa YYY/buz/ebin -pa ZZZ/foo/ebin Erlang/OTP 26 [erts-14.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] [jit:ns] Eshell V14.2 (press Ctrl+G to abort, type help(). for help) 1> application:ensure_all_started([kernel, stdlib, bar, buz, foo]).
Then the foo application shall be launched, and a shell be available to interact with the corresponding VM.
The purpose of supervisors is to ease the development of fault-tolerant applications by building hierarchical process structures called supervision trees.
For that, supervisors are to monitor their children, that may be workers (typically implementing the gen_{event,server,statem} behaviour) and/or other supervisors (they can thus be nested).
We recommend to define a foobar_sup:start_link/0 function (it is an user-level API, so any name and arity can be used). This foobar_sup module is meant to implement the supervisor behaviour (to be declared with -behaviour(supervisor).), which in practice requires an init/1 function to be defined.
So this results, in foobar_sup, in a code akin to:
-spec start_link() -> supervisor:startlink_ret(). start_link() -> % This will result in calling init/1 next: supervisor:start_link( _Registration={local, my_foobar_main_sup}, _Mod=?MODULE, _Args=[]). -spec init(list()) -> {'ok', {supervisor:sup_flags(), [child_spec()]}}. init(_Args=[]) -> [...] {ok, {SupSettings, ChildSpecs}}.
Our otp_utils module may help a bit defining proper restart strategies and child specifications, i.e. the information regarding the workers that will be supervised, here, by this root supervisor.
For example it could be:
init(_Args=[]) -> [...] SupSettings = otp_utils:get_supervisor_settings( _RestartStrategy=one_for_one, ExecTarget), % Always restarted in production: RestartSettings = otp_utils:get_restart_setting(ExecTarget), WorkerShutdownDuration = otp_utils:get_maximum_shutdown_duration(ExecTarget), % First child, the main Foobar worker process: MainWorkerChild = #{ id => foobar_main_worker_id, start => {_Mod=foobar, _Fun=start_link, _MainWorkerArgs=[A, B, C]}, restart => RestartSettings, shutdown => WorkerShutdownDuration, type => worker, modules => [foobar] }, ChildSpecs = [MainWorkerChild], {ok, {SupSettings, ChildSpecs}}.
Children are created synchronously and in the order of their specification [5].
[5] | Yet some interleaving is possible thanks to proc_lib:init_ack/1. |
So if ChildSpecs=[A, B, C], then a child according to the A spec is first created, then, once it is over (either its init/1 finished successfully, or it called proc_lib:init_ack/{1,2} [6] and then continued its own initialisation concurrently), a child according to the B spec is created, then once done a child according to the C spec.
[6] | Typically: proc_lib:init_ack(self()). |
Such a worker, which can be any Erlang process (implementing an OTP behaviour, like gen_server, or not) will thus be spawned here through a call to the foobar:start_link/3 function (another user-defined API) made by this supervisor. This is a mere call (an apply/3), not a spawn of a child process based on that function.
Therefore the called function is expected to create the worker process by itself, like, in the foobar module:
start_link(A, B ,C) -> [...] {ok, proc_lib:start_link(?MODULE, _Func=init, _Args=[U, V], _Timeout=infinity, SpawnOpts)}.
Here thus the spawned worker will start by executing foobar:init/2, a function not expected to return, often trapping EXIT signals (process_flag(trap_exit, true)), setting system flags and, once properly initialised, notifying its supervisor that it is up and running (e.g. proc_lib:init_ack(_Return=self())) before usually entering a tail-recursive loop.
Depending on the shutdown entry of its child specification, on application stop that worker may be terminated by different ways. We tend to prefer specifying a maximum shutdown duration: then the worker will be sent by its supervisor first a shutdown EXIT message, that this worker may handle, typically in its main loop:
receive [...] {'EXIT', _SupervisorPid, shutdown} -> % Just stop. [...]
If the worker fails to stop (on time, or at all) and properly terminate, it will then be brutally killed by its supervisor.
Non-OTP processes (e.g. WOOPER instances) can act as supervisors thanks to the supervisor_bridge module.
Such a process shall implement the supervisor_bridge behaviour, namely init/1 and terminate/2. If the former function spawns a process, the latter shall ensure that this process terminates in a synchronous manner, otherwise race conditions may happen.
See traces_bridge_sup for an example thereof.
One may refer to;
Metaprogramming is to be done in Erlang through parse transforms, which are user-defined modules that transform an AST (for Abstract Syntax Trees, an Erlang term that represents actual code; see the Abstract Format for more details) into another AST that is fed afterwards to the compiler.
See also:
A proper list is created from the empty one ([], also known as "nil") by appending (with the | operator, a.k.a. "cons") elements in turn; for example [1,2] is actually [1 | [2 | []]].
However, instead of enriching a list from the empty one, one can start a list with any other term than [], for example my_atom. Then, instead of [2|[]], [2|my_atom] may be specified and will be indeed a list - albeit an improper one.
Many recursive functions expect proper lists, and will fail (typically with a function clause) if given an improper list to process (e.g. lists:flatten/1).
So, why not banning such construct? Why even standard modules like digraph rely on improper lists?
The reason is that improper lists are a way to reduce the memory footprint of some datastructures, by storing a value of interest instead of the empty list.
Indeed, as explained in this post, a (proper) list of 2 elements will consume:
For a total of 4 words of memory (so, on a 64-bit architecture, it is 32 bytes).
As for an improper list of 2 elements, only 1 list cell (2 words of memory) will be consumed to store the first element and then the second one.
Such a solution is even more compact than a pair (a 2-element tuple), which consumes 2+1 = 3 words. Accessing the elements of an improper list is also faster (one handle to be inspected vs also an header to be inspected).
Finally, for sizes expressed in bytes:
1> system_utils:get_size([2,my_atom]). 40 2> system_utils:get_size({2,my_atom}). 32 3> system_utils:get_size([2|my_atom]). 24
See also the 1, 2 pointers for more information.
Everyone shall decide on whether relying on improper lists is a trick, a hack or a technique to prohibit.
Open Computing Language is a standard interface designed to program many processing architectures, such as CPUs, GPUs, DSPs, FPGAs.
OpenCL is notably a way of using one's GPU to perform more general-purpose processing than typically the rendering operations allowed by GLSL (even compared to its compute shaders).
In Erlang, the cl binding is available for that.
A notable user thereof is Wing3D; one may refer to the *.cl files in this directory, but also to its optional build integration as a source of inspiration, and to wings_cl.erl.
Erlang programs may fail, and this may result in mere (Erlang-level) crashes (the VM detects an error, and reports information about it, possibly in the form of a crash dump) or (sometimes, quite infrequently though) in more brutal, lower-level core dumps (the VM crashes as a whole, like any faulty program run by the operating system); this last case happens typically when relying on faulty NIFs.
To monitor a (live, Erlang) application, one may use:
If experiencing "only" an Erlang-level crash, a erl_crash.dump file is produced in the directory whence the executable (generally erl) was launched. The best way to study it is to use the cdv (refer to crashdump viewer) tool, available, from the Erlang installation, as lib/erlang/cdv [7].
[7] | Hence, according to the Ceylan-Myriad conventions, in ~/Software/Erlang/Erlang-current-install/lib/erlang/cdv. |
Using this debug tool is as easy as:
$ cdv erl_crash.dump
Then, through the wx-based interface, a rather large number of Erlang-level information will be available (processes, ports, ETS tables, nodes, modules, memory, etc.) to better understand the context of this crash and hopefully diagnose its root cause.
In the worst cases, the VM will crash like any other OS-level process, and generic (non Erlang-specific) tools will have to be used. Do not expect to be pointed to line numbers in Erlang source files anymore!
Refer to our general section dedicated to core dumps for that.
The two main approaches in order to integrate third-party code to Erlang are to:
Knowing that, in functional languages such as Erlang, terms ("variables") are immutable, why could not they be shared between local processes when sent through messages, instead of being copied in the heap of each of them, as it is actually the case with the Erlang VM?
The reason lies in the fact that, beyond the constness of these terms, their life-cycle has also to be managed. If they are copied, each process can very easily perform its (concurrent, autonomous) garbage collections. On the contrary, if terms were shared, then reference counting would be needed to deallocate them properly (neither too soon nor never at all), which, in a concurrent context, is bound to require locks.
So a trade-off between memory (due to data duplication) and processing (due to lock contention) has to be found and at least for most terms (excepted larger binaries), the sweet spot consists in sacrificing a bit of memory in favour of a lesser CPU load. Solutions like persistent_term may address situations where more specific needs arise.
This long-awaited feature, named BeamAsm and whose rationale and history have been detailed in these articles, has been introduced in Erlang 24 and shall transparently lead to increased performances for most applications.
Static type checking can be performed on Erlang code; the usual course of action is to use Dialyzer - albeit other solutions like Gradualizer and also eqWAlizer exist, and are mostly complementary (see also 1 and 2).
More precisely:
See EEP 61 for further typing-related information [8].
[8] | On a side note, the (newer) dynamic() type mentioned there is often used to mark "inherently dynamic code", like reading from ETS, message passing, deserialization and so on. |
Also a few statically-typed languages can operate on top of the Erlang VM, even if none has reached yet the popularity of Erlang or Elixir (that are dynamically-typed).
In addition to the increased type safety that statically-typed languages permit (possibly applying to sequential code but also to inter-process messages), it is unsure whether such extra static awareness may also lead to better performances (especially now that the standard compiler supports JIT).
Beyond mere code, the messages exchanges between processes could also be typed and checked. Version upgrades could also benefit from it. Of course type-related errors are only a subset of the software errors.
Note that developments that rely on parse-transforms (almost all ours, directly or not) shall be verified based on their BEAM files (hence their actual, final output) rather than on their sources (as the checking would be done on code not transformed yet). See also the Type-checking Myriad section.
Nothing is to be done, as Dialyzer is included in the standard Erlang distribution.
Our preferred options (beyond path specifications of course) are: -Wextra_return -Wmissing_return -Wno_return -Werror_handling -Wno_improper_lists -Wno_unused -Wunderspecs. See the DIALYZER_OPTS variable in Myriad's GNUmakevars.inc and the copiously commented options.
A problem is that typing errors tend to snowball: many false positives (functions that are correct but whose call is not because an error upward in the callgraph) may be reported (leading to the infamous Function f/N has no local return, which does not tell much).
We recommend focusing on the first error reported for each module, and re-running the static analysis once supposedly fixed.
We install Gradualizer that way:
$ cd ~/Software $ mkdir gradualizer && cd gradualizer $ git clone https://github.com/josefs/Gradualizer.git $ cd Gradualizer $ make escript
Then just ensure that the ~/Software/gradualizer/Gradualizer/bin directory is in your PATH (e.g. set in your .bashrc).
Our preferred options (beyond path specifications of course) are: --infer --fmt_location verbose --fancy --color always --stop_on_first_error. See the GRADUALIZER_OPTS variable in Myriad's GNUmakevars.inc and the copiously commented options..
It is a tool developed in Rust.
We install eqWAlizer that way:
$ cd ~/Software $ mkdir -p eqwalizer && cd eqwalizer $ wget https://github.com/WhatsApp/eqwalizer/releases/download/vx.y.z/elp-linux.tar.gz $ tar xvf elp-linux.tar.gz && mkdir -p bin && /bin/mv -f elp bin/
Then just ensure that the ~/Software/eqwalizer/bin directory is in your PATH (e.g. set in your .bashrc).
We use it out of a rebar3 context.
Settings are to be stored in a JSON file (e.g. conf/foobar-for-eqwalizer.json), to be designated thanks to the --project option.
See also the EQWALIZER_OPTS variable in Myriad's GNUmakevars.inc and its own myriad-for-eqwalizer.json project file.
(our first test was not successful, we will have to investigate more when time permits)
Type correctness is essential, yet of course it does not guarantee that a program is correct and relevant. Other approaches, like the checking of other properties (notably concurrency, see Concuerror) can be very useful.
Beyond checking, testing is also an invaluable help for bug-fixing. Various tools may help, including QuickCheck.
Finally, not all errors can be anticipated, from network outages, hardware failures to human factor. An effective last line of defence is to rely on (this time at runtime) supervision mechanisms in order to detect any kind of faults (bound to happen, whether expected or not), and overcome them. The OTP framework is an excellent example of such system, much useful to reach higher levels of robustness, including the well-known nine nines - that is an availability of 99.9999999%.
To better discover the inner workings of the Erlang compilation, one may look at the eplaypen online demo (whose project is here) and/or at the Compiler Explorer (which supports the Erlang language among others).
Both of them allow to read the intermediate representations involved when compiling Erlang code (BEAM stage, erl_scan, preprocessed sources, abstract code, Core Erlang, Static Single Assignment form, BEAM VM assembler opcodes, x86-64 assembler generated by the JIT, etc.).
Depending on how Erlang was built in a given environment, some modules may or may not be available.
A way of determining availability and/or version of a module (e.g. wx, cl, crypto) from the command-line:
$ erl -noshell -eval 'erlang:display(code:which(wx))' -s erlang halt "/home/bond/Software/Erlang/Erlang-24.2/lib/erlang/lib/wx-2.1.1/ebin/wx.beam" $ erl -noshell -eval 'erlang:display(code:which(cl))' -s erlang halt non_existing
A corresponding in-makefile test, taken from Wings3D:
# Check if OpenCL package is as external dependency: CL_PATH = $(shell $(ERL) -noshell -eval 'erlang:display(code:which(cl))' -s erlang halt) ifneq (,$(findstring non_existing, $(CL_PATH))) # Add it if not found: DEPS=cl endif
Some features appeared in later Erlang versions, and may be conditionally enabled.
For example:
FullSpawnOpts = case erlang:system_info(version) >= "8.1" of true -> [{message_queue_data, off_heap}|BaseSpawnOpts]; false -> BaseSpawnOpts end, [...]
To know which version of a library (here, wxWidgets) a given Erlang install is using (if any), one may run an Erlang shell (erl), collect the PID of this (UNIX) process (e.g. ps -edf | grep beam), then trigger a use of that library (e.g. for wx, execute wx:demo().) in order to force its dynamic binding.
Then determine its name, for example thanks to pmap ${BEAM_PID} | grep libwx).
This may indicate that for example libwx_gtk2u_core-3.0.so.0.5.0 is actually used.
Usually we apply the following conventions:
To better denote reciprocal operations, following namings for functions may be used:
Various tools are able to format Erlang code, see this page for a comparison thereof.
For projects already relying on rebar, one may use rebar3_format (as a plugin [9]) that way:
$ cd ~/Software $ git clone https://github.com/AdRoll/rebar3_format.git $ cd rebar3_format $ ERL_FLAGS="-enable-feature all" rebar3 format
[9] | Note that rebar3_format cannot be used as an escript (so no ERL_FLAGS="-enable-feature all" rebar3 as release escriptize shall be issued). |
Then {project_plugins, [rebar3_format]} shall be added to the project's rebar.config.
Another tool is erlfmt, which can be installed that way:
$ cd ~/Software $ git clone https://github.com/WhatsApp/erlfmt $ cd erlfmt $ rebar3 as release escriptize
And then ~/Software/erlfmt/_build/release/bin can be added to one's PATH.
Indeed erlfmt can be used as a rebar plugin or as a standalone escript - which we find useful, especially for projects whose build is not rebar-based.
Running it to reformat in-place source files is then as simple as:
$ erlfmt --write foo.hrl bar.erl
Experimental features (such as maybe in Erlang 25) of the compiler (once Erlang has been built, they are potentially available) may have to be specifically enabled at runtime, like in ERL_FLAGS="-enable-feature all" rebar3 as release escriptize.
LCO means here Last Call Optimisation. This consists simply when, in a given module, a (typically exported) function f ends by calling a local function g (i.e. has for last expression a call like g(...)), in not pushing on the stack the call to g, but instead replacing directly the stackframe of f (which can be skipped here, as returning from g will mean directly returning from f as well) with a proper one for g.
This trick spares the use of one level of stack at each ending local call, which is key for recursive functions [10] (typically for infinitely-looping server processes): they remain then in constant stack space, whereas otherwise the number of their stackframes would grow indefinitely, at each recursive call, and explode in memory.
[10] | When the last call of f branches to f itself, it is named TCO, for Tail Call Optimisation (which is thus a special - albeit essential - case of LCO). |
So LCO is surely not an option for a functional language like Erlang, yet it comes with a drawback: if f ends with a last call to g that ends with a last call to h and a runtime error happens in them, none of these functions will appear in the resulting stacktraces: supposing all these functions use a library foobar, it will be as if the VM directly jumped from the entry point in the user code (typically a function calling f) to the failing point in a function of foobar; there will be no line number pointing to the expression of f, g or h that triggers the faulty behaviour, whereas this is probably the information we are mostly interested in - as these functions may make numerous calls to foobar's function. This makes the debugging unnecessarily difficult.
Yet various workarounds exist (see this topic for more information) - just for debugging purposes - so that given "suspicious" functions (here f, g or h) are not LCO'ed:
The first workaround is probably the simplest, when operating on "suspicious" modules of interest (knowing that LCO is useful, and should still apply to most essential server-like processes).
One can nevertheless note that unfortunately LCO is not the only cause for the vanishing of calls in the stacktrace (even in the absence of any function inlining).
Within Ceylan-Myriad, if the (non-default) myriad_disable_lco compilation option is set (typically with the -Dmyriad_disable_lco command-line flag), this workaround is applied automatically on the modules on which the Myriad parse transform operates - i.e. all but bootstrapped modules (refer to the lco_disabling_clause_transform_fun/2 function of the myriad_parse_transform module).
When using run_erl, lines like the following are output:
Write pipe '/tmp/launch-erl-3822033.w' found, waiting 2 seconds to ensure start-up is successful indeed. ************************************************************** ** Node 'us_main' ready and running as a daemon. ** Use 'to_erl /tmp/launch-erl-3822033' to connect to that node. ** (then type CTRL-D to exit without killing the node) ************************************************************** (authbind success reported) EPMD names output (on default US-Main port ): epmd: up and running on port 4507 with data: name us_main at port 50002 name us_main_exec-xxx at port 60001
If connecting to that node with to_erl /tmp/launch-erl-3822033 (hence from the local host), a direct access to an Erlang shell on that node is granted.
Remember that exiting the interpreter as usual (hitting CTRL-C twice) thus means killing that node; so prefer CTRL-D (once) instead!
wx is now [11] the standard Erlang way of programming one's graphical user interface; it is a binding to the wxWidgets library.
[11] | wx replaced gs. To shelter already-developed graphical applications from any future change of backend, we developed MyriadGUI, an interface doing its best to hide the underlying graphical backend at hand: should the best option in Erlang change, that API would have to be ported to the newer backend, hopefully with little to (ideally) no change in the user applications. |
Here are some very general wx-related information that may be of help when programming GUIs with this backend:
{'_wxe_error_',710,{wxDC,setPen,2},{badarg,"This"}}
it is probably the sign that the user code attempted to perform an operation on an already-deallocated wx object; the corresponding life-cycle management might be error-prone, as some deallocations are implicit, others are explicit, and in a concurrent context race conditions easily happen
[12] | MyriadGUI took a different convention: whether an event will propagate by default depends on its type, knowing that most of the types are to propagate. Yet the user can override these default behaviours, by specifying either the trap_event or the propagate_event subscription option, or by calling either the trap_event/1 or the propagate_event/1 function. |
[13] | This is why MyriadGUI applies per-event type defaults, thus possibly trapping events; in this case, if the built-in backend mechanisms would still be of use, they can be triggered by calling the propagate_event/1 function from the user-defined handler, only once all its prerequisite operations have been performed (this is thus a way of restoring sequential operations). |
Extra information resources about wx (besides the documentation of its modules):
One may use our install-rebar3.sh script for that (installed from sources, or prebuilt).
To avoid having to perform a lookup in the documentation:
(no need for the erlang module to be explicitly specified for the first two functions, as both are auto-imported)
case BOOLEAN_EXPR of true -> DO_SOMETHING; false -> ok end
[14] | Note that it is not the case of the and and or operators, whose precedence is higher than, notably, the comparison operators. For example a clause defined as f(I) when is_integer(I) and I >= 0 -> ... will never be triggered as it is interpreted as f(I) when (is_integer(I) and I) >= 0 -> ..., and the and guard will always fail as I is an integer here, not a boolean. So such a clause should be defined as the (correct) f(I) when is_integer(I) andalso I >= 0 -> ... instead. |
with the equivalent (provided BOOLEAN_EXPR evaluates to either true or false - otherwise a bad argument exception will be thrown) yet shorter: BOOLEAN_EXPR andalso DO_SOMETHING; for example: [...], OSName =:= linux andalso fix_for_linux(), [...]
Similarly, orelse can be used to evaluate a target expression iff a boolean expression is false, to replace a longer expression like:
case BOOLEAN_EXPR of true -> ok; false -> DO_SOMETHING; end
with: BOOLEAN_EXPR orelse DO_SOMETHING; for example: [...], file_utils:exists("/etc/passwd") orelse throw(no_password_file), [...]
In both andalso / orelse cases, the DO_SOMETHING branch may be a single expression, or a sequence thereof (i.e. a body), in which case a begin/end block may be of use, like in:
file_utils:exists("/etc/passwd") orelse begin trace_utils:notice("No /etc/password found."), throw(no_password_file) end
Similarly, taking into account the aforementioned precedences, Count =:= ExpectedCount orelse throw({invalid_count, Count}) will perform the expected check.
Be wary of not having precedences wrong, lest bugs are introduced like the one in:
MaybeListenerPid =:= undefined orelse MaybeListenerPid ! {onDeserialisation, [self(), FinalUserData]}
(orelse having more priority than !, parentheses shall be added, otherwise, if having a PID, the message will actually be sent to any process that would be registered as true)
For the record, an alternative is [f(...) || BoolCondition], like in: _ = [put('$ancestors', Ancestors) || Shell =/= {}] (in group.erl).
Some of these elements have been adapted from the Wings3D coding guidelines.