TTree depth-first traversal

440bx

Hero Member
Posts: 4063

Re: TTree depth-first traversal

« Reply #15 on: September 22, 2019, 08:40:33 pm »

Quote from: simone on September 22, 2019, 05:45:45 pm

I have not revealed an absolute truth, but a typical situation:

I'm not sure if your post was a result of what I posted but, I agree.

Quote from: simone on September 22, 2019, 05:45:45 pm

in general recursive alghorithms are slightly slower than iterative ones, because they require as many 'calls with return' as the number of recursive cycles.

That's when one has to be very careful. Some solutions can be implemented efficiently and elegantly using a recursive algorithm and, when such a recursive algorithm is restructured to avoid recursion, the implementation, as you pointed out in a recent post, tends to be more complicated.

Restructuring a simple, efficient and elegant recursive algorithm to eliminate recursion usually requires implementing a stack. In such cases, all that is saved is the push of the return address needed in the recursive algorithm. There is obviously a cost in pushing a return address to a memory location (the stack in this case) but, that operation isn't really very expensive.

Quote from: simone on September 22, 2019, 05:45:45 pm

These types of operations are very expensive for the CPU.

There is obviously a cost but, pushing a return address on the stack isn't a very expensive operation but, in many cases, the added complexity resulting from converting a recursive algorithm into an iterative one can be considerable. Personally, if the recursive algorithm performs well, I will forego the very small performance increase of an iterative algorithm because I don't think the small performance gain justifies the added complexity.

Quote from: simone on September 22, 2019, 05:45:45 pm

I mentioned authoritative sources in support of my thesis. If someone cites to me equally authoritative sources or shows me experimental evidence of the opposite sign, I am happy to change my mind, because I often use recursive algorithms.

Generally speaking, what you stated is true. The keyword being "generally".

« Last Edit: September 22, 2019, 08:42:14 pm by 440bx »

Logged

(FPC v3.0.4 and Lazarus 1.8.2) or (FPC v3.2.2 and Lazarus v3.2) on Windows 7 SP1 64bit.

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #16 on: September 22, 2019, 08:49:20 pm »

I wrote:

Quote from: simone on September 22, 2019, 12:40:55 pm

... I note only that it would have the two typical small disadvantages of recursive algorithms. First, it would be slightly slower. Second, since recursive calls allocate resources in the system stack, in the case of traversals of very large trees, a stack overflow could happen during execution.

It seems to me that you agree.

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

avk

Hero Member
Posts: 752

Re: TTree depth-first traversal

« Reply #17 on: September 23, 2019, 03:34:58 am »

So, can anyone show an iterative DFS implementation(equivalent) that would be faster than the recursive version?

Logged

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #18 on: September 23, 2019, 07:50:29 am »

As soon as possible I will do some tests to confirm (or deny) what is reported in scientific literature and on many programming forums. In the meantime, can you refute this thesis with technical arguments or quote authoritative sources, as I did?

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #19 on: September 23, 2019, 02:31:04 pm »

I do a simple test. This is NOT a rigorous benchmark. I have adapted the code in order to generate a tree with a very simple topology (two levels) but with a big number of nodes N=100000000 nodes: 1 root and n-1 children. This topology should be the best case for recursion approach, because should reduce the nesting level of recursive calls (but this issue requires some further investigation). I wrote a Recursive DFT algorithm (DFS is a special case of DFT). Unlike the iterative case, the Node parameter is required for the procedure. At the first call this parameter must be the tree root, while in subsequent recursive calls acts as the root of the sub-trees. To avoid noise, the callback procedure does nothing.

Using Lazarus 2.0.4+FPC 3.0.4 (64bits) on a PC with a Intel Core i7, I have following results:

Recursive DFT
Start Timer
End Timer
Time Elapsed: 75345

Iterarive DFT
Start Timer
End Timer
Time Elapsed: 55202

However it's only a simple test. I need further confirmations.

This is the code:

Code: Pascal [Select][+]

program project1;
{$mode objfpc}{$H+}
 
uses
    gtree, gvector, sysutils;
 
type
 
    TExpression = class(TObject)
    public
        value: integer;
        constructor Create(v: integer);
    end;
 
    TExpressionNode = class(specialize TTreeNode<TExpression>)
        destructor Destroy; override;
    end;
 
    { TExpressionTree }
 
    TExpressionTree = class(specialize TTree<TExpression>)
      procedure DepthFirstTraverseRecursive(Node : TTreeNodeType; Callback: TDepthFirstCallbackType);
    end;
 
var
    tree: TExpressionTree;
    n,n1 : TExpressionNode;
    e,e1 : TExpression;
    T0,T1 : Comp;
    ind : QWord;
 
procedure TimerOn;
begin
  Writeln('Start Timer');
  T0:=TimestampToMSecs(DateTimeToTimestamp(Now));
end;
 
procedure TimerOff;
begin
  T1:=TimestampToMSecs(DateTimeToTimestamp(Now));
  Writeln('End Timer');
  Writeln('Time Elapsed: '+IntToStr(QWord(T1-T0)));
end;
 
{ TExpressionTree }
 
procedure TExpressionTree.DepthFirstTraverseRecursive(Node : TTreeNodeType; Callback: TDepthFirstCallbackType);
var
  Child: TTreeNodeType;
begin
  if Assigned(Node) then
    begin
      Callback(Node.Data);
      for Child in Node.Children do
        DepthFirstTraverseRecursive(Child,Callback);
    end;
end;
 
 
constructor TExpression.Create(v: integer);
begin
    self.value := v;
end;
 
 
destructor TExpressionNode.Destroy;
begin
    self.Data.Free;
    inherited;
end;
 
 
procedure WriteCallback(const e: TExpression);
begin
  //write(e.value, ' ');
end;
 
 
begin
    Tree:=TExpressionTree.Create;
 
    e1:=TExpression.Create(0);
    n1:=TExpressionNode.Create;
    n1.Data := e1;
    tree.Root := n1;
    for ind:=1 to 100000000 do
      begin
        e:= TExpression.Create(ind);
        n:= TExpressionNode.Create;
        n.Data:=e;
        n1.Children.PushBack(n);
      end;
 
    writeln('Recursive DFT');
    TimerOn;
    Tree.DepthFirstTraverseRecursive(n1,@WriteCallback);
    TimerOff;
    writeln;
 
    writeln('Iterarive DFT');
    TimerOn;
    Tree.DepthFirstTraverse(@WriteCallback);
    TimerOff;
    writeln;
 
    tree.Free;
 
    readln;
 
end.
 

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

avk

Hero Member
Posts: 752

Re: TTree depth-first traversal

« Reply #20 on: September 23, 2019, 02:42:30 pm »

My English is certainly terrible, but you should have noticed that I was interested specifically in the DFS implementation for the reason that I already explained. And I'm not going to refute CS stars at all and I trust their opinion, I am interested in the practical aspect: is it possible to somehow improve the situation I described above.
As for DepthFirstTraverse, I have already said that it is not equivalent to recursive DFS (for example, yours).

Logged

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #21 on: September 23, 2019, 03:48:09 pm »

Your English is certainly better than mine. I didn't understand exactly what you need. A search (for example using a DFS algorithm) on a data structure such as a list, a tree or a graph, unless It is ordered, requires a complete traversal of the structure (for example using a DFT algorithm). In our example the library that implements the DFT allows the use of a callback procedure that can help to perform a search.

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

avk

Hero Member
Posts: 752

Re: TTree depth-first traversal

« Reply #22 on: September 23, 2019, 05:01:44 pm »

I'm trying to say that if we want to compare the performance of an iterative and recursive version of some algorithm, these versions should be equivalent, otherwise the comparison does not make sense. Of couse, DepthFirstTraverse does a complete traversal of the tree, without any doubt, but this is a different algorithm.

Logged

440bx

Hero Member
Posts: 4063

Re: TTree depth-first traversal

« Reply #23 on: September 23, 2019, 05:03:31 pm »

Quote from: avk on September 23, 2019, 02:42:30 pm

is it possible to somehow improve the situation I described above.

The answer to that is _very likely_ to be, yes. The performance gain in most cases will be minimal to negligible and, the code to implement an iterative solution will be noticeably more complex. It's simply not worth it.

Just FYI, Sedgewick, in the second edition of his book "Algorithms" presents both, a recursive and an iterative implementation (Pascal pseudocode) of a DFS (neither is optimized.) As expected, the iterative implementation is more complex than the recursive one and, as presented in the book, it's unclear which one would actually perform better.

I _believe_ that, with careful optimization of both algorithms, the iterative algorithm would be a smidgen faster (most likely not humanly perceptible in the great majority of cases.) I believe that because, in a good iterative implementation, a push of the return address on the stack will no longer be necessary (in most cases, that isn't much of a performance gain.)

Quote from: simone on September 22, 2019, 08:49:20 pm

It seems to me that you agree.

In general, yes, I agree with what you've stated.

However, there is one statement you've made that I disagree with, which is:

Quote from: simone on September 22, 2019, 05:45:45 pm

These types of operations are very expensive for the CPU.

There is obviously a cost associated with pushing "items" on a stack but, they really aren't "very expensive".

Logged

(FPC v3.0.4 and Lazarus 1.8.2) or (FPC v3.2.2 and Lazarus v3.2) on Windows 7 SP1 64bit.

Thaddy

Hero Member
Posts: 14377
Sensorship about opinions does not belong here.

Re: TTree depth-first traversal

« Reply #24 on: September 23, 2019, 05:17:55 pm »

In general iterative algorithms have better time complexity over recursion. But it will only show on *huge* amounts of data.
If there is a noticeable discrepancy between the two, likely the slowest one has an inferior implementation.

(More on topic: left and right are a contract, keep to the contract and you are safe..)

Note

Quote

There is obviously a cost associated with pushing "items" on a stack but, they really aren't "very expensive".

Well, the obvious amazes me, since stack/queue are linked list based. You can't improve on that, currently.....

You are right, btw... But you should have added the details. Also, recursion is a hornet's nest of problems in the hands of all programmers....

« Last Edit: September 23, 2019, 05:32:44 pm by Thaddy »

Logged

Object Pascal programmers should get rid of their "component fetish" especially with the non-visuals.

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #25 on: September 23, 2019, 05:54:44 pm »

Quote from: avk on September 23, 2019, 05:01:44 pm

I'm trying to say that if we want to compare the performance of an iterative and recursive version of some algorithm, these versions should be equivalent, otherwise the comparison does not make sense. Of couse, DepthFirstTraverse does a complete traversal of the tree, without any doubt, but this is a different algorithm.

Indeed in my test I compared a ricorsive DFT algorithm with an iterative one

Quote from: 440bx on September 23, 2019, 05:03:31 pm

Quote from: 440bx on September 23, 2019, 05:03:31 pm

Quote from: simone on September 22, 2019, 08:49:20 pm
It seems to me that you agree.
In general, yes, I agree with what you've stated.

However, there is one statement you've made that I disagree with, which is:
Quote from: simone on September 22, 2019, 05:45:45 pm
These types of operations are very expensive for the CPU.
There is obviously a cost associated with pushing "items" on a stack but, they really aren't "very expensive".

Recursion, as well known, is implemented using stack. Every recursive call allocates a new 'stack frame' on the top of the stack. Each frame contains, among others, following information, for each execution of recursive procedure: the parameters values passed to procedure; the return address of the caller procedure; all the values of local variables of the procedure. When recursion terminates, all these frames are deallocated, in reverse order. These operation are very expensive, since consume space of stack and time of CPU.

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

440bx

Hero Member
Posts: 4063

Re: TTree depth-first traversal

« Reply #26 on: September 23, 2019, 07:24:38 pm »

Quote from: simone on September 23, 2019, 05:54:44 pm

Recursion, as well known, is implemented using stack. Every recursive call allocates a new 'stack frame' on the top of the stack. Each frame contains, among others, following information, for each execution of recursive procedure: the parameters values passed to procedure; the return address of the caller procedure; all the values of local variables of the procedure. When recursion terminates, all these frames are deallocated, in reverse order. These operation are very expensive, since consume space of stack and time of CPU.

What you stated above is correct except for the "very expensive" part.

A "very expensive" sequence of instructions is, for instance, servicing an exception, _that_ is very expensive.

I suggest you do the following to give yourself an idea of how relatively "expensive" some instruction sequences are: in a loop - say about a million times - call a procedure. First with no parameters then, with an increasing number of parameters. Obviously, time that.

You may also want to do the above with a function that returns an ordinal type (integer for instance) and a function that returns a structured type of 32 bytes.

Compare the results you obtained above (for the various cases), with a loop (same number of executions) that simply executes the expression "a := b + c;" (assign random values to b and c before the loop or the compiler may optimize the code causing it to be executed only once.)

It is obviously true that there is a cost in setting up a stack frame but, it is _not_ a very expensive operation. The only time, setting up the stack frame is "very expensive" as you put it is when the total size of the local variables cause the stack pointer to "bump into" (or go past) the bottom of the stack. In that case, one or more exceptions will need to be serviced by the O/S to commit whatever additional memory is needed on the stack to accommodate the local variables. That is the only time when setting up a stack frame can legitimately be considered to be "very expensive".

It should also be noted that if a program is concerned about the amount of stack space it is going to consume during a time sensitive operation, it can specify a "stack commit size" in the PE file large enough to ensure that no exceptions will be necessary to ensure that enough memory has been committed to it.

HTH.

« Last Edit: September 23, 2019, 07:28:21 pm by 440bx »

Logged

(FPC v3.0.4 and Lazarus 1.8.2) or (FPC v3.2.2 and Lazarus v3.2) on Windows 7 SP1 64bit.

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #27 on: September 23, 2019, 07:36:25 pm »

'Expensive' is a relative term… However I agree with your statements.

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

julkas

Guest

Re: TTree depth-first traversal

« Reply #28 on: September 23, 2019, 08:20:35 pm »

What about {$optimization tailrec}? Any practical example?

Logged

simone

Hero Member
Posts: 573

Re: TTree depth-first traversal

« Reply #29 on: September 23, 2019, 10:00:06 pm »

I did not know this switch. Thanks for pointing this out to me. I will try it.

« Last Edit: September 23, 2019, 10:09:16 pm by simone »

Logged

Microsoft Windows 10 64 bit - Lazarus 3.0 FPC 3.2.2 x86_64-win64-win32/win64

Lazarus

Bookstore

Search

Recent

Author Topic: TTree depth-first traversal (Read 8624 times)

440bx

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

avk

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

avk

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

avk

Re: TTree depth-first traversal

440bx

Re: TTree depth-first traversal

Thaddy

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

440bx

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

julkas

Re: TTree depth-first traversal

simone

Re: TTree depth-first traversal

	Computer Math and Games in Pascal (preview)
	Lazarus Handbook