Print Page - Can't we get rid off circular unit reference?

Free Pascal => FPC development => Topic started by: Pascal on February 11, 2018, 12:22:21 pm

Title: Can't we get rid off circular unit reference?
Post by: Pascal on February 11, 2018, 12:22:21 pm

I like (free)pascal very much. But one of the most anoying things is the limitations due to circular unit references.
Why do we still need this? Inside a unit we can have foward declaration which can be used to
make two classes to reference each other. Why can't this work accross units? The compiler can stop iterating thru the
units when he raches a unit he has already paresed and behave like he does with forward declarations?

I do not see the reason for this limitation! Maybe someone can give me a clue?

I know that i can use base classes but i have to cast the the base classes to the real classes in the implematation section
anyway.

Title: Re: Can't we get rid off circular unit reference?
Post by: marcov on February 11, 2018, 01:07:23 pm

Quote from: Pascal on February 11, 2018, 12:22:21 pm

I like (free)pascal very much. But one of the most anoying things is the limitations due to circular unit references.
Why do we still need this? Inside a unit we can have foward declaration which can be used to
make two classes to reference each other. Why can't this work accross units? The compiler can stop iterating thru the
units when he raches a unit he has already paresed and behave like he does with forward declarations?

The advantage of this way is that you only import something that is fairly completely defined (except a few things like inline, and changing of procedure attributes in the implementation). You never have to assume what an identifier means (and backtrack if wrong when you parse the other source) of have workarounds or assumptions in the language to disambiguate this. (and think not just of work to compile correct code, but also the effort to get clear and correct error messages)

And I never saw it as a problem to begin with. I like a nice tight declaration without code in it (C++ and Java classes look messy to me, I can live with it, but consider it suboptimal), and ordering the uses clauses quickly becomes a second nature. I only hit the limit when I'm deliberately experimenting during refactoring rarely or never during normal coding.

Anyway, once you start down a road, regardless what other languages do, you are stuck to that model, or totally redesign and rewrite the way multiple sources are combined to one program, which is one of the hardest part of the program.

Exploring what works and what not, would be more a job for a new and small and focussed pascal compiler that can move more agilely than a twenty+ year old behemoth like Free Pascal with a zillion features that might need fixing and rethinking.

Quote

I know that i can use base classes but i have to cast the the base classes to the real classes in the implematation section
anyway.

Generics alleviate that somewhat. I was quite happy how my own container classes ported to it.

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 11, 2018, 01:51:14 pm

Note that iso mode has iirc such a feature to a certain extend. But only with all iso code. Can't mix it.
[edit] Ahh, I see, not implemented yet, since it is extended mode.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 11, 2018, 06:20:39 pm

Quote from: marcov on February 11, 2018, 01:07:23 pm

The advantage of this way is that you only import something that is fairly completely defined (except a few things like inline, and changing of procedure attributes in the implementation). You never have to assume what an identifier means (and backtrack if wrong when you parse the other source) of have workarounds or assumptions in the language to disambiguate this. (and think not just of work to compile correct code, but also the effort to get clear and correct error messages)

But it stays completely defined. It's just that two classes from two units can reference each other. Like it can be done with implicit forward declarations inside a unit allready.

Quote from: marcov on February 11, 2018, 01:07:23 pm

And I never saw it as a problem to begin with. I like a nice tight declaration without code in it (C++ and Java classes look messy to me, I can live with it, but consider it suboptimal), and ordering the uses clauses quickly becomes a second nature. I only hit the limit when I'm deliberately experimenting during refactoring rarely or never during normal coding.

To have two classes reference each other you have to put them in one unit which breaks the concept of clean and simple units.
Or you have to user base classes which makes you use casts in the implementation part which breaks the concept of type safety.
Or you use helper classes.

So why not do it in a straightforward and simple way and enable circular unit references?

Quote from: marcov on February 11, 2018, 01:07:23 pm

Anyway, once you start down a road, regardless what other languages do, you are stuck to that model, or totally redesign and rewrite the way multiple sources are combined to one program, which is one of the hardest part of the program.

Yes, but this inconvenience always forces you to find some way around the "circular unit reference problem". So you always have to build some kind of hack!
Imho this is contradictory to the clean concept of the pascal language.

Quote from: marcov on February 11, 2018, 01:07:23 pm

Exploring what works and what not, would be more a job for a new and small and focussed pascal compiler that can move more agilely than a twenty+ year old behemoth like Free Pascal with a zillion features that might need fixing and rethinking.

Shouldn't this be quite easy to implement? The compiler just has to stop parsing a unit that he already knows. So unit A can use classes of unit B and vice versa.

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 11, 2018, 08:00:18 pm

Note for allowing interface sections with multiple implementation sections we already have inc files.
The multi-platform support is even built on that.
Personally
- I can live with how it is now.
- I use include files when appropriate.
- If you have many circular references on the same platform your architecture is probably wrong and we can probably set you on a better track when we see some code..

Problem solved.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 12, 2018, 06:05:58 am

Quote from: Thaddy on February 11, 2018, 08:00:18 pm

Note for allowing interface sections with multiple implementation sections we already have inc files.
The multi-platform support is even built on that.
Personally
- I can live with how it is now.
- I use include files when appropriate.
- If you have many circular references on the same platform your architecture is probably wrong and we can probably set you on a better track when we see some code..

Problem solved.

I am talking about one platform. And i don't think that my architecture is wrong. The problem is that you always have to refactor your class layout when
you need to have two classes which need to reference each other. And this is only needed to circumvent the "circular unit problem". In Free Pascal it is
possible to have two classes reference each other as long as the two classes are in the same unit. So you are right, i can use include files but in fact that
will lead to one big unit.

I do not really see the point why that, what works inside on unit, is not allowed across two units. This really blows up the class layout and forces you to
build in some hacks. What is wrong with classes referencing each other when they are in different units instead of the same one?
You can not just say it's bad design!
There might have been reasons in the past why it should not be possible to have two units uses each other in the interface section. But i do not think
that we still need it today.

When the compiler finds a unit it already has parsed it can break stepping into this unit's interface section and proecess with the next used unit and
everything which is needed for building types should be there. If the parser the finds a class which is not defined it can do an implicit forward declaration and see if it will be solved until the interface section.
And that should be enough.

If i would know the compiler sources i would have done a try already. But unfortunately i do not! So maybe someone of the FPC team could give this idea a try?

Title: Re: Can't we get rid off circular unit reference?
Post by: Handoko on February 12, 2018, 06:34:30 am

Quote from: Thaddy on February 11, 2018, 08:00:18 pm

Personally
- I can live with how it is now.
- I use include files when appropriate.
- If you have many circular references on the same platform your architecture is probably wrong and we can probably set you on a better track when we see some code..

I know include files, but I never use it. Can you explain on what cases it is good to use include files instead of make them as units or simply merge them into the code?

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 12, 2018, 06:41:47 am

Suppose you have optimized implementations for arm and x86. You can include a single interface include file and two implementation include file.
If the interface is well defined you need just a single interface, platform independent, and implement the platform implementations as multiple includes.
That way the interface stays unified. Look at the structure of the rtl. Many such examples.

Title: Re: Can't we get rid off circular unit reference?
Post by: Handoko on February 12, 2018, 06:52:47 am

Okay, I understand now. Include files are good when used on writing cross platform codes. Thank you.

Title: Re: Can't we get rid off circular unit reference?
Post by: marcov on February 12, 2018, 09:51:28 am

Quote from: Pascal on February 11, 2018, 06:20:39 pm

Quote from: marcov on February 11, 2018, 01:07:23 pm
The advantage of this way is that you only import something that is fairly completely defined (except a few things like inline, and changing of procedure attributes in the implementation). You never have to assume what an identifier means (and backtrack if wrong when you parse the other source) of have workarounds or assumptions in the language to disambiguate this. (and think not just of work to compile correct code, but also the effort to get clear and correct error messages)

But it stays completely defined. It's just that two classes from two units can reference each other. Like it can be done with implicit forward declarations inside a unit allready.

Only if you would forward define it, which is the workaround thing again. And then only for reference types. (interfaces and classes, e.g. not TP objects)

Quote

To have two classes reference each other you have to put them in one unit which breaks the concept of clean and simple units.
Or you have to user base classes which makes you use casts in the implementation part which breaks the concept of type safety.
Or you use helper classes.

Or maybe even generics.

Quote

So why not do it in a straightforward and simple way and enable circular unit references?

I think it is not straightforward as implementation goes, and you will be dealing a long time with fallout. Moreover you will have to rewrite a significant and difficult part.

I'm not stopping you to try to come with a patch, but I would first research the consequences first.

Quote from: marcov on February 11, 2018, 01:07:23 pm

Yes, but this inconvenience always forces you to find some way around the "circular unit reference problem". So you always have to build some kind of hack!

Mutual reference is not "clean" by definition, and breaking it is actually clean. Doing it straightforward for simple cases is actually the "hack". Some languages allow it for ease of use, but it is trouble from a language design view.

Quote from: Pascal on February 11, 2018, 06:20:39 pm

Shouldn't this be quite easy to implement? The compiler just has to stop parsing a unit that he already knows. So unit A can use classes of unit B and vice versa.

Anything that goes over unit borders and is not quite defined is usually a nightmare. And soon the bugs for corner cases with this functionality will start to pile up. (somebody will try to do a 3-unit cycle etc) Anybody taking this on better be prepared to monitor it for a long time.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 12, 2018, 11:05:23 am

Quote from: marcov on February 12, 2018, 09:51:28 am

I'm not stopping you to try to come with a patch, but I would first research the consequences first.

That's my problem! I would like to do, but my knowledge of the compiler sources is very limited atm. If someone could lead me to the
significant places (unit interface parsing, forward declarations) i would like to give it a try.

Btw: Is there an up to date wiki about the compiler and its inner structure?

Quote from: marcov on February 12, 2018, 09:51:28 am

Mutual reference is not "clean" by definition, and breaking it is actually clean. Doing it straightforward for simple cases is actually the "hack". Some languages allow it for ease of use, but it is trouble from a language design view.

From the language point of view i can agree with you. But from the ease of use point of view i would like to see this "hack" across units.

Quote from: marcov on February 12, 2018, 09:51:28 am

Anything that goes over unit borders and is not quite defined is usually a nightmare. And soon the bugs for corner cases with this functionality will start to pile up. (somebody will try to do a 3-unit cycle etc) Anybody taking this on better be prepared to monitor it for a long time.

Atm i am just thinking about simple type references.

The problem here is that while parsing the used units the parser will find types which will be resolved in the interface of the unit it is parsing. So
we would need something like implicit forward declarations across units as explicit forward declarations will not work here (only work per unit).

Title: Re: Can't we get rid off circular unit reference?
Post by: marcov on February 12, 2018, 11:15:58 am

As an example of a corner case, think of things like

Code: Pascal [Select][+]

   type
      TFoo = class
                     anOhterFoo  : TOtherFoo;
                     property bah : integer read anohterfoo.bah;
                  end;
  

After TOtherfoo is defined in the other unit, you need to make sure that it has a "bah". Moreover you can't generate code till you have the full definition of TOtherFoo. Which can then have a circular reference much larger than 2 units, which already gives me a headache THINKING of it, let alone the implementation.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 12, 2018, 12:57:04 pm

Quote from: marcov on February 12, 2018, 11:15:58 am

As an example of a corner case, think of things like

Code: Pascal [Select][+][-]
type
TFoo = class
anOhterFoo : TOtherFoo;
property bah : integer read anohterfoo.bah;
end;

After TOtherfoo is defined in the other unit, you need to make sure that it has a "bah". Moreover you can't generate code till you have the full definition of TOtherFoo. Which can then have a circular reference much larger than 2 units, which already gives me a headache THINKING of it, let alone the implementation.

If all interfaces are okay then everything should be solved until the implementation part. If not, generate an error like:

Code: Text [Select][+]

unit1.pas(3,36) Error: Forward type not resolved "TOtherClass"

Code: Text [Select][+]

unit1.pas(4,61) Error: Unknown record field identifier "bah"

So pretty everything like it is done with forward declarations now.

The parser just has to insert an implicit forward declaration when it reaches an undefined type while parsing the interfaces of the used units.

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 12, 2018, 01:08:57 pm

Quote from: Pascal on February 12, 2018, 12:57:04 pm

So pretty everything like it is done with forward declarations now.

The parser just has to insert an implicit forward declaration when it reaches an undefined type while parsing the interfaces of the used units.

Well, as Marco implicitly explained this is not the case:
Suppose unitA has a type or procedure declared, like say, TPoint. And unitB re-declares TPoint. Now, the parser can only resolve its use based on unit order..
What you propose does not work with forward declarations, probably not even with a multi-pass compiler. At parse time the compiler does not know which scope you mean: A or B.
This is already a common scenario in the existing code base, where clashes already occur and need to be resolved by explicit scoping: unitB.Tpoint, unitA.Tpoint.
Maybe a record is not a proper example but the same goes for classes, procedures and functions.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 12, 2018, 01:16:22 pm

Quote from: Thaddy on February 12, 2018, 01:08:57 pm

Quote from: Pascal on February 12, 2018, 12:57:04 pm
So pretty everything like it is done with forward declarations now.

The parser just has to insert an implicit forward declaration when it reaches an undefined type while parsing the interfaces of the used units.
Well, as Marco implicitly explained this is not the case:
Suppose unitA has a type or procedure declared, like say, TPoint. And unitB re-declares TPoint. Now, the parser can only resolve its use based on unit order..
What you propose does not work with forward declarations, probably not even with a multi-pass compiler. At parse time the compiler does not know which scope you mean: A or B.
This is already a common scenario in the existing code base, where clashes already occur and need to be resolved by explicit scoping: unitB.Tpoint, unitA.Tpoint.
Maybe a record is not a proper example but the same goes for classes, procedures and functions.

Yes, but this is another issue. I unfortunately do not see what this has to do with my topic?

Title: Re: Can't we get rid off circular unit reference?
Post by: Ñuño_Martínez on February 12, 2018, 01:42:20 pm

Pascal was designed with top-down/bottom-up design (https://en.wikipedia.org/wiki/Top-down_and_bottom-up_design) in mind. I like this way of working. Circular reference is a consequence of that design, and I think it is good because it forces you to think better solutions (better because they're more encapsulated and errors will have less propagation. I hope you understand me).

My 2 cents.

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 12, 2018, 01:43:46 pm

Getting rid of circular references? O:-) O:-) O:-)
It may be that unitA causes a circular reference and the same re-declaration in unitB doesn't?
Think logically... Can you prevent that with your proposal? NO. Depends on unit order.

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 12, 2018, 01:45:03 pm

Quote from: Ñuño_Martínez on February 12, 2018, 01:42:20 pm

Pascal was designed with top-down/bottom-up design (https://en.wikipedia.org/wiki/Top-down_and_bottom-up_design) in mind. I like this way of working. Circular reference is a consequence of that design, and I think it is good because it forces you to think better solutions (better because they're more encapsulated and errors will have less propagation. I hope you understand me).

My 2 cents.

Indeed. It is also - often, not always - a warning for bad design: insufficient code separation. Most circular references can be factored out.

Title: Re: Can't we get rid off circular unit reference?
Post by: marcov on February 12, 2018, 01:53:42 pm

Quote from: Pascal on February 12, 2018, 01:16:22 pm

Yes, but this is another issue. I unfortunately do not see what this has to do with my topic?

Your title is more generic (all types) than the discussion (reference class types only, later limited also to a subset of options (e.g. not supporting the property case)).

Anyway, while with a lot of limitations some simple cases could be maybe done, it is a ton of work for the cases that are simple to solve in the first place.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 12, 2018, 02:37:59 pm

Quote from: marcov on February 12, 2018, 01:53:42 pm

Quote from: Pascal on February 12, 2018, 01:16:22 pm
Yes, but this is another issue. I unfortunately do not see what this has to do with my topic?

Your title is more generic (all types) than the discussion (reference class types only, later limited also to a subset of options (e.g. not supporting the property case)).

Well, yes.

Quote from: marcov on February 12, 2018, 01:53:42 pm

Anyway, while with a lot of limitations some simple cases could be maybe done, it is a ton of work for the cases that are simple to solve in the first place.

If you or someone else could lead me to the relevant places in the compiler/parser sources i will give it a try.
And as asked earlier: Is there an up to date wiki/doku of the inner structure/working of the compiler, which could help me understand the sources?

Title: Re: Can't we get rid off circular unit reference?
Post by: Thaddy on February 12, 2018, 03:09:12 pm

Quote from: Pascal on February 12, 2018, 02:37:59 pm

If you or someone else could lead me to the relevant places in the compiler/parser sources i will give it a try.
And as asked earlier: Is there an up to date wiki/doku of the inner structure/working of the compiler, which could help me understand the sources?

Well, there is "advanced documentation" in the sense that it is documented how the compiler itself can be compiled with debug info. So you can debug the compiler under fpc itself. So you can get any information you want following program flow. O:-)
The compiler sources are current documentation, I guess. (Not very helpful, but that's how I do it)

Title: Re: Can't we get rid off circular unit reference?
Post by: marcov on February 12, 2018, 03:47:09 pm

Quote from: Pascal on February 12, 2018, 02:37:59 pm

If you or someone else could lead me to the relevant places in the compiler/parser sources i will give it a try.
And as asked earlier: Is there an up to date wiki/doku of the inner structure/working of the compiler, which could help me understand the sources?

No, there is not much internal documentation. There are maybe some docs on specific features in the wiki, but the only attempt at all-encompassing documentation is pre 2005, and terribly old. (and even then it is more a guide to look up specific types/nodes, and less a "how to" manual).

So getting your feet wet by trying (bugfixing and simple features) is the typical way to go.

Title: Re: Can't we get rid off circular unit reference?
Post by: Pascal on February 13, 2018, 07:57:24 am

Okay, then let's start diving in :D

Title: Re: Can't we get rid off circular unit reference?
Post by: PascalDragon on February 17, 2018, 07:16:09 pm

If you want to play around you can try to use the attached patch (might result in conflicts cause it's a bit older already) to play around with the concept of formal classes (like they are done for the objcclass). It basically allows you to use the following code:

Code: Pascal [Select][+]

unit foo;
 
{$mode objfpc}
 
type
  TFromOtherUnit = class external; // originally declared in OtherUnit
 
  TMyType = class
    fField: TFromOtherUnit;
    constructor Create;
  end;
 
implementation
 
uses
  OtherUnit;
 
constructor TMyType.Create;
begin
  fField := TFromOtherUnit.Create;
end;
 
begin
end.
 

Please note that this has the restrictions that you can't inherit from an externally declared class or use fields or properties unless the declaring unit is in scope. It's essentially an opaque reference, because the compiler knows that classes are pointers it can treat it as such for quite some time.
There might also be unresolved problems regarding the RTTI and such which is why I haven't integrated this in trunk.

Trying to rework the unit loading so that circular references are supported is highly discouraged as this can lead to hard to solve bugs (as the introduction of inlining some years back has shown).

Lazarus

Free Pascal => FPC development => Topic started by: Pascal on February 11, 2018, 12:22:21 pm