FeedForwardNetwork is catastrophically broken #255

ntraft · 2022-11-20T02:22:08Z

One-Line Summary

On almost all evolved genomes, FeedForwardNetwork does not execute all nodes, and many nodes retain their initial value of 0.0. This manifests itself as a network always returning an output of all zeros.

More Details

Sorry for the melodramatic headline, but I wanted to make it clear that this isn't just any old bug. As far as I can figure out, it would affect any and all feed-forward NEAT programs in a very dramatic way. I found the issue on the OpenAI Lunar Lander example. It's actually pretty interesting that the XOR and Cart-Pole examples still work well. The issue doesn't present itself until the network evolves for awhile, so it isn't immediately apparent.

I'm using the latest version of the repo (not the pip release). The code I'm looking at has been present since at least 2017, so I'm not exactly sure whether it's expected behavior.

The issue I'm seeing is basically that FeedForwardNetwork.node_evals does not actually contain all necessary nodes. In the net below, only the following nodes are being run: 2176, 1153, 1602, 1311. Notice that these are the only nodes which depend on an input, but do not depend on any other nodes (blue arrows).

There are many non-input nodes which have no predecessors (red arrows). These nodes are not being included in the initial set of eligible nodes in neat.graphs.feed_forward_layers(). The fix would be to add these nodes as the first "layer" of the computation. I intend to fix this in my own fork and submit a pull request.

However, I'm not sure if this is the correct fix, or if this is an indicator of a deeper problem? Are these "dangling" input nodes expected? This seems related to #250. These nodes are largely unnecessary, yet they are not completely functionless (they essentially act as an additional "bias" parameter). Plus, they could become more integrated by the addition of new edges in later generations. So it seems like they belong there, but it is shocking that such a massive bug exists for years and makes me very suspicious of whether my fix is correct.

To Reproduce

Steps to reproduce the behavior:

cd examples/openai-lander
python evolve.py
See a fitness.svg plot like the one below (I have modified it for more clarity). We can't achieve a positive reward (solving the task would be a reward of +200).

The text was updated successfully, but these errors were encountered:

ntraft · 2022-11-20T03:04:47Z

As I look into this further, it really does seem like it's not intended for any node in the graph not to have inputs. Many of the aggregation functions throw an error if called with an empty list. Is there some place in the NEAT algorithm where these dangling no-input nodes should be filtered out?

ntraft · 2022-11-21T05:04:24Z

I have read the original NEAT code (C++, from Stanley's thesis, 2001), and indeed have found this note before disabling a connection:

//We need to make sure that another gene connects out of the in-node
//Because if not a section of network will break off and become isolated

That's exactly what's happening here. Also, there is no "delete node" or "delete connection" mutation in the original work; there is only "toggle enable". So in the original NEAT there cannot be these dangling nodes without inputs.

I will likely code up a fix for this in the next week or so, but have bigger bugs to unravel first.

markste-in · 2023-08-09T09:23:13Z

maybe try out my 'fix'

it removes all the dangling nodes every iteration. I did a few tests and I get a bit more stable results

https://github.com/markste-in/neat-python/tree/remove_dangling_nodes

Finebouche · 2024-05-20T14:17:05Z

I proposed a fix here and added some code so that computation doesn't take into account any dandling nodes : #282

This was referenced Nov 20, 2022

Neat disconnected #163

Open

NEAT returning zeros #165

Open

ntraft added a commit to ntraft/neat-python that referenced this issue Nov 20, 2022

Fix bug CodeReclaimers#255 and add corresponding unit tests.

2e16473

allinduetime mentioned this issue Jun 5, 2024

Many dangling nodes without a connection to an output are created / left -> network breaks the longer you run it #250

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FeedForwardNetwork is catastrophically broken #255

FeedForwardNetwork is catastrophically broken #255

ntraft commented Nov 20, 2022

ntraft commented Nov 20, 2022

ntraft commented Nov 21, 2022

markste-in commented Aug 9, 2023

Finebouche commented May 20, 2024 •

edited

Loading

FeedForwardNetwork is catastrophically broken #255

FeedForwardNetwork is catastrophically broken #255

Comments

ntraft commented Nov 20, 2022

One-Line Summary

More Details

To Reproduce

ntraft commented Nov 20, 2022

ntraft commented Nov 21, 2022

markste-in commented Aug 9, 2023

Finebouche commented May 20, 2024 • edited Loading

Finebouche commented May 20, 2024 •

edited

Loading