MegaGlest Forum

MegaGlest => Bug reports => Closed bug reports => Topic started by: titi on 5 August 2013, 23:24:35

Title: [fixed]3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 5 August 2013, 23:24:35
Today I had the third OOS with a human playing egypt. So far i never saw this with a egypt AI player, so it must be something that humans do. Its always the egypt faction which gets OOS.
Sadly I cannot give more details yet , just the fact that it looks like this bug sometimes causes crashes too ( https://forum.megaglest.org/index.php?topic=9165.0 ).

I know I need to get more details, I just want to mention that there still must be a OOS :-( .
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: filux on 6 August 2013, 04:38:22
From my observations that it is quite rare and regardless of faction.
The only thing repeated in the error messages is: problem with "isCancelPreMorphCommand".
For a very long time I haven't seen any OOS which aren't related with this command.

e.g. https://forum.megaglest.org/index.php?topic=9110.0 (https://forum.megaglest.org/index.php?topic=9110.0)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: titi on 7 August 2013, 22:20:40
YOu are right   :scared: its not only egypt, it happens with other factions too. I had one more OOS today without egypt. I hope I will find the time on weekend to let the computers play vs each other. Only problem is we still have a bug in the OOS sync logging itself :-/ .

Update: and one more OOS game :-(

Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: softcoder on 8 August 2013, 19:34:49
Ok please setup host and clients with world synch logging then provide the logs after getting OOS. MAke sure to quit and restart the game if a previous game plays successful.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: titi on 11 August 2013, 19:51:10
I tried a lot now, but I had no luck. Currently its logging much too much and this takes so much time that the client ( windows laptop ) always dropps out, because it falls back too much.

Is there any way to reduce the the things we log ?
Currently I get  for exmaple 4,6 GB for just some minutes of playing ( when second attack wave of playing AIs starts ).

Like this I cannot log the error,  :(

update:
A different idea how to find/log out of sync problems:  What about only calculating the CRCs for states on server and client instead of logging all. These CRCs are send to server and client. Once they are different we start to log on servers and clients.
By this we only log if things are different.
A problem is that we don't see which "commands" caused these problems,  If those are needed, Maybe a cyclic logging to memory for the last 50 frames or so helps. This is written to the logfile once we see that there is trouble.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: filux on 23 August 2013, 19:30:03
Another to collection.
I received it close to the end of the game hosted by Carla.

Code: [Select]
[] *ERROR* In [commander.cpp::buildCommand Line: 997]
Can not find command type for network command = [networkCommandType = 0
unitId = 500448
commandTypeId = 3
positionX = 73
positionY = 176
unitTypeId = 18
targetId = 0
wantQueue= 0
fromFactionIndex = 0
unitFactionUnitCount = 101
unitFactionIndex = 5, commandStateType = 0, commandStateValue = -1, unitCommandGroupId = -1]
Commands:  id = 0 id = 1 id = 2
for unit = 500448
[axe_thrower]
[
HP: 0/700
Armor: 20+5 (leather)
Sight: 12
Kills: 1]
actual local factionIndex = 5.
Unit Type Info:
[Unit Name: [axe_thrower] id = 2 maxHp = 700 hpRegeneration = 0 maxEp = 0 epRegeneration = 0 maxUnitCount = 0 fields index = 0 value = 1 fields index = 1 value = 0 properties index = 0 value = 0 properties index = 1 value = 0 armor = 20 armorType Name: [leather id = 1 light = 0 lightColor = x [0] y [0] z [0] multiSelect = 1 sight = 12 size = 1 height = 2 rotatedBuildPos = 0.0000000000000000 rotationAllowed = 1 skillTypes: [5] i = 0 Stop i = 1 Move i = 2 Attack i = 3 Morph i = 4 Die commandTypes: [3] i = 0 Stop i = 1 Move i = 2 Attack storedResources: [0] levels: [0] meetingPoint = 0 countInVictoryConditions = 0]
Network unit type:
[thunderbird]
isCancelPreMorphCommand: 0
Game out of synch.
[] *ERROR* In [game.cpp::update Line: 2671] Error [Error [#3]: Game is out of sync, please check log files for details.
Stack Trace:
./megaglest:Shared::Platform::megaglest_runtime_error::megaglest_runtime_error(std::string const&, bool)address [0xa20222] line: 237
./megaglest:Glest::Game::Commander::buildCommand(Glest::Game::NetworkCommand const*) constaddress [0x583851] line: 1014
./megaglest:Glest::Game::Commander::giveNetworkCommand(Glest::Game::NetworkCommand*) constaddress [0x585d6e] line: 863
./megaglest:Glest::Game::Commander::updateNetwork(Glest::Game::Game*)address [0x589acb] line: 548
./megaglest:Glest::Game::Game::update()address [0x5c98e0] line: 2147
./megaglest:Glest::Game::Program::loopWorker()address [0x6b9c71] line: 459
./megaglest:Glest::Game::glestMain(int, char**)address [0x6a9510] line: 5379
./megaglest:Glest::Game::glestMainSEHWrapper(int, char**)address [0x6ac226] line: 5626
/lib64/libc.so.6:__libc_start_main()address [0x7f5912199455] line: 0
./megaglest() [0x515a89]address [0x515a89]
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: tomreyn on 28 August 2013, 23:30:04
I had this happen again today. I was hosting on Linux on atibox, several clients with mixed Linux / Windows were connected, and we went OOS. I had not set any special settings, notably I had not activated techtree translation. What I did have set was Desert4, so a tileset using the new tileset features.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: titi on 19 September 2013, 08:48:27
Current state: Softcoder added a CRC check in the code now who instantly finds any kind of difference between client and server. Also it still does not work perfect it already helped to find another bug that caused OOS before.
Whats needed now are more play tests ( crossplatform windows/linux ) so we know where we stand. Yesterday we already had a bigger game and no problems showed up also I used egypt faction which typically shows this Problem very good . So if you have time, get the executable from here:( https://forum.megaglest.org/index.php?topic=8753.0 ) or compile for yourself and help testing.

Then report back all you have ! Success or no success, all is needed ( just regarding Out Of Sync of course )
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: tomreyn on 19 September 2013, 13:39:27
We played two more full (start to end) x-platform games last night on r4547 (and higher) with Egypt and couldn't reproduce OOS either.

One of these games involved using the fixed (r4548) CRC checks. We did not run into any mismatch (great!) but generally the checks are too slow to keep them enabled for a full game (especially but not only Windows users will time out due to the computational overhead), it makes the game really hard to play and impossible to enjoy. But then, we're just testing - and don't get me wrong: it's great that we have this optional feature available. ;)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: softcoder on 24 September 2013, 18:44:03
Let me know if this is reproducible.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: tomreyn on 24 September 2013, 23:56:28
Lyra (Windows 7-64, client) and I (Ubuntu 13.04 x86_64) played three games with all Egypt (2 humans vs. 2 Ultra CPU, 4.9x) and lots of splash. The first two games there was no CRC checksum mismatch, but there was on the last game. I've uploaded the combined logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130925_1_1.tar.xz). It happened many minutes into the game, and lyra got disconnected with an error message (the server only witnessed what looked like a voluntary disconnect) and I stopped the game shortly after.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt (more details needed)
Post by: titi on 25 September 2013, 00:12:34
and one more game of Lyra and me which ended with a CRC:

Here are the logs:
Me host ( linux ): http://titi.megaglest.org/server.7z
Lyra client WIndows:  http://www.sendspace.com/file/ejkwo5
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 25 September 2013, 02:38:12
Lyra and I just played again, r4571 this time, but with a similar setup as before. Her Windows client lagged quite a bit again due to checksumming and Intel graphics, and we ran into a CRC mismatch. My (server) logs are uploaded here (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130925_2.tar.xz). Lyra's (client) logs are uploaded there (http://www.sendspace.com/file/o850fv). Her cmd said this (http://pastebin.com/raw.php?i=2zEJsL4w), mine said that:

Code: [Select]
megaglest v3.8.0-dev
Compiled using: GNUC: 40703 [64bit] on: Sep 25 2013 03:00:50 platform: Linux-X64 endianness: little
SVN: [Rev: 4571M] - using STREFLOP [SSE] - [no-denormals]
radeon: Acquired access to Hyper-Z.
[] *ERROR* SOCKET WRITE TIMEOUT In [/home/user1/SCM/megaglest-trunk/source/shared_lib/sources/platform/posix/socket.cpp::isWritable Line: 1756] i = 0 sock = 16
[] *ERROR* SOCKET WRITE TIMEOUT In [/home/user1/SCM/megaglest-trunk/source/shared_lib/sources/platform/posix/socket.cpp::isWritable Line: 1756] i = 0 sock = 16
[] *ERROR* SOCKET WRITE TIMEOUT In [/home/user1/SCM/megaglest-trunk/source/shared_lib/sources/platform/posix/socket.cpp::isWritable Line: 1756] i = 0 sock = 16
*Note: Monitoring Network CRC NORMAL synchronization...
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
*TESTING*: START Waiting for lagging client playerIndex = 0 [lyra] clientLagCount = 11.000000 [0.000000]
*TESTING*: FINISHED Waiting for lagging client playerIndex = 0 [lyra]
Check save world CRC to log. isNetworkGame = 1 fileSuffix = _faction_0
Save to log debugCRCWorldLogFile = /home/user1/.megaglest/debugCRCWorld.log_faction_0
Check save world CRC to log. isNetworkGame = 1 fileSuffix = _server
Save to log debugCRCWorldLogFile = /home/user1/.megaglest/debugCRCWorld.log_server
Check save world CRC to log. isNetworkGame = 1 fileSuffix = _client
Save to log debugCRCWorldLogFile = /home/user1/.megaglest/debugCRCWorld.log_client
Check save world CRC to log. isNetworkGame = 1 fileSuffix = _client
Save to log debugCRCWorldLogFile = /home/user1/.megaglest/debugCRCWorld.log_client
radeon: Released access to Hyper-Z.

Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 25 September 2013, 19:58:16
rev 4572:

2 player game:

lyra windows client: http://www.sendspace.com/file/4386um

titi linux 64 server: http://titi.megaglest.org/server2.7z
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 25 September 2013, 23:17:31
Another game on r4572 with CRC mismatch. I was hosting (atibox), Titi was the Windows client, Atze (Linux 64) joined us.

Server logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130925_4.tar.xz)
Titi's client logs (http://titi.megaglest.org/debugCRCWorld.log2_client.7z)
Atzes' client logs (http://s000.tinyupload.com/?file_id=09361330986063960499)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 26 September 2013, 22:21:05
rev 4574:
4 player cross platform game game:
we only have logs of 2 clients and the server:

Atze server linux: http://s000.tinyupload.com/?file_id=02890039838876336676
titi client windows:  http://titi.megaglest.org/debugCRCWorld.log_win_client.7z
lyra linux client: http://www.sendspace.com/file/3y8f7z
derek windows: log is missing, but got CRC in the same moment as I got it.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: softcoder on 27 September 2013, 04:11:48
Try svn as I have committed additional fixes.

Thanks
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 27 September 2013, 04:22:44
Thanks.

Lyra (W-7-64 client) and I (atibox) just tried r4575, but ran into another checksum mismatch. Rather quickly actually, but I set my game to German language, so this could be why.

My (server) logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130927_1.tar.xz)

Lyra's logs (http://www.sendspace.com/file/75q745), Lyra's terminal output (http://pastebin.com/raw.php?i=i0xT8fXz)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: ctz on 27 September 2013, 05:41:27
A CRC error occurred on r4576 with tomreyn as the host, I (lyra) the client, same machines as last time (above post).
This error happened with the same settings as last time and a similar situation, with tomreyn walking a mummy to the opposing teams' area to provoke them and leading them around to fight each other.

tomreyn's server logs: http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130927_2.tar.xz
my client log: http://www.sendspace.com/file/os3yni and terminal output: http://pastebin.com/raw.php?i=LvY27g2A
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 27 September 2013, 23:47:18
Today, Titi (Windows 7-64, I think) was running the client which connected to my (atibox) server, both running r4577.
It took us quite a while to go OOS, pleaying a very similar game (same settings) to the one lyra / ctz and I played yesterday.

My log (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130928_1.tar.xz)
Titi's log (http://titi.megaglest.org/debugCRCWorld.log_client.7z)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 29 September 2013, 23:19:23
Lyra me and tomreyn teste rev 4583:

You nearly instantly get a CRC error now if  someone uses attack particles on the map. Lyra attacled his own worker with the Egypt Priest. 1 second later we got a CRC error.
We tried it again with same result.

Here are the logfiles for our first try:

<lyra`> http://www.sendspace.com/file/4tjivo    ( linux client )
<tomreyn> http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20130930_1.tar.xz  (  linux server )
<titi_linux> http://titi.megaglest.org/debugCRCWorld.log_client_windows.7z ( windows client )
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: ctz on 29 September 2013, 23:29:15
Correction to titi's report above:
I attacked my spearthrower with the priest, and had a spearman in the splash radius.  Then I moved the priest to stop the attack, and a few seconds later titi got disconnected by CRC error.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 2 October 2013, 23:06:25
Nig and I just had a CRC mismatch on r4589, windows 7-64 host (nig), linux-64 client (tomreyn on atibox).

client logs: http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131003_1.tar.xz (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131003_1.tar.xz)
server logs: http://www.fastshare.org/download/lo..7z (http://www.fastshare.org/download/lo..7z)

Code: (Client terminal output) [Select]
X**X Frame: 7840 faction: 1 local CRC: 4081333890 Remote CRC: 180790778
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 3 October 2013, 03:50:43
Now.. this may actually be fixed on r4592!

We played 2 cross-platform games (both hosted on atibox - it would be good to also host some on Windows), one with all Egypt and loads of splash, the other with mixed factions (but still Megapack) and plenty of fights (also including splash). Lyra lagged got disconnected both times even though I had the "wait for lagging player" option on, and there were no warnings ever, it just said she had disconnected.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 4 October 2013, 03:31:09
This has now less to do with the original topic of this thread (not about Egypt), but since the underlying issue is still related, I'll keep it here.

Lyra (Windows 64 bit client) and I (atibox hosting) were playing a game against each other on r4596. I think we never did this test scenario since checksumming was introduced.

Lyra was with a 1.5 Ultra bot, I was alone on the other side (and team), and Lyra experienced a checksum mismatch during the first larger attack. She also said she experienced a few pauses (followed by rapid catching-up) during this game, likely due to network interference.

My (server) log file (and terminal output) (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131003_5.tar.xz)

Lyra's terminal output (http://pastebin.com/raw.php?i=7RS8nXLn)
Lyra's log file (http://www.sendspace.com/file/8azrlg)

Edit: Added revision #
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 4 October 2013, 22:46:40
We played another game on r4596 today. I was hosting on atibox, Filux (Linux 64 bit), Jammy (Windows 64-bit) were clients. I was on one side of the map with a CPU Mega (on the same team) and  Filux and Jammy formed the second team on the other side of the map. Roughly 10 minutes into the game, the game reported for both Filux and me that Filux (the Linux -64 system, so same platform!) had run into a checksum mismatch. Right at this time both Filux, and less of a second later, Jammy, got disconnected off the server. Jammy had mentioned that the game was sluggish for him from the very start.

My logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131003_6.tar.xz)
Jammy's logs (https://www.dropbox.com/s/y42v1c9xwc0scq4/debugCRCWorld.7z)
Filux's screenshot (http://i.imgur.com/bm4CQEq.png) and logs (http://s000.tinyupload.com/index.php?file_id=36221391588615916660)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 4 October 2013, 23:16:56
Now we did the same game setup again, but just Filux vs me (hosting), no Jammy, no CPU. And we got a mismatch:

Code: [Select]
<tomreyn> filux was reported to have disconnected the very first moment our fighters met
<tomreyn> i wonder whether we saw the same  series of events there, filux. what i saw / did was this: i had two fakirs who discovered that you were building a new main building on the top left of the map.
<tomreyn> initially only one of the fakirs attacked this structure which your worker was just working on.
<filux> yes, and exactly in the disconnection moment I started another build on the bottom too
<tomreyn> then i made both fakirs attack your worker directly
<tomreyn> where i had two more units lurking
<filux> in the first disconnection I produced some workers
<filux> then it can be related with main norsemen building
<tomreyn> if you were building very close to the bottom gold spot then my units there should have seen yours
<filux> or fighting :)

My logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131005_2.tar.xz)
Filux's logs
 (http://s000.tinyupload.com/index.php?file_id=84929853734172017778)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 8 October 2013, 20:21:39
r4598 today, I was hosting on atibox, MuWuM (Windows 7-64) was client, and had a CPU ultra team mate. Around the middle of this Prax game, he ran into a CRC mismatch during the second CPU attack wave (neither MuWuM nor I had "visited" the other side of this Conflict map, yet).

Linux server logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131008_1.tar.xz)
Windows client logs (http://muwum.org/glest/debugCRCWorld.7z)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 8 October 2013, 20:55:44
Another one, same setup as before, just the Windows client was Nig's (again W7-64) this time.

Linux server logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131008_2.tar.xz)
Windows client log (http://www.fastshare.org/download/debugCRCWorl4..7z)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 10 October 2013, 18:41:04
r4601, atibox hosting, Carolinux' Linux system (not sure which architecture) was the client, he ran into a checksum mismatch around mid-game.

Server logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131010_1.tar.xz)
Client logs (http://fs03n1.sendspace.com/dl/45c9ba77bd14b7144054a7b6f40505d0/5256f34c7d85e994/rzo4he/debugCRCWorld.log_client.tar.lzma)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: softcoder on 11 October 2013, 01:44:05
The link to client logs is not working (404)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 11 October 2013, 06:05:57
Hmm, that's right. Two problems there:
1. I provided the wrong URL, the correct one is http://www.sendspace.com/file/rzo4he
2. At the correct URL it doesn't work either: "The file has been deleted by the uploader and it cannot be restored. Please contact the sender and ask them to upload the file again."

We played another test game tonight, though.
I (atibox) was hosting, Lyra (linux-64) and Herbstvisage (Windows 7-64) were clients.

Tomreyn's server logs (http://tomreyn.megaglest.org/debugCRCWorld_tomreyn_20131011_1.tar.xz) (includes terminal output)
Lyra's client logs (http://tomreyn.megaglest.org/debugCRCWorld_lyra_20131011_1.xz) (terminal output (http://pastebin.com/raw.php?i=MpwAGa9z))
Herbstvisage's client logs (http://tomreyn.megaglest.org/debugCRCWorld_herbstvisage_20131011_1.zip)

The checksum mismatch happened about mid-game on this player vs player game on a 3vs3 map. I was playing against the two, there was no CPU.
Lyra recieved a message that she ran into a CRC mismatch. Herbstvisage said he "also had the message" (I'm not sure whether he got a report that he ran into the mismatch or that Lyra did). For the server, a checksum mismatch was reported but I forgot for which player. Currently this is not written to the terminal (might be good to change this) but just reported on screen, so I'm afraid I cannot tell.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 12 October 2013, 03:07:57
More games, more logs: We played two games and ran into checksum mismatches both times, both same and cross platform.

We, that's:
tomreyn (atibox) hosting
Lyra (Linux x86_64) - client
Jammy (Linux x86_64) - client
Herbstvisage (Windows 7-64) - client

Game 1:
Game 2:
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: MuwuM on 16 October 2013, 20:37:54
r4606 today, tomreyn was hosting on atibox, I (MuwuM, Windows 7-64) was client, we where both in one Team and had 2 CPU enemies. At the first CPU attack wave, I ran into a CRC mismatch (neither tomreyn nor I had "visited" the other side of this map, yet).

CRC debug files:

tomreyn (atibox): http://tomreyn.megaglest.org/debugcrcworld_tomreyn_4696_20131015.tar.xz (http://tomreyn.megaglest.org/debugcrcworld_tomreyn_4696_20131015.tar.xz)

MuwuM (win7 64 bit): http://glest.muwum.org/debugCRCWorld-2013-10-16.7z (http://glest.muwum.org/debugCRCWorld-2013-10-16.7z)
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: softcoder on 17 October 2013, 02:52:24
Updated svn with more info please try again and post the logs
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 17 October 2013, 04:48:56
Lyra and I just tested on 4609, I was hosting (atibox), she was on Windows 7-64, and ran into a CRC mismatch quite early.
Server logs (http://tomreyn.megaglest.org/debugcrcworld_tomreyn_4696_20131017.tar.xz)
Client logs (http://www.sendspace.com/file/ucqrw1) (terminal output (http://pastebin.com/raw.php?i=NkK6E67p))

Thanks!
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 17 October 2013, 20:04:34
This time, I tested with Derek on r4610.
Server logs (http://tomreyn.megaglest.org/debugcrcworld_tomreyn_20131017_2.tar.xz)
Client logs (shall be added by Derek when he's finished uploading.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 18 October 2013, 14:09:09
Lyra and I tried r4612 today. Her Windows client (using the cudgellady snapshots) crashed when connecting to my host (running a full rebuild on atibox). We tried this once, I quit and restarted, and we tried again with the same effect.

Lyra's terminal outputs:
http://pastebin.com/raw.php?i=haz5NbNj
http://pastebin.com/raw.php?i=UynFEw48
http://pastebin.com/raw.php?i=07fWk4wM

Each time the Windows client crashed, two DMP files were written. Here's the stack traces for these:
http://megaglest.nopaste.dk/p63026
http://megaglest.nopaste.dk/p63027
http://megaglest.nopaste.dk/p63028
http://megaglest.nopaste.dk/p63029

Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: softcoder on 18 October 2013, 15:47:04
Should be fixed in svn now, sorry about that
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 19 October 2013, 18:45:13
r4616: CRC mismatch with four players. Linux-64 host (atibox), Lyra on Linux-64, Nig and CyberneticPony on Windows-64.

All logs: http://tomreyn.megaglest.org/debugcrcworld_tomreyn_20131019_1.tar.xz (http://tomreyn.megaglest.org/debugcrcworld_tomreyn_20131019_1.tar.xz)

Before this one, I played two x-platform games with a total of 2 human players each, and there were no problems.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: tomreyn on 20 October 2013, 17:16:53
I played a couple r4618 games today, some with Typedef (Linux 3.x-64), some with Nig (Windows 7-64). There were no checksum mismatches. However, we had serious problems with FPS dropping remarkably after the first third/half of the game. Both ends were then constantly playing at 1-3 FPS, i.e. unplayable. This happens whether or not we are using CRC checksumming, with different techtrees, different maps, different AI settings, different tilesets, and whether Nig or I were hosting.

Yesterday I took part in three games where this wasn't a problem, and where no mismatches occured either. But those were only 2 player games, like today. So it may be too early to tell whether everything is fixed. And large games are not possible with the current slowdowns.
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 29 October 2013, 00:55:21
I switched all CRC/logging off now ( I was the server ) and I played 2 games with Jammyjamman ( rev

EnableNetworkGameSynchChecks=0
EnableNetworkGameSynchMonitor=0

It was very smooth and performat (rev 4641)
We just need to play a real big 8 player game now. Then we set this as default and we are nearly ready for a release. Just some translation needed. ( and tiny data changes which I don't want to do now )
 
Title: Re: 3.8.0-beta1 OOS when playing windows/linux and using egypt
Post by: titi on 16 November 2013, 14:18:19
Yeah , I can mark this as fixed !!!  :O :O! :| ::) :P :-* :-* :-*