Page Index Toggle Pages: 1 2 [3]  Send TopicPrint
Hot Topic (More than 35 Replies) It Must Be Getting Bad (Read 20371 times)
Frank
Puppy Farmer
****
Offline


I Love Drama!

Posts: 1429
Joined: Jan 14th, 2016
Re: It Must Be Getting Bad
Reply #50 - Apr 9th, 2016 at 2:52pm
Print Post  
Meursault wrote on Apr 9th, 2016 at 2:02pm:
Interesting, would you mind showing me the last few lines of your traceroute and explaining how you're getting Quest from it?

Looked like Qwest to me also when I ran a few traces right after the move.  The last hop before I hit their server was a Qwest address.
I've lost the host IP they provided and I'm not really interested in looking it up, so I can't show you my traceroute.  You might be using the old host address?  It does not look familiar to me, but then memory might not be serving.
  

No, let me be Frank.

Digimonk wrote on Dec 8th, 2016 at 12:39pm:
I've had multiple RoSS rot just from doing the explorer points in Sands across multiple lives since it is exclusive.


Smart players know how to use buyback.

Digimonk wrote on Dec 22nd, 2016 at 1:58pm:
I will not chone.

I did what I did intentionally and while my primary purpose was not to annoy the other Vaulties, I acknowledge that it was a side effect.


Smart people don't elicit "side effects."  They understand in advance the consequences of their actions.
Back to top
 
IP Logged
 
kum-gulp
VoD Slasher
*****
Offline


I Love Tired Vault Memes!

Posts: 1812
Joined: Jul 16th, 2010
Re: It Must Be Getting Bad
Reply #51 - Apr 9th, 2016 at 3:26pm
Print Post  
I can't get a connection at all now, not even to log in to check if it's playable yet. I'm just about out of patience.
  
Back to top
 
IP Logged
 
Flav
Vault Frog
*
Offline


One Frog to Rule them
All!

Posts: 10011
Location: Land of the Frogs
Joined: Aug 29th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #52 - Apr 10th, 2016 at 1:17am
Print Post  
Meursault wrote on Apr 9th, 2016 at 2:02pm:
Interesting, would you mind showing me the last few lines of your traceroute and explaining how you're getting Quest from it? I'm seeing
Code
Select All
10  207.88.14.162.ptr.us.xo.net (207.88.14.162)  48.116 ms  53.413 ms  54.832 ms
11  207.88.14.186.ptr.us.xo.net (207.88.14.186)  55.468 ms  58.141 ms  61.545 ms
12  205.197.230.6 (205.197.230.6)  23.541 ms  26.270 ms  31.128 ms
13  border1.pc1-bbnet1.nyj001.pnap.net (216.52.95.9)  23.484 ms border1.pc2-bbnet2.nyj001.pnap.net (216.52.95.73)  26.263 ms  29.349 ms
14  turbine-14.border1.nyj001.pnap.net (70.42.39.234)  33.037 ms  36.248 ms  40.144 ms
 


And a whois on hop 13, the last before Turbine, showed the registrant as "Internap Network Services Corporation".

So what am I doing wrong?


now that's interesting... Do you trace on gls.ddo.com or on an IP ?

Code
Select All
C:\Users\Guillaume>tracert gls.ddo.com

Détermination de l'itinéraire vers gls.ddo.com [198.252.160.23]
avec un maximum de 30 sauts :

  1    <1 ms    <1 ms    <1 ms  <private ip>
  2    16 ms    15 ms    16 ms  <access router ip>
  3    16 ms    16 ms    15 ms  10.123.109.138
  4    17 ms    17 ms    16 ms  ae41-0.niaub201.Aubervilliers.francetelecom.net [193.252.98.161]
  5    19 ms    23 ms    23 ms  81.253.184.182
  6    19 ms    19 ms    19 ms  tengige0-14-0-9.auvtr1.Aubervilliers.opentransit.net [193.251.242.217]
  7    98 ms    98 ms    99 ms  tengige0-6-1-0.nyktr1.NewYork.opentransit.net [193.251.240.202]
  8    95 ms    95 ms    94 ms  nyc2-brdr-01.inet.qwest.net [63.146.27.5]
  9    96 ms    96 ms    96 ms  ewr-cntr-11.inet.qwest.net [205.171.17.2]
 10   105 ms   105 ms   106 ms  206.103.215.50
 11    93 ms    93 ms    92 ms  63.236.3.130
 12     *        *        *     Délai d'attente de la demande dépassé.
 13   105 ms   105 ms   106 ms  198.252.160.23
 14   106 ms   106 ms   105 ms  198.252.160.23

Itinéraire déterminé.

C:\Users\Guillaume> 



ARIN sais that 63.236.3.130 belongs to Qwest.
hop 12 is the (in)famous Turbine border router that spend most of it's time not answering to traceroutes and when answering gives an ip in the 10.0.0.0/4 private range.

There is no reasons why I would move from France Telecom backbone ( OpenTransit ) to Qwest if the Datacenter belongs to PNAP. If it was the case I'd move to PNAP backbone at hop 8.

if you go by PNAP up to the (in)famous Turbine border router it means that they have done the two circuit trick for DDO too ( which is logical... it wouldn't be tied to a game, but to everything in that datacenter ), and that each circuit is using a different carrier ( one PNAP, the other Qwest ).
  

Yes my avatar is an Hermine eating a Greenland Lemming for brunch.
Back to top
 
IP Logged
 
Meursault
Horoluth Raider
****
Offline


I Love Freedom of Speech

Posts: 2410
Location: Hartford, CT; USA
Joined: Aug 22nd, 2013
Gender: Male
Re: It Must Be Getting Bad
Reply #53 - Apr 10th, 2016 at 1:50pm
Print Post  
Flav wrote on Apr 10th, 2016 at 1:17am:
now that's interesting... Do you trace on gls.ddo.com or on an IP ?

I traced on gls.ddo.com, it resolves to 198.252.160.23, just like yours.

Code
Select All
date && traceroute gls.ddo.com
Sun Apr 10 13:22:32 EDT 2016
traceroute to gls.ddo.com (198.252.160.23), 30 hops max, 60 byte packets
 1  XXX  0.285 ms  0.384 ms  0.483 ms
 2  XXX  13.739 ms  16.697 ms  20.580 ms
 3  XXX  26.464 ms  30.386 ms  31.347 ms
 4  XXX   32.551 ms  35.994 ms  39.430 ms
 5  ae6---0.cor01.nycm.ny.frontiernet.net (74.40.5.209)  53.096 ms  56.280 ms  59.446 ms
 6  ae0---0.cor02.nycm.ny.frontiernet.net (74.40.4.66)  63.373 ms  64.133 ms  67.046 ms
 7  ae12---0.cor02.asbn.va.frontiernet.net (74.40.5.238)  69.936 ms  23.622 ms  23.090 ms
 8  ae1---0.cbr01.asbn.va.frontiernet.net (74.40.2.178)  25.854 ms  28.771 ms  32.158 ms
 9  74.40.26.179 (74.40.26.179)  36.649 ms  39.976 ms  42.869 ms
10  207.88.14.162.ptr.us.xo.net (207.88.14.162)  46.143 ms  52.021 ms  53.944 ms
11  207.88.14.186.ptr.us.xo.net (207.88.14.186)  55.766 ms  58.442 ms  61.860 ms
12  205.197.230.6 (205.197.230.6)  25.422 ms  27.561 ms  30.509 ms
13  border1.pc1-bbnet1.nyj001.pnap.net (216.52.95.9)  22.745 ms border1.pc2-bbnet2.nyj001.pnap.net (216.52.95.73)  25.554 ms  28.967 ms
14  turbine-14.border1.nyj001.pnap.net (70.42.39.234)  32.417 ms  35.349 ms  42.481 ms
15  * * *
16  198.252.160.23 (198.252.160.23)  60.166 ms  60.396 ms  59.451 ms
17  198.252.160.23 (198.252.160.23)  62.858 ms  23.739 ms  26.624 ms
 



So this strikes me as odd. I can trace to both the IP and to the name, and get identical results. I hit the web and ran some public trace tools. Pingdom gets the same results I do
http://tools.pingdom.com/ping/?target=gls.ddo.com&o=1&id=8789498

but Network Tools goes through Qwest like you do
http://network-tools.com/default.asp?prog=trace&host=gls.ddo.com

All 5 (yours, my IP, my name, and the 2 web tools tracing by name) resolve to the same IP for the final destination, but the last steps before the Turbine border are different.

I might expect this from a CDN, but these are interactive servers, there can't be 2 copies, right?
  

Turdbin, keep changing the DDO rules, because McDonalds sold over 200 billion hamburgers by changing the recipe for their Special Sauce every couple of months to keep interest up.
Back to top
 
IP Logged
 
m4lacka
Horoluth Raider
****
Offline


Coitus ergo sum.

Posts: 2494
Joined: Mar 19th, 2014
Re: It Must Be Getting Bad
Reply #54 - Apr 10th, 2016 at 2:25pm
Print Post  
I actually visit wayfinder every 2-3 days and it seems its getting fucking lag everywhere, every single minute.

I guess my first guess was not far off with "datacenter move" being a simple server swapping.

I'm gooooooood! Cool
  

Vaultaccount wrote on Aug 29th, 2016 at 7:06pm:
The most broken epic PL in the game is the ridiculous AC buff from martial PLs. But when you include all the inherent benefits from reincarnation, the thing is above the roof. The benefit from TR was supposed to be minimal, but now it is game breaking. start a new character and you will feel, you will get raped in heroics.

Personally I think lower heroics on new characters are harder than Legendary Elite content on a character with good gear and has all destinies farmed.


Arkat wrote on Apr 2nd, 2014 at 4:14pm:
And someone in another thread called ME delusional recently!

Wow.

Have you EVER gone to Law School Feyn? Have you EVER been asked by a United States Senator who is now the Vice President of the United States to write a paper just for him on a particular topic of Constitutional Law for his use as a Senator? Have you ever been asked to help a very notable Harvard PhD turned Constitutional scholar with his very-well received book on Legal Philosophy and Constitutional Revolutions? Have you ever been offered a job as a Deputy Attorney General BEFORE you even graduated Law School?

No? Then STFU you fucking amateur.


Meat-Head wrote on Jul 29th, 2015 at 12:03am:
Thx. I was semi-waiting because windows 10 will be here soon. but, the crashing is making me want to punch old people.


IMARANGER wrote on Sep 8th, 2014 at 5:40pm:
Pretty sure that the fact that a service can be hacked doesn't make it the "community chest".
Back to top
 
IP Logged
 
Flav
Vault Frog
*
Offline


One Frog to Rule them
All!

Posts: 10011
Location: Land of the Frogs
Joined: Aug 29th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #55 - Apr 11th, 2016 at 4:24am
Print Post  
Meursault wrote on Apr 10th, 2016 at 1:50pm:
So this strikes me as odd.


Nothing odd.

Meursault wrote on Apr 10th, 2016 at 1:50pm:
but the last steps before the Turbine border are different.

I might expect this from a CDN, but these are interactive servers, there can't be 2 copies, right?


In the old datacenter they didn't have any choice, the last 2/3 hops were always PNAP because the peering room was not in the same building or Turbines border router couldn't get more than one circuit  ( in peering term ) up towards Internet.

In the new datacenter the border router is the peering router so Turbine can gets several circuits ( in peering term ) towards several ISPs.
Protocols like BGP will then tell the Carriers what to do with the traffic : send it towards one peering link or the other.
We probably can try to dig out more if more people play around with tracert, but I suspect that for some location ( say CONUS ) they use PNAP as the carrier, and for other locations ( say EMEA ) they use Qwest as carrier.

from my point of view it's an improvement over what they had before ( because if they have an issue with one of the carrier they can switch to the other in a matter of seconds ).

Now if only they could get their fingers out of their collective asses and kill the Virtualization ( put real servers instead ) so that the game is back to playable.
( now with the June rumor that might be what they are going to do... getting servers can take time )
  

Yes my avatar is an Hermine eating a Greenland Lemming for brunch.
Back to top
 
IP Logged
 
Asheras
Completionist (i.t.p.)
******
Offline


This is why we can't have
nice things.

Posts: 10230
Location: Ohio
Joined: Jun 9th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #56 - Apr 11th, 2016 at 9:21am
Print Post  
Flav wrote on Apr 11th, 2016 at 4:24am:
Now if only they could get their fingers out of their collective asses and kill the Virtualization ( put real servers instead ) so that the game is back to playable.
( now with the June rumor that might be what they are going to do... getting servers can take time )


It isn't really necessary to "kill virtualization".  There is nothing intrinsically better about a hardware solution over a virtual solution.  A poorly designed hardware solution will suck ass.  As will a poorly designed virtual solution.  A well designed hardware solution can be amazing.  As can a well designed virtual solution.  The only difference is that 95% of systems architects understand proper hardware solution design wheraas maybe 15-20% have a handle on proper virtualization design.  The average number of years of experience with hardware solutions is probably 3-4 times more than virtuals.  This lack of available experience often flows down through the whole project team.  Often the idea of preceding a project of this sort with training and education at a certified training center is skipped.  So the risk associated with virtual solutions is higher. 

We moved our apps to virtuals and they have never performed better.  We get almost twice the density we were getting on hardware.  But our hardware was ancient (purchased in 2007/2008).  And we did 6 months of testing the apps starting with functional tests through full stress tests to determine the proper configuration before we did the transition.  And we did a soft cut over where we migrated the customer base over the course of 90 days vs. a hard cut like Turbine did.  So we could adjust as actual user load started increasing.  Just in case our testing missed anything.  (Real users do shit you can't predict 100%.  No matter how good your test scripts are.)

This isn't necessarily a decision error in hardware vs virtual.  I believe it to be more of an execution error.

Regardless, they need to do something.  The lag is still killing raids.  I don't have much problem in 6 man content, but geez.  The raids are annoying as hell.
  
Back to top
 
IP Logged
 
archon
Korthos Resident
*
Offline


Hugs! Not Drugs!

Posts: 42
Joined: Mar 29th, 2012
Re: It Must Be Getting Bad
Reply #57 - Apr 11th, 2016 at 1:08pm
Print Post  
This data center move is like the U.S. healthcare web rollout disaster.  Only thing is, there ain't no hot google engineer gonna show up to fix this shit.
  
Back to top
 
IP Logged
 
Flav
Vault Frog
*
Offline


One Frog to Rule them
All!

Posts: 10011
Location: Land of the Frogs
Joined: Aug 29th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #58 - Apr 11th, 2016 at 1:37pm
Print Post  
Asheras wrote on Apr 11th, 2016 at 9:21am:
There is nothing intrinsically better about a hardware solution over a virtual solution.  A poorly designed hardware solution will suck ass.  As will a poorly designed virtual solution.  A well designed hardware solution can be amazing.  As can a well designed virtual solution.  The only difference is that 95% of systems architects understand proper hardware solution design wheraas maybe 15-20% have a handle on proper virtualization design.


I will agree with you... in general terms.

In DDO Case it was just stupid to move towards virtualization. The software was is designed for that, and the architecture is probably not designed either.
It's probably would have been cheaper and less damaging to just replace the hardware and be done with it during the move instead of going to virtualized stuff.

  

Yes my avatar is an Hermine eating a Greenland Lemming for brunch.
Back to top
 
IP Logged
 
noamineo
Completionist (i.t.p.)
******
Offline


All men fear time, but
time fears the pyramids

Posts: 9183
Location: Titan
Joined: Jul 2nd, 2014
Re: It Must Be Getting Bad
Reply #59 - Apr 11th, 2016 at 3:15pm
Print Post  
One wonders why more of these things aren't tested beforehand. You know, with things like specialized servers, just for testing.

"Hey, what if we moved Lamania over to a VM, then had everyone get on it and see how it works?" - Said no one, apparently.
  

I'll never understand the propensity of people to brag about being good at a video game. Its a toy you play with for fun. The only person who should be proud of you is your mother. If you're 3.
Back to top
 
IP Logged
 
Revaulting
Completionist (i.t.p.)
******
Offline



Posts: 10143
Location: Not in my pants
Joined: Apr 3rd, 2014
Gender: Male
Re: It Must Be Getting Bad
Reply #60 - Apr 11th, 2016 at 8:44pm
Print Post  
noamineo wrote on Apr 11th, 2016 at 3:15pm:
One wonders why more of these things aren't tested beforehand. You know, with things like specialized servers, just for testing.

"Hey, what if we moved Lamania over to a VM, then had everyone get on it and see how it works?" - Said no one, apparently.

We'd wonder that if we didn't know Turbine, sure. Their corporate structure essentially guarantees that they didn't know they were going virtual until the bean counters decided. At which time, it was far too late for testing.
  

Silence is golden, but I only get silver rolls.
Back to top
 
IP Logged
 
Flav
Vault Frog
*
Offline


One Frog to Rule them
All!

Posts: 10011
Location: Land of the Frogs
Joined: Aug 29th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #61 - Apr 12th, 2016 at 3:22am
Print Post  
noamineo wrote on Apr 11th, 2016 at 3:15pm:
One wonders why more of these things aren't tested beforehand. You know, with things like specialized servers, just for testing.

"Hey, what if we moved Lamania over to a VM, then had everyone get on it and see how it works?" - Said no one, apparently.


Except that Lamania's load doesn't come near Wayfinder load... so L-Space might have been virtualized for ages and nobody noticed it.
Trying to simulate a real load is a bitch...
Been there done that, crashed the process that was badly coded every morning so that the coder could code a new one in the afternoon, for a week;
Acceptance tests, with a single test system as load, were ok, when I put 6 lives systems process crashed. ( and it wasn't even virtualized )
  

Yes my avatar is an Hermine eating a Greenland Lemming for brunch.
Back to top
 
IP Logged
 
Frank
Puppy Farmer
****
Offline


I Love Drama!

Posts: 1429
Joined: Jan 14th, 2016
Re: It Must Be Getting Bad
Reply #62 - Apr 12th, 2016 at 7:07am
Print Post  
Flav wrote on Apr 12th, 2016 at 3:22am:
Except that Lamania's load doesn't come near Wayfinder load... so L-Space might have been virtualized for ages and nobody noticed it.
Trying to simulate a real load is a bitch...
Been there done that, crashed the process that was badly coded every morning so that the coder could code a new one in the afternoon, for a week;
Acceptance tests, with a single test system as load, were ok, when I put 6 lives systems process crashed. ( and it wasn't even virtualized )

In this case it might have actually done what it was attempting to do.  The player load does not seem to be any part of the issue.
  

No, let me be Frank.

Digimonk wrote on Dec 8th, 2016 at 12:39pm:
I've had multiple RoSS rot just from doing the explorer points in Sands across multiple lives since it is exclusive.


Smart players know how to use buyback.

Digimonk wrote on Dec 22nd, 2016 at 1:58pm:
I will not chone.

I did what I did intentionally and while my primary purpose was not to annoy the other Vaulties, I acknowledge that it was a side effect.


Smart people don't elicit "side effects."  They understand in advance the consequences of their actions.
Back to top
 
IP Logged
 
Meursault
Horoluth Raider
****
Offline


I Love Freedom of Speech

Posts: 2410
Location: Hartford, CT; USA
Joined: Aug 22nd, 2013
Gender: Male
Re: It Must Be Getting Bad
Reply #63 - Apr 12th, 2016 at 7:22am
Print Post  
Flav wrote on Apr 12th, 2016 at 3:22am:
Except that Lamania's load doesn't come near Wayfinder load... so L-Space might have been virtualized for ages and nobody noticed it.
Trying to simulate a real load is a bitch...

Agreed, but if they announced that they were planning a big change and asked people to help out with stress testing the numbers would have been much higher. People are used to bugs found on Lamania going live regardless, so why waste time testing something with a character that is going to get wiped anyway.

And with a VM it is easy to simulate a lower capacity machine - if the live servers are slated to get 1/8 of a 20 GB pipe, 128 GB of RAM, and 32 cores then constrain your Lamania test to a 500MB pipe, 8 GB ram, and 2 cores.
  

Turdbin, keep changing the DDO rules, because McDonalds sold over 200 billion hamburgers by changing the recipe for their Special Sauce every couple of months to keep interest up.
Back to top
 
IP Logged
 
Flav
Vault Frog
*
Offline


One Frog to Rule them
All!

Posts: 10011
Location: Land of the Frogs
Joined: Aug 29th, 2010
Gender: Male
Re: It Must Be Getting Bad
Reply #64 - Apr 12th, 2016 at 8:49am
Print Post  
Frank wrote on Apr 12th, 2016 at 7:07am:
In this case it might have actually done what it was attempting to do.  The player load does not seem to be any part of the issue.


I should have said it another way...
The process that crashed was the communication process between the server ( alarm gathering server ) and the telecom equipment ( that generated the alarms )
with the server connected to a single test telecom equipment, that generated test alarms when needed by the test plan, everything was fine.
with the server connected to 6 telecom equipment, that generate live alarms from a real telecom network, the process crashed in a matter of seconds.
with the server connected back to the single test telecom equipment, and a frustrated flav in control of all the parts, the process crashed in a matter of minutes.

It took one week for the developer that got assigned to me to fix the problem...
at the rate of one flav generated crash in the morning, one new version of the daemon that crashed late in the afternoon.
Once the daemon survived a telecom equipment restart ( and the 100 pages of alarms it generates in such an event )
as well as a flav tinkering with other applications and settings at the same time, it was considered fit for duty.
( that's the best I could come up with a test bed sized single telecom exchange to simulate a real 6 exchange network )

player load is similar... you try the thing with 30 players and everything is fine... you try it with 300 and everything go to hell.
  

Yes my avatar is an Hermine eating a Greenland Lemming for brunch.
Back to top
 
IP Logged
 
Page Index Toggle Pages: 1 2 [3] 
Send TopicPrint