Computer failure. Solved. Now playing with 1070 ti!

124

Comments

  • kyoto kidkyoto kid Posts: 41,851
    ebergerly said:

    It is unfortunate that Daz does not warn people that Iray is very hard on Video cards and really good cooling is a must with Iray or Octaine. Oh by the way I'm a Alienware User myself. I like the cool looking cases.

     

    While it might seem intuitively obvious that rendering/iray is "hard" on video cards, I'm not sure that's necessarily the case.

    Keep in mind that electronic components have a design life, and if you operate them, even continuously, within their "normal" design ratings (like temperature, frequency, etc.) they should last for their design life. A bit like how the human body can operate fine for over 100 years with a continuous temperature of 37C (98.6F). Which, BTW, isn't all that far from the temps some GPU's attain during rendering...

    So, for example, if during rendering your GPU temps get around 50C (which is probably in the "normal" or "continuous" rating), and it renders at that temp continously for a week, it should still last for its design life as long as the temps stay in the normal range. And if that design life is, say, 10 years, it's probable the GPU will be fine for 10 years. Doesn't mean it won't fail for some (other) reason, since everything has an unpredictable/probabilistic chance of failure (called "Mean Time Between Failure", MTBF).

    But whether you render for 1 minute or 1 week or 1 month at that normal temperature it doesn't (or shouldn't) really matter. The GPU's fan cooling is designed for it, and it's thermal protection is designed for it, and the components work fine at that temp.

    So yes, rendering raises the GPU's temperature, and it can be for a long time, but that doesn't mean it's a bad thing. Now, if you somehow exceed the normal range (which isn't easy if all is working as it should), and get near the GPU's "maximum" ratings, the GPU (and other electronic components) have built-in protection to either throttle performance, or shut down the computer so stuff doesn't get damaged. But the way the GPU is designed is to provide enough fan cooling so that this doesn't happen, and the temps should be continuously regulated to stay within the safe, normal, continuous range.

    So you'd need the fan/cooling to fail, and also the thermal protection to fail in order for meltdown. Unless you bought a poorly designed cheapo piece of junk I suppose.

    So if a GPU on a render machine fails it could be for many reasons, unrelated to the fact that it's used for rendering. If you want peace of mind, just monitor your temps while rendering to see if all is well. And if it fails during rendering while the temps are in the normal range, something else is probably going on.

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    ...unfortunately few of us can afford the multi four and even five digit price tags for the cards that are best suited to the task.  So, we have to take other precautions as you mention. Water cooling, yes that will help keep temperatures down but again adds to the complexity and cost (though not as bad as it did six years ago). I have enough room in the case for such a system if I can find an aftermarket one with a backplate.

    I've installed several additional fans in my case to promote even more airflow in support the Titan-X, but the biggest benefit is the 400mm one one on the side panel which draws in fresh air right next to the GPU (and has a filter screen).  Part of the reason I'll never get rid of this case, even if I build an entirely new system as there are so few cases with this feature today (most have that [in my book useless] window on the left panel now).

    So does MSI afterburner work with any brand or just MSI cards?

  • LeatherGryphonLeatherGryphon Posts: 12,085
    hphoenix said:
    kyoto kid said:

    ...indeed, that is how I ended up with my Titan X. The last "new" card I bought was my old 460 when I built my system. I was saving for a 1070 just before the prices shot into the stratosphere.

    Sadly most EVGA 1070s and 1070 Ti's I am seeing at Newegg are still over 500$. There are a couple Ti's for about 490$ but some reviews issues with the cooling fans and no backplate.

    There is one 1070 Hybrid liquid cooled card for 489$

    https://www.newegg.com/Product/Product.aspx?Item=N82E16814487264

    Also comes with a free 550w PSU that could serve as a backup or auxiliary.

    I know absolutely nothing about liquid cooled cards. Do you have to do anything to maintain them at all?

    Just clicked the link, it's listing at $580.  It's a closed-loop water cooling solution, so no maintainence issues.  Just blow out the radiator/fan combo with compressed air every so often (like all fans).

     

    So what is the difference between a card and its "TI" version? What is this ti of which we speak?

    Im think that a 1070 might be an option. Just digging for more details.

    The "TI" suffix was added by a card manufacturer way back in Earlier GeForce days, it was an abbreviation for "Titanium", which meant it was a better spec GPU (usually meant it could be overclocked better, and was overclocked from the factory.)

    Today, it usually indicates that the GPU is actually a higher-end GPU chip that had some cores marked as bad.  So they disable those cores, put them on a lower platform name, and call it a "Ti" card.  It means it runs at the clock speed of the better card, has more cores than the regular model (but not as many as the better one).  It allows the company to reduce the losses from chips that come off the fab line with some small issues (a few CUDA cores are bad, etc.)  They have to design the chips in such a way that certain blocks of the chip can be disabled, but the extra design and circuitry is more than offset by the loss recovery.

    In fact, many of the cards today (1060, 1070, 1080) are all the same basic GPU, just some had failures in some blocks, so they disable those and put them in the lower spec card and at lower speed.  There may be ONE bad CUDA core in an nvidia GP-104 chip, but they have to disable the whole block of CUDA cores it is in.  So instead of being a 1080, it becomes a 1070 Ti.  If They have to disable two blocks of CUDA cores, then it becomes a 1070.  If three or more blocks have bad cores, it's probably a loss.  The 1060 has several tiers in it, and are GP-106 chips.  The 1080Ti and the Titan X and Xp are all GP-102 chips.  Just differing in how well they test coming out of the fabrication plant. 

    Hmmm, interesting.  Back in the old days when we still used slings, arrows, spears, and discrete transistors, the plethora of transister designation numbers were due to the uncertainty of the manufacturing process.  They'd make a bunch of transisters, designed with a desired response curve in mind, then put them on a testing machine and chart its actual amplification characteristics and throw them into buckets depending on the shape of their response curves, and label them "2N3055" or some other related number in the series depending on the actual shape of the response curve.

  • tj_1ca9500btj_1ca9500b Posts: 2,057

    Newegg is running their pre-fourth of July sale right now.  Definitely take a look.

    www.newegg.com

  • kyoto kidkyoto kid Posts: 41,851

    ...according to Nvida the recommended power supply is 500w (the card peaks at 180w).  I would still got a little higher particularly if you are considering other future upgrades.

  • ebergerlyebergerly Posts: 3,255
    edited July 2018

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

     

    Post edited by ebergerly on
  • AnotherUserNameAnotherUserName Posts: 2,727

    Guess what im currently toying with!? yeslaugh

  • TaozTaoz Posts: 10,256
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

    I used to think so too but there must be a reason why nVidia reduces the warranty to 90 days (normally 3 years for gaming cards) on cards designed specifically for mining:

    "What’s being stripped away? Details are scarce, but the manufacturers are removing HDMI and/or Display ports as image outputs from the mining-specific hardware. They will also come with reduced 90-day warranty periods due to the intensive 24-hour operation that the bitcoin mining GPUs are likely to see."

    https://www.ccn.com/nvidia-amd-to-release-cheaper-bitcoin-mining-gpus/

  • ebergerlyebergerly Posts: 3,255
    sapat said:

    EVGA specs say it draws less than 250 watts. They say a 500watt PS is minimum, though I'm sure that assumes worst case and that you're using a ton of other stuff simultaneously. Your best bet IMO is to buy a $30 power meter and measure what your computer actually requires. My 1080ti specs say it draws 250watts max, and with a  GTX1070 in parallel my computer barely ever gets to 400watts, and I have a 750 watt PS. 

    Often you'll hear that you need to get a massive power supply. Well, if you're planning on adding a lot of stuff later on then maybe. But most of your components draw relatively little power. The GPU is typically the biggest by far. 

  • AnotherUserNameAnotherUserName Posts: 2,727
    sapat said:

    Sapat, if there is any way that you can ditch that old card of yours and summon up the funds, defenitely look into it. Im still just doing random tests. What I can see already is improved render speeds, better navigation when dealing with the geometry on screen, far more stability as far as memory use is concerned. Ive already rendered a couple sets that the old card wouldve just crashed on or dumped to cpu and wouldve taken a ridiculous amount of time to render.

    Ill toss up some examples...

    I did come attempt one render on a set however that still threatened to maximize the cards recources and dump to cpu if not straight out crash it. But only one. It had atmospheric fog and bloated texture files, so...

  • ebergerlyebergerly Posts: 3,255
    edited July 2018
    Taoz said:
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

    I used to think so too but there must be a reason why nVidia reduces the warranty to 90 days (normally 3 years for gaming cards) on cards designed specifically for mining:

    "What’s being stripped away? Details are scarce, but the manufacturers are removing HDMI and/or Display ports as image outputs from the mining-specific hardware. They will also come with reduced 90-day warranty periods due to the intensive 24-hour operation that the bitcoin mining GPUs are likely to see."

    https://www.ccn.com/nvidia-amd-to-release-cheaper-bitcoin-mining-gpus/

    Of course they reduce the warranty for mining cards. Have you ever seen the racks of GPUs side by side in the miming warehouses, where they run a flexible pipe from the aircon and hope it works? The heat they have to dissipate with 100 GPUs side by side is enormous and the built in fans-are probably incapable of dissipating the heat. Far different from a typical PC in controlled envvironment.
    Post edited by ebergerly on
  • AnotherUserNameAnotherUserName Posts: 2,727
    edited July 2018

    This render I stopped at 28% after 10 minutes. What gets me about this render is that I didnt hide or delete a single item in the set, I didnt reduce a single texture size on the set or the figure. Anyone who knows this set also knows that it has alot of glossy surfaces. I attempted to render this set with the 770 a couple times. I had to delete or hide as many items and surfaces as I possibly could and I had to resize the textures at least twice. Even after that I never finished the render because it was super grainy and taking an impracticle amount of time to get anywhere with it. That was without any figures or extras in the scene.

    test.png
    1024 x 819 - 1M
    Post edited by AnotherUserName on
  • ebergerlyebergerly Posts: 3,255
    edited July 2018
    By the way heres a good analogy for GPUs. Is running your car continuously at 70 MPH bad for it? Of course not. You can drive across country for days and days at 70 mph and it wont overheat or get damaged, as long as its in the environment the engineers assumed. On a smooth highway with ambient temps less than say 120F or something. They design the cooling to protect the engine under those conditions. Now if you drive off road on dirt roads and drive up a steep mountain when ambient temps are 150F, all bets are off.
    Post edited by ebergerly on
  • AnotherUserNameAnotherUserName Posts: 2,727
    edited July 2018

    Heres something just for fun. Complete render time = 43 secs.

    Test 2.png
    632 x 819 - 327K
    Post edited by AnotherUserName on
  • namffuaknamffuak Posts: 4,406
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

     

    I have both a 980 TI and a 1080 TI in a very well ventilated case. I also have a very intensive render that will run both cards to thermal limit in about 10 minutes on the default fan settings. At thermal limit (85 C for both) the clocks throttle back - eventually to less than half speed. With a custom fan setting the 980 TI stays under 75 C and the 1080 TI stays close to 65 C and both will run at full clock speed and 97% gpu utilization.

    I don't see any point in running the cards hot if I don't have to.

  • AnotherUserNameAnotherUserName Posts: 2,727
    namffuak said:
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

     

    I have both a 980 TI and a 1080 TI in a very well ventilated case. I also have a very intensive render that will run both cards to thermal limit in about 10 minutes on the default fan settings. At thermal limit (85 C for both) the clocks throttle back - eventually to less than half speed. With a custom fan setting the 980 TI stays under 75 C and the 1080 TI stays close to 65 C and both will run at full clock speed and 97% gpu utilization.

    I don't see any point in running the cards hot if I don't have to.

    So maybe this is a no-brainer, but should I run my fans at 100% while I render or is that unnecessary? I never have anything super complex going on...

  • TaozTaoz Posts: 10,256
    ebergerly said:
    Taoz said:
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

    I used to think so too but there must be a reason why nVidia reduces the warranty to 90 days (normally 3 years for gaming cards) on cards designed specifically for mining:

    "What’s being stripped away? Details are scarce, but the manufacturers are removing HDMI and/or Display ports as image outputs from the mining-specific hardware. They will also come with reduced 90-day warranty periods due to the intensive 24-hour operation that the bitcoin mining GPUs are likely to see."

    https://www.ccn.com/nvidia-amd-to-release-cheaper-bitcoin-mining-gpus/

     

    Of course they reduce the warranty for mining cards. Have you ever seen the racks of GPUs side by side in the miming warehouses, where they run a flexible pipe from the aircon and hope it works? The heat they have to dissipate with 100 GPUs side by side is enormous and the built in fans-are probably incapable of dissipating the heat. Far different from a typical PC in controlled envvironment.

    Well it's obvious that they run 24/7 but how well they're cooled is hard to tell from a picture. Wonder if there are some data out there about this.

  • kyoto kidkyoto kid Posts: 41,851

    Heres something just for fun. Complete render time = 43 secs.

    ...very nice.

  • TaozTaoz Posts: 10,256
    namffuak said:
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

     

    I have both a 980 TI and a 1080 TI in a very well ventilated case. I also have a very intensive render that will run both cards to thermal limit in about 10 minutes on the default fan settings. At thermal limit (85 C for both) the clocks throttle back - eventually to less than half speed. With a custom fan setting the 980 TI stays under 75 C and the 1080 TI stays close to 65 C and both will run at full clock speed and 97% gpu utilization.

    I don't see any point in running the cards hot if I don't have to.

    So maybe this is a no-brainer, but should I run my fans at 100% while I render or is that unnecessary? I never have anything super complex going on...

    IMO the most important thing is case cooling, for if it's too hot inside the case even the best or fastest CPU or GPU coolers/fans are not worth much. Room temperature also matters.

    I have a GTX 1070 with two dynamic cooling fans, in a well cooled case (2 x 120 mm fans in and 3 x 140 mm out). The card is usually at 60-65 C and the GPU fans run at 50-60% speed no matter how long time I render, depending on room temperature which is usually between 25-30 C. The 2 x 140 mm top fans in the case are also running at high speed when I render (the case has a manual switch where you can set their speed to low or high).

    I haven't noticed any difference in GPU clock speeds or temperature whether the scene is complex or not so it doesn't look like that matters.

     

  • DaikatanaDaikatana Posts: 830
    edited July 2018
    Taoz said:
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

    I used to think so too but there must be a reason why nVidia reduces the warranty to 90 days (normally 3 years for gaming cards) on cards designed specifically for mining:

    "What’s being stripped away? Details are scarce, but the manufacturers are removing HDMI and/or Display ports as image outputs from the mining-specific hardware. They will also come with reduced 90-day warranty periods due to the intensive 24-hour operation that the bitcoin mining GPUs are likely to see."

    https://www.ccn.com/nvidia-amd-to-release-cheaper-bitcoin-mining-gpus/

    First off, a lot of mining setups are not engineered well. The airflow is not directed correctly.  In fact, some are not even set up in proper cases but simply mounted on a backplane and running in ambient room air.  That’s going to shorten the life of any electronics.  In an intelligently designed pc, the case itself assists with cooling as in constrains the airflow brought in and routed through by intelligently placed fans.  This is just simple engineering at work and it’s a beautiful thing.  My pc case for example has cool air from almost floor level pulled in by a fan and that airflow is directed up and over what needs to be cooled  before being pushed out of the case by a strategically placed exhaust fan at the top rear of the case.  Now if I had 3 or more gpu’s in my case I would consider a liquid cooling setup for the gpu’s necessary because you can only do so much with engineered airflow.  However for a single gpu ,or two gpu’s in a well designed case with intelligent fan placement and direction, ambient room temperature of about 68f-75f is perfectly fine for both hours of gaming and long rendering stretches.  I think a lot of people get alarmed when the thermal sensors cause the fans to ramp up but that’s what they were designed and built to do.  

    Post edited by Daikatana on
  • SimonJMSimonJM Posts: 6,067
    kyoto kid said:
    ebergerly said:

    It is unfortunate that Daz does not warn people that Iray is very hard on Video cards and really good cooling is a must with Iray or Octaine. Oh by the way I'm a Alienware User myself. I like the cool looking cases.

     

    While it might seem intuitively obvious that rendering/iray is "hard" on video cards, I'm not sure that's necessarily the case.

    Keep in mind that electronic components have a design life, and if you operate them, even continuously, within their "normal" design ratings (like temperature, frequency, etc.) they should last for their design life. A bit like how the human body can operate fine for over 100 years with a continuous temperature of 37C (98.6F). Which, BTW, isn't all that far from the temps some GPU's attain during rendering...

    So, for example, if during rendering your GPU temps get around 50C (which is probably in the "normal" or "continuous" rating), and it renders at that temp continously for a week, it should still last for its design life as long as the temps stay in the normal range. And if that design life is, say, 10 years, it's probable the GPU will be fine for 10 years. Doesn't mean it won't fail for some (other) reason, since everything has an unpredictable/probabilistic chance of failure (called "Mean Time Between Failure", MTBF).

    But whether you render for 1 minute or 1 week or 1 month at that normal temperature it doesn't (or shouldn't) really matter. The GPU's fan cooling is designed for it, and it's thermal protection is designed for it, and the components work fine at that temp.

    So yes, rendering raises the GPU's temperature, and it can be for a long time, but that doesn't mean it's a bad thing. Now, if you somehow exceed the normal range (which isn't easy if all is working as it should), and get near the GPU's "maximum" ratings, the GPU (and other electronic components) have built-in protection to either throttle performance, or shut down the computer so stuff doesn't get damaged. But the way the GPU is designed is to provide enough fan cooling so that this doesn't happen, and the temps should be continuously regulated to stay within the safe, normal, continuous range.

    So you'd need the fan/cooling to fail, and also the thermal protection to fail in order for meltdown. Unless you bought a poorly designed cheapo piece of junk I suppose.

    So if a GPU on a render machine fails it could be for many reasons, unrelated to the fact that it's used for rendering. If you want peace of mind, just monitor your temps while rendering to see if all is well. And if it fails during rendering while the temps are in the normal range, something else is probably going on.

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    <snip>

    So does MSI afterburner work with any brand or just MSI cards?

    Pretty sure it does, at least it works onmy cards and i have NO idea of their actual make (PC was made up for me by a company).

  • ebergerlyebergerly Posts: 3,255
    Daikatana said:

     

    irst off, a lot of mining setups are not engineered well. 

    To say the least... 

    Consider this: a GTX 1080ti uses 250watts, which means it generates 250 watts of heat. So four of those next to each other is like a 1,000 watt toaster oven on HIGH. Certainly not what the GPU designers designed for. 

    Also, keep in mind that many/most of these mining operations are do-it-yourself operations where a group of guys hear that they can make a ton of money if they buy a bunch of GPU's. So they build some racks and get some power supplies and get it running as quickly as possible. It's not like they can buy a pre-made and pre-engineered mining rack system from Dell or HP or NVIDIA. They rent some space and throw something together and hope for the best. Totally do-it-yourself. Just imagine trying to figure how to cool the equivalent of 20 toaster ovens on HIGH, when all you have is an office building space with standard A/C vents in the ceiling. That's why youtube has tons of "hey, here's how I cooled my mining rig" videos. 

    It's not the continuous rendering that causes problems. Its about putting your PC in a pizza oven and hoping the GPU fans will keep it cool while it's rendering.  

     

  • Dim ReaperDim Reaper Posts: 687

    AnotherUserName said:

    So maybe this is a no-brainer, but should I run my fans at 100% while I render or is that unnecessary? I never have anything super complex going on...

     

    Firstly, congratulations on your new card - good to see you enjoying the new speed. 

    I have a very similar setup to Nanffuak who posted earlier, and I also use a custom fan profile.  I have used MSI Afterburner for quite a long time, but I've recently had to stop using it and go with the software that came with my card because Riva Tuner Statistics Server is used to display the graphs and I'm having problems with it since the update to Windows 10 1803.  But if you're not runing that build of Windows 10 then I would definitely recommend it.  I usually runthe fans at 30% at 50 Celsius and at 60% at 70 Celsius and this is enough to keep them at around 70 Celsius during a prolonged render.  If it's a very hot day then I turn the GPU fans up for long renders.

  • Silver DolphinSilver Dolphin Posts: 1,638
    ebergerly said:

    Nvidia video cards for consumers are designed for gaming not Iray. The fan profiles and cooling solutions are designed with this in mind. This is why even a really well build card with excellent cooling is gonna go over designed heating threshold when subjected to Iray. The way video cards are designed is like american cars they are built to break>> Consequently we are left with the need to do everything in our power to expand the MTBF life of our investment in these expensive video cards as best we can. Use MSI afterburner and open case and point a fan at computer to blow cool air at case especially during the summer months.

    I realize that's a popular belief, but do you have any references to support any of that? Because there are a lot of people who successfully use their GPU's on the standard fan profiles for rendering iray for years, and I don't know of any facts to support the belief that their cards would have lasted longer if they were just gaming and not doing iray. Electronics have thermal limits because of physical characteristics of the silicon, etc. And they have continuous ratings, because that's what they can withstand continuously. For some reason some like to believe that using something at the continuous ratings for a longer period is worse. I don't think the facts support that. My GPU's fans keep the temps at or below about 80C. Whether I'm rendering for 20 minutes or 20 hours. 80C max. That's not a damaging temperature. Just check the equipment specs.   

     

    If video card makers wanted they could use something other than solder to connect the GPU to the video card which is the Achilles heel of the video card most failure is due to the constant cooling and reheating of solder connecting this to the card. When this starts to happen your GPU becomes disconnected from video board and that's it for your video card (time to by another). Keeping the card cool lengthens the life of these video cards solder points. If solder was a good solution we would have them on expensive desktop CPU sockets but most good motherboard manufactures don't solder CPU's because this is the hottest part of the system and the most expensive. Consequenlty, we have sockets that make the direct connection to CPU without solder and really good air or water cooling for the CPU. Solder is a cheap way for companies to attach something to a board and it breaks down!  Do CPU and GPU wear out? Yes they do but most of the time it is bad solder contacts or bad resistors on the boards that go bad that bring the CPU motherboard system or video card down. Yes I'm over simplifing but it would take pages and pages to expain why things are engineered to break just ask any good engineer about the problem Henry Ford ran into for making a car that could run too long without breaking.

    Oh, Kyoto Kid yes MSI Afterburner will work with any video card.

  • AnotherUserNameAnotherUserName Posts: 2,727

    AnotherUserName said:

    So maybe this is a no-brainer, but should I run my fans at 100% while I render or is that unnecessary? I never have anything super complex going on...

     

    Firstly, congratulations on your new card - good to see you enjoying the new speed. 

    I have a very similar setup to Nanffuak who posted earlier, and I also use a custom fan profile.  I have used MSI Afterburner for quite a long time, but I've recently had to stop using it and go with the software that came with my card because Riva Tuner Statistics Server is used to display the graphs and I'm having problems with it since the update to Windows 10 1803.  But if you're not runing that build of Windows 10 then I would definitely recommend it.  I usually runthe fans at 30% at 50 Celsius and at 60% at 70 Celsius and this is enough to keep them at around 70 Celsius during a prolonged render.  If it's a very hot day then I turn the GPU fans up for long renders.

    Good to know, thanks. Im pretty sure that I have fan controllers somewhere in this computer.

  • ebergerlyebergerly Posts: 3,255
    edited July 2018

    Regarding the real causes of failure of GPU's, here's a quote from a Former Chief Product Architect at AMD, when asked "What are the most common reasons for GPU cards to fail?". His answer (I've highlighted some pertinent parts): 

    "Overheating. Which could be caused by intentional overclocking, insufficient ventilation, or something overriding the built-in power throttling that the GPU boards to prevent overheating. If you are having problems with early failure, check the LFM (linear feet per minute) requirement of the board and see what your system supplies.

    Many GPU cards are designed for a single-width or double-width PCI slot. The original standard power limit for PCI was 25 watts per slot, but in the competition for higher performance, GPU cards have gone far past that limit using on-board fans and other tricks. Many high-end GPU cards are over 200 watts these days. The safest thing is to use an external card with its own cooling system and not depend on the airflow in an existing PC. Or buy a PC from a reputable manufacturer that has the GPU card configured in the product, since the engineers will have worked out what it takes to keep the card functioning properly.

    Some effects of overheating are immediate, since it can cause temperature-dependent devices to go out of spec and then the software will fail. Other effects are gradual because high heat can cause ions in transistors to diffuse and eventually fall out of spec. Graphics boards are usually loaded with capacitors that don't do well with high temperatures, either.

    As long as a GPU card is kept within its operating parameters for temperature, it should last as long as the computer system it is plugged into.

    I was the Chief Product Architect for AMD's graphics business unit (formerly ATI), and I do not recall any clamor over GPU cards experiencing early failures. Maybe there's a faulty product on the market now; that kind of mistake can happen in any industry. GPU boards should last at least seven years.
    "

    Basically he's echoing the points I've been making. As long as you don't operate outside the continuous ratings (by overclocking or damaging the cooling system), your GPU should be fine. 

    Post edited by ebergerly on
Sign In or Register to comment.