diff options
Diffstat (limited to 'zarb-ml/mageia-sysadm/2012-April/004368.html')
-rw-r--r-- | zarb-ml/mageia-sysadm/2012-April/004368.html | 147 |
1 files changed, 147 insertions, 0 deletions
diff --git a/zarb-ml/mageia-sysadm/2012-April/004368.html b/zarb-ml/mageia-sysadm/2012-April/004368.html new file mode 100644 index 000000000..accfe5f63 --- /dev/null +++ b/zarb-ml/mageia-sysadm/2012-April/004368.html @@ -0,0 +1,147 @@ +<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN"> +<HTML> + <HEAD> + <TITLE> [Mageia-sysadm] questions about our infrastructure setup & costs + </TITLE> + <LINK REL="Index" HREF="index.html" > + <LINK REL="made" HREF="mailto:mageia-sysadm%40mageia.org?Subject=Re%3A%20%5BMageia-sysadm%5D%20questions%20about%20our%20infrastructure%20setup%20%26%09costs&In-Reply-To=%3C20120402194156.GC21938%40mars-attacks.org%3E"> + <META NAME="robots" CONTENT="index,nofollow"> + <META http-equiv="Content-Type" content="text/html; charset=us-ascii"> + <LINK REL="Previous" HREF="004362.html"> + <LINK REL="Next" HREF="004363.html"> + </HEAD> + <BODY BGCOLOR="#ffffff"> + <H1>[Mageia-sysadm] questions about our infrastructure setup & costs</H1> + <B>nicolas vigier</B> + <A HREF="mailto:mageia-sysadm%40mageia.org?Subject=Re%3A%20%5BMageia-sysadm%5D%20questions%20about%20our%20infrastructure%20setup%20%26%09costs&In-Reply-To=%3C20120402194156.GC21938%40mars-attacks.org%3E" + TITLE="[Mageia-sysadm] questions about our infrastructure setup & costs">boklm at mars-attacks.org + </A><BR> + <I>Mon Apr 2 21:41:56 CEST 2012</I> + <P><UL> + <LI>Previous message: <A HREF="004362.html">[Mageia-sysadm] questions about our infrastructure setup & costs +</A></li> + <LI>Next message: <A HREF="004363.html">[Mageia-sysadm] perl modules shipped by mageia - the web site! +</A></li> + <LI> <B>Messages sorted by:</B> + <a href="date.html#4368">[ date ]</a> + <a href="thread.html#4368">[ thread ]</a> + <a href="subject.html#4368">[ subject ]</a> + <a href="author.html#4368">[ author ]</a> + </LI> + </UL> + <HR> +<!--beginarticle--> +<PRE>On Mon, 02 Apr 2012, Romain d'Alverny wrote: + +><i> On Mon, Apr 2, 2012 at 17:49, nicolas vigier <<A HREF="https://www.mageia.org/mailman/listinfo/mageia-sysadm">boklm at mars-attacks.org</A>> wrote: +</I>><i> > Using paid hosting will not remove problems like bad RJ45 or switch +</I>><i> > that stop working. If we want good availability, we need more servers +</I>><i> > in different places. +</I>><i> +</I>><i> In paid hosting, (physical) server and link failure is to be directly +</I>><i> handled by people that have a financial incentive to have it work. I +</I>><i> expect (but may be wrong) that the availability will be higher than +</I>><i> what we have today, and that it is still affordable for _some_ +</I>><i> services. It's not about going full speed to paid services or to spend +</I>><i> unnecessarily money, it's about using what we can (it includes money) +</I>><i> to improve our systems availability. +</I> +It's doesn't matter that it's paid hosting or not, if a switch stop +working on friday evening, and if there's nobody available to go to the +datacenter to replace it, then everything will be offline until one of +us has time to go to the datacenter. We can pay very expensive datacenter +hosting, but they won't replace our switch if it stops working. + +We can also pay expensive hosting in datacenter and have power outage, +network problems because of a flood or other reason, air-conditioning +problems, etc ... + +And we can also pay expensive hosting at EC2 and have 2 days downtime : +<A HREF="http://www.pcworld.com/businesscenter/article/226327/what_your_business_can_learn_from_the_amazon_cloud_outage.html">http://www.pcworld.com/businesscenter/article/226327/what_your_business_can_learn_from_the_amazon_cloud_outage.html</A> + +In 1 year we had only one major unexpected downtime on our servers, +because of a bad network cable, on friday evening, and hopefully this +kind of problem does not happen very often. Before this we had more +downtimes on the servers hosted at gandi, because of problems on their +storage servers for all their customers. + +><i> +</I>><i> The point is that: I don't know and I don't have the data to get an +</I>><i> idea about that; and I'm not even sure the data needed is compiled +</I>><i> somewhere at this time. And I suspect I'm not alone in this case. If I +</I>><i> don't ask, someone else will later. Or even worse than that, won't +</I>><i> dare to ask. +</I>><i> +</I>><i> That's why I'm asking for this for those two purposes: explaining more +</I>><i> how it works, understanding how it could work. +</I>><i> - functional split list => your skills/job +</I>><i> - needs per functional unit => same +</I>><i> - dependencies between units => same [1] +</I>><i> - cost per unit in different contexts => can be spread around +</I> +We don't have a lot of servers, so no need for complex dependency graph +to see that all of the servers are critical, and downtime of any of the +server will cause problems somewhere. If we want to reduce the risk of +having a lot of services down at the same time, then we need more +servers, hosted in different places. + +><i> +</I>><i> And yes, it may be too expensive. Or it may not. But I suspect we +</I>><i> don't know, or it's not obvious enough. On the other hand, having one, +</I>><i> or several server downtime like this for 2/3 days also costs a lot to +</I>><i> the project (loss of time, and reputation shift). +</I> +If we can't afford a 2 days downtime, then we should probably stop +everything now and do something else. + +Projects with more money and more machines than us also have unexpected +server downtime. + +Fedora had almost 1 day of downtime on their buildsystem in december : +<A HREF="http://lists.fedoraproject.org/pipermail/devel-announce/2011-December/000867.html">http://lists.fedoraproject.org/pipermail/devel-announce/2011-December/000867.html</A> +And if we read their mailing list archives we can see 2 hours on many +services in january 2012, 1 hour for build system in febuary 2012, 2 hours +in january 2012, etc ... + +In april 2010 Debian had their buildd.debian.org server down on friday +and restored on monday, wiki.debian.org for one day, forums.debian.org +for a few days : +<A HREF="http://lists.debian.org/debian-infrastructure-announce/2010/04/msg00001.html">http://lists.debian.org/debian-infrastructure-announce/2010/04/msg00001.html</A> +wiki down for an unknow time in january 2010 : +<A HREF="http://lists.debian.org/debian-infrastructure-announce/2010/01/msg00001.html">http://lists.debian.org/debian-infrastructure-announce/2010/01/msg00001.html</A> +ftp-master in january 2011: +<A HREF="http://lists.debian.org/debian-infrastructure-announce/2011/01/msg00000.html">http://lists.debian.org/debian-infrastructure-announce/2011/01/msg00000.html</A> + +And I think it's the same for most projects. + +</PRE> + + + + + + + + + + +<!--endarticle--> + <HR> + <P><UL> + <!--threads--> + <LI>Previous message: <A HREF="004362.html">[Mageia-sysadm] questions about our infrastructure setup & costs +</A></li> + <LI>Next message: <A HREF="004363.html">[Mageia-sysadm] perl modules shipped by mageia - the web site! +</A></li> + <LI> <B>Messages sorted by:</B> + <a href="date.html#4368">[ date ]</a> + <a href="thread.html#4368">[ thread ]</a> + <a href="subject.html#4368">[ subject ]</a> + <a href="author.html#4368">[ author ]</a> + </LI> + </UL> + +<hr> +<a href="https://www.mageia.org/mailman/listinfo/mageia-sysadm">More information about the Mageia-sysadm +mailing list</a><br> +</body></html> |