Ronnie Sahlberg [Thu, 21 May 2009 04:10:45 +0000 (14:10 +1000)]
change the socket we use for sending grautious ARPs from AF_INET/SOCK_PACKET to AF_PACKET/SOCK_RAW
(This used to be ctdb commit
2c4c20d7803f4449f8d463314c40d4734ec80e2f)
Ronnie Sahlberg [Thu, 21 May 2009 01:49:16 +0000 (11:49 +1000)]
Whitespace changes and using the CTDB_NO_MEMORY() macro changes to
the previous patch.
(This used to be ctdb commit
d623ea7c04daa6349b42d50862843c9f86115488)
Sumit Bose [Wed, 20 May 2009 10:08:13 +0000 (12:08 +0200)]
add missing checks on so far ignored return values
Most of these were found during a review by Jim Meyering <meyering@redhat.com>
(This used to be ctdb commit
3aee5ee1deb4a19be3bd3a4ce3abbe09de763344)
Sumit Bose [Wed, 20 May 2009 10:02:27 +0000 (12:02 +0200)]
structure member node_list_file is not used anywhere
(This used to be ctdb commit
0e84ea23d1d998d4d4ac7d8a858b3d8294f056cb)
Sumit Bose [Wed, 20 May 2009 09:47:34 +0000 (11:47 +0200)]
structure member logfile is not used anywhere
(This used to be ctdb commit
4f86c991812c2d0bddbe3de9a9906cf5df118cd4)
Sumit Bose [Wed, 20 May 2009 07:17:01 +0000 (09:17 +0200)]
fix a configure warning while checking for netfilter.h
(This used to be ctdb commit
fa5afee8e9a8fba6017bc58f87bc040de7206e63)
Sumit Bose [Wed, 20 May 2009 06:59:00 +0000 (08:59 +0200)]
added a missing dependency
(This used to be ctdb commit
1d833163b57853b84f098dffdb3c5f50164fcc73)
Ronnie Sahlberg [Mon, 18 May 2009 22:55:42 +0000 (08:55 +1000)]
Change the loglevel of "registered tcp client for ..." to INFO
instead of ERR
(This used to be ctdb commit
92b5580c38c23b99c1692708540983b0c0fcd6cf)
Ronnie Sahlberg [Mon, 18 May 2009 22:47:19 +0000 (08:47 +1000)]
From : Flavio Carmo Junior <carmo.flavio@gmail.com>
Add a helper function that checks whether a unix domain socket exists
and there is a daemon LISTENING to it similar to the existing function
to check for a daemon LISTENING to a tcp/ip socket.
(This used to be ctdb commit
025a836ab3be3c078fccd8c10b10dfffbfdd94d0)
Volker Lendecke [Fri, 15 May 2009 20:08:21 +0000 (22:08 +0200)]
Fix http://ctdb.samba.org/download.html
(This used to be ctdb commit
177295ba400fcaf47f026653f27a42a8ff798d36)
Christian Ambach [Wed, 6 May 2009 17:01:58 +0000 (19:01 +0200)]
Remove error messages about a non-existing /var/log/log.ctdb when running ctdb with logging to syslog
(This used to be ctdb commit
afdbf3c0df02decd823615134294abf2c8a8a5f3)
Ronnie Sahlberg [Thu, 14 May 2009 08:25:00 +0000 (18:25 +1000)]
add additional log info to track if/why we cant switch to client mode.
(This used to be ctdb commit
722171fc94a36ffe9e0a5c64502b916fde0a13a4)
Ronnie Sahlberg [Thu, 14 May 2009 00:33:25 +0000 (10:33 +1000)]
Track how long it takes to take out the recovery lock from both the main dameon and also from the recovery daemon.
Log this in "ctdb statistics".
Also add a varaible "RecLockLatencyMs" that will log an error everytime it takes longer than this to access the reclock file.
(This used to be ctdb commit
042377ed803bb8f7ca9d6ea1a387427b7b8ba45a)
Ronnie Sahlberg [Wed, 13 May 2009 22:55:40 +0000 (08:55 +1000)]
new version 1.0.82
(This used to be ctdb commit
82ee458329968001bb03b2aec42e65f532f007b3)
Ronnie Sahlberg [Wed, 13 May 2009 22:55:05 +0000 (08:55 +1000)]
use scope host when adding the interface to loopback so we dont respond to ARPs for this ip
(This used to be ctdb commit
fcd6226a6c00cf657532aa76804bfe029df21ba6)
Ronnie Sahlberg [Wed, 13 May 2009 22:12:48 +0000 (08:12 +1000)]
change the prefix NATGW_ to CTDB_NATGW_
(This used to be ctdb commit
b7ed7fd4a5fbd344d41caa1afa100b1f24506173)
Michael Adam [Tue, 12 May 2009 05:56:23 +0000 (07:56 +0200)]
ping pong: fix logic for mmap reads vs. preads
Michael
(This used to be ctdb commit
0c88fa41bc3c629052bc137ed30c473ed10522fd)
Michael Adam [Tue, 12 May 2009 20:59:35 +0000 (22:59 +0200)]
maketarball.sh: add GPL license header
Michael
(This used to be ctdb commit
13270a011016bf20bbf721f6d083b2f113fdbc79)
Michael Adam [Tue, 12 May 2009 20:59:08 +0000 (22:59 +0200)]
makerpms.sh: add GPL license header
Michael
(This used to be ctdb commit
7498e176817719eadd91201bbd0d9ceb91eefdae)
Michael Adam [Thu, 26 Mar 2009 18:03:03 +0000 (19:03 +0100)]
Remove generated binary files.
Noted by Mathieu Parent <math.parent@gmail.com>
Michael
(This used to be ctdb commit
b321dfd1d23492169ac25ed901d49d7c69ad5340)
Ronnie Sahlberg [Tue, 12 May 2009 08:21:26 +0000 (18:21 +1000)]
remove NATGW_PRIVATE_IFACE from the documentation since we do not need
it any more.
(This used to be ctdb commit
c967b234f59e5998bc8f2250062f4b0d1f39d820)
Ronnie Sahlberg [Tue, 12 May 2009 08:42:13 +0000 (18:42 +1000)]
assign the natgw address to loopback and not the private network so that natgw will still work even when public and private networks are one and the same
(This used to be ctdb commit
2bd796b8a098074502fe20e3ab69098b2109c133)
Ronnie Sahlberg [Tue, 12 May 2009 08:39:34 +0000 (18:39 +1000)]
add extra debug statements to the log to make it easier to see when a recovery dameon has hung due to the underlying filesystem hanging.
(This used to be ctdb commit
5b0067a4e335cbbf6e606646e612d4bfcfdb7441)
Ronnie Sahlberg [Tue, 12 May 2009 08:32:41 +0000 (18:32 +1000)]
check that a node is banned before trying to unban it.
(This used to be ctdb commit
4467b5f88d749d455854512f60a5d313cafa828b)
Martin Schwenke [Fri, 3 Apr 2009 01:54:26 +0000 (12:54 +1100)]
In 51_ctdb_bench.sh now allows a 2% difference between positive and
negative. ctdb_bench.c checks to ensure the timer has advanced from 0
before dividing.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
723413f246399b25166462d2018237920515655f)
Martin Schwenke [Tue, 21 Apr 2009 06:50:37 +0000 (16:50 +1000)]
Avoid floating point divide by 0 in ctdb_fetch.c's bench_fetch().
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
3c67e164eb92591f8763883430490805c1dfa9ed)
Martin Schwenke [Fri, 1 May 2009 07:40:45 +0000 (17:40 +1000)]
Bug fixes for tests: simple/12_ctdb_getdebug.sh and scripts/test_wrap.
simple/12_ctdb_getdebug.sh now recognises output with multi-digit node
numbers.
Sharing the ctdb directory via NFS and testing on a real cluster by
setting CTDB_TEST_REAL_CLUSTER didn't work by default. The fix is to
hack scripts/test_wrap so that it tries to find a valid bin directory
next to the directory containing it is in.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
ea2ca769e1d1068fbbad843750b19acfd87360e0)
Ronnie Sahlberg [Mon, 11 May 2009 22:59:49 +0000 (08:59 +1000)]
From: Sumit Bose <sbose@redhat.com>
fix handling of AC_INIT
(This used to be ctdb commit
1c31fea7432b870169fb839c1fbba5a33dec8e8a)
Martin Schwenke [Mon, 11 May 2009 04:43:17 +0000 (14:43 +1000)]
Fix lvsmaster and natgwlist nodespecs.
They both need to use a -Y option to ctdb and for natgwlist we only
want the 1st line.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
e781ff61e17d733349021bb036514f823c7cbfbb)
Martin Schwenke [Mon, 11 May 2009 04:14:11 +0000 (14:14 +1000)]
Updated onnode docs to reflect recent changes.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
cdf092d69a710310d82d1d67baa0ffb19f676927)
Martin Schwenke [Mon, 11 May 2009 03:39:31 +0000 (13:39 +1000)]
New lvs/lvsmaster and natgw/natgwlist nodespecs for onnode.
Some code re-factoring to implement this and to make it easy to
implement new ones. New simpler implementation of echo_nth() no
longer uses deleted get_nth() function.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
29559f5dd099bec210e98909c9b2e048461b7c81)
Martin Schwenke [Wed, 6 May 2009 03:17:34 +0000 (13:17 +1000)]
New option "-o <prefix>" saves stdout from each node to file <prefix>.<ip>.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
a0f5148ac749758e2dfbd6099e829c5bf1d900e6)
Martin Schwenke [Tue, 5 May 2009 06:02:30 +0000 (16:02 +1000)]
Use ctdb_fetch_lock rather than ctdb_call.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
5f3d2e29a446972ac244e20a08e48f4c2f4ffef4)
Martin Schwenke [Mon, 11 May 2009 04:50:28 +0000 (14:50 +1000)]
41.httpd event script workaround for RHEL5-ism.
RHEL5 can SIGKILL httpd when stopping it, causing it to leak
semaphores. This means that eventually a node runs out of semaphores
and httpd can't be started. So, before we attempt to start httpd we
clean up any semaphores owned by apache. We also try to restart httpd
in the monitor event if httpd has gone away.
Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit
2d3fbbbb63f443686f9fec42c0bc2058d115806e)
Ronnie Sahlberg [Mon, 11 May 2009 04:44:59 +0000 (14:44 +1000)]
Add a -Y machinereadable flag to "lvsmaster"
(This used to be ctdb commit
bbae698656d5da9a4a5b0fbfc3003844f246d54b)
Ronnie Sahlberg [Mon, 11 May 2009 03:56:28 +0000 (13:56 +1000)]
in the "lvsmaster" command, return -1 if there is no lvsmaster
(This used to be ctdb commit
ce6afbdef36e3c386b75709f73ef55efe0bd1987)
Ronnie Sahlberg [Fri, 8 May 2009 07:29:57 +0000 (17:29 +1000)]
new version 1.0.81
(This used to be ctdb commit
a8019f20cd42a1965410fef5bac2c5b73657b38e)
Ronnie Sahlberg [Wed, 6 May 2009 10:32:39 +0000 (20:32 +1000)]
From: Sumit Bose <sbose@redhat.com>
fix handling of AC_INIT and read version from ctdb.spec
(This used to be ctdb commit
f7f64f92e26a0757af210d33288162eefcd07d79)
Michael Adam [Tue, 5 May 2009 11:16:38 +0000 (13:16 +0200)]
ping_pong: add GPL comment header with Tridge's copyright
Michael
(This used to be ctdb commit
a87ef6a9206820d5110a7117240f743af010ff19)
Michael Adam [Wed, 29 Apr 2009 22:35:55 +0000 (00:35 +0200)]
ping_pong: get pread/pwrite prototypes from unistd.h
by defining _XOPEN_SOURCE to be 500 before including headers
Michael
(This used to be ctdb commit
96c79bddf7895e57ccf90f0d250bd08b7c4daf40)
Michael Adam [Wed, 29 Apr 2009 16:03:03 +0000 (18:03 +0200)]
ping_pong: reduce a couple of prototype warnings
Michael
(This used to be ctdb commit
fce851621fe2099c9692acfbfaade24c3d69727a)
Michael Adam [Wed, 29 Apr 2009 15:58:17 +0000 (17:58 +0200)]
packaging: also package ping_pong
Michael
(This used to be ctdb commit
300e84f7023e9194b313e96db943e4050bd64e68)
Michael Adam [Wed, 29 Apr 2009 15:57:43 +0000 (17:57 +0200)]
build: also build and install ping_pong
Michael
(This used to be ctdb commit
200de8f299c8fa44d6dc696532f1a947132e7ec4)
Michael Adam [Wed, 29 Apr 2009 15:50:38 +0000 (17:50 +0200)]
add tridge's ping_pong.c to the utils folder
Michael
(This used to be ctdb commit
fe59ecb697fb4686ad8ea2fe4ec1cc7b4629e74f)
Ronnie Sahlberg [Wed, 6 May 2009 00:29:07 +0000 (10:29 +1000)]
From Sumit Bose <sbose@redhat.com>
add more 64bit plattforms to configure.ac and preserve cli settings
(This used to be ctdb commit
8a86f65826b58c2ee3f07f221a4fc82193beec81)
Andrew Tridgell [Tue, 5 May 2009 06:06:58 +0000 (16:06 +1000)]
added link to michaels sambaxp papers
(This used to be ctdb commit
48c011188c624f10c9a754d4ead27db558088fd4)
Andrew Tridgell [Tue, 5 May 2009 06:49:05 +0000 (16:49 +1000)]
allow pages in subdirs
(This used to be ctdb commit
68da42c4ee92fcdfe65baf04c1a2d6446583858b)
Andrew Tridgell [Tue, 5 May 2009 06:52:24 +0000 (16:52 +1000)]
more subdir html support
(This used to be ctdb commit
9ce9a500543de4f0aef5e8c28cda9bbc3c9d1b77)
Andrew Tridgell [Tue, 5 May 2009 22:18:21 +0000 (08:18 +1000)]
use less intrusive smbstatus call in periodic connections cleanup
(This used to be ctdb commit
a152fdc79e3360049aee66c3e628237a91df181f)
root [Tue, 5 May 2009 06:33:21 +0000 (16:33 +1000)]
change the talloc hierarchy for the main transaction_start context and the individual transaction_all handles
(This used to be ctdb commit
919b29850671b59bcf748aec25658ea09d8b4f1c)
root [Tue, 5 May 2009 21:32:25 +0000 (07:32 +1000)]
fixed a problem with clients disconnecting during a traverse
When a client (such as smbstatus) is killed, it may have outstanding
traverse children on remote nodes. We need to catch the client
disconnect in ctdbd and send a control to all nodes telling them to
kill those outstanding traverse children.
(This used to be ctdb commit
f2fb2df4619a14f7f6c11f9132ee7d793028042c)
root [Fri, 1 May 2009 02:37:52 +0000 (12:37 +1000)]
new version 1.0.80
(This used to be ctdb commit
bf1b76955db6ba00ec64686b53084268573ba6a0)
root [Fri, 1 May 2009 02:30:26 +0000 (12:30 +1000)]
when tracking the ctdb statistics, only decrement num_clients and pending_calls IFF the counter is >0
Otherwise there is the chance that we will reset the statistics after the counter has been incremented (client connects) to zero and when the client disconnects we decrement it to a negative number.
this is a pure cosmetic patch with no operational impact to ctdb
(This used to be ctdb commit
72f1c696ee77899f7973878f2568a60d199d4fea)
root [Thu, 30 Apr 2009 15:18:27 +0000 (01:18 +1000)]
Add a new variable VerifyRecoveryLock which can be used to disable the test that the recovery daemon holds the lock properly when performing a recovery
(This used to be ctdb commit
329df9e47e6ca8ab5143985a999e68f37c6d88a5)
Ronnie Sahlberg [Thu, 30 Apr 2009 07:38:30 +0000 (17:38 +1000)]
dont unconditionally kill/restart ctdb when given "service ctdb start" only start ctdb if it is not already running, and print an error message othervise
(This used to be ctdb commit
94343309992929a592348c936e09a7b4f8b512c1)
Ronnie Sahlberg [Sat, 25 Apr 2009 22:47:38 +0000 (08:47 +1000)]
we only need to have transaction nesting disabled when we start the new transaction for the recovery
(This used to be ctdb commit
bf8dae63d10498e6b6179bbacdd72f1ff0fc60be)
Ronnie Sahlberg [Sat, 25 Apr 2009 22:42:54 +0000 (08:42 +1000)]
set the TDB_NO_NESTING flag for the tdb before we start a transaction from within recovery
(This used to be ctdb commit
1b2029dbb055ff07367ebc1f307f5241320227b2)
Ronnie Sahlberg [Sat, 25 Apr 2009 22:38:37 +0000 (08:38 +1000)]
add TDB_NO_NESTING. When this flag is set tdb will not allow any nested transactions and tdb_transaction_start() will implicitely _cancel() any pending transactions before starting any new ones.
(This used to be ctdb commit
459e4ee135bd1cd24c15e5325906eb4ecfd550ec)
Ronnie Sahlberg [Fri, 24 Apr 2009 08:23:48 +0000 (18:23 +1000)]
add a tuneable RecoveryDropAllIPs so it is possible to control after how long a node that has been stuck in recovery will wait until it will yield all public addresses.
this now defaults to 60 seconds
This is useful if a split brain occurs due to network partitioning since it will make sure that the "other half" of the cluster that does not contain the recovery master will eventually release all ips and thus avoiding a duplicate ip situation for the public addresses
(This used to be ctdb commit
70f21428c9eec96bcc787be191e7478ad68956dc)
Ronnie Sahlberg [Fri, 24 Apr 2009 08:09:51 +0000 (18:09 +1000)]
increase the loglevel for the message we print when we automatically release all ips when we have been in recovery for too long
(This used to be ctdb commit
7af060ded5113a49832f6a08a942523a202586b3)
Ronnie Sahlberg [Fri, 24 Apr 2009 04:41:21 +0000 (14:41 +1000)]
tweak some timeouts so that we do trigger a banning even if the control hangs/timesout
(This used to be ctdb commit
1860a365e6ba8212e15c33016c80a2adcf8d10f4)
Ronnie Sahlberg [Fri, 24 Apr 2009 03:58:32 +0000 (13:58 +1000)]
If we can not pull a database from a node during recovery, mark this node as a "culprit" so that it will eventually become banned.
(This used to be ctdb commit
69dc3bf60b86d8df6dc5c7c6ebf303e847fb2ba9)
Andrew Tridgell [Thu, 23 Apr 2009 01:35:42 +0000 (11:35 +1000)]
change shutdown level for ctdb to be 01
We want ctdb to shutdown first, as it manages many other
services. With the old level of 32 the NFS service would shutdown
first, and that would trigger ctdb to do a recovery. Then ctdb itself
would be shutdown a few seconds later, which causes a lot of error
messages in the other nodes logs
(This used to be ctdb commit
2f952af1a12e81a652ec9a4794db96f9593f2676)
Andrew Tridgell [Thu, 23 Apr 2009 01:00:16 +0000 (11:00 +1000)]
Merge commit 'ronnie/master'
(This used to be ctdb commit
373a04d6a47a5c7cf8d822d60be1d9b9ab4df784)
Ronnie Sahlberg [Wed, 8 Apr 2009 02:56:52 +0000 (12:56 +1000)]
new version 1.0.79
(This used to be ctdb commit
6c900aa343096c5e1e297e055c36832ffa5028dd)
Ronnie Sahlberg [Wed, 8 Apr 2009 02:49:28 +0000 (12:49 +1000)]
create a function "remote_ip" which can be used from scripts to remove a single ip from an interface.
use this fucntion from the natgw eventscript
(This used to be ctdb commit
feab5f30b2d6cebf4dd28abc5a81f93424a4c852)
Ronnie Sahlberg [Wed, 8 Apr 2009 00:45:00 +0000 (10:45 +1000)]
set libdir to ../lib64 on x86-64 platforms
(This used to be ctdb commit
a9f851caec2525ccbb3a6d6283eaef52b89a4eb2)
Ronnie Sahlberg [Tue, 7 Apr 2009 23:34:20 +0000 (09:34 +1000)]
install ctdb.pc from the RPM
(This used to be ctdb commit
1b47ddc97373376b416a50939b74dc8c926fc917)
Ronnie Sahlberg [Tue, 7 Apr 2009 23:21:11 +0000 (09:21 +1000)]
From Mathieu Parent <math.parent@gmail.com>
Install the pkgconfig file
(This used to be ctdb commit
7c4389cc0baa43a0ffa9fb08944c253db7885807)
Mathieu Parent [Tue, 7 Apr 2009 23:14:20 +0000 (09:14 +1000)]
Ronnie Sahlberg [Tue, 7 Apr 2009 22:48:55 +0000 (08:48 +1000)]
install /etc/ctdb/notify.sh as executable.
this addresses bug 6250
(This used to be ctdb commit
b8be5b06c3359d037db336dc12d38e0018349951)
Andrew Tridgell [Tue, 7 Apr 2009 07:07:41 +0000 (17:07 +1000)]
Merge commit 'ronnie/master'
(This used to be ctdb commit
43c718ebd402abe2e1e6360ce21b08cc13030c88)
Ronnie Sahlberg [Mon, 6 Apr 2009 04:03:09 +0000 (14:03 +1000)]
we only need to switch into client mode from the eventscript child if we are running the monitor event
(This used to be ctdb commit
13e2c9044950f21918e4610726e73ed3d8f76920)
Ronnie Sahlberg [Mon, 6 Apr 2009 04:00:41 +0000 (14:00 +1000)]
increase the listen queue. Now that the eventscripts may become clients and connect back to the server we do get a lot more concurrent connection attempts (takepip/teleaseip are performed in parallell)
(This used to be ctdb commit
018f8b0b1823ef59b46f1a671aec5309d10628f4)
Ronnie Sahlberg [Mon, 6 Apr 2009 03:16:36 +0000 (13:16 +1000)]
use _exit() and not exit() when we terminate a failed eventscript child process
(This used to be ctdb commit
33b296cee177adc61edc911caec8c24b3efa8441)
Ronnie Sahlberg [Mon, 6 Apr 2009 02:00:22 +0000 (12:00 +1000)]
We dont need to verify the nodemap on remote nodes that are banned
(This used to be ctdb commit
7f8f9385deee6eff2b7303147bc6412bbdc122df)
Ronnie Sahlberg [Thu, 2 Apr 2009 03:50:43 +0000 (14:50 +1100)]
if we cant pull the remote nodemap off a node we should mark it as a culprit so it eventually becomes banned.
(This used to be ctdb commit
0889ae3c237bdb3bd72d45f2f64f5e5d8420870c)
Ronnie Sahlberg [Wed, 1 Apr 2009 06:21:38 +0000 (17:21 +1100)]
Change the (dodgy) seqnumfrequency variable to have ms resolution instead of second resolution.
Rename the variable to SeqnumInterval for
1, it is an interval and not a 1/interval unit
2, so that we catch when people use this old variable and can update the sysconfig file instead of silently changin semantics of this variable
this is a real dodgy variable
(This used to be ctdb commit
68eac459e5d2b6b534f72821036675ffe5d7a350)
Ronnie Sahlberg [Wed, 1 Apr 2009 06:13:48 +0000 (17:13 +1100)]
remove a prototype for a function no longer used
(This used to be ctdb commit
9ac9745ba9296d01e3b18148ae8c3240e51cf090)
Ronnie Sahlberg [Tue, 31 Mar 2009 09:04:45 +0000 (20:04 +1100)]
new release 1.0.78
(This used to be ctdb commit
00d2213613822b758939019361a619bd7d7f4984)
Ronnie Sahlberg [Tue, 31 Mar 2009 09:00:00 +0000 (20:00 +1100)]
we should also install the 11.natgw eventscript if we want to be able to use it
(This used to be ctdb commit
42e2797271bc1cdb4eecf1227d4c2db668587193)
Ronnie Sahlberg [Tue, 31 Mar 2009 03:38:52 +0000 (14:38 +1100)]
install a default /etc/ctdb/notify.sh script as example on how to use
snmptrap/email to notify that a node has changed health status
(This used to be ctdb commit
ee52c0866e2b26c396fe60946159c559d47199eb)
Ronnie Sahlberg [Tue, 31 Mar 2009 03:23:31 +0000 (14:23 +1100)]
add a mechanism where the ctdb daemon will run a usercontrolled script when the node status changes to/from UNHEALTHY state.
This would allow a sysadmin to set up ctdb to send an email/snmptrap/... when the status of the node changes.
(This used to be ctdb commit
ce534a83a05dbd40238e4eee0669d60ff396f935)
Ronnie Sahlberg [Tue, 31 Mar 2009 00:42:10 +0000 (11:42 +1100)]
new version 1.0.77
(This used to be ctdb commit
274a4a1fe2e016f33296ebfc5ed6337ce3141d06)
Ronnie Sahlberg [Tue, 31 Mar 2009 00:33:28 +0000 (11:33 +1100)]
we must also try to set the routes when we release an ip since during the release/10.interfaces there can actually be a window where the kernel decides to remove all addresses (before we manually add them back in 10.interfaces) during which the kernel may also decide to delete all routes since there are no gateways reachable through this interface anymore.
(This used to be ctdb commit
34633223a46caaa079da233663f9c6dcc1803f87)
Ronnie Sahlberg [Wed, 25 Mar 2009 03:52:08 +0000 (14:52 +1100)]
new version 1.0.76
(This used to be ctdb commit
56b7095994d1de95e40a223ed503b5572ea9d1b9)
Ronnie Sahlberg [Wed, 25 Mar 2009 03:46:05 +0000 (14:46 +1100)]
change the ctdb command table to allow us to describe commands which can be run independtly of the ctdb daemon.
create a new debugging command xpnn which discovers the pnn of the local node and which works even if the local daemon is not running
(This used to be ctdb commit
cd78765f9400d7abce7929a2dd199f65226e7664)
Ronnie Sahlberg [Wed, 25 Mar 2009 02:46:41 +0000 (13:46 +1100)]
iupdate the documentation for NATGW to reflect that you can now use
multiple natgw groups in one cluster
(This used to be ctdb commit
e059df6d3cd81c67e5505e8ef2d6d0ef9a287b31)
Ronnie Sahlberg [Wed, 25 Mar 2009 02:37:57 +0000 (13:37 +1100)]
update how the NATGW configuration works.
allow the cluster to be partitioned into multiple disjoint natgw subsets
(This used to be ctdb commit
1046885cd22b5001e0251de2e536b5f6793459be)
Ronnie Sahlberg [Tue, 24 Mar 2009 08:02:00 +0000 (19:02 +1100)]
web: fix typo
Conflicts:
web/index.html
(This used to be ctdb commit
95d22e4cf265d2119f72200ab0ec708f095853df)
Ronnie Sahlberg [Tue, 24 Mar 2009 07:59:27 +0000 (18:59 +1100)]
update the documentatio n with all the new commands we supprot in the
ctdb tool
(This used to be ctdb commit
ae317b2013eee01c4c0a5108c03f4024bea9e313)
Ronnie Sahlberg [Tue, 24 Mar 2009 07:23:56 +0000 (18:23 +1100)]
fix the html so that mine and obnox names are shown
(This used to be ctdb commit
0840aa2bd31b2da95342dca8ff35786a3d998688)
Ronnie Sahlberg [Tue, 24 Mar 2009 06:49:55 +0000 (17:49 +1100)]
Merge branch 'obnox'
(This used to be ctdb commit
972036a5d510fb9b399f1ee34a8861dee4221267)
Ronnie Sahlberg [Tue, 24 Mar 2009 03:08:57 +0000 (14:08 +1100)]
new version 1.0.75
(This used to be ctdb commit
857733ae2bdfa0037af224abfabc020e2ac384c7)
Ronnie Sahlberg [Tue, 24 Mar 2009 03:05:31 +0000 (14:05 +1100)]
create a varient of kill_tcp_connections that only kills off the local side of a connection
(This used to be ctdb commit
dc2f28f7c988364b5d45f3048be4db3e5ff113b3)
Ronnie Sahlberg [Tue, 24 Mar 2009 02:51:32 +0000 (13:51 +1100)]
set --single-public-ip when lvs is used
(This used to be ctdb commit
292fff6eace39141591871e12f9a64e3441237be)
Ronnie Sahlberg [Tue, 24 Mar 2009 02:45:11 +0000 (13:45 +1100)]
we need to set the port properly in the parse_ip helper
(This used to be ctdb commit
43fe18d86995744ba61c7a6405b70edcb265930a)
Ronnie Sahlberg [Mon, 23 Mar 2009 10:44:35 +0000 (21:44 +1100)]
add michael adams as one of the ctdb developers on the main ctdb webpage
(This used to be ctdb commit
be50059c33845fec260ca53975d421a890303880)
Michael Adam [Mon, 23 Mar 2009 09:07:44 +0000 (10:07 +0100)]
Merge commit 'ctdb-ronnie/master'
(This used to be ctdb commit
39a972b0d6d0d70282c25c54a124b67431467e77)
root [Mon, 23 Mar 2009 08:07:45 +0000 (19:07 +1100)]
add a new command "ctdb scriptstatus"
this command shows which eventscripts were executed during the last monitoring cycle and the status from each eventscript.
If an eventscript timedout or returned an error we also
show the output from the eventscript.
Example :
[root@rcn1 ctdb-git]# ./bin/ctdb scriptstatus
6 scripts were executed last monitoring cycle
00.ctdb Status:OK Duration:0.021 Mon Mar 23 19:04:32 2009
10.interface Status:OK Duration:0.048 Mon Mar 23 19:04:32 2009
20.multipathd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009
40.vsftpd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009
41.httpd Status:OK Duration:0.011 Mon Mar 23 19:04:33 2009
50.samba Status:ERROR Duration:0.057 Mon Mar 23 19:04:33 2009
OUTPUT:ERROR: Samba tcp port 445 is not responding
Add a new helper function "switch_from_server_to_client()" which both
the recovery daemon can use as well as in the child process we start for running the actual eventscripts.
Create several new controls, both for the eventscript child process to inform the master daemon of the current status of the scripts as well as for the ctdb tool to extract this information from the runninc daemon.
(This used to be ctdb commit
c98f90ad61c9b1e679116fbed948ddca4111968d)