[Pkg-ganeti-devel] Bug#510965: Acknowledgement (ganeti: Ganeti does not work with python-twisted 8.1)
Iustin Pop
iustin at google.com
Tue Jan 27 09:44:48 UTC 2009
On Tue, Jan 27, 2009 at 10:29:19AM +0100, Daniel Schreiber wrote:
> Iustin Pop schrieb:
>> On Mon, Jan 26, 2009 at 02:28:51PM +0100, Daniel Schreiber wrote:
>>>
>>> It blocks forever after printing the twisted version.
>>
>> This is strange then. The breakage occurs even before printin the first result,
>> which means it does not work at all (so it's different from the bug that we
>> fixed in 1.2.5).
>>
>> Just to be sure, can you confirm this also happens when trying against
>> localhost? And do you get anything in /var/log/ganeti/node-daemon.log?
>
> Traceback (most recent call last):
> File "/usr/sbin/ganeti-noded", line 635, in <module>
> main()
> File "/usr/sbin/ganeti-noded", line 631, in main
> reactor.run()
> File "/usr/lib/python2.5/site-packages/twisted/internet/base.py", line
> 1048, in run
> self.mainLoop()
> --- <exception caught here> ---
> File "/usr/lib/python2.5/site-packages/twisted/internet/base.py", line
> 1060, in mainLoop
> self.doIteration(t)
> File
> "/usr/lib/python2.5/site-packages/twisted/internet/selectreactor.py",
> line 126, in doSelect
> self._preenDescriptors()
> File
> "/usr/lib/python2.5/site-packages/twisted/internet/selectreactor.py",
> line 88, in _preenDescriptors
> self._disconnectSelectable(selectable, e, False)
> File "/usr/lib/python2.5/site-packages/twisted/internet/posixbase.py",
> line 196, in _disconnectSelectable
> selectable.connectionLost(failure.Failure(why))
> File "/usr/lib/python2.5/site-packages/twisted/internet/posixbase.py",
> line 150, in connectionLost
> os.close(fd)
> exceptions.OSError: [Errno 9] Bad file descriptor
>
> This happens right after startup of the node-daemon. Same trace on all
> nodes. Nothing else is logged after that.
>
>> I would be interested at this point in:
>> - the output of "python -v call_version.py ...."
>> - an strace of "python call_version.py ..."
>> - an strace of ganeti-noded while doing the above
>
> Attached.
Thanks. This is starting to look like a different problem.
The errno 9 we had before, but it should be solved. The strace shows
that the ganeti-noded is not actually listening, and the call_version
talks to a different host - it completes a connect(), while the node
daemon doesn't get any traffic.
I'll try to understand why you get the errno 9 error, but in the
meantime also a "lsof -p $pid_of_node_daemon" and an strace of the node
daemon startup would be helpful to understand in what state a node
daemon is.
iustin
More information about the Pkg-ganeti-devel
mailing list