Hi all,
we recently reactivated an OST and now the 450 nodes send this OST many
more I/O requests that they send to the other osts. We have 4 servers
and 48 osts. The other oss have a load of about 100. The server with
this ost has 1500 and many "filter_commitrw_write()) scratch-OST002f:
slow i_mutex 30s" messages. Thus, I set qos_prio_free to 0 in
/proc/fs/lustre/lov/scratch-MDT0000-mdtlov
Does anyone have an expectation how long it will take until the load on
this server will go down? Hours? Days?
Regards, Michael
--
Dr.-Ing. Michael Kluge
Technische Universität Dresden
Center for Information Services and
High Performance Computing (ZIH)
D-01062 Dresden
Germany
Contact:
Willersbau, Room A 208
Phone: (+49) 351 463-34217
Fax: (+49) 351 463-37773
e-mail: michael.kluge(a)tu-dresden.de
WWW:
http://www.tu-dresden.de/zih