Hi Atul,
Need some more info,
1- What were the size of files which are missing ? What was the stripe
count(I assume -1) ?
2- Does 'lfs df' show any size difference on OSTs and MDT after and before
size missing ?
3- What operations did you perform or were going on(i.e- recovery, fail
over, node shut down, power off, network down, etc..) after which you
started seeing the issue ? Please give some detailed info or logs, if you
have any, about the occurrence of the problem.
Also please search on
https://jira.hpdd.intel.com/secure/Dashboard.jspa for
the similar type of bug if it may help, like LU-6945 etc.
Thanks.
On Tue, Dec 20, 2016 at 7:35 PM, Atul Yadav <atulyadavtech(a)gmail.com> wrote:
Hi Team,
Need your help in diagnosing and resolving the issue.
Issue: Data files are missing.
Environment details
Server
kernel-devel-2.6.32-431.5.1.el6_lustre.x86_64
lustre-osd-ldiskfs-2.5.1-2.6.32_431.5.1.el6_lustre.x86_64.x86_64
kernel-headers-2.6.32-431.5.1.el6_lustre.x86_64
lustre-modules-2.5.1-2.6.32_431.5.1.el6_lustre.x86_64.x86_64
perf-2.6.32-431.5.1.el6_lustre.x86_64
kernel-2.6.32-431.5.1.el6_lustre.x86_64
lustre-2.5.1-2.6.32_431.5.1.el6_lustre.x86_64.x86_64
python-perf-2.6.32-431.5.1.el6_lustre.x86_64
kernel-firmware-2.6.32-431.5.1.el6_lustre.x86_64
Client
lustre-client-2.5.1-2.6.32_431.5.1.el6.x86_64.x86_64
lustre-client-modules-2.5.1-2.6.32_431.5.1.el6.x86_64.x86_64
0 UP osd-ldiskfs lustre-OST0000-osd lustre-OST0000-osd_UUID 5
1 UP mgc MGC172.16.0.51@o2ib 75c7aa53-15bd-25f8-ec4d-5fb638705823 5
2 UP ost OSS OSS_uuid 3
3 UP obdfilter lustre-OST0000 lustre-OST0000_UUID 47
4 UP lwp lustre-MDT0000-lwp-OST0000 lustre-MDT0000-lwp-OST0000_UUID 5
5 UP osd-ldiskfs lustre-OST0003-osd lustre-OST0003-osd_UUID 5
6 UP osd-ldiskfs lustre-MDT0000-osd lustre-MDT0000-osd_UUID 14
7 UP mgs MGS MGS 51
8 UP obdfilter lustre-OST0003 lustre-OST0003_UUID 47
9 UP lwp lustre-MDT0000-lwp-OST0003 lustre-MDT0000-lwp-OST0003_UUID 5
10 UP mds MDS MDS_uuid 3
11 UP lod lustre-MDT0000-mdtlov lustre-MDT0000-mdtlov_UUID 4
12 UP mdt lustre-MDT0000 lustre-MDT0000_UUID 59
13 UP mdd lustre-MDD0000 lustre-MDD0000_UUID 4
14 UP qmt lustre-QMT0000 lustre-QMT0000_UUID 4
15 UP osp lustre-OST0000-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
16 UP osp lustre-OST0001-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
17 UP osp lustre-OST0002-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
18 UP osp lustre-OST0003-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
19 UP osp lustre-OST0004-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
20 UP osp lustre-OST0005-osc-MDT0000 lustre-MDT0000-mdtlov_UUID 5
21 UP lwp lustre-MDT0000-lwp-MDT0000 lustre-MDT0000-lwp-MDT0000_UUID 5
lustre-MDT0000-mdc-ffff880fe0916800: active
lustre-OST0000-osc-ffff880fe0916800: active
lustre-OST0001-osc-ffff880fe0916800: active
lustre-OST0002-osc-ffff880fe0916800: active
lustre-OST0003-osc-ffff880fe0916800: active
lustre-OST0004-osc-ffff880fe0916800: active
lustre-OST0005-osc-ffff880fe0916800: active
UUID bytes Used Available Use% Mounted on
lustre-MDT0000_UUID 1.2T 5.3G 1.1T 0% /home[MDT:0]
lustre-OST0000_UUID 9.3T 8.1T 743.3G 92% /home[OST:0]
lustre-OST0001_UUID 9.3T 8.1T 742.4G 92% /home[OST:1]
lustre-OST0002_UUID 9.3T 8.1T 743.3G 92% /home[OST:2]
lustre-OST0003_UUID 9.3T 8.1T 742.8G 92% /home[OST:3]
lustre-OST0004_UUID 9.3T 8.1T 742.7G 92% /home[OST:4]
lustre-OST0005_UUID 9.3T 8.2T 726.4G 92% /home[OST:5]
filesystem summary: 56.0T 48.9T 4.3T 92% /home
cat /proc/fs/lustre/health_check
healthy
Thank You
Atul Yadav
_______________________________________________
HPDD-discuss mailing list
HPDD-discuss(a)lists.01.org
https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.0
1.org_mailman_listinfo_hpdd-2Ddiscuss&d=DgICAg&c=IGDlg0lD0b-
nebmJJ0Kp8A&r=gj56200g3Czb2-u9fAr58lagcEHgzKLnvXnmaB36Mt4&m=
ih8qdca13LHh5nOVvwjBylEF2znZC1arXjRXuX5DLgo&s=d9_SnfLM1C-TWb
MECsE1km3SqcuWSdi6ffsEmD0xk3M&e=