Commit Graph

2305 Commits

Author SHA1 Message Date
Bryan Boreham
ec6da3acae Prometheus metrics: optionally export total CPU instead of per-CPU
Per-CPU stats are more expensive to transport and store, and that
level of detail is not required in many cases.

We export overall total cpu in the same metric as per-cpu, so that
dashboards which previously summed over cpu will work identically.
2018-02-20 13:58:44 +00:00
David Ashpole
6116f26530
Merge pull request #1882 from dashpole/overlay_upper
monitor only the upper directory for overlay
2018-02-08 11:18:43 -08:00
David Ashpole
eb15723cb0
Merge pull request #1886 from miaoyq/singleton-client
All handlers of containerd containers share a single client instance
2018-02-08 10:46:41 -08:00
Yanqiang Miao
1dd9469469 All handlers of containerd containers share a single client instance
Signed-off-by: Yanqiang Miao <miao.yanqiang@zte.com.cn>
2018-02-08 12:21:51 +08:00
David Ashpole
a31881621b
Merge pull request #1885 from Random-Liu/no-uuid-error-out
Do not error out when /dev/disk/by-uuid is not found.
2018-02-07 11:33:28 -08:00
Lantao Liu
ef7f0696fe Do not error out when /dev/disk/by-uuid is not found.
Signed-off-by: Lantao Liu <lantaol@google.com>
2018-02-07 08:34:19 +00:00
David Ashpole
aa6d6f1195
Merge pull request #1871 from jsravn/connect-to-docker-forever-on-startup
Connect to docker with no timeout on startup
2018-02-06 15:29:26 -08:00
James Ravn
e660d8b8b7 Retry docker status on startup
For https://github.com/google/cadvisor/issues/1866.
2018-02-06 10:31:11 +00:00
David Ashpole
812cc819ac monitor only the upper directory for overlay 2018-02-05 15:49:27 -08:00
David Ashpole
828ac5eb11
Merge pull request #1878 from dims/fix-warning-to-warningf
Fix Warning->Warningf for better logging
2018-02-02 16:42:02 -08:00
Davanum Srinivas
b1656b253f Fix Warning->Warningf for better logging 2018-02-02 19:19:07 -05:00
David Ashpole
05f9f74aa4
Merge pull request #1868 from dashpole/migrate_prow
Migrate to prow
2018-02-02 14:47:00 -08:00
David Ashpole
66e12ca652 migrate to prow, which uses node-e2e to run tests 2018-02-01 15:20:53 -08:00
Derek Carr
b111f65653
Merge pull request #1861 from vikaschoudhary16/bump_runc
Bump runc to d5b4a3eddbe4c890843da971b64f45a0f023f4db
2018-01-12 15:18:01 -05:00
vikaschoudhary16
13d955d6a9 Bump runc to d5b4a3eddbe4c890843da971b64f45a0f023f4db
Signed-off-by: vikaschoudhary16 <vichoudh@redhat.com>
2018-01-12 13:56:11 -05:00
David Ashpole
5a2e3d06d6
Merge pull request #1845 from dashpole/fix_ui
Fix small UI issues
2018-01-02 15:20:15 -08:00
David Ashpole
aacb48c7d0
Merge pull request #1847 from dims/skip-get-vfs-stats-when-mountpoint-does-not-exist
Skip getVfsStats when file does not exist
2018-01-02 15:08:53 -08:00
Davanum Srinivas
5f8eea99dd Skip getVfsStats when file does not exist
There are a lot of spurious exceptions in the kubernetes kubelet logs
like:
E1018 21:03:09.616581   22780 fs.go:332] Stat fs failed. Error: no such
file or directory

Since we know that calling syscall.Statfs will just fail when the path
does not exist, we should just skip making the call.

NOTE: fixing 2017->2018 problems in build by running `./build/jenkins_e2e.sh`
2018-01-02 17:52:38 -05:00
David Ashpole
9ffa37396f
Merge pull request #1806 from sentinelt/master
fix #1607; use container creation time provided by Docker handler
2017-12-20 11:25:35 -08:00
David Ashpole
e917087ad5 fix small logo and slash in container 2017-12-19 13:28:26 -08:00
David Ashpole
1e567c2ac3
Merge pull request #1835 from dashpole/changelog_0.28.3
changelog for v0.28.3
2017-12-07 09:52:08 -08:00
David Ashpole
e6b4e4c38a changelog for v0.28.3 2017-12-07 09:46:35 -08:00
David Ashpole
fc6d4b920c
Merge pull request #1830 from jsravn/add-docker-timeouts
Add timeouts for docker queries
2017-12-07 09:37:53 -08:00
James Ravn
57e17d8be2 Add timeouts for docker queries
As these can otherwise block indefinitely due to docker issues.

This is to fix https://github.com/kubernetes/kubernetes/issues/53207,
where kubelet relies on cadvisor for gathering docker information as
part of its periodic node status update.
2017-12-05 13:50:48 +00:00
David Ashpole
0bde1c615c
Merge pull request #1831 from brian-brazil/prometheus-labels
Ensure all Prometheus metrics have the same labelnames.
2017-11-30 09:54:44 -08:00
Brian Brazil
27f103b266 Ensure all Prometheus metrics have the same labelnames.
Fixes #1704
2017-11-30 16:33:37 +00:00
David Ashpole
7d11f4243f
Merge pull request #1827 from tallclair/logging
Clean up cAdvisor logging
2017-11-29 10:16:58 -08:00
David Ashpole
b26bf6ebb2
Merge pull request #1826 from mindprince/gpu-docs
Add docs for using nvidia gpu monitoring.
2017-11-28 17:49:29 -08:00
Tim Allclair
1eb1355ae6
Default logging to V(2) 2017-11-27 19:49:49 -08:00
Tim Allclair
5b435b4b70
Clean up cAdvisor logging 2017-11-27 19:48:05 -08:00
Tim Allclair
3a40bbfc5c
Raise verbosity on runtime registration failure 2017-11-27 19:48:04 -08:00
Rohit Agarwal
6ba3fa4e8c Add docs for using nvidia gpu monitoring. 2017-11-27 17:43:14 -08:00
David Ashpole
49440c7e0a
Merge pull request #1818 from dashpole/changelog
changelog for v0.28.2
2017-11-21 16:32:31 -08:00
David Ashpole
9689d84e7f changelog for v0.28.2 2017-11-21 16:27:22 -08:00
David Ashpole
e420065e7d
Merge pull request #1817 from dashpole/util_clock
Switch from apimachinery clock to k8s.io/utils/clock
2017-11-21 16:24:51 -08:00
David Ashpole
3166cdae87 add utils/clock dependency 2017-11-21 16:19:57 -08:00
David Ashpole
3a347ec3fe Revert "add apimachinery clock dependency"
This reverts commit fd43dc16ba.
2017-11-21 14:21:47 -08:00
David Ashpole
5831d72df8
Merge pull request #1814 from mindprince/accelerator-data-race
Avoid race in accessing nvidiaDevices between Setup() and GetCollector()
2017-11-21 14:03:18 -08:00
Rohit Agarwal
3c3845e92f Avoid race in accessing nvidiaDevices between Setup() and GetCollector() 2017-11-21 13:53:47 -08:00
David Ashpole
7cb3faad02
Merge pull request #1811 from dashpole/changelog_0_28_1
changelog for v0.28.1
2017-11-20 15:13:49 -08:00
David Ashpole
1cd2620be6 changelog for v0.28.1 2017-11-20 15:08:11 -08:00
David Ashpole
17dcf1ca98
Merge pull request #1779 from dashpole/on_demand_metrics
On-Demand container metrics
2017-11-20 15:06:26 -08:00
David Ashpole
3d6ad6dd86 on demand metrics 2017-11-20 14:51:04 -08:00
David Ashpole
fd43dc16ba add apimachinery clock dependency 2017-11-20 13:15:15 -08:00
David Ashpole
ece1334172 update testify dependency 2017-11-17 16:15:28 -08:00
David Ashpole
a27bed7b9d
Merge pull request #1807 from dashpole/revert_1760
Revert "fix #1708; move from inotify to fsnotify"
2017-11-17 15:03:42 -08:00
David Ashpole
577f63f3da
Merge pull request #1808 from dashpole/update_ui
Update jquery and bootstrap dependencies
2017-11-17 14:42:58 -08:00
David Ashpole
ee8cbf1054 update jquery and bootstrap 2017-11-17 13:17:51 -08:00
David Ashpole
6988e70a3d Revert "fix #1708; move from inotify to fsnotify"
This reverts commit e6b6a1ac57.
2017-11-17 10:28:28 -08:00
David Ashpole
5231853e71
Merge pull request #1805 from andyxning/marshal_device_name_to_json_output
marshal device name to json output
2017-11-15 16:36:04 -08:00