Skip to content

Commit 3a1c8fc

Browse files
authored
Merge pull request #1121 from run-ai/log-collection-218
Merge pull request #1120 from run-ai/log-collection
2 parents e3e73c4 + 0ff3e7b commit 3a1c8fc

File tree

3 files changed

+46
-64
lines changed

3 files changed

+46
-64
lines changed
Lines changed: 44 additions & 57 deletions
Original file line numberDiff line numberDiff line change
@@ -1,35 +1,35 @@
11

2+
# Logs Collection
23

34
This article provides instructions for IT administrators on collecting Run:ai logs for support, including prerequisites, CLI commands, and log file retrieval. It also covers enabling verbose logging for Prometheus and the Run:ai Scheduler.
45

56
## Collect logs to send to support
67

7-
To collect Run:ai logs, follow these steps precisely:
8+
To collect Run:ai logs, follow these steps:
89

910
### Prerequisites
1011

1112
* Ensure that you have administrator-level access to the Kubernetes cluster where Run:ai is installed.
12-
* The Run:ai Administrator Command-Line Interface (CLI) must be [installed](..//config/cli-admin-install.md).
13-
* You must be logged into the Run:ai CLI with the correct permissions.
13+
* The Run:ai [Administrator Command-Line Interface](../config/cli-admin-install.md) (CLI) must be installed.
1414

15-
### Step-by-Step Instructions
15+
#### Step-by-Step Instructions
1616

17-
1. Open a terminal on your local machine (or any machine that has network access to the Kubernetes cluster) where the Run:ai Administrator CLI is installed.
18-
2. Log in to the Run:ai CLI (if required)
19-
3. Collect the Logs:
20-
Execute the command to collect the logs:
17+
1. Run the Command from your local machine or a Bastion Host (secure server)
18+
Open a terminal on your local machine (or any machine that has network access to the Kubernetes cluster) where the Run:ai Administrator CLI is installed.
19+
2. Collect the Logs
20+
Execute the following command to collect the logs:
2121

2222
``` bash
2323
runai-adm collect-logs
2424
```
2525

2626
This command gathers all relevant Run:ai logs from the system and generate a compressed file.
2727

28-
5. Locate the Generated File
28+
3. Locate the Generated File
2929
After running the command, note the location of the generated compressed log file. You can retrieve and send this file to Run:ai Support for further troubleshooting.
3030

3131
!!! Note
32-
The tar file packages the logs of Run:ai components only. It does __not__ include logs of researcher containers that may contain private information.
32+
The tar file packages the logs of Run:ai components only. It does not include logs of researcher containers that may contain private information
3333

3434
## Logs verbosity
3535

@@ -44,70 +44,57 @@ Before you begin, ensure you have the following:
4444
* kubectl installed and configured:
4545
* The Kubernetes command-line tool, `kubectl`, must be installed and configured to interact with the cluster.
4646
* Sufficient privileges to edit configurations and view logs.
47-
* Administrative access to Run:ai’s installation settings.
4847
* Monitoring Disk Space
4948
* When enabling verbose logging, ensure adequate disk space to handle the increased log output, especially when enabling debug or high verbosity levels.
5049

5150
### Adding verbosity
5251

53-
#### Adding verbosity to Prometheus
52+
??? "Adding verbosity to Prometheus"
53+
To increase the logging verbosity for Prometheus, follow these steps:
5454

55-
To increase the logging verbosity for Prometheus, follow these steps:
55+
1. Edit the `RunaiConfig` to adjust Prometheus log levels. Copy the following command to your terminal:
5656

57-
1. Edit the `RunaiConfig` to adjust Prometheus log levels. Copy the following command to your terminal:
58-
2. Bash
59-
60-
```
61-
kubectl edit runaiconfig runai -n runai
62-
```
57+
``` bash
58+
kubectl edit runaiconfig runai -n runai
59+
```
6360

64-
4.
65-
In the configuration file that opens, add or modify the following section to set the log level to `debug`:
66-
5. Bash
61+
2. In the configuration file that opens, add or modify the following section to set the log level to `debug`:
6762

68-
```
69-
spec:
70-
prometheus:
63+
``` yaml
7164
spec:
72-
logLevel: debug
73-
```
74-
75-
7.
76-
Save the changes. To view the Prometheus logs with the new verbosity level, run:
77-
8. Bash
78-
79-
```
80-
kubectl logs -n runai prometheus-runai-0
81-
```
65+
prometheus:
66+
spec:
67+
logLevel: debug
68+
```
69+
70+
3. Save the changes. To view the Prometheus logs with the new verbosity level, run:
8271

83-
10.
72+
``` bash
73+
kubectl logs -n runai prometheus-runai-0
74+
```
75+
8476
This command streams the last 100 lines of logs from Prometheus, providing detailed information useful for debugging.
8577

86-
#### Adding verbosity to the scheduler
87-
88-
To enable extended logging for the Run:ai scheduler:
89-
90-
1. Edit the `RunaiConfig` to adjust scheduler verbosity:
91-
2. Bash
78+
??? "Adding verbosity to the scheduler"
9279

93-
```
94-
kubectl edit runaiconfig runai -n runai
95-
```
80+
To enable extended logging for the Run:ai scheduler:
9681

97-
4.
98-
Add or modify the following section under the scheduler settings:
99-
5. Bash
82+
1. Edit the `RunaiConfig` to adjust scheduler verbosity:
10083

101-
```
102-
runai-scheduler:
103-
args:
104-
verbosity: 6
105-
```
84+
``` bash
85+
kubectl edit runaiconfig runai -n runai
86+
```
87+
88+
2 Add or modify the following section under the scheduler settings:
10689

107-
7.
108-
This increases the verbosity level of the scheduler logs to provide more detailed output.
90+
``` yaml
91+
runai-scheduler:
92+
args:
93+
verbosity: 6
94+
```
10995

110-
Warning
96+
This increases the verbosity level of the scheduler logs to provide more detailed output.
11197

112-
Enabling verbose logging can significantly increase disk space usage. Monitor your storage capacity and adjust the verbosity level as necessary.
98+
!!! Warning
99+
Enabling verbose logging can significantly increase disk space usage. Monitor your storage capacity and adjust the verbosity level as necessary.
113100

docs/home/overview.md

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -41,13 +41,7 @@ Run:ai cloud availability is monitored at [status.run.ai](https://status.run.ai)
4141

4242
## Collect Logs to Send to Support
4343

44-
As an IT Administrator, you can collect Run:ai logs to send to support:
45-
46-
* Install the [Run:ai Administrator command-line interface](../admin//config/cli-admin-install.md).
47-
* Run `runai-adm collect-logs`. The command will generate a compressed file containing all of the existing Run:ai log files.
48-
49-
!!! Note
50-
The tar file packages the logs of Run:ai components only. It does __not__ include logs of researcher containers that may contain private information.
44+
As an IT Administrator, you can collect Run:ai logs to send to support. For more information see [logs collection](../admin/troubleshooting/logs-collection.md).
5145

5246
## Example Code
5347

mkdocs.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -251,6 +251,7 @@ nav:
251251
- 'User Identity in Container' : 'admin/authentication/non-root-containers.md'
252252
- 'Troubleshooting' :
253253
# - 'Cluster Health' : 'admin/troubleshooting/cluster-health-check.md'
254+
- 'Logs Collection' : 'admin/troubleshooting/logs-collection.md'
254255
- 'Troubleshooting' : 'admin/troubleshooting/troubleshooting.md'
255256
- 'Diagnostics' : 'admin/troubleshooting/diagnostics.md'
256257

0 commit comments

Comments
 (0)