Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: 🎸 disable two datasets to avoid load on the Hub #2120

Merged
merged 1 commit into from
Nov 15, 2023

Conversation

severo
Copy link
Collaborator

@severo severo commented Nov 15, 2023

a lot of calls to the Hub are pending because of "expand" in the tree
API by the datasets library

> a lot of calls to the Hub are pending because of "expand" in the tree
API by the datasets library
Copy link

ArgoCD Diff for commit 9197576

Updated at 11/15/2023, 1:22:37 PM CEST

App: datasets-server-prod
YAML generation: Success 🟢
App sync status: Out of Sync ⚠️

===== apps/Deployment datasets-server/prod-datasets-server-admin ======
--- /tmp/argocd-diff3292380020/prod-datasets-server-admin-live.yaml	2023-11-15 12:22:36.272071412 +0000
+++ /tmp/argocd-diff3292380020/prod-datasets-server-admin	2023-11-15 12:22:36.268071315 +0000
@@ -542,7 +542,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -601,7 +601,7 @@
           value: "9"
         - name: ADMIN_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-admin:sha-827b9de
+        image: huggingface/datasets-server-services-admin:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-api ======
--- /tmp/argocd-diff2973402961/prod-datasets-server-api-live.yaml	2023-11-15 12:22:36.292071896 +0000
+++ /tmp/argocd-diff2973402961/prod-datasets-server-api	2023-11-15 12:22:36.288071799 +0000
@@ -337,7 +337,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -386,7 +386,7 @@
           value: "9"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-api:sha-827b9de
+        image: huggingface/datasets-server-services-api:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-rows ======
--- /tmp/argocd-diff2162735304/prod-datasets-server-rows-live.yaml	2023-11-15 12:22:36.316072477 +0000
+++ /tmp/argocd-diff2162735304/prod-datasets-server-rows	2023-11-15 12:22:36.312072380 +0000
@@ -422,7 +422,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -473,7 +473,7 @@
           value: "8080"
         - name: ROWS_INDEX_MAX_ARROW_DATA_IN_MEMORY
           value: "300_000_000"
-        image: huggingface/datasets-server-services-rows:sha-827b9de
+        image: huggingface/datasets-server-services-rows:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-search ======
--- /tmp/argocd-diff1740726350/prod-datasets-server-search-live.yaml	2023-11-15 12:22:36.332072864 +0000
+++ /tmp/argocd-diff1740726350/prod-datasets-server-search	2023-11-15 12:22:36.332072864 +0000
@@ -408,7 +408,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -459,7 +459,7 @@
           value: refs/convert/parquet
         - name: DUCKDB_INDEX_CACHE_DIRECTORY
           value: /storage/duckdb-index
-        image: huggingface/datasets-server-services-search:sha-827b9de
+        image: huggingface/datasets-server-services-search:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-sse-api ======
--- /tmp/argocd-diff1523091073/prod-datasets-server-sse-api-live.yaml	2023-11-15 12:22:36.344073154 +0000
+++ /tmp/argocd-diff1523091073/prod-datasets-server-sse-api	2023-11-15 12:22:36.344073154 +0000
@@ -279,7 +279,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -322,7 +322,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-sse-api:sha-827b9de
+        image: huggingface/datasets-server-services-sse-api:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-storage-admin ======
--- /tmp/argocd-diff2328410997/prod-datasets-server-storage-admin-live.yaml	2023-11-15 12:22:36.360073541 +0000
+++ /tmp/argocd-diff2328410997/prod-datasets-server-storage-admin	2023-11-15 12:22:36.356073445 +0000
@@ -326,7 +326,7 @@
         helm.sh/chart: datasets-server
     spec:
       containers:
-      - image: huggingface/datasets-server-services-storage-admin:sha-827b9de
+      - image: huggingface/datasets-server-services-storage-admin:sha-0334b86
         imagePullPolicy: IfNotPresent
         name: prod-datasets-server-storage-admin
         resources:

===== apps/Deployment datasets-server/prod-datasets-server-worker-heavy ======
--- /tmp/argocd-diff1859254247/prod-datasets-server-worker-heavy-live.yaml	2023-11-15 12:22:36.392074316 +0000
+++ /tmp/argocd-diff1859254247/prod-datasets-server-worker-heavy	2023-11-15 12:22:36.384074122 +0000
@@ -604,7 +604,7 @@
   uid: 30e540ee-faae-4944-9b76-3f3d91b7f42b
 spec:
   progressDeadlineSeconds: 600
-  replicas: 0
+  replicas: 6
   revisionHistoryLimit: 10
   selector:
     matchLabels:
@@ -660,7 +660,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -804,7 +804,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-827b9de
+        image: huggingface/datasets-server-services-worker:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-worker-light ======
--- /tmp/argocd-diff435946901/prod-datasets-server-worker-light-live.yaml	2023-11-15 12:22:36.420074993 +0000
+++ /tmp/argocd-diff435946901/prod-datasets-server-worker-light	2023-11-15 12:22:36.416074896 +0000
@@ -604,7 +604,7 @@
   uid: a9aa0b42-d117-48cd-8ee0-6ce0b9231b8d
 spec:
   progressDeadlineSeconds: 600
-  replicas: 0
+  replicas: 10
   revisionHistoryLimit: 10
   selector:
     matchLabels:
@@ -660,7 +660,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -804,7 +804,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-827b9de
+        image: huggingface/datasets-server-services-worker:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-worker-medium ======
--- /tmp/argocd-diff2749300187/prod-datasets-server-worker-medium-live.yaml	2023-11-15 12:22:36.452075767 +0000
+++ /tmp/argocd-diff2749300187/prod-datasets-server-worker-medium	2023-11-15 12:22:36.448075671 +0000
@@ -604,7 +604,7 @@
   uid: 53b50e80-3bf2-4fd8-873e-cddf5b0f6f2d
 spec:
   progressDeadlineSeconds: 600
-  replicas: 0
+  replicas: 30
   revisionHistoryLimit: 10
   selector:
     matchLabels:
@@ -660,7 +660,7 @@
               name: datasets-server-prod-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -804,7 +804,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-827b9de
+        image: huggingface/datasets-server-services-worker:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== batch/CronJob datasets-server/prod-datasets-server-job-backfill ======
--- /tmp/argocd-diff3925698176/prod-datasets-server-job-backfill-live.yaml	2023-11-15 12:22:36.468076154 +0000
+++ /tmp/argocd-diff3925698176/prod-datasets-server-job-backfill	2023-11-15 12:22:36.464076058 +0000
@@ -182,7 +182,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -201,7 +201,7 @@
               value: CreateCommitError,LockedDatasetTimeoutError,ExternalServerError
             - name: LOG_LEVEL
               value: debug
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-backfill
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-cache-metrics-collector ======
--- /tmp/argocd-diff3562743093/prod-datasets-server-job-cache-metrics-collector-live.yaml	2023-11-15 12:22:36.476076348 +0000
+++ /tmp/argocd-diff3562743093/prod-datasets-server-job-cache-metrics-collector	2023-11-15 12:22:36.472076251 +0000
@@ -178,7 +178,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -193,7 +193,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-cache-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-cache-metrics-collector
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-clean-duckdb-cache ======
--- /tmp/argocd-diff1896877483/prod-datasets-server-job-clean-duckdb-cache-live.yaml	2023-11-15 12:22:36.484076542 +0000
+++ /tmp/argocd-diff1896877483/prod-datasets-server-job-clean-duckdb-cache	2023-11-15 12:22:36.484076542 +0000
@@ -229,7 +229,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -252,7 +252,7 @@
               value: cache/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "259200"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-clean-duckdb-cache
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-clean-duckdb-downloads ======
--- /tmp/argocd-diff817981543/prod-datasets-server-job-clean-duckdb-downloads-live.yaml	2023-11-15 12:22:36.496076832 +0000
+++ /tmp/argocd-diff817981543/prod-datasets-server-job-clean-duckdb-downloads	2023-11-15 12:22:36.492076735 +0000
@@ -229,7 +229,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -252,7 +252,7 @@
               value: downloads/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "259200"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-clean-duckdb-downloads
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-clean-duckdb-job-runner ======
--- /tmp/argocd-diff2270826793/prod-datasets-server-job-clean-duckdb-job-runner-live.yaml	2023-11-15 12:22:36.508077123 +0000
+++ /tmp/argocd-diff2270826793/prod-datasets-server-job-clean-duckdb-job-runner	2023-11-15 12:22:36.504077026 +0000
@@ -229,7 +229,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -252,7 +252,7 @@
               value: job_runner/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "10800"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-clean-duckdb-job-runner
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-clean-hf-datasets-cache ======
--- /tmp/argocd-diff3542807313/prod-datasets-server-job-clean-hf-datasets-cache-live.yaml	2023-11-15 12:22:36.516077316 +0000
+++ /tmp/argocd-diff3542807313/prod-datasets-server-job-clean-hf-datasets-cache	2023-11-15 12:22:36.516077316 +0000
@@ -187,7 +187,7 @@
           containers:
           - env:
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -210,7 +210,7 @@
               value: '*/datasets/*'
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "10800"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-clean-hf-datasets-cache
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-clean-stats-cache ======
--- /tmp/argocd-diff706415861/prod-datasets-server-job-clean-stats-cache-live.yaml	2023-11-15 12:22:36.524077509 +0000
+++ /tmp/argocd-diff706415861/prod-datasets-server-job-clean-stats-cache	2023-11-15 12:22:36.524077509 +0000
@@ -187,7 +187,7 @@
           containers:
           - env:
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -210,7 +210,7 @@
               value: '*'
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "10800"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-clean-stats-cache
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-delete-obsolete-cache ======
--- /tmp/argocd-diff940919862/prod-datasets-server-job-delete-obsolete-cache-live.yaml	2023-11-15 12:22:36.532077703 +0000
+++ /tmp/argocd-diff940919862/prod-datasets-server-job-delete-obsolete-cache	2023-11-15 12:22:36.532077703 +0000
@@ -222,7 +222,7 @@
             - name: CACHED_ASSETS_STORAGE_PROTOCOL
               value: s3
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -271,7 +271,7 @@
               value: delete-obsolete-cache
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-delete-obsolete-cache
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-post-messages ======
--- /tmp/argocd-diff2418885657/prod-datasets-server-job-post-messages-live.yaml	2023-11-15 12:22:36.544077993 +0000
+++ /tmp/argocd-diff2418885657/prod-datasets-server-job-post-messages	2023-11-15 12:22:36.544077993 +0000
@@ -190,7 +190,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -217,7 +217,7 @@
               value: post-messages
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-post-messages
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-queue-metrics-collector ======
--- /tmp/argocd-diff2744946186/prod-datasets-server-job-queue-metrics-collector-live.yaml	2023-11-15 12:22:36.552078187 +0000
+++ /tmp/argocd-diff2744946186/prod-datasets-server-job-queue-metrics-collector	2023-11-15 12:22:36.552078187 +0000
@@ -179,7 +179,7 @@
                   name: datasets-server-prod-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -194,7 +194,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-queue-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-queue-metrics-collector
             resources:

App: datasets-server-staging
YAML generation: Success 🟢
App sync status: Out of Sync ⚠️

===== apps/Deployment datasets-server/staging-datasets-server-admin ======
--- /tmp/argocd-diff2567131915/staging-datasets-server-admin-live.yaml	2023-11-15 12:22:37.288095997 +0000
+++ /tmp/argocd-diff2567131915/staging-datasets-server-admin	2023-11-15 12:22:37.284095900 +0000
@@ -534,7 +534,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -593,7 +593,7 @@
           value: "1"
         - name: ADMIN_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-admin:sha-827b9de
+        image: huggingface/datasets-server-services-admin:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-api ======
--- /tmp/argocd-diff4066393990/staging-datasets-server-api-live.yaml	2023-11-15 12:22:37.300096287 +0000
+++ /tmp/argocd-diff4066393990/staging-datasets-server-api	2023-11-15 12:22:37.300096287 +0000
@@ -319,7 +319,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -363,7 +363,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-api:sha-827b9de
+        image: huggingface/datasets-server-services-api:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-rows ======
--- /tmp/argocd-diff2992957992/staging-datasets-server-rows-live.yaml	2023-11-15 12:22:37.328096965 +0000
+++ /tmp/argocd-diff2992957992/staging-datasets-server-rows	2023-11-15 12:22:37.324096868 +0000
@@ -425,7 +425,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -471,7 +471,7 @@
           value: "8080"
         - name: ROWS_INDEX_MAX_ARROW_DATA_IN_MEMORY
           value: "300_000_000"
-        image: huggingface/datasets-server-services-rows:sha-827b9de
+        image: huggingface/datasets-server-services-rows:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-search ======
--- /tmp/argocd-diff1279743229/staging-datasets-server-search-live.yaml	2023-11-15 12:22:37.344097352 +0000
+++ /tmp/argocd-diff1279743229/staging-datasets-server-search	2023-11-15 12:22:37.340097255 +0000
@@ -418,7 +418,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -464,7 +464,7 @@
           value: refs/convert/parquet
         - name: DUCKDB_INDEX_CACHE_DIRECTORY
           value: /storage/duckdb-index
-        image: huggingface/datasets-server-services-search:sha-827b9de
+        image: huggingface/datasets-server-services-search:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-sse-api ======
--- /tmp/argocd-diff4204477999/staging-datasets-server-sse-api-live.yaml	2023-11-15 12:22:37.356097642 +0000
+++ /tmp/argocd-diff4204477999/staging-datasets-server-sse-api	2023-11-15 12:22:37.352097545 +0000
@@ -287,7 +287,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -325,7 +325,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-sse-api:sha-827b9de
+        image: huggingface/datasets-server-services-sse-api:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-storage-admin ======
--- /tmp/argocd-diff65664166/staging-datasets-server-storage-admin-live.yaml	2023-11-15 12:22:37.372098029 +0000
+++ /tmp/argocd-diff65664166/staging-datasets-server-storage-admin	2023-11-15 12:22:37.368097932 +0000
@@ -326,7 +326,7 @@
         helm.sh/chart: datasets-server
     spec:
       containers:
-      - image: huggingface/datasets-server-services-storage-admin:sha-827b9de
+      - image: huggingface/datasets-server-services-storage-admin:sha-0334b86
         imagePullPolicy: IfNotPresent
         name: staging-datasets-server-storage-admin
         resources:

===== apps/Deployment datasets-server/staging-datasets-server-worker-all ======
--- /tmp/argocd-diff749506909/staging-datasets-server-worker-all-live.yaml	2023-11-15 12:22:37.404098804 +0000
+++ /tmp/argocd-diff749506909/staging-datasets-server-worker-all	2023-11-15 12:22:37.396098610 +0000
@@ -678,7 +678,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -822,7 +822,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-827b9de
+        image: huggingface/datasets-server-services-worker:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-worker-light ======
--- /tmp/argocd-diff1731890799/staging-datasets-server-worker-light-live.yaml	2023-11-15 12:22:37.436099577 +0000
+++ /tmp/argocd-diff1731890799/staging-datasets-server-worker-light	2023-11-15 12:22:37.432099481 +0000
@@ -678,7 +678,7 @@
               name: datasets-server-staging-secrets
               optional: false
         - name: COMMON_BLOCKED_DATASETS
-          value: open-llm-leaderboard/*
+          value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
           value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
         - name: COMMON_HF_ENDPOINT
@@ -822,7 +822,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-827b9de
+        image: huggingface/datasets-server-services-worker:sha-0334b86
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== batch/CronJob datasets-server/staging-datasets-server-job-cache-metrics-collector ======
--- /tmp/argocd-diff3741650360/staging-datasets-server-job-cache-metrics-collector-live.yaml	2023-11-15 12:22:37.444099771 +0000
+++ /tmp/argocd-diff3741650360/staging-datasets-server-job-cache-metrics-collector	2023-11-15 12:22:37.444099771 +0000
@@ -177,7 +177,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -192,7 +192,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-cache-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-cache-metrics-collector
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-clean-duckdb-cache ======
--- /tmp/argocd-diff3245443721/staging-datasets-server-job-clean-duckdb-cache-live.yaml	2023-11-15 12:22:37.456100062 +0000
+++ /tmp/argocd-diff3245443721/staging-datasets-server-job-clean-duckdb-cache	2023-11-15 12:22:37.452099965 +0000
@@ -228,7 +228,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -251,7 +251,7 @@
               value: cache/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "600"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-clean-duckdb-cache
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-clean-duckdb-downloads ======
--- /tmp/argocd-diff1986403016/staging-datasets-server-job-clean-duckdb-downloads-live.yaml	2023-11-15 12:22:37.468100352 +0000
+++ /tmp/argocd-diff1986403016/staging-datasets-server-job-clean-duckdb-downloads	2023-11-15 12:22:37.464100255 +0000
@@ -228,7 +228,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -251,7 +251,7 @@
               value: downloads/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "600"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-clean-duckdb-downloads
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-clean-duckdb-job-runner ======
--- /tmp/argocd-diff2425015271/staging-datasets-server-job-clean-duckdb-job-runner-live.yaml	2023-11-15 12:22:37.476100545 +0000
+++ /tmp/argocd-diff2425015271/staging-datasets-server-job-clean-duckdb-job-runner	2023-11-15 12:22:37.476100545 +0000
@@ -228,7 +228,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -251,7 +251,7 @@
               value: job_runner/*
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "600"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-clean-duckdb-job-runner
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-clean-hf-datasets-cache ======
--- /tmp/argocd-diff3511254880/staging-datasets-server-job-clean-hf-datasets-cache-live.yaml	2023-11-15 12:22:37.484100739 +0000
+++ /tmp/argocd-diff3511254880/staging-datasets-server-job-clean-hf-datasets-cache	2023-11-15 12:22:37.484100739 +0000
@@ -186,7 +186,7 @@
           containers:
           - env:
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -209,7 +209,7 @@
               value: '*/datasets/*'
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "600"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-clean-hf-datasets-cache
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-clean-stats-cache ======
--- /tmp/argocd-diff270474141/staging-datasets-server-job-clean-stats-cache-live.yaml	2023-11-15 12:22:37.496101029 +0000
+++ /tmp/argocd-diff270474141/staging-datasets-server-job-clean-stats-cache	2023-11-15 12:22:37.496101029 +0000
@@ -186,7 +186,7 @@
           containers:
           - env:
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -209,7 +209,7 @@
               value: '*'
             - name: DIRECTORY_CLEANING_EXPIRED_TIME_INTERVAL_SECONDS
               value: "600"
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-clean-stats-cache
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-delete-obsolete-cache ======
--- /tmp/argocd-diff3594199896/staging-datasets-server-job-delete-obsolete-cache-live.yaml	2023-11-15 12:22:37.508101320 +0000
+++ /tmp/argocd-diff3594199896/staging-datasets-server-job-delete-obsolete-cache	2023-11-15 12:22:37.504101223 +0000
@@ -221,7 +221,7 @@
             - name: CACHED_ASSETS_STORAGE_PROTOCOL
               value: s3
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -270,7 +270,7 @@
               value: delete-obsolete-cache
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-delete-obsolete-cache
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-post-messages ======
--- /tmp/argocd-diff1589074057/staging-datasets-server-job-post-messages-live.yaml	2023-11-15 12:22:37.516101513 +0000
+++ /tmp/argocd-diff1589074057/staging-datasets-server-job-post-messages	2023-11-15 12:22:37.512101417 +0000
@@ -189,7 +189,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -216,7 +216,7 @@
               value: post-messages
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-post-messages
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-queue-metrics-collector ======
--- /tmp/argocd-diff1544895190/staging-datasets-server-job-queue-metrics-collector-live.yaml	2023-11-15 12:22:37.524101707 +0000
+++ /tmp/argocd-diff1544895190/staging-datasets-server-job-queue-metrics-collector	2023-11-15 12:22:37.520101610 +0000
@@ -178,7 +178,7 @@
                   name: datasets-server-staging-secrets
                   optional: false
             - name: COMMON_BLOCKED_DATASETS
-              value: open-llm-leaderboard/*
+              value: open-llm-leaderboard/*,taesiri/arxiv_qa,enzostvs/stable-diffusion-tpu-generations
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
               value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},togethercomputer/RedPajama-Data-V2,andreped/*'
             - name: COMMON_HF_ENDPOINT
@@ -193,7 +193,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-queue-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-827b9de
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-0334b86
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-queue-metrics-collector
             resources:

Legend Status
The app is synced in ArgoCD, and diffs you see are solely from this PR.
⚠️ The app is out-of-sync in ArgoCD, and the diffs you see include those changes plus any from this PR.
🛑 There was an error generating the ArgoCD diffs due to changes in this PR.

@severo
Copy link
Collaborator Author

severo commented Nov 15, 2023

asked by @Kakulukian

@lhoestq @mariosasko is it related to huggingface/huggingface_hub#1809 and huggingface/huggingface_hub#1815?

To be honest, I don't really understand the issue. Feel free to chime in.

@severo severo merged commit 912690e into main Nov 15, 2023
4 checks passed
@severo severo deleted the disable_two_datasets branch November 15, 2023 13:45
@lhoestq
Copy link
Member

lhoestq commented Nov 15, 2023

Yes we plan to remove all the expand calls in datasets soon

@severo severo mentioned this pull request Nov 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants