소스 검색

request to both nodes in integration test, dont preload the model - exo should be robust against that

Alex Cheema 1 년 전
부모
커밋
32bb44b3a2
1개의 변경된 파일1개의 추가작업 그리고 13개의 파일을 삭제
  1. 1 13
      .circleci/config.yml

+ 1 - 13
.circleci/config.yml

@@ -117,18 +117,6 @@ jobs:
             # Check processes before proceeding
             check_processes
 
-            # first one to load the model
-            curl -s http://localhost:8000/v1/chat/completions \
-                -H "Content-Type: application/json" \
-                -d '{
-                  "model": "llama-3.1-8b",
-                  "messages": [{"role": "user", "content": "Keep responses concise. Placeholder to load model..."}],
-                  "temperature": 0.7
-                }'
-
-            # Check processes after model load
-            check_processes
-
             response_1=$(curl -s http://localhost:8000/v1/chat/completions \
               -H "Content-Type: application/json" \
               -d '{
@@ -141,7 +129,7 @@ jobs:
             # Check processes after first response
             check_processes
 
-            response_2=$(curl -s http://localhost:8000/v1/chat/completions \
+            response_2=$(curl -s http://localhost:8001/v1/chat/completions \
               -H "Content-Type: application/json" \
               -d '{
                 "model": "llama-3.1-8b",