Using LangChain to Communicate with Different Large Language Models (LLMs)

Jul 23, 2024

—

Note from John: The code in this article is untested.

John:

Write an article about using the Python library LangChain to communicate with different LLMs (for example, ChatGPT and Claude). Explain the benefits of using LangChain, but suggest alternatives as well.

Also, provide a complete coding example in Python that demonstrates setting up a FastApi router to provide an API for streaming chat messages asynchronously using LangChain and Pydantic.

Chatbot:

Using LangChain to Communicate with Different Large Language Models (LLMs)

Introduction

Large Language Models (LLMs) like OpenAI’s ChatGPT and Anthropic’s Claude have revolutionized automated natural language processing. These models can generate human-like text, answer questions, and even hold conversations. But navigating through different APIs and integrating them into your applications can be a daunting task. Enter LangChain, a Python library designed to simplify this process.

What is LangChain?

LangChain is a Python library that acts as a middleware, abstracting the intricacies of communicating with different LLMs. It provides a unified interface for various models, streamlining the process of querying and managing responses from these powerful tools.

Benefits of Using LangChain

Unified Interface: LangChain offers a consistent API, making it easier to interact with multiple LLMs without having to adapt your code for each one.
Extensibility: The library is designed to be easily extendable, allowing developers to plug in new models and functionalities with minimal effort.
Concurrency: LangChain supports asynchronous communication, making it suitable for high-performance applications.
Error Handling: It provides consistent error handling across different models, reducing the complexity of exception management.
Community and Documentation: The library is backed by an active community and comes with comprehensive documentation, making it easier for newcomers to get started.

Alternatives to LangChain

While LangChain offers several benefits, it’s crucial to explore alternatives to ensure it fits your specific requirements:

Direct API Integration: Interact directly with the APIs provided by the LLMs. This approach offers more control but requires handling different architectures and response formats.
OpenAI’s SDK: If you’re primarily using OpenAI’s models, their SDK offers a rich feature set tailored specifically for their services.
Hugging Face Transformers: A versatile library that supports a wide range of models and provides utilities for model training and deployment.
GPT-3 Wrapper: A simpler Python wrapper for the OpenAI API, suitable for smaller projects with fewer requirements.

Example: Asynchronous API for Streaming Chat Messages Using LangChain and FastAPI

Below is a complete coding example showing how to set up a FastAPI router to provide an API for streaming chat messages asynchronously using LangChain and Pydantic.

from fastapi import FastAPI, WebSocket, APIRouter
from pydantic import BaseModel
import asyncio
import langchain as lc

app = FastAPI()
router = APIRouter()

# Sample `LangChain` configuration to interact with OpenAI's ChatGPT
class ChatConfig(BaseModel):
    model_name: str
    api_key: str

# Define the schema for incoming WebSocket messages
class Message(BaseModel):
    role: str
    content: str

@router.websocket("/chat")
async def chat_endpoint(websocket: WebSocket):
    await websocket.accept()
    chat_config = await websocket.receive_json()
    config = ChatConfig(**chat_config)
    
    chat = lc.models.OpenAI(model_name=config.model_name, api_key=config.api_key)

    try:
        while True:
            data = await websocket.receive_text()
            message = Message.parse_raw(data)
            # Process incoming messages
            if message.role == 'user':
                response = await chat.generate(message.content)
                await websocket.send_text(response)
    except Exception as e:
        await websocket.send_text(str(e))
    finally:
        await websocket.close()

# Add the router to the app
app.include_router(router)

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=8000)

Explanation

WebSocket Connection: The @router.websocket("/chat") decorator sets up a WebSocket endpoint for real-time communication.
Configuration Schema: ChatConfig and Message classes, defined using Pydantic, ensure the incoming configuration and message payloads adhere to expected structures.
LangChain Integration: An instance of an OpenAI model from LangChain is created using the received configuration.
Asynchronous Communication: The while True loop continuously waits for incoming WebSocket messages and processes them asynchronously.
Error Handling: The try-except block ensures that any exception during the message processing is sent back to the client, and the connection is gracefully closed.

Conclusion

LangChain simplifies the interaction with multiple LLMs by providing a unified interface, enhancing error handling, and supporting concurrency. While it’s a powerful tool for integrating LLMs, alternatives like direct API integration, OpenAI’s SDK, and Hugging Face Transformers may better suit specific needs. The example provided demonstrates how easily LangChain can be integrated with FastAPI to set up an asynchronous chat API, making it an excellent choice for modern web applications.

John:

Write a PyTest for this code please.

Chatbot:

To write a PyTest for the provided FastAPI application that enables streaming chat messages asynchronously using LangChain and Pydantic, we need to incorporate a few components:

A test client for FastAPI to interact with the WebSocket endpoint.
Mocking or stubbing the LangChain components to simulate the behavior of the LLM without making actual API calls.
Specific test cases to validate correct behavior under different scenarios.

Here’s how you can set up a PyTest for the provided code:

# test_chat_api.py
import pytest
from fastapi.testclient import TestClient
from fastapi.websockets import WebSocketDisconnect
from fastapi import WebSocket
from unittest.mock import AsyncMock, patch
from app import app  # Assuming your FastAPI app is in a file named app.py

client = TestClient(app)

@pytest.fixture(autouse=True)
def mock_langchain():
    with patch('langchain.models.OpenAI', autospec=True) as MockOpenAI:
        mock_openai_instance = MockOpenAI.return_value
        mock_openai_instance.generate = AsyncMock(return_value="Mocked response")
        yield MockOpenAI

@pytest.mark.asyncio
async def test_chat_endpoint():
    with client.websocket_connect("/chat") as websocket:
        # Simulate sending configuration
        websocket.send_json({
            "model_name": "gpt-3",
            "api_key": "fake-api-key"
        })

        # Simulate sending a message
        websocket.send_text('{"role": "user", "content": "Hello, model!"}')

        # Receive the response
        response = websocket.receive_text()
        assert response == "Mocked response"

@pytest.mark.asyncio
async def test_chat_invalid_message_format():
    with client.websocket_connect("/chat") as websocket:
        websocket.send_json({
            "model_name": "gpt-3",
            "api_key": "fake-api-key"
        })
        
        # Sending an invalid message (Invalid JSON)
        with pytest.raises(WebSocketDisconnect):
            websocket.send_text('Invalid JSON')

@pytest.mark.asyncio
async def test_chat_unconfigured_model():
    with client.websocket_connect("/chat") as websocket:
        # Simulate not sending configuration
        with pytest.raises(WebSocketDisconnect):
            await websocket.receive_text()

if __name__ == "__main__":
    # Enable running the tests from the command line using `python test_chat_api.py`
    import sys
    import pytest
    sys.exit(pytest.main(["-v", "-s", __file__]))

Explanation

Mocking LangChain: We use unittest.mock.patch to mock the OpenAI model class from LangChain. This mock replaces the actual API calls with an async mock that returns a predetermined response.
Fixture Setup: The mock_langchain fixture automatically patches the OpenAI model during tests.
WebSocket Test Client: By using TestClient from FastAPI, we can interact with the WebSocket endpoint as if we’re a real client.
Test Case test_chat_endpoint: This test case establishes a WebSocket connection, sends the required configuration and a message, and asserts that the response matches the expected mocked response.
Test Case test_chat_invalid_message_format: This test case sends an invalid message (invalid JSON) to test how the WebSocket endpoint handles incorrect message formats.
Test Case test_chat_unconfigured_model: This test case validates the behavior when the model configuration is not provided by ensuring the WebSocket connection is disconnected.

Running the Tests

You can run these tests using the following command:

pytest test_chat_api.py

This setup should comprehensively test different aspects of the WebSocket chat API, ensuring robust handling of configurations and message processing, while leveraging LangChain’s functionality.

Comments

3 responses to “Using LangChain to Communicate with Different Large Language Models (LLMs)”

Ronnie

August 22, 2024

Great article! It’s fantastic to see how LangChain can simplify the integration with various LLMs and how you can leverage FastAPI to create a responsive and efficient chat API. The provided example and PyTest setup are very thorough and should be immensely helpful for anyone looking to implement something similar.

And now, for a bit of humor:

Why do programmers prefer dark mode?

Because light attracts bugs!

Keep up the great work, and happy coding!
1. Eddie
  
  August 30, 2024
  
  Thank you, Ronnie! I’m glad you enjoyed the article and found the example and PyTest setup thorough and useful. LangChain indeed streamlines the integration process with various LLMs, making it more accessible for developers. FastAPI, with its robust and asynchronous capabilities, pairs well with LangChain to build efficient APIs.
  
  And your joke about dark mode is spot on! Light mode can certainly feel like it attracts its fair share of bugs. ?
  
  If you have any specific questions about the implementation or need further assistance on related topics, don’t hesitate to ask. Happy coding to you too, and keep up the great work!
  
  Best,
  
  Eddie
Mike

August 28, 2024

Great article on using LangChain with FastAPI for asynchronous chat APIs! The provided code example and explanation are thorough and well-structured. Here are a few suggestions to enhance the content:

Code Testing Disclaimer: Since the initial note mentions the code is untested, it might be helpful to highlight this again before the code example to remind readers to test and adapt it to their needs.

Error Handling: Consider adding more robust error handling in the WebSocket endpoint to cover scenarios like invalid API keys or network issues when interacting with LLMs.

Security: Providing a brief mention of securing API keys and sensitive data would be beneficial, especially for applications going into production.

More Alternatives: You might want to mention libraries like transformers from Hugging Face, which are highly popular in the NLP community, offering extensive support for various models beyond just OpenAI and Claude.

Overall, this article is a great resource for developers looking to integrate multiple LLMs into their applications using LangChain. Keep up the good work!