ajar98
diff --git a/‎docs/mint.json
+2 b/‎docs/mint.json
+2
diff --git a/‎docs/open-source/conversation-mechanics.mdx
+53 b/‎docs/open-source/conversation-mechanics.mdx
+53
diff --git a/‎docs/open-source/events-manager.mdx
+38-65 b/‎docs/open-source/events-manager.mdx
+38-65
diff --git a/‎docs/open-source/logging-with-loguru.mdx
+73 b/‎docs/open-source/logging-with-loguru.mdx
+73
diff --git a/‎docs/open-source/react-quickstart.mdx
+15-25 b/‎docs/open-source/react-quickstart.mdx
+15-25
@@ -69,6 +69,7 @@
         "open-source/python-quickstart",
         "open-source/telephony",
         "open-source/create-your-own-agent",
+        "open-source/conversation-mechanics",
         "open-source/langchain-agent",
         "open-source/action-agents",
         "open-source/action-phrase-triggers",
@@ -80,6 +81,7 @@
         "open-source/playground",
         "open-source/turn-based-conversation",
         "open-source/language-support",
+        "open-source/logging-with-loguru",
         "open-source/agent-factory"
       ]
     },
 
@@ -0,0 +1,53 @@
+---
+title: "Conversation Mechanics"
+description: "How to tune the responsiveness in Vocode conversations"
+---
+
+Building two-way conversations with an AI is a highly use-case specific task - how realistic the conversation is depends greatly on the nature of the conversation itself. In this guide, we'll cover some of the dials you can turn to configure the mechanics of a conversation in Vocode.
+
+# Endpointing
+
+Endpointing is the process of understanding when someone has finished speaking. The `EndpointingConfig` controls how this is done. There are a couple of different ways to configure endpointing:
+
+We provide `DeepgramEndpointingConfig()` which has some reasonable defaults and knobs to suit most use-cases (but only works with the Deepgram transcriber).
+
+```
+class DeepgramEndpointingConfig(EndpointingConfig, type="deepgram"):  # type: ignore
+    vad_threshold_ms: int = 500
+    utterance_cutoff_ms: int = 1000
+    time_silent_config: Optional[TimeSilentConfig] = Field(default_factory=TimeSilentConfig)
+    use_single_utterance_endpointing_for_first_utterance: bool = False
+```
+
+- `vad_threshold_ms`: translates to [Deepgram's `endpointing` feature](https://developers.deepgram.com/docs/endpointing#enable-feature)
+- `utterance_cutoff_ms`: uses [Deepgram's Utterance End features](https://developers.deepgram.com/docs/utterance-end)
+- `time_silent_config`: is a Vocode specific parameter that marks an utterance final if we haven't seen any new words in X seconds
+- `use_single_utterance_endpointing_for_first_utterance`: Uses `is_final` instead of `speech_final` for endpointing for the first utterance (works really well for outbound conversations, where the user's first utterance is something like "Hello?") - see [this doc on Deepgram](https://developers.deepgram.com/docs/understand-endpointing-interim-results) for more info.
+
+Endpointing is highly use-case specific - building a realistic experience for this greatly depends on the person speaking to the AI. Here are few paradigms that we've used to help you along the way:
+
+- Time-based endpointing: This method considers the speaker to be finished when there is a certain duration of silence.
+- Punctuation-based endpointing: This method considers the speaker to be finished when there is a certain duration of silence after a punctuation mark.
+
+# Interruptions
+
+When the AI speaks in a `StreamingConversation`, it can be interrupted by the user. `AgentConfig` itself provides a parameter called `interrupt_sensitivity` that can be used to control how sensitive the AI is to interruptions. Interrupt sensitivity has two options: low (default) and high. Low sensitivity makes the bot ignore backchannels (e.g. “sure”, “uh-huh”) while the bot is speaking. High sensitivity makes the agent treat any word from the human as an interruption.
+
+The implementation of this configuration is in `StreamingConversation.TranscriptionsWorker` - in order to make this work well, you may need to fork Vocode and override this behavior, but it provides a good starting place for most use-cases.
+
+Stay tuned, more dials to come here soon!
+
+# Conversation Speed
+
+`StreamingConversation` also exposes a parameter called `conversation_speed`, which controls the length of endpointing pauses, i.e. how long the bot will wait before responding to the human. This includes normal utterances from the human as well as interruptions.
+
+The amount of time the bot waits inversely scales with the `conversation_speed` value. So a bot with `conversation_speed` of 2 responds in half the time compared to a `conversation_speed` of 1. Likewise a `conversation_speed` of 0.5 means the bot takes twice as long to respond.
+
+```python
+conversation = StreamingConversation(
+    speed_coefficient=2
+    ...
+)
+```
+
+Based on the speed of the user's speech (we calculate the WPM from each final utterance that goes through the pipeline), the `speed_coefficient` updates throughout the course of the conversation - see `vocode.streaming.utils.speed_manager` to see this implementation!
@@ -5,87 +5,60 @@ description: "How events are emitted and consumed."
 
 ## What is the Events Manager
 
-The Events Manager is a class designed to facilitate asynchronous handling of events in the application. It allows for non-blocking actions on events, such as processing transcripts, managing phone calls, and other tasks. The main components of the Events Manager are the `EventsManager` class and several `Event` subclasses representing various event types.
+The Events Manager consumes realtime events during conversations - it provides a framework to consume and take action on these events asychronously.
 
-## EventsManager Class
-
-The `EventsManager` class is responsible for managing the event queue and handling events asynchronously. The class provides methods for publishing events, starting the event loop, handling events, and ending the event loop.
-
-### Initialization
+## Current Event Types
 
-```python
-def __init__(self, subscriptions: List[EventType] = []):
-    self.queue = asyncio.Queue()
-    self.subscriptions = set(subscriptions)
-    self.active = False
-```
+The current event types include:
 
-The `EventsManager` constructor accepts an optional list of `EventType` subscriptions. By default, it initializes an empty set of subscriptions, an asynchronous queue, and sets the `active` attribute to `False`.
+1. `TRANSCRIPT`: Indicates a partial transcript for the conversation has been received.
+2. `TRANSCRIPT_COMPLETE`: Indicates the transcript is complete (ie conversation has ended).
+3. `ACTION`: Indicates that a Vocode action has begun or completed.
+4. `PHONE_CALL_CONNECTED`: Indicates a phone call has been connected (only gets sent during `PhoneConversation`s)
+5. `PHONE_CALL_ENDED`: Indicates a phone call has ended.
 
-### Publishing Events
+## Usage
 
-```python
-def publish_event(self, event: Event):
-    if event.type in self.subscriptions:
-        self.queue.put_nowait(event)
-```
+Using the events manager to take particular action when events fire requires that you subclass `vocode.streaming.utils.EventsManager` and override the `handle_event` method.
 
-The `publish_event` method takes an `Event` object as input and adds it to the queue if its type is in the set of subscribed event types.
+You can also configure which events your `EventsManager` is subscribed to by using the `subscriptions` property (see example).
 
-### Starting the Event Loop
+### Example
 
 ```python
-async def start(self):
-    self.active = True
-    while self.active:
-        try:
-            event: Event = await self.queue.get()
-        except asyncio.QueueEmpty:
-            await asyncio.sleep(1)
-        self.handle_event(event)
-```
+from vocode.streaming.models.events import Event, EventType
 
-## Current Event Types
+from vocode.streaming.models.events import Event, EventType
+from vocode.streaming.models.transcript import TranscriptCompleteEvent
+from vocode.streaming.utils.events_manager import EventsManager
 
-The current event types include:
 
-1. `TRANSCRIPT`: Indicates a partial transcript for the conversation has been received.
-2. `TRANSCRIPT_COMPLETE`: Indicates the transcript is complete (ie conversation has ended).
-3. `PHONE_CALL_CONNECTED`: Indicates a phone call has been connected.
-4. `PHONE_CALL_ENDED`: Indicates a phone call has ended.
-5. `RECORDING`: (Vonage Only) Indicates a secure URL containing a recording of the call is available. Requires `recording=true` in `VonageConfig`.
+class CustomEventsManager(EventsManager):
+    def __init__(self):
+        super().__init__([EventType.TRANSCRIPT_COMPLETE])
 
-## Example Usage
+    async def handle_event(self, event: Event):
+        if isinstance(event, TranscriptCompleteEvent):
+            print("The call has finished, the transcript was", event.transcript.to_string())
+```
 
-The following example demonstrates how the `EventsManager` class can be used to consume the `TRANSCRIPT_COMPLETE` event and save the transcript to a file using the `add_transcript` method:
+In this example, we create a custom `EventsManager` subclass is created with a subscription to the `TRANSCRIPT_COMPLETE` event and then print the transcript when we receive the event.
 
-```python
-import logging
-from fastapi import FastAPI
-from vocode.streaming.models.events import Event, EventType, TranscriptCompleteEvent
-from vocode.streaming.utils import events_manager
-from call_transcript_utils import add_transcript
+To use `CustomEventsManager`, you can pass it into any Conversation, e.g.
 
-app = FastAPI(docs_url=None)
+```
+...
+conversation = StreamingConversation(
+    ...,
+    events_manager=CustomEventsManager()
+)
+```
 
-logging.basicConfig()
-logger = logging.getLogger(__name__)
-logger.setLevel(logging.DEBUG)
+You can also pass it into a `TelephonyServer`, like:
 
-class CustomEventsManager(events_manager.EventsManager):
-    def __init__(self):
-        super().__init__(subscriptions=[EventType.TRANSCRIPT_COMPLETE])
-
-    def handle_event(self, event: Event):
-        if event.type == EventType.TRANSCRIPT_COMPLETE:
-            transcript_complete_event = typing.cast(TranscriptCompleteEvent, event)
-            add_transcript(
-                transcript_complete_event.conversation_id,
-                transcript_complete_event.transcript,
-            )
-
-events_manager_instance = CustomEventsManager()
-await events_manager_instance.start()
 ```
-
-In this example, a custom `EventsManager` subclass is created with a subscription to the `TRANSCRIPT_COMPLETE` event. The `handle_event` method is overridden to save the transcript to a file using the `add_transcript` method when the `TRANSCRIPT_COMPLETE` event is received.
+server = TelephonyServer(
+    ...,
+    events_manager=CustomEventsManager()
+)
+```
@@ -0,0 +1,73 @@
+---
+title: "Logging with Loguru"
+description: "Make logging set up less painful for local and production usage!"
+---
+
+Loguru is a powerful and flexible logging library for Python that simplifies logging setup and usage. It provides a more intuitive and feature-rich alternative to Python's built-in logging module.
+
+## Why Use Loguru?
+
+Loguru offers several advantages over the standard logging module:
+
+- **Ease of Use**: Loguru simplifies the process of setting up and using loggers.
+- **Rich Features**: It provides advanced features like automatic exception catching, structured logging, and more.
+- **Flexibility**: Loguru allows for easy configuration of different logging formats and destinations.
+
+## Using the Vocode Implementation
+
+The Vocode implementation of Loguru provides a seamless way to integrate logging into your application. It includes custom handlers and configuration functions to streamline the setup process. When utilizing the JSON logging configuration, it'll also pull relevant environment variables such as `conversation_id` and include them in the JSON output for better production debugging!
+
+### Setting Up Logging
+
+To set up logging in your application, you can use the provided configuration functions. Here's how to configure pretty printing for local development and JSON logging for production:
+
+#### Pretty Printing Locally
+
+To enable pretty printing locally, use the `configure_pretty_logging` function. This will set up Loguru to output logs with colored formatting, making them easier to read during development.
+
+```python
+from vocode.logging import configure_pretty_logging
+
+configure_pretty_logging()
+```
+
+#### JSON Logging in Production
+
+For production environments, you may want to log in JSON format for better integration with logging systems and easier parsing. Use the `configure_json_logging` function to set this up.
+
+```python
+from vocode.logging import configure_json_logging
+
+configure_json_logging()
+```
+
+### Why Use Different Setups?
+
+Using different logging setups for local and production environments can be beneficial for several reasons:
+
+- **Readability**: Pretty printing makes logs easier to read during development, helping you quickly identify issues.
+- **Structured Logging**: JSON logging provides structured logs that are easier to parse and analyze in production, especially when using log aggregation and monitoring tools.
+
+## Example Snippet
+
+Here's an example of how you can set up logging in your application:
+
+```python
+import os
+from vocode.logging import configure_pretty_logging, configure_json_logging
+
+DEPLOYED_ENVIRONMENTS = ["production", "staging"]
+ENVIRONMENT = os.environ.get("ENVIRONMENT", "development")
+
+def configure_logging() -> None:  # pragma: no cover
+    """Configures logging."""
+    if ENVIRONMENT in DEPLOYED_ENVIRONMENTS:
+        configure_json_logging()
+    else:
+        configure_pretty_logging()
+
+# Configure logging based on the environment
+configure_logging()
+
+# Your application code here
+```
@@ -13,7 +13,7 @@ Or, start from our [Replit template](https://replit.com/@vocode/Simple-Conversat
 
 ## Setting up the conversation
 
-Our self-hosted backend allows you to expose a websocket route in the same format that our hosted backend does. This allows you to deploy any agent you'd like into the conversation.
+Our self-hosted backend allows you to expose a websocket route that operates like `StreamingConversation`.
 
 To get started, clone the Vocode repo or copy the [client backend app](https://github.com/vocodedev/vocode-python/tree/main/apps/client_backend) directory.
 
@@ -56,35 +56,25 @@ uvicorn main:app --port 3000
 
 You now have a server with a Vocode websocket route at localhost:3000! You can now use the `useConversation` hook with your self-hosted backend as follows:
 
-```javascript
-const { status, start, stop, analyserNode } = useConversation({
+```typescript
+import { useConversation } from "vocode";
+
+const { status, start, stop, error, analyserNode } = useConversation({
   backendUrl: "<YOUR_BACKEND_URL>", // looks like ws://localhost:3000/conversation or wss://asdf1234.ngrok.app/conversation if using ngrok
   audioDeviceConfig: {},
 });
 ```
 
-# Demo installation and setup
-
-Clone the `vocode-react-demo` [repository](https://github.com/vocodedev/vocode-react-demo).
-
-```
-$ git clone https://github.com/vocodedev/vocode-react-demo.git
-```
-
-Run npm install inside the directory to download all of the dependencies.
-
-```
-$ npm install
-```
-
-Set your Client SDK key inside of your `.env`
+Use the `status`, `start`, and `stop` objects within your React components to control conversations with your self-hosted backend, e.g.
 
-```
-REACT_APP_VOCODE_API_KEY=YOUR KEY HERE
-```
-
-Start the application
+```jsx
+<>
+  {status === "idle" && <p>Press me to talk!</p>}
+  {status == "error" && error && <p>{error.message}</p>}
 
-```
-$ npm start
+  <button
+    disabled={["connecting"].includes(status)}
+    onClick={status === "connected" ? stop : start}
+  ></button>
+</>
 ```